GenomeNet

Database: UniProt
Entry: A0A084WQP7_ANOSI
LinkDB: A0A084WQP7_ANOSI
Original site: A0A084WQP7_ANOSI 
ID   A0A084WQP7_ANOSI        Unreviewed;      1764 AA.
AC   A0A084WQP7;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   27-MAR-2024, entry version 37.
DE   SubName: Full=Collagen IV NC1 domain-containing protein {ECO:0000313|EnsemblMetazoa:ASIC020791-PA};
GN   ORFNames=ZHAS_00020791 {ECO:0000313|EMBL:KFB52541.1};
OS   Anopheles sinensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB52541.1};
RN   [1] {ECO:0000313|EMBL:KFB52541.1, ECO:0000313|Proteomes:UP000030765}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA   Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA   Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA   Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT   "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT   of mosquito competence for malaria parasites.";
RL   BMC Genomics 15:42-42(2014).
RN   [2] {ECO:0000313|EnsemblMetazoa:ASIC020791-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC       Secreted, extracellular space, extracellular matrix, basement membrane
CC       {ECO:0000256|ARBA:ARBA00004302}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ATLV01025642; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KE525396; KFB52541.1; -; Genomic_DNA.
DR   STRING; 74873.A0A084WQP7; -.
DR   EnsemblMetazoa; ASIC020791-RA; ASIC020791-PA; ASIC020791.
DR   VEuPathDB; VectorBase:ASIC020791; -.
DR   VEuPathDB; VectorBase:ASIS002310; -.
DR   OMA; SNNESCG; -.
DR   Proteomes; UP000030765; Unassembled WGS sequence.
DR   GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR   GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR   Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR001442; Collagen_IV_NC.
DR   InterPro; IPR036954; Collagen_IV_NC_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   PANTHER; PTHR37456:SF5; -; 1.
DR   PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR   Pfam; PF01413; C4; 2.
DR   Pfam; PF01391; Collagen; 16.
DR   SMART; SM00111; C4; 2.
DR   SUPFAM; SSF56436; C-type lectin-like; 2.
DR   PROSITE; PS51403; NC1_IV; 1.
PE   4: Predicted;
KW   Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW   Secreted {ECO:0000256|ARBA:ARBA00022530};
KW   Signal {ECO:0000256|ARBA:ARBA00022729}.
FT   DOMAIN          1541..1764
FT                   /note="Collagen IV NC1"
FT                   /evidence="ECO:0000259|PROSITE:PS51403"
FT   REGION          23..59
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          79..716
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          743..952
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          968..1102
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1154..1187
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1211..1231
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1271..1533
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        240..257
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1159..1180
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1764 AA;  174156 MW;  EBD37763617C7A23 CRC64;
     MYAVGQHMQQ FWSDLSNTGG VGLFKRADDD HPRHTQPRID PSYSIIDPAS GPQGPPSKNC
     TSGGCCLPKC FAEKGNRGLP GPAGLKGGKG VRGFPGSEGL PGEKGSKGEP GPMGLMGPKG
     DRGRDGLPGY PGIPGTNGVP GNPGTPGLPG RDGCNGTDGL PGLPGLAGSP GPRGYAGSPG
     SKGDKGEPAR HPENYNKGQK GEPGLEGLEG LPGPVGEPGV RGYPGTPGDK GVPGLPGAKG
     ERGEKGECIK GAKGEKGAKG TEVYGLESPP SSTTGPKGDK GDRGEPGEPG RPGDKGQAGD
     RGQVGDRGHK GEKGLPGQPG PRGRDGNFGP VGLPGQKGDR GSEGLHGLKG SIGPKGDAGR
     DGYPGQPGIA GPPGAPGGGE GRPGAPGPKG PRGYEGPPGP KGMDGFEGEK GERGQMGPKG
     GQGLPGRPGP EGMPGDKGDK GEPGSVGMPG PMGLRGYPGQ PGQDGLRGEP GQPGIGAPGP
     KGNPGMAGFP GLKGQKGERG FKGVMGVPGD AKEGRPGASG MPGRDGEKGE PGRPGLPGGK
     GERGQKGELG GRCTDCRPGL KGDKGERGYA GEPGRPGVSG APGERGYPGV PGEDGPPGQR
     GEPGPKGEQG LIGPPGPSGE PGRDAEITLA QLKPIKGDKG NMGERGLQGV KGEKGYSGPV
     GPEGKPGLPG PKGEKGRPGE MGIDGTPGTP GNDGAPGRHG QTIKGEPGLK GNVGYTGDKG
     DKGYAGLKGE AGKCADIPQN IVDAIRGPPG AQGDKGAPGI QGERGDKGEM GLQGRTGLPG
     NAGPPGAPGP VGPRGLTGNR GEKGNTGPVG PPGSPGRDGM PGMPGVPGPK GSKGDPGLSM
     VGPPGPKGNT GFRGPKGDRG GMGDRGDPGL PGSVGYPGEK GDAGVPGQPG FPGEVGPKGE
     PGPAGPAGHT GAPGRPGMDG VKGLPGLKGD VGAPGVIGLP GQKGETGQAG NDGLKGFKGH
     KGMMGLPGVQ GMRGPQGAKG EPGEKGDRGE IGVKGLPGPS GPPGLLGPKG DKGLSGLPGP
     TCLPGVSGEK GDKGYTGPEG PPGEPGAASE KGQKGEPGVP GLRGNDGLPG LAGPTGPKGD
     AGVPGYGRPG PQGEKGDVGL TGINGFPGLN GVKGDMGVPG FPGVKGDKGM TGLPGIPGAP
     CVDGLPGVEG PMGPRGYDGE KGFKGEPGRI GERGERGEKG DQGLTGPVGL VGLKGDRGLP
     GTPGPSATVT AIKGDKGEPG FPGAVGRPGK VGAPGLPGEM GLKGEAGFQG LPGLPGPPGL
     NGLPGMKGDM GPIGEKGDTC PVVKGEKGLP GRPGKTGRDG PPGLTGEKGD KGIAGLPGPT
     GPPGPPGPLG RQGEKGDRGD TGLIGRPGKD GFPGAPGQRG LPGPQGEKGD QGPPGFLGPK
     GDKGERGRDG MNGMNGPQGL KGERGLPGLE GVAGLPGMVG EKGDRGLPGM AGLSGPPGEK
     GQKGETPQLP PQRKGPPGPP GYNGQKGDKG LPGLAGPPGI PGAPGAPGEM GLRGFDGARG
     LQGLRGDVGL EGRPGRDGAP GLPGPKGEPG RDCESAPYWS GILVVRHSQT EDIPMCEPGH
     LKLWDGYSLL YVDGNDFPHN QDLGSAGSCV RKFSTLPILS CGQNNVCNYA SRNDRTFWLS
     TTAPIPMMPV SENEMRPYIS RCAVCEAPAN VIAVHSQSLH IPNCPNGWES MWIGYSFLMH
     TAVGHGGGGQ SLSGSGSCLE DFRATPFIEC NGGKGHCHYY ETQTSFWMAT IEDHQQFQRP
     EQQTIKAGNL LSRVSRCQVC IRRP
//
DBGET integrated database retrieval system