ID A0A084WQP7_ANOSI Unreviewed; 1764 AA.
AC A0A084WQP7;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE SubName: Full=Collagen IV NC1 domain-containing protein {ECO:0000313|EnsemblMetazoa:ASIC020791-PA};
GN ORFNames=ZHAS_00020791 {ECO:0000313|EMBL:KFB52541.1};
OS Anopheles sinensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB52541.1};
RN [1] {ECO:0000313|EMBL:KFB52541.1, ECO:0000313|Proteomes:UP000030765}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT of mosquito competence for malaria parasites.";
RL BMC Genomics 15:42-42(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASIC020791-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Membrane {ECO:0000256|ARBA:ARBA00004370}.
CC Secreted, extracellular space, extracellular matrix, basement membrane
CC {ECO:0000256|ARBA:ARBA00004302}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATLV01025642; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KE525396; KFB52541.1; -; Genomic_DNA.
DR STRING; 74873.A0A084WQP7; -.
DR EnsemblMetazoa; ASIC020791-RA; ASIC020791-PA; ASIC020791.
DR VEuPathDB; VectorBase:ASIC020791; -.
DR VEuPathDB; VectorBase:ASIS002310; -.
DR OMA; SNNESCG; -.
DR Proteomes; UP000030765; Unassembled WGS sequence.
DR GO; GO:0005604; C:basement membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005201; F:extracellular matrix structural constituent; IEA:InterPro.
DR GO; GO:0048856; P:anatomical structure development; IEA:UniProt.
DR Gene3D; 2.170.240.10; Collagen IV, non-collagenous; 1.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR001442; Collagen_IV_NC.
DR InterPro; IPR036954; Collagen_IV_NC_sf.
DR InterPro; IPR016187; CTDL_fold.
DR PANTHER; PTHR37456:SF5; -; 1.
DR PANTHER; PTHR37456; SI:CH211-266K2.1; 1.
DR Pfam; PF01413; C4; 2.
DR Pfam; PF01391; Collagen; 16.
DR SMART; SM00111; C4; 2.
DR SUPFAM; SSF56436; C-type lectin-like; 2.
DR PROSITE; PS51403; NC1_IV; 1.
PE 4: Predicted;
KW Basement membrane {ECO:0000256|ARBA:ARBA00022869};
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Extracellular matrix {ECO:0000256|ARBA:ARBA00022530};
KW Reference proteome {ECO:0000313|Proteomes:UP000030765};
KW Secreted {ECO:0000256|ARBA:ARBA00022530};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1541..1764
FT /note="Collagen IV NC1"
FT /evidence="ECO:0000259|PROSITE:PS51403"
FT REGION 23..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 79..716
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 743..952
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 968..1102
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1154..1187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1211..1231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1271..1533
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..257
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1159..1180
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1764 AA; 174156 MW; EBD37763617C7A23 CRC64;
MYAVGQHMQQ FWSDLSNTGG VGLFKRADDD HPRHTQPRID PSYSIIDPAS GPQGPPSKNC
TSGGCCLPKC FAEKGNRGLP GPAGLKGGKG VRGFPGSEGL PGEKGSKGEP GPMGLMGPKG
DRGRDGLPGY PGIPGTNGVP GNPGTPGLPG RDGCNGTDGL PGLPGLAGSP GPRGYAGSPG
SKGDKGEPAR HPENYNKGQK GEPGLEGLEG LPGPVGEPGV RGYPGTPGDK GVPGLPGAKG
ERGEKGECIK GAKGEKGAKG TEVYGLESPP SSTTGPKGDK GDRGEPGEPG RPGDKGQAGD
RGQVGDRGHK GEKGLPGQPG PRGRDGNFGP VGLPGQKGDR GSEGLHGLKG SIGPKGDAGR
DGYPGQPGIA GPPGAPGGGE GRPGAPGPKG PRGYEGPPGP KGMDGFEGEK GERGQMGPKG
GQGLPGRPGP EGMPGDKGDK GEPGSVGMPG PMGLRGYPGQ PGQDGLRGEP GQPGIGAPGP
KGNPGMAGFP GLKGQKGERG FKGVMGVPGD AKEGRPGASG MPGRDGEKGE PGRPGLPGGK
GERGQKGELG GRCTDCRPGL KGDKGERGYA GEPGRPGVSG APGERGYPGV PGEDGPPGQR
GEPGPKGEQG LIGPPGPSGE PGRDAEITLA QLKPIKGDKG NMGERGLQGV KGEKGYSGPV
GPEGKPGLPG PKGEKGRPGE MGIDGTPGTP GNDGAPGRHG QTIKGEPGLK GNVGYTGDKG
DKGYAGLKGE AGKCADIPQN IVDAIRGPPG AQGDKGAPGI QGERGDKGEM GLQGRTGLPG
NAGPPGAPGP VGPRGLTGNR GEKGNTGPVG PPGSPGRDGM PGMPGVPGPK GSKGDPGLSM
VGPPGPKGNT GFRGPKGDRG GMGDRGDPGL PGSVGYPGEK GDAGVPGQPG FPGEVGPKGE
PGPAGPAGHT GAPGRPGMDG VKGLPGLKGD VGAPGVIGLP GQKGETGQAG NDGLKGFKGH
KGMMGLPGVQ GMRGPQGAKG EPGEKGDRGE IGVKGLPGPS GPPGLLGPKG DKGLSGLPGP
TCLPGVSGEK GDKGYTGPEG PPGEPGAASE KGQKGEPGVP GLRGNDGLPG LAGPTGPKGD
AGVPGYGRPG PQGEKGDVGL TGINGFPGLN GVKGDMGVPG FPGVKGDKGM TGLPGIPGAP
CVDGLPGVEG PMGPRGYDGE KGFKGEPGRI GERGERGEKG DQGLTGPVGL VGLKGDRGLP
GTPGPSATVT AIKGDKGEPG FPGAVGRPGK VGAPGLPGEM GLKGEAGFQG LPGLPGPPGL
NGLPGMKGDM GPIGEKGDTC PVVKGEKGLP GRPGKTGRDG PPGLTGEKGD KGIAGLPGPT
GPPGPPGPLG RQGEKGDRGD TGLIGRPGKD GFPGAPGQRG LPGPQGEKGD QGPPGFLGPK
GDKGERGRDG MNGMNGPQGL KGERGLPGLE GVAGLPGMVG EKGDRGLPGM AGLSGPPGEK
GQKGETPQLP PQRKGPPGPP GYNGQKGDKG LPGLAGPPGI PGAPGAPGEM GLRGFDGARG
LQGLRGDVGL EGRPGRDGAP GLPGPKGEPG RDCESAPYWS GILVVRHSQT EDIPMCEPGH
LKLWDGYSLL YVDGNDFPHN QDLGSAGSCV RKFSTLPILS CGQNNVCNYA SRNDRTFWLS
TTAPIPMMPV SENEMRPYIS RCAVCEAPAN VIAVHSQSLH IPNCPNGWES MWIGYSFLMH
TAVGHGGGGQ SLSGSGSCLE DFRATPFIEC NGGKGHCHYY ETQTSFWMAT IEDHQQFQRP
EQQTIKAGNL LSRVSRCQVC IRRP
//