ID A0A084WMK4_ANOSI Unreviewed; 887 AA.
AC A0A084WMK4;
DT 29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT 29-OCT-2014, sequence version 1.
DT 28-JAN-2026, entry version 43.
DE SubName: Full=AGAP006516-PA-like protein {ECO:0000313|EMBL:KFB51448.1};
GN ORFNames=ZHAS_00019929 {ECO:0000313|EMBL:KFB51448.1};
OS Anopheles sinensis (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB51448.1};
RN [1] {ECO:0000313|EMBL:KFB51448.1, ECO:0000313|Proteomes:UP000030765}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT of mosquito competence for malaria parasites.";
RL BMC Genomics 15:42-42(2014).
RN [2] {ECO:0000313|EnsemblMetazoa:ASIC019929-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATLV01024475; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ATLV01024476; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ATLV01024477; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ATLV01024478; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; KE525352; KFB51448.1; -; Genomic_DNA.
DR STRING; 74873.A0A084WMK4; -.
DR EnsemblMetazoa; ASIC019929-RA; ASIC019929-PA; ASIC019929.
DR VEuPathDB; VectorBase:ASIC019929; -.
DR VEuPathDB; VectorBase:ASIS003329; -.
DR OMA; YSHERPY; -.
DR OrthoDB; 5983381at2759; -.
DR Proteomes; UP000030765; Unassembled WGS sequence.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR CDD; cd00247; Endostatin-like; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000030765}.
FT DOMAIN 609..655
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 708..873
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 17..92
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 147..387
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 401..551
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 567..598
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 25..35
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..54
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 58..77
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 199..210
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..260
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..387
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..415
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 465..478
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..516
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..585
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 887 AA; 92095 MW; CD623079FF6E1169 CRC64;
MLYFGNNFLS YFSLHKASGE DPFEEPPPIS PPPPEYTGYR IKGDKGERGT KGESIRGPPG
PPGPQGPPGP PGPPGSPGPK GGGYFDSDGS GDXXXXXXXL ALLMFRTFFL GHLYDTLPNR
KHAGQCSCNA TLIIEELKMD STLREYLRGP QGMPGKEGKT GSPGLTGDKG DRGDQGAAGP
EGLQGSKGEP GLDGAAGVPG PPGPPGPPGL PENYDESMLG SPIQGMRGGS PGLKGEPGDK
GEIGFPGEKG EHGTKGDRGD PGLTGAKGER GHQGTHGQPG PKGPPGTPGI PGLPGQTGAS
GPKGDKGNTG DSGPPGPPGP PGMVVHSEGK NGTQDCQCQA GPPGPPGARG PPGIDGAPGL
TGETGLPGHP GLPGDKGERG LPGLKGDKGA EFIINENVAF NSSRANKGDK GDRGQRGRRG
KTGPPGPIGP PGKSNMGETW PGRPGPKGDQ GPKGEKGDSA AMRGLKGDKG DRGIDGRDGL
PGPPGLPAAS GDGGVQYIPM PGPPGPPGQP GPPGPPGLSI VGEKGEPGLD SRSPFYSDSQ
HGYYGRPGGR SSLDELKALR ELKHHKDYED STLGPPGPPG PPGPAGRPLH DSDEIPNNFG
ANVRIVPGAV TFQNSETMKK TTATIPVGTL AYIIDEEALL VRVNKGWQYI ALGTLMEIST
APPPTTTNLP PQRSNLQSSN LVNNLPQPVD GSVVSFVPTL SVHLFFSLRM AALNEPYTGD
MQGIRGADFA CYRQARRAGL LGTFRAFLSS RIQNLDSIVR LADRELPVVN TRGDVLFNSW
SSIFGGQGGF FSQAPRIYSF SGKNVLTDMA WPQKLVWHGS SALGERAIDT YCDAWHSPSP
DKVGLASSLL GNKLLDQERY SCDNRFVVLC VEAVPQDRRR KRRDTRR
//