GenomeNet

Database: UniProt
Entry: A0A084WMK4_ANOSI
LinkDB: A0A084WMK4_ANOSI
Original site: A0A084WMK4_ANOSI 
ID   A0A084WMK4_ANOSI        Unreviewed;       887 AA.
AC   A0A084WMK4;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   28-JAN-2026, entry version 43.
DE   SubName: Full=AGAP006516-PA-like protein {ECO:0000313|EMBL:KFB51448.1};
GN   ORFNames=ZHAS_00019929 {ECO:0000313|EMBL:KFB51448.1};
OS   Anopheles sinensis (Mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=74873 {ECO:0000313|EMBL:KFB51448.1};
RN   [1] {ECO:0000313|EMBL:KFB51448.1, ECO:0000313|Proteomes:UP000030765}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=24438588; DOI=10.1186/1471-2164-15-42;
RA   Zhou D., Zhang D., Ding G., Shi L., Hou Q., Ye Y., Xu Y., Zhou H.,
RA   Xiong C., Li S., Yu J., Hong S., Yu X., Zou P., Chen C., Chang X., Wang W.,
RA   Lv Y., Sun Y., Ma L., Shen B., Zhu C.;
RT   "Genome sequence of Anopheles sinensis provides insight into genetics basis
RT   of mosquito competence for malaria parasites.";
RL   BMC Genomics 15:42-42(2014).
RN   [2] {ECO:0000313|EnsemblMetazoa:ASIC019929-PA}
RP   IDENTIFICATION.
RG   EnsemblMetazoa;
RL   Submitted (MAY-2020) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ATLV01024475; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ATLV01024476; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ATLV01024477; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; ATLV01024478; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; KE525352; KFB51448.1; -; Genomic_DNA.
DR   STRING; 74873.A0A084WMK4; -.
DR   EnsemblMetazoa; ASIC019929-RA; ASIC019929-PA; ASIC019929.
DR   VEuPathDB; VectorBase:ASIC019929; -.
DR   VEuPathDB; VectorBase:ASIS003329; -.
DR   OMA; YSHERPY; -.
DR   OrthoDB; 5983381at2759; -.
DR   Proteomes; UP000030765; Unassembled WGS sequence.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   GO; GO:0031012; C:extracellular matrix; IEA:TreeGrafter.
DR   GO; GO:0005615; C:extracellular space; IEA:TreeGrafter.
DR   GO; GO:0030020; F:extracellular matrix structural constituent conferring tensile strength; IEA:TreeGrafter.
DR   GO; GO:0030198; P:extracellular matrix organization; IEA:TreeGrafter.
DR   CDD; cd00247; Endostatin-like; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 1.20.5.320; 6-Phosphogluconate Dehydrogenase, domain 3; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023:SF1112; COL_CUTICLE_N DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000030765}.
FT   DOMAIN          609..655
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          708..873
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          17..92
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          147..387
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          401..551
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          567..598
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        25..35
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        41..54
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        58..77
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        199..210
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        236..260
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        375..387
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        406..415
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        465..478
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        500..516
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        575..585
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   887 AA;  92095 MW;  CD623079FF6E1169 CRC64;
     MLYFGNNFLS YFSLHKASGE DPFEEPPPIS PPPPEYTGYR IKGDKGERGT KGESIRGPPG
     PPGPQGPPGP PGPPGSPGPK GGGYFDSDGS GDXXXXXXXL ALLMFRTFFL GHLYDTLPNR
     KHAGQCSCNA TLIIEELKMD STLREYLRGP QGMPGKEGKT GSPGLTGDKG DRGDQGAAGP
     EGLQGSKGEP GLDGAAGVPG PPGPPGPPGL PENYDESMLG SPIQGMRGGS PGLKGEPGDK
     GEIGFPGEKG EHGTKGDRGD PGLTGAKGER GHQGTHGQPG PKGPPGTPGI PGLPGQTGAS
     GPKGDKGNTG DSGPPGPPGP PGMVVHSEGK NGTQDCQCQA GPPGPPGARG PPGIDGAPGL
     TGETGLPGHP GLPGDKGERG LPGLKGDKGA EFIINENVAF NSSRANKGDK GDRGQRGRRG
     KTGPPGPIGP PGKSNMGETW PGRPGPKGDQ GPKGEKGDSA AMRGLKGDKG DRGIDGRDGL
     PGPPGLPAAS GDGGVQYIPM PGPPGPPGQP GPPGPPGLSI VGEKGEPGLD SRSPFYSDSQ
     HGYYGRPGGR SSLDELKALR ELKHHKDYED STLGPPGPPG PPGPAGRPLH DSDEIPNNFG
     ANVRIVPGAV TFQNSETMKK TTATIPVGTL AYIIDEEALL VRVNKGWQYI ALGTLMEIST
     APPPTTTNLP PQRSNLQSSN LVNNLPQPVD GSVVSFVPTL SVHLFFSLRM AALNEPYTGD
     MQGIRGADFA CYRQARRAGL LGTFRAFLSS RIQNLDSIVR LADRELPVVN TRGDVLFNSW
     SSIFGGQGGF FSQAPRIYSF SGKNVLTDMA WPQKLVWHGS SALGERAIDT YCDAWHSPSP
     DKVGLASSLL GNKLLDQERY SCDNRFVVLC VEAVPQDRRR KRRDTRR
//
DBGET integrated database retrieval system