GenomeNet

Database: UniProt
Entry: A0A903XYZ8_ANOGA
LinkDB: A0A903XYZ8_ANOGA
Original site: A0A903XYZ8_ANOGA 
ID   A0A903XYZ8_ANOGA        Unreviewed;      1142 AA.
AC   A0A903XYZ8;
DT   22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT   22-FEB-2023, sequence version 1.
DT   28-JAN-2026, entry version 14.
DE   SubName: Full=Multiplexin {ECO:0000313|EnsemblMetazoa:AGAP029898-PA};
OS   Anopheles gambiae (African malaria mosquito).
OC   Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC   Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC   Anophelinae; Anopheles.
OX   NCBI_TaxID=7165 {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062};
RN   [1] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=12364791; DOI=10.1126/science.1076181;
RA   Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA   Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M., Wides R.,
RA   Salzberg S.L., Loftus B., Yandell M., Majoros W.H., Rusch D.B., Lai Z.,
RA   Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA   Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA   Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA   Christophides G.K., Chrystal M.A., Clamp M., Cravchik A., Curwen V.,
RA   Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA   Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA   Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA   Jaillon O., Ke Z., Kodira C., Kokoza E., Koutsos A., Letunic I.,
RA   Levitsky A., Liang Y., Lin J.J., Lobo N.F., Lopez J.R., Malek J.A.,
RA   McIntosh T.C., Meister S., Miller J., Mobarry C., Mongin E., Murphy S.D.,
RA   O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA   Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA   Thomasova D., Ton L.Q., Topalis P., Tu Z., Unger M.F., Walenz B., Wang A.,
RA   Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A.,
RA   Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA   Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA   Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA   Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA   Collins F.H., Hoffman S.L.;
RT   "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL   Science 298:129-149(2002).
RN   [2] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA,
RC   ECO:0000313|Proteomes:UP000007062};
RX   PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA   Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT   "The Anopheles gambiae genome: an update.";
RL   Trends Parasitol. 20:49-52(2004).
RN   [3] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA}
RP   IDENTIFICATION.
RC   STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA};
RG   EnsemblMetazoa;
RL   Submitted (OCT-2022) to UniProtKB.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AAAB01008960; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; A0A903XYZ8; -.
DR   EnsemblMetazoa; AGAP029898-RA; AGAP029898-PA; AGAP029898.
DR   VEuPathDB; VectorBase:AGAMI1_000748; -.
DR   Proteomes; UP000007062; Chromosome 2L.
DR   GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR   CDD; cd00247; Endostatin-like; 1.
DR   FunFam; 3.40.1620.70:FF:000001; Multiplexin collagen isoform Ap3; 1.
DR   Gene3D; 2.60.120.200; -; 1.
DR   Gene3D; 3.40.1620.70; -; 1.
DR   Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR   InterPro; IPR016186; C-type_lectin-like/link_sf.
DR   InterPro; IPR008160; Collagen.
DR   InterPro; IPR050149; Collagen_superfamily.
DR   InterPro; IPR010515; Collagenase_NC10/endostatin.
DR   InterPro; IPR013320; ConA-like_dom_sf.
DR   InterPro; IPR016187; CTDL_fold.
DR   InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR   PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR   PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR   Pfam; PF01391; Collagen; 3.
DR   Pfam; PF20010; Collagen_trimer; 1.
DR   Pfam; PF06482; Endostatin; 1.
DR   SUPFAM; SSF56436; C-type lectin-like; 1.
DR   SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE   4: Predicted;
KW   Collagen {ECO:0000256|ARBA:ARBA00023119};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..20
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           21..1142
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5037080011"
FT   DOMAIN          858..906
FT                   /note="Collagen type XV/XVIII trimerization"
FT                   /evidence="ECO:0000259|Pfam:PF20010"
FT   DOMAIN          943..1108
FT                   /note="Collagenase NC10/endostatin"
FT                   /evidence="ECO:0000259|Pfam:PF06482"
FT   REGION          252..337
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          378..800
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          816..847
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        283..298
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        302..320
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        322..332
FT                   /note="Gly residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        445..456
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        482..506
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        623..638
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        654..663
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        672..681
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        714..724
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        749..765
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        824..834
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1142 AA;  120038 MW;  208BA5084F895F27 CRC64;
     MKWLYGTVVT LICIAHRVTS SELNIFGGQG IRDALAERDL MGAIEIPLQN GVKFVDGLDG
     FPAFGVTSEA DLKSPYKMIL SDHLQDFALI ATVRPQSSSG GWVFSVVNSL DTVVQLGLLL
     EPTAGGDQWN ATLYYTDAKK ERISQPLASF QVPYGKSWMK MIFKVLPDQV VFYYNCLEAG
     VVPVKKEPRK LVFDSASTVY IGQAGPVLKQ KFEGTFLFLK IYGYPEIVKT HCNRTSLPID
     EEIESTDEFG DDFNSEMVYD QSGDDDGFNE PPMISPPPPE YGYRLKGDKG ERGTKGESIR
     GPPGPPGPQG PPGPPGPPGV AGPKGGGAFD GDGSGDELKR LSQLYDTLPN RKHGGQCFCN
     ASLIIEELKM DSKLREYLRG PQGMPGKEGK TGAPGLTGVT GPQGERGASG PKGDKGDRGD
     QGAAGPEGLH GSKGEPGLDG APGVQGPPGP PGPPGLPENY DESMLGAPIQ GMRGGSPGLK
     GEPGEKGEIG FPGEKGEHGT KGDRGDPGLT GAKGERGHQG AHGQTGPKGP PGIPGIPGLP
     GQTGASGPKG EKGNTGESGP PGPPGPPGMV MHTEGGRNGT TDQCQCQPGP PGPPGARGPS
     GYDGAPGLPG ETGLPGHPGL PGDKGERGLP GPKGEKGPEF IINENAAFNS SRANKGEKGE
     RGQRGRRGKT GPPGPIGPPG KPGGMGETWP GRPGPKGDQG PKGEKGDSMA MRGLKGDKGE
     RGMNGRDGLP GPPGLPAASG DGGVQYIPMP GPPGPPGPPG QPGPPGLSIV GEKGEPGMDS
     RSPFYSDSQH GFYGRPGGRS SLDELKALRE LKHHKDYEDS TLGPPGPPGP PGPAGRPLHD
     SDEIPNSYGA NVRIVPGAVT FQNAETMSKM SSTTPVGTLA YIIDEEALLV RVNKGWQYIA
     LGTLVPIATV PPPTTTIVPP QRADLQASNL IHNLPQPGDG SVLRMAALNE PYSGDMQGIR
     GADFACYRQA RRAGLLGTFR AFLSSRIQNL DSIVRIADRE LPVVNTRGDV LFNSWNGIFN
     GQGGFFSQAP RIYSFSGKNV LTDMAWPQKL VWHGSSAHGE RAIDTYCDAW HSQTPDKVGL
     ASSLLGNKLL DQERYSCDNR FVVLCVEAVP QDRRRKRRDT TSQHEFANEK EYSQYLQSIS
     AL
//
DBGET integrated database retrieval system