ID A0A903XYZ8_ANOGA Unreviewed; 1142 AA.
AC A0A903XYZ8;
DT 22-FEB-2023, integrated into UniProtKB/TrEMBL.
DT 22-FEB-2023, sequence version 1.
DT 28-JAN-2026, entry version 14.
DE SubName: Full=Multiplexin {ECO:0000313|EnsemblMetazoa:AGAP029898-PA};
OS Anopheles gambiae (African malaria mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=7165 {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062};
RN [1] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA,
RC ECO:0000313|Proteomes:UP000007062};
RX PubMed=12364791; DOI=10.1126/science.1076181;
RA Holt R.A., Subramanian G.M., Halpern A., Sutton G.G., Charlab R.,
RA Nusskern D.R., Wincker P., Clark A.G., Ribeiro J.M., Wides R.,
RA Salzberg S.L., Loftus B., Yandell M., Majoros W.H., Rusch D.B., Lai Z.,
RA Kraft C.L., Abril J.F., Anthouard V., Arensburger P., Atkinson P.W.,
RA Baden H., de Berardinis V., Baldwin D., Benes V., Biedler J., Blass C.,
RA Bolanos R., Boscus D., Barnstead M., Cai S., Center A., Chaturverdi K.,
RA Christophides G.K., Chrystal M.A., Clamp M., Cravchik A., Curwen V.,
RA Dana A., Delcher A., Dew I., Evans C.A., Flanigan M.,
RA Grundschober-Freimoser A., Friedli L., Gu Z., Guan P., Guigo R.,
RA Hillenmeyer M.E., Hladun S.L., Hogan J.R., Hong Y.S., Hoover J.,
RA Jaillon O., Ke Z., Kodira C., Kokoza E., Koutsos A., Letunic I.,
RA Levitsky A., Liang Y., Lin J.J., Lobo N.F., Lopez J.R., Malek J.A.,
RA McIntosh T.C., Meister S., Miller J., Mobarry C., Mongin E., Murphy S.D.,
RA O'Brochta D.A., Pfannkoch C., Qi R., Regier M.A., Remington K., Shao H.,
RA Sharakhova M.V., Sitter C.D., Shetty J., Smith T.J., Strong R., Sun J.,
RA Thomasova D., Ton L.Q., Topalis P., Tu Z., Unger M.F., Walenz B., Wang A.,
RA Wang J., Wang M., Wang X., Woodford K.J., Wortman J.R., Wu M., Yao A.,
RA Zdobnov E.M., Zhang H., Zhao Q., Zhao S., Zhu S.C., Zhimulev I.,
RA Coluzzi M., della Torre A., Roth C.W., Louis C., Kalush F., Mural R.J.,
RA Myers E.W., Adams M.D., Smith H.O., Broder S., Gardner M.J., Fraser C.M.,
RA Birney E., Bork P., Brey P.T., Venter J.C., Weissenbach J., Kafatos F.C.,
RA Collins F.H., Hoffman S.L.;
RT "The genome sequence of the malaria mosquito Anopheles gambiae.";
RL Science 298:129-149(2002).
RN [2] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA, ECO:0000313|Proteomes:UP000007062}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA,
RC ECO:0000313|Proteomes:UP000007062};
RX PubMed=14747013; DOI=10.1016/j.pt.2003.11.003;
RA Mongin E., Louis C., Holt R.A., Birney E., Collins F.H.;
RT "The Anopheles gambiae genome: an update.";
RL Trends Parasitol. 20:49-52(2004).
RN [3] {ECO:0000313|EnsemblMetazoa:AGAP029898-PA}
RP IDENTIFICATION.
RC STRAIN=PEST {ECO:0000313|EnsemblMetazoa:AGAP029898-PA};
RG EnsemblMetazoa;
RL Submitted (OCT-2022) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAAB01008960; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; A0A903XYZ8; -.
DR EnsemblMetazoa; AGAP029898-RA; AGAP029898-PA; AGAP029898.
DR VEuPathDB; VectorBase:AGAMI1_000748; -.
DR Proteomes; UP000007062; Chromosome 2L.
DR GO; GO:0005581; C:collagen trimer; IEA:UniProtKB-KW.
DR CDD; cd00247; Endostatin-like; 1.
DR FunFam; 3.40.1620.70:FF:000001; Multiplexin collagen isoform Ap3; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR Gene3D; 3.40.1620.70; -; 1.
DR Gene3D; 3.10.100.10; Mannose-Binding Protein A, subunit A; 1.
DR InterPro; IPR016186; C-type_lectin-like/link_sf.
DR InterPro; IPR008160; Collagen.
DR InterPro; IPR050149; Collagen_superfamily.
DR InterPro; IPR010515; Collagenase_NC10/endostatin.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR045463; XV/XVIII_trimerization_dom.
DR PANTHER; PTHR24023; COLLAGEN ALPHA; 1.
DR PANTHER; PTHR24023:SF1082; COLLAGEN TRIPLE HELIX REPEAT; 1.
DR Pfam; PF01391; Collagen; 3.
DR Pfam; PF20010; Collagen_trimer; 1.
DR Pfam; PF06482; Endostatin; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Collagen {ECO:0000256|ARBA:ARBA00023119};
KW Reference proteome {ECO:0000313|Proteomes:UP000007062};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..1142
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5037080011"
FT DOMAIN 858..906
FT /note="Collagen type XV/XVIII trimerization"
FT /evidence="ECO:0000259|Pfam:PF20010"
FT DOMAIN 943..1108
FT /note="Collagenase NC10/endostatin"
FT /evidence="ECO:0000259|Pfam:PF06482"
FT REGION 252..337
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 378..800
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 816..847
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..298
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 302..320
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 322..332
FT /note="Gly residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 445..456
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 482..506
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 623..638
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 654..663
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 672..681
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 714..724
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 749..765
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 824..834
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1142 AA; 120038 MW; 208BA5084F895F27 CRC64;
MKWLYGTVVT LICIAHRVTS SELNIFGGQG IRDALAERDL MGAIEIPLQN GVKFVDGLDG
FPAFGVTSEA DLKSPYKMIL SDHLQDFALI ATVRPQSSSG GWVFSVVNSL DTVVQLGLLL
EPTAGGDQWN ATLYYTDAKK ERISQPLASF QVPYGKSWMK MIFKVLPDQV VFYYNCLEAG
VVPVKKEPRK LVFDSASTVY IGQAGPVLKQ KFEGTFLFLK IYGYPEIVKT HCNRTSLPID
EEIESTDEFG DDFNSEMVYD QSGDDDGFNE PPMISPPPPE YGYRLKGDKG ERGTKGESIR
GPPGPPGPQG PPGPPGPPGV AGPKGGGAFD GDGSGDELKR LSQLYDTLPN RKHGGQCFCN
ASLIIEELKM DSKLREYLRG PQGMPGKEGK TGAPGLTGVT GPQGERGASG PKGDKGDRGD
QGAAGPEGLH GSKGEPGLDG APGVQGPPGP PGPPGLPENY DESMLGAPIQ GMRGGSPGLK
GEPGEKGEIG FPGEKGEHGT KGDRGDPGLT GAKGERGHQG AHGQTGPKGP PGIPGIPGLP
GQTGASGPKG EKGNTGESGP PGPPGPPGMV MHTEGGRNGT TDQCQCQPGP PGPPGARGPS
GYDGAPGLPG ETGLPGHPGL PGDKGERGLP GPKGEKGPEF IINENAAFNS SRANKGEKGE
RGQRGRRGKT GPPGPIGPPG KPGGMGETWP GRPGPKGDQG PKGEKGDSMA MRGLKGDKGE
RGMNGRDGLP GPPGLPAASG DGGVQYIPMP GPPGPPGPPG QPGPPGLSIV GEKGEPGMDS
RSPFYSDSQH GFYGRPGGRS SLDELKALRE LKHHKDYEDS TLGPPGPPGP PGPAGRPLHD
SDEIPNSYGA NVRIVPGAVT FQNAETMSKM SSTTPVGTLA YIIDEEALLV RVNKGWQYIA
LGTLVPIATV PPPTTTIVPP QRADLQASNL IHNLPQPGDG SVLRMAALNE PYSGDMQGIR
GADFACYRQA RRAGLLGTFR AFLSSRIQNL DSIVRIADRE LPVVNTRGDV LFNSWNGIFN
GQGGFFSQAP RIYSFSGKNV LTDMAWPQKL VWHGSSAHGE RAIDTYCDAW HSQTPDKVGL
ASSLLGNKLL DQERYSCDNR FVVLCVEAVP QDRRRKRRDT TSQHEFANEK EYSQYLQSIS
AL
//