ID A0A182XFY9_ANOQN Unreviewed; 3253 AA.
AC A0A182XFY9;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EnsemblMetazoa:AQUA008749-PA};
OS Anopheles quadriannulatus (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=34691 {ECO:0000313|EnsemblMetazoa:AQUA008749-PA, ECO:0000313|Proteomes:UP000076407};
RN [1] {ECO:0000313|Proteomes:UP000076407}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SANGQUA {ECO:0000313|Proteomes:UP000076407};
RG The Broad Institute Genomics Platform;
RA Neafsey D.E., Howell P., Walker B., Young S.K., Zeng Q., Gargeya S.,
RA Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anopheles quadriannulatus QUAD4_A.";
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EnsemblMetazoa:AQUA008749-PA}
RP IDENTIFICATION.
RC STRAIN=SANGQUA {ECO:0000313|EnsemblMetazoa:AQUA008749-PA};
RG EnsemblMetazoa;
RL Submitted (MAY-2020) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 34691.A0A182XFY9; -.
DR EnsemblMetazoa; AQUA008749-RA; AQUA008749-PA; AQUA008749.
DR VEuPathDB; VectorBase:AQUA008749; -.
DR Proteomes; UP000076407; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR CDD; cd02897; A2M_2; 2.
DR Gene3D; 1.50.10.20; -; 2.
DR Gene3D; 2.20.130.20; -; 3.
DR Gene3D; 2.60.120.1540; -; 2.
DR Gene3D; 2.60.40.1930; -; 5.
DR Gene3D; 2.60.40.1940; -; 3.
DR Gene3D; 2.60.40.2950; -; 2.
DR Gene3D; 6.20.50.160; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 2.
DR Gene3D; 2.60.40.10; Immunoglobulins; 4.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR041813; A2M_TED.
DR InterPro; IPR047565; Alpha-macroglob_thiol-ester_cl.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR019742; MacrogloblnA2_CS.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR040839; MG4.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11412:SF176; GH01829P-RELATED; 1.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR Pfam; PF00207; A2M; 2.
DR Pfam; PF07703; A2M_BRD; 2.
DR Pfam; PF07677; A2M_recep; 2.
DR Pfam; PF01835; MG2; 3.
DR Pfam; PF17791; MG3; 3.
DR Pfam; PF17789; MG4; 2.
DR Pfam; PF07678; TED_complement; 2.
DR SMART; SM01360; A2M; 2.
DR SMART; SM01359; A2M_N_2; 2.
DR SMART; SM01361; A2M_recep; 2.
DR SMART; SM01419; Thiol-ester_cl; 2.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 2.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 2.
DR PROSITE; PS00477; ALPHA_2_MACROGLOBULIN; 2.
PE 4: Predicted;
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Thioester bond {ECO:0000256|ARBA:ARBA00022966}.
FT DOMAIN 773..909
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 1015..1105
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 1629..1721
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
FT DOMAIN 2170..2305
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 2478..2569
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 3096..3185
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
FT REGION 3209..3229
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3253 AA; 362807 MW; A573B5EAF1C93106 CRC64;
LKEGHYQLKV WDKQKQNLLN TTSLERIDRS YLVLFQTDKP AYKPGDRVQF RVLFLLPDTK
PVGQSVRPTI FIADPDRVHM KQWNGVSLSS GVFEGSFQLA EYTSFGRWII TASVNEQIYK
DTFSVEEYTL PLYRVQVQSI PKAYFQCDEP KMSLKLSASF VHGGSVRGNA TVVVRANYNN
YPSQTKEVAR KELFINGTAI VDFPTDVVAK SCDEERNVWF DVIVTESSTG VSYNTTCSYT
VHNAGGVTME VLDGNEAFYP GLAMRLMSVG VTLLTDGYLP DVGFIPHSFG ELARTTSTLD
EDDSVREDFP ETWLWESIRA KKYCSCLRLL IEMFSKGGGM RFGGEVKRTV PDPKDHKDGH
YSIIGARILR PNSVYRCVVS TFDTKSAIVF RISIVAKDKP IATEEITLNS NESRLISFTI
DSIPEEEYEL VAEGLSGLEF KTKSRLDFDN KFCSVLIQTD KSVYKPGDTV RYRVLVLDRS
MKPLPAGDSG MMVYIRDGKG NRIKQWSNAS LGECGVFQAE LTLSTEPVLG EWTINVEVVG
LKESKTFDVD EYVLPTYEVT VESPGYTFLD DELLKVVVNS KYTYGKPVAG ELTVSVKLAS
SMCFRREPTE TSICQKVLPI DGKTVVEFNL KEILSSKTYI RELTIEAEVC ETLTGRTQKG
STTVQLHDER YQVRMIEESS YFPGLPYNAW IQVTNLDGSP VQDGAKEVEI VLRNYNIDLH
KQSSTLDEKG MAQLNVKLDE LDFDYVSVEV KYRGKDYYVQ GITKPRDYEE ALMRVRLSEK
EPTPGKDLTF DVACTKPLQC VSYSLLARGE LLAGGAVKGS EASTTISITI PSTFAMVPRA
KLLVHYISSA GYIVSSYDTV EFKRVFENQI QLTLSKDELK PAETLDIDIR TEKDTFVGLL
AVDQSVLLLK SGNDISRDEV VQQLEMYESA LNCHEFYDKN STSDCKHAGA VLLSNRFIPE
DIWPQGRFFA CSANSMAFGA APMMEACVMK GAIMDSDMAT APVNEPTVRS KFPETWIWES
ISNCKEMESI RKIVPDTITS WIITGFSLSK RHGLGLVDNP SKVNVFMPFF LSIDLPYSVK
LGETIRIPVV VFNYMDKDQL ADVIFYNNDD EFEFVSDIKD QEEKHRQEQI TVPRGTGKTL
TFVLKPTKVG HVTLKITAKC ALAGDGIERQ LLVEPEGLPQ YINKALLVDL RSVKEIKQPF
EVDIPLDAVP DSTKVEVSVI GDVLGSSIEN LDSLIRMPFG CGEQNMLNFV PCIVVLDYLK
ACKRLTVEIE SKAKRCMEIG YQRELTYKHQ DGSFSAFGES DKSGSTWLTA FVAKSFQQAA
KHMTIEEDVI DSALGWLSKV QTADGAFPEV GTICHKDMQG GAGSGIALTA YTVIAFLEHP
KLGEKYKASV DKALTYVKDH ISELDDVYAH ALAAYALQIA DHPLKNEVYA SLLSKSNKQG
DIQWWSKDIP EKNDSDCCWW YRPCSVNVEM SAYGLLATLE ASSAGLEGLP IMKWLVSQRN
DKGGFESTQD TVVGLQALSK MAAQLSSSEA DMSLKEIIPG EQEKCLQVNG GNILVLQKHE
LASNTRKLEM IATGTGCALF QLSYKYNIKD VDNSPRFTLK PEAKQGSIKS CIDLSITTSF
IPKEDQAVSN MAVMEVDMPS GFIVESDTLK QLKQHEMVKK VETKRSDTTV VLYFDNIGEE
AVHLQMSAFQ KHEVENAKPA NVIIYDYYDN SNVMRRSTMF ASKKDPAIRI LSTSSSSLLA
VCLVLGVVLV PAVQCEGHYS IVGAKLLRPN SEYHVAVTNQ DVSEPIRFSL AITDASSVIA
KQEITLNTGE TRLVPFAIGD IPESSYKLVA EGLSGLTFKN ETDLEYQQKS FSVFVQTDKS
IYKPGDTVRF RVLVLDPNTK PLQKADNISV HINDAKANRI KQWKEGKLVK GVFESELTLS
TAPVLGAWTI NVEVLGSKHN KVFEVDEYVL PKFEVTVESP GITTFKDGKV KAIIRSKYTY
GKPVKGEATV SVSPEFQFHY VQPFAKDVIT RMVIPIDGKG SVEFDLREDI HLEGDYSRNI
VIEAVVEEEL TGRKQNASAK VMIYDRRYKM ELVKSDDNFK PGLPYTAWLK VSYQDGAPVQ
DQTNPVEVKQ SSFESTTSVQ NYTLDQNGMA KLEINTEVNS SYINVVGVYL GQEFYLHGIS
KAESDVDAYI RAQVLTEMPL VGKDVLVEVT STSPMKYFTY QLLGRGDVLL SNTIAVPESK
TQSFKFPATF AMVPRAKLVV YYIAPNGDMV SDSKVITFDS ELQNFMKVSL SKEQSKPGQD
VEISISTNPD SYVGLLGVDQ SVLLLKSGND ITKQQVFSEL EKYEERSYGF YRRKKRFAWN
PHAEHRDFST VGAFVMSNAN DPPRSAVLRC CDDTDDLLDI RVAEEDRIGS ALEFAPSTLS
SAPITNNTLP EVPIYKFHRA PNGTVLYTTI EKPRAQKQHH VLVTNTRPPL AGPFAFSRIP
RPHRDIPRLF LSQEIQNTWL FDNTYSGFSG EKTLQKKVPD TITSWIITGF SVNPIYGLGL
TQQPRKLNVF LPFFVSTNLP YSVKRGEVVA IPIVVFNYME DDQTAEVVLH NDEQEFEFAD
VENEVVESNK VELFRQKRLD IASNTGKSVS FMVKPKKLGH ITIKVTAKTK IAGDAVERQL
LVEPEGLPQF INKAAFIDLR AAPEVTKTFE VEIPKNAVPD STRIEVAVIG DVMGSTIQNL
DSLIRMPYGC GEQNMLNFVP NIVVLDYLKA TNKLTANIEA KAKKFMEAGY QRELSYKHRD
GSFSAFGEND KSGSTWLTAF VARSFKQAAN HITIDEGVID KSLEWLSDHQ APNGSFPEVG
VVSHKDMQGG SGSGVALTAY TLIAFLENIN LVDKYKNTIN KAIDYVYRNT ESLDDTYALA
LAAYALQLAD HSSKGLILSK LDTKATTDSD SKWWHKPIPE TEQKNPWYSR PNSVNVEMSA
YGMLAFLEAG LDTDALPIMK WLIGQRNDKG GFQSTQDTVV GLQALAKLAA KITSPNNDVT
LTAKINENQE KRMTVNAENG MILQKFELPS AARNIEIQAT GSGFAVVQLS YKYNMNVTGE
WPRFVLDPQV NANTNPDYLH LSVCASFVPS AGQNVSNMAV MEVGFPSGFT ADSDTLPSLE
NMPFIKKVET KDGDTTVVLY FDSLDQRELC PTISAFRTHK VAKQKPAPVV IYDYYDNSRI
ARQFYDGPKA SLCDICENED CGEACSIRSQ KQRSSDSPSR QPTVEGTMQS GSQTVRVSFF
TFLLATLLVR MFH
//