ID A0A0L0DV59_THETB Unreviewed; 2169 AA.
AC A0A0L0DV59;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=Alpha-2-macroglobulin domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AMSG_02198 {ECO:0000313|EMBL:KNC56184.1};
OS Thecamonas trahens ATCC 50062.
OC Eukaryota; Apusozoa; Apusomonadida; Apusomonadidae; Thecamonas.
OX NCBI_TaxID=461836 {ECO:0000313|EMBL:KNC56184.1, ECO:0000313|Proteomes:UP000054408};
RN [1] {ECO:0000313|EMBL:KNC56184.1, ECO:0000313|Proteomes:UP000054408}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50062 {ECO:0000313|EMBL:KNC56184.1,
RC ECO:0000313|Proteomes:UP000054408};
RG The Broad Institute Genome Sequencing Platform;
RA Russ C., Cuomo C., Shea T., Young S.K., Zeng Q., Koehrsen M., Haas B.,
RA Borodovsky M., Guigo R., Alvarado L., Berlin A., Bochicchio J.,
RA Borenstein D., Chapman S., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Hepburn T., Howarth C., Jen D.,
RA Larson L., Mehta T., Park D., Pearson M., Roberts A., Saif S., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., Thomson T., Walk T., White J., Yandava C.,
RA Burger G., Gray M.W., Holland P.W.H., King N., Lang F.B.F., Roger A.J.,
RA Ruiz-Trillo I., Lander E., Nusbaum C.;
RT "The Genome Sequence of Thecamonas trahens ATCC 50062.";
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL349440; KNC56184.1; -; Genomic_DNA.
DR RefSeq; XP_013761218.1; XM_013905764.1.
DR STRING; 461836.A0A0L0DV59; -.
DR EnsemblProtists; KNC56184; KNC56184; AMSG_02198.
DR GeneID; 25561895; -.
DR eggNOG; ENOG502QS6U; Eukaryota.
DR OMA; LDRYPYG; -.
DR OrthoDB; 5839at2759; -.
DR Proteomes; UP000054408; Unassembled WGS sequence.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.20.130.20; -; 1.
DR Gene3D; 2.60.40.1930; -; 1.
DR Gene3D; 2.60.40.3710; -; 1.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR041246; Bact_MG10.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR032812; SbsA_Ig.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR40094; ALPHA-2-MACROGLOBULIN HOMOLOG; 1.
DR PANTHER; PTHR40094:SF1; UBIQUITIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF13205; Big_5; 1.
DR Pfam; PF17973; bMG10; 1.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000054408};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 1138..1296
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 1435..1524
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT REGION 90..120
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1388..1413
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 101..118
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1388..1408
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2169 AA; 232764 MW; 719F982EFADA2103 CRC64;
MATPITSSSR ASISRRLATF GESDDDIDIC SSYDDGSVVG DVPESSTFVV SLSNAPTGAT
TAGGTGWTPP PPLPATELSG EQVARLMARA EDDDASGAGP SNDNGDVQEF NQREKTLPTP
KTGDVIELDL FAPQPDGAPQ QEPVSEEEMA ARAKALAGDV TILRYYPEGS TAARGQSKTV
TNLAITFNQP MVELGSIDAQ DAVTPAVLDP PCTGAWRWTG VTTLQFVPDT RFAFSTTYTV
SIPAGTASAL GNMLPDDFSF SFTTNTLVLQ HSVPRSSTLR SNVVQSLRPR ILLQFDQAVD
PEAILASSSI TANNAPYSGT LTLCSDSPEI LVKNPDFEWA IRGLRSRNAL DHMVVFDLSD
DLPTDAAIAL VVAAGASSAE GPNTTPAPIT IAFHTYPPFS VIQPSTSPGS RPGATWMVRF
TNPIDYTTLR KDLITIEPQV EFSVQVHEHS ATSISIVNSS TKNTTYTLTI ARDLADIHGQ
ALTNNTVTIR VGKPYLCGSV SGPTGMVTLQ AASGDPTVFP VVVYNFTAIR VRIFHVEPNE
YLSSPLVSTY NYFHPERENL LDIGKKVEDY EVDLEDAEEK PLDYVVNLEP FLQHPSSNTG
QLYVVVEPTV AAWKTLKSRV RYGSAEYRYR PIAHAWVQIT NIALAALTYS ATTPKTLVWA
SQLATGIPLS DLAVSHLASR GSSSGTQLGC TDPAGVTIYD GSISNSGIVV ASHADDLAFV
TDVRVSRSAP SEPRWYVFDD RKLYKPNEVI TVKGFLRHVH IDVDGAIAPD YVRGSISWTA
YDSRGNKIAD SASQGPIAAD ADGPRLDPCL TLSSYGAFHF QVALPDNLNL GDGRIVISYS
GSSSLPATSY THSFSCQEFR RPEYDVSASF ESLGPHISSN AAAGSVTALV SAQYFAGGAL
ADAETRWEVT AKTGSYTPPH RSDYTFGRQN KWWLRPWWEP APSHSASLGT WNHTEATTDA
DGEHRLLIEF AGNDVPPAPI TLEAAATVID LNYQARFAKS TTLLHPSSLY VGYKVKAWGT
AGEPLAIELV VVDVDGELIA GVGIDLEVTH SVTTLVEDAE SGIASWATTE YSDVLHLVSP
DGAPLVTDYL PTHGGSYSFT ATVTDAAGSL NASSFAVNVR GKEPKRGATN SRVEAGELLL
IADKQCYSAG DTGELLVQAP FTDGELLLTF TADGLQATLR EPLIDGAAVV PFSVDASWLP
NFEIRVDAVG TEPRLTALGQ LNLAAPPKPA LASGKLNLQV DKRARSLDLV VCPASDSIAP
GSETSVSVAC NHADTGAPAE GVEVTLVVVD EAVLALTGYK LSDPVETFFT RRSTGESNST
FSHESVLLLS EEDLERFRTQ AQSELADNEA SASAIMPASK ACCARSCAVP AMMNCMPQAR
GMCAVDDSSD GDDFDDDDND DCLDSTDNAP TADDDAAELA EGAAKVRVRT KFDALAHFSP
TSVTDASGRV ELTFTVPDSL TRYRIWAIAA TDTRYGLGES KVTAQLPLMV RPSPPRFLNY
GDSAVVPVVL QNQTSDELKV YIGCSTSNLE LAPGFAGGYV ALLAPGQRGA LSFPVATAGA
GTARLQWVAA AGDFTDAIQK TVPVFTPATS EATATYGDTE NPEGLLAFDL RPPLDAIPHF
GGAQVSLTST AVAALTDAFI YLHNYPYECC EQIASKLLGI LPLLDVLEAF DADDIPSRAA
LQASVSKDMK LLARRQKSSG GFGYWNDRAD PYVTCHVAHA LAKCVERGLD VPDNLVSRTI
RYLQDIDSHL AHWPFVLYSL KTLNGFRAYA AFALTRLGVD SGRRAAEVFR AHPIHEFSLE
ALGWLLVALS VTSVGGIDDP KAAILKHLAG RVNETAQTAF FVSDYGDEGA YVMLHSNRRT
DAVLLEALLE AEPDSDLIVK VAKGLLAHRR AGKWRNTQEN AFCLVALHKY FTVREAEVPD
FAVHLWLASE FAGTAAFAGR TTDTLVGTVP MAKLLDSAAA SDRDGLTRLI VEKSGPGRLY
YRIAVDYAPA ALTVDALDRG FTVAREYVGV DSPDHAVRAD DGSWRIALGE KIRVRVYMRT
TQRRYHVALV DQLPAGFEAL NPALKGTPEV ELKNLPAPAG AASGNGATTA SRSIFAWRWV
DPLRWFEFQN LRDERAEAFR SLLWEGNYEF QYIVRATTAG SFVVPPAKAE CMYETEIFGR
SASGRVTIA
//