ID A0A1D2N0C7_ORCCI Unreviewed; 1234 AA.
AC A0A1D2N0C7;
DT 30-NOV-2016, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2016, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=CD109 antigen {ECO:0008006|Google:ProtNLM};
GN ORFNames=Ocin01_07991 {ECO:0000313|EMBL:ODM98688.1};
OS Orchesella cincta (Springtail) (Podura cincta).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Collembola;
OC Entomobryomorpha; Entomobryoidea; Orchesellidae; Orchesellinae; Orchesella.
OX NCBI_TaxID=48709 {ECO:0000313|EMBL:ODM98688.1, ECO:0000313|Proteomes:UP000094527};
RN [1] {ECO:0000313|EMBL:ODM98688.1, ECO:0000313|Proteomes:UP000094527}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Mixed pool {ECO:0000313|EMBL:ODM98688.1};
RX PubMed=27289101; DOI=10.1093/gbe/evw134;
RA Faddeeva-Vakhrusheva A., Derks M.F., Anvar S.Y., Agamennone V., Suring W.,
RA Smit S., van Straalen N.M., Roelofs D.;
RT "Gene Family Evolution Reflects Adaptation to Soil Environmental Stressors
RT in the Genome of the Collembolan Orchesella cincta.";
RL Genome Biol. Evol. 8:2106-2117(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ODM98688.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LJIJ01000335; ODM98688.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1D2N0C7; -.
DR STRING; 48709.A0A1D2N0C7; -.
DR OMA; CETRIIN; -.
DR OrthoDB; 3972225at2759; -.
DR Proteomes; UP000094527; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.60.40.1930; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11412:SF179; LD23292P; 1.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF07677; A2M_recep; 1.
DR Pfam; PF07678; TED_complement; 1.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SMART; SM01361; A2M_recep; 1.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000094527};
KW Thioester bond {ECO:0000256|ARBA:ARBA00022966}.
FT DOMAIN 126..263
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 408..499
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 1086..1177
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
SQ SEQUENCE 1234 AA; 139510 MW; F0D8D00B78355852 CRC64;
MDLEKVSEER LSTSVFRIRS EAYLTSGQRV DLGALVLNPV ELERDEDDIY NEVANSRDYE
EKEIKDVILK HQVKRYLEEA MVEMDLHLPK TAARLIVRAE YTDESGAVAV GELSAVSAFS
PSQKFLKVVL KTADAIGVGQ FVVLHLYANF AINHFHFLVV AKGMIVSHGT ERLNSYWSLP
KSFTVVVGQE MSPGFRVFVY CGTRKGEIVT DSIFIPVQNL YRHQAQLEIN QAKDRSKDTL
ELRFVGAPAA YFGASVQRGV RHLMQAGNEL TPAYVLNTLH SLEPWNKSLS RLVRRQRSGE
TPDQVHYLQS SGYAIDSYSA LLQSSLIMFS DGFAPRSSFL TSGDECPEER CLTVQGCFRF
IEKCDLINHC TDASDELDCD FIDEEDIQHF RVTRISRFAD FYDPSDGDWA WTKENIAHGG
DYILTRDTPG VTDSWYLNGF AVSKQFGFAL IDSPIEYETL RPFLMVCDAP SSIRRGEHVG
IIVMLYNKTP KEILVMVSMA GSDDYEFVHV GKDGLMNFDD KHPRFSSGEH QHLVWMGPES
EVELRLPVKP TIDQGTISVE ITATTQIRSQ TVTMDIDVLA DGVQIGKHTS LLLDLRNRAF
MLKYMNIFVE ELPEIPYDII TKYVHGSPAG HVIVSGDVIG PSASLLPVSM ESLIGKEGKG
TADRIFDLAS NTWTLHYLRL TNQLESSVAK NTFKAMNKHF AQVMKRFNAN VGWFKTWNLS
KPCVWLTSWA LQVFAHANFQ DWENEFYVEQ KIFSKALRWL LKYQNEDGSF SETRWYILHP
LNKRMGYMRN MNMTSSNVTL TAHLVITMSQ VMGNVDGQLR VEASIGKAKA LHFLERQLAD
MVDPYHIAIT AYALTIGDST EKEFAFSKMH AVRRERAELV YWSPGEVPTN GIVYENQRPF
LEAKNNEYWD AIAVEGTSYA LLVYLTRDGI SPIADSIVTW LNTMRLTDGA FISTVDTVVA
FQALTEYANR ARMREITDLT VYIEISSQQE SEPPEPVKFA PDSVTATHVI DMEMVWGHVN
IEGRGAGQAV VQLDLTYGID WEPFKDTGPV DPFDLKIQEH FGGRNKSILE IEACFRWILL
EESEVSGAAV LEVEIPSGYG LLQSDALRLV ASGAHPTLKD SVVSPGKTSW FFEFIPPYWT
CLNHTVYRWY PVANLTTHRQ ATIYEAYAPE RFINTIYNDT SMRFLDICHV CGSFQCPYCP
FYNVASELKR LSVATAILNF AILRYLHNVM IGAG
//