ID G7Y523_CLOSI Unreviewed; 1756 AA.
AC G7Y523;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=CD109 antigen {ECO:0000313|EMBL:GAA48059.1};
GN ORFNames=CLF_101128 {ECO:0000313|EMBL:GAA48059.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA48059.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA48059.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA48059.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF142867; GAA48059.1; -; Genomic_DNA.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0005615; C:extracellular space; IEA:InterPro.
DR GO; GO:0004866; F:endopeptidase inhibitor activity; IEA:InterPro.
DR Gene3D; 1.50.10.20; -; 1.
DR Gene3D; 2.20.130.20; -; 1.
DR Gene3D; 2.60.40.1930; -; 2.
DR Gene3D; 2.60.40.1940; -; 1.
DR Gene3D; 2.60.40.2950; -; 1.
DR Gene3D; 2.60.40.690; Alpha-macroglobulin, receptor-binding domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR009048; A-macroglobulin_rcpt-bd.
DR InterPro; IPR036595; A-macroglobulin_rcpt-bd_sf.
DR InterPro; IPR011625; A2M_N_BRD.
DR InterPro; IPR011626; Alpha-macroglobulin_TED.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR001599; Macroglobln_a2.
DR InterPro; IPR002890; MG2.
DR InterPro; IPR041555; MG3.
DR InterPro; IPR008930; Terpenoid_cyclase/PrenylTrfase.
DR PANTHER; PTHR11412:SF179; LD23292P; 1.
DR PANTHER; PTHR11412; MACROGLOBULIN / COMPLEMENT; 1.
DR Pfam; PF00207; A2M; 1.
DR Pfam; PF07703; A2M_BRD; 1.
DR Pfam; PF07677; A2M_recep; 1.
DR Pfam; PF01835; MG2; 1.
DR Pfam; PF17791; MG3; 1.
DR Pfam; PF07678; TED_complement; 1.
DR SMART; SM01360; A2M; 1.
DR SMART; SM01359; A2M_N_2; 1.
DR SMART; SM01361; A2M_recep; 1.
DR SUPFAM; SSF49410; Alpha-macroglobulin receptor domain; 1.
DR SUPFAM; SSF48239; Terpenoid cyclases/Protein prenyltransferases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 565..701
FT /note="Alpha-2-macroglobulin bait region"
FT /evidence="ECO:0000259|SMART:SM01359"
FT DOMAIN 854..945
FT /note="Alpha-2-macroglobulin"
FT /evidence="ECO:0000259|SMART:SM01360"
FT DOMAIN 1568..1659
FT /note="Alpha-macroglobulin receptor-binding"
FT /evidence="ECO:0000259|SMART:SM01361"
SQ SEQUENCE 1756 AA; 199043 MW; 8297BEE2123BFBC5 CRC64;
MTSAAEFAGD GVNASVHSSF PPFALLCEKR PSRKPIDWRL CNLAGSKPSC FILVAWQIGT
ERVLLLSDML LPLLNNAPEF CGLDYLTRPF TMEISIGTSV NTDLLVVCGE HPSTFTFPSR
GILPFKPRSY IILAPNRIRA NELVQVTVSI FRLFYPQLSI RISIKIDTDE IISALEVFRS
PGTRLMQLKV PDYTRNATYW FHVEGSIVAN SSLLFFNRTK LEFMPQSASF FIQVSKPVYH
QSQIVRFRVI PVMPDLTPLY GSLVSIEVLD ASGNLFRRWL NPMTNAGGII ELDFPLSDMV
HEGEWTIRAT HELFSASKTF RVVEYWRPLW DVNVTVPMRM LDNELAFYGL VSANYTSGKA
VRGNATVLIQ LREAGDSRWI MPARAQLTKQ LHAVDGIASF LITLDDIRRA ITGAGASTSL
ANTELWVNVS YYSWWETDVR TGWAYTQIFS STPMIRFLGG QVRPFKPNMY FTAYIVVFMP
DGSMVQYFGS RRIRLQFFCN ENTPSSPAVN LVVPDNGLVT YTYRPSGEGC VTYRLQADYM
DETDRILATR NQRIFQYHSY SDTFLQLSTS TLQPRVNEYF VVTVQTNYPT DTIHYVIVSN
GNILVADQLR LPNAITSRTF SVAVSRSMFP FAHLIAYFIK DSSEIVSDAL TFYTNYTNLN
NVHLEVNRGK DLNQDTVEVR GHATPGSYLA VNVIHSDLYK FAAASILREH NASAYIFIVD
ELAAYNGQSQ RPFVHTWYDN MMDIQRVYVP APSIGADANT TMNVSGLILF TDANFTKANF
YHTCNETLDP ARALPCFSTT GRDCYSRAEQ CNGIAECVTW VDEMNCPINE SDLPQPSRTI
DSYNLLYRLW NDGAWMWHST FVKPDGQIQF RVNLPKLNAD WIAGAFAVDE QLGMSLMQQP
YYFAGIRRFY MTVEVVEEAV WGEQLGVRLC LFNNWDYWIE ALVELKASPD IRVIQIGFGG
RTSAYSPKTS VNESVQTLVF LEAGSSKYIY MPVLPKEPGN SSFTICAYSF IGSNCETHEV
RVTMNGVPNY YHTATFLDLT SSSALFVNNF KIIVPQKYTI PERRMHRFVP GSQFASLSVV
GDCIGPALHS NFPFATTQNV LRMGYGSAES VFFELGYNLN LLLYLYGSPG LSQDVEREGL
LYCSVVLQRG FSFFNPALGA FANFRDELDR PSPLATAFAL WNLLLTRQPQ WNRLIYVDDP
TFIRIIDYLA STQQTSSSMG GTNSVDVRLA GSWEVGEVID RRFAPISNAT DPIDREAHRR
IPCAAMVVIA FRSTGRPMPS GIGAERAEVV VASAVQFIAR HLLQTNDLFS LMIGTYALSV
ATDSTSQQKI MLAMQIMDQY RRKGEYIYFA NYPIPPPKWE LDQAGRRIEN PRWEMPNDGY
GVASTSFYFL LKRKLGEWNV HMTEAMDVVH WLASQRNHIS GFSSTFDSLV ALHALREFAL
SDTNRALYRM AVDEKISSIS DWTNRIYVDR SNYSTVTRTT FPPWDVWGDV TMKVEGTGWL
LLQLDVRVNV EYPQMQKMPR SPQNLEEVMR SFDIECIPGF RGRNNSIMIM KACGRWVGTT
GVEPLSQSGM AVFSIGLPTG YVVLNDDLRR YVMSGQVPNL RFARTWTKTV DFFFDTITTN
QTCVQFQAER YYPVANTTQQ QVCSAYEYYE PGRYNHSMYN VVSLYTNHIC NVCGAYACPY
CPDYNEAKRK FSKKTTVSLL VILTLWLTQQ YLARSEGLWQ LKCIVGIPHL LRIMSADRIE
GEAIDRQASD RNRTQN
//