ID G7YH68_CLOSI Unreviewed; 776 AA.
AC G7YH68;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE SubName: Full=Polycomb protein SCMH1 {ECO:0000313|EMBL:GAA52301.1};
GN ORFNames=CLF_107761 {ECO:0000313|EMBL:GAA52301.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA52301.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA52301.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA52301.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF143270; GAA52301.1; -; Genomic_DNA.
DR AlphaFoldDB; G7YH68; -.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd20092; MBT_dScm-like_rpt2; 1.
DR CDD; cd09509; SAM_Polycomb; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF132; POLYCOMB PROTEIN SCM; 1.
DR Pfam; PF02820; MBT; 2.
DR SMART; SM00561; MBT; 2.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 3.
DR PROSITE; PS51079; MBT; 1.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008909};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 161..262
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 709..759
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 59..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 263..340
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 374..405
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 452..588
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 614..637
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 671..698
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..301
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 312..340
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..501
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..554
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 556..576
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 673..696
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 776 AA; 85536 MW; FB9D6FA5F4083383 CRC64;
MSTHNSSDSS SFFEWDDYLS RTGARPADPK CFKQSRIPPQ NLFEVNALLE AEDQRSAAHD
VQTTSPFTPD GFKTSTTGST GVGSGVNRRP SLATNPVGTL RRFRAAAFSL AQVVEVWGPR
LRIRLIGTDD RNDCWFLVDS DQIRPIRRVN PPPFGYMYNH LNWSRTLKSA TEGAKFADPS
WFVESPSDPT DNFFQVGDKL EAVDRHNSQL ICPATIGAVN GQHIFVSFDG WSGAFDYWTR
FDSRELFPVG WCKLADYPLQ SPGPNALRSP TNQSTPAPAR PLTSFRSGPL NTSSATETPS
LPGSAKRPIG TKRLEKRNRS HAAKHGGRPN SRRRVKSSVL RRQRLSAIVH KPIGVAKPEV
PASDTVWPSE TVKRLTSVSS PHPELTDSAN SSMRSMSSPP TIHPMNEDAS ESALNLSVSS
SDQPPAIEVA LPIQSDPPRV RRISNSSDCV VRLQAKSPPV PSDRADYTKP EHKSWQVVPI
SPQSQRKAPK RHRHSVDKAS RLRKKFKLVG TSERPTFATL KGELDKHSTS KETISTLRQD
EHSNKTKPEV SAREDATPQR PTSPSSLPLP SLQHSHATHF RSDRSGQTPL HLYCGDYNTA
ALASYETKPV AYLPPQSESV SSGEHLDDDV ASSVTRSNGV AQSYTSGLWY PSSRMEYNSS
KMLDAHAASS HATDDYSDGQ SLAPHVSSST VSPSSLDLGP GPLPNPALWT IDEVYHYLTT
RDPSLLEVAQ KFKHHEIDGQ ALLLLNMESL RNYLQVKLGP ALKVDHLISR IKRGLL
//