ID A0A2K5W3U6_MACFA Unreviewed; 613 AA.
AC A0A2K5W3U6;
DT 28-MAR-2018, integrated into UniProtKB/TrEMBL.
DT 02-JUN-2021, sequence version 2.
DT 27-MAR-2024, entry version 27.
DE SubName: Full=Scm polycomb group protein homolog 1 {ECO:0000313|Ensembl:ENSMFAP00000031707.2};
GN Name=SCMH1 {ECO:0000313|Ensembl:ENSMFAP00000031707.2};
OS Macaca fascicularis (Crab-eating macaque) (Cynomolgus monkey).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini;
OC Cercopithecidae; Cercopithecinae; Macaca.
OX NCBI_TaxID=9541 {ECO:0000313|Ensembl:ENSMFAP00000031707.2, ECO:0000313|Proteomes:UP000233100};
RN [1] {ECO:0000313|Ensembl:ENSMFAP00000031707.2, ECO:0000313|Proteomes:UP000233100}
RP NUCLEOTIDE SEQUENCE.
RA Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSMFAP00000031707.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the SCM family. {ECO:0000256|ARBA:ARBA00008469}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_015307111.1; XM_015451625.1.
DR AlphaFoldDB; A0A2K5W3U6; -.
DR Ensembl; ENSMFAT00000005927.2; ENSMFAP00000031707.2; ENSMFAG00000036779.2.
DR GeneID; 101925446; -.
DR CTD; 22955; -.
DR VEuPathDB; HostDB:ENSMFAG00000036779; -.
DR GeneTree; ENSGT00940000157999; -.
DR OrthoDB; 2908161at2759; -.
DR Proteomes; UP000233100; Chromosome 1.
DR Bgee; ENSMFAG00000036779; Expressed in adult mammalian kidney and 13 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd20108; MBT_SCMH1_rpt2; 1.
DR CDD; cd09578; SAM_Scm; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR047280; MBT_SCMH1_rpt2.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR047531; SAM_Scm-like.
DR InterPro; IPR033763; SCML2_RBR.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF68; POLYCOMB PROTEIN SCMH1; 1.
DR Pfam; PF02820; MBT; 2.
DR Pfam; PF17208; RBR; 1.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 2.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 2.
DR PROSITE; PS51079; MBT; 2.
DR PROSITE; PS50105; SAM_DOMAIN; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000233100};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Repressor {ECO:0000256|ARBA:ARBA00022491}.
FT REPEAT 1..79
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 87..188
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 546..611
FT /note="SAM"
FT /evidence="ECO:0000259|PROSITE:PS50105"
FT REGION 186..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 203..221
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 613 AA; 68072 MW; 682A6D6CB0F967E7 CRC64;
MQSYTPPSNE FKISMKLEAQ DPRNTTSTCI ATVVGLTGAR LRLRLDGSDN KNDFWRLVDS
AEIQPVGNCE KNGGMLQPPL GFRLNASSWP MFLLKTLNGA EMAPIRIFHK EPPSPSHNFF
KMGMKLEAVD RKNPHFICPA TIGEVRGSEV LVTFDGWRGA FDYWCRFDSR DIFPVGWCSL
TGDNLQPPGT KVVIPKNPYP ASDVNTEKPS IHSSTKTVLE HQPGQRGRKP GKKRGRTPKT
LISHPISAPS KTAEPLKFPK KRGPKPGSKR KPRTLLNPPP ASPTTSTPEP DTSTVPQDAA
TIPSSAMQAP TVCIYLNKNG STGPHLDKKK VQQLPDHFGP ARASVVLQQA VQACIDCAYH
QKTVFSFLKQ GHGGEVISAV FDREQHTLNL PAVNSITYVL RFLEKLCHNL RSDNLFGNQP
FIQTHLSLTA TEYSHSHDRY LPGETFVLGN SLARSLEPHS DSMDSASNPT NFVSTSQRHR
PLLPSCGLPP STASAVRRLC SRGVLKGSNE RRDMESFWKL NRSPGSDRYL ESRDASRLSG
RDPSSWTVED VMQFVQEADP QLGPHADLFR KHEIDGKALL LLRSDMMMKY MGLKLGPALK
LSYHIDRLKQ GKF
//