ID F7CSA8_HORSE Unreviewed; 890 AA.
AC F7CSA8;
DT 27-JUL-2011, integrated into UniProtKB/TrEMBL.
DT 13-SEP-2023, sequence version 3.
DT 27-MAR-2024, entry version 62.
DE SubName: Full=Scm like with four mbt domains 1 {ECO:0000313|Ensembl:ENSECAP00000005215.3};
GN Name=SFMBT1 {ECO:0000313|Ensembl:ENSECAP00000005215.3,
GN ECO:0000313|VGNC:VGNC:22895};
OS Equus caballus (Horse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Perissodactyla; Equidae; Equus.
OX NCBI_TaxID=9796 {ECO:0000313|Ensembl:ENSECAP00000005215.3, ECO:0000313|Proteomes:UP000002281};
RN [1] {ECO:0000313|Ensembl:ENSECAP00000005215.3, ECO:0000313|Proteomes:UP000002281}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000005215.3,
RC ECO:0000313|Proteomes:UP000002281};
RX PubMed=19892987; DOI=10.1126/science.1178158;
RG Broad Institute Genome Sequencing Platform;
RG Broad Institute Whole Genome Assembly Team;
RA Wade C.M., Giulotto E., Sigurdsson S., Zoli M., Gnerre S., Imsland F.,
RA Lear T.L., Adelson D.L., Bailey E., Bellone R.R., Bloecker H., Distl O.,
RA Edgar R.C., Garber M., Leeb T., Mauceli E., MacLeod J.N., Penedo M.C.T.,
RA Raison J.M., Sharpe T., Vogel J., Andersson L., Antczak D.F., Biagi T.,
RA Binns M.M., Chowdhary B.P., Coleman S.J., Della Valle G., Fryc S.,
RA Guerin G., Hasegawa T., Hill E.W., Jurka J., Kiialainen A., Lindgren G.,
RA Liu J., Magnani E., Mickelson J.R., Murray J., Nergadze S.G., Onofrio R.,
RA Pedroni S., Piras M.F., Raudsepp T., Rocchi M., Roeed K.H., Ryder O.A.,
RA Searle S., Skow L., Swinburne J.E., Syvaenen A.C., Tozaki T., Valberg S.J.,
RA Vaudin M., White J.R., Zody M.C., Lander E.S., Lindblad-Toh K.;
RT "Genome sequence, comparative analysis, and population genetics of the
RT domestic horse.";
RL Science 326:865-867(2009).
RN [2] {ECO:0000313|Ensembl:ENSECAP00000005215.3}
RP IDENTIFICATION.
RC STRAIN=Thoroughbred {ECO:0000313|Ensembl:ENSECAP00000005215.3};
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; F7CSA8; -.
DR STRING; 9796.ENSECAP00000005215; -.
DR PaxDb; 9796-ENSECAP00000005215; -.
DR Ensembl; ENSECAT00000007196.4; ENSECAP00000005215.3; ENSECAG00000006486.4.
DR CTD; 51460; -.
DR VGNC; VGNC:22895; SFMBT1.
DR GeneTree; ENSGT00940000157363; -.
DR HOGENOM; CLU_005352_0_0_1; -.
DR InParanoid; F7CSA8; -.
DR OMA; HWSLKNG; -.
DR OrthoDB; 5405166at2759; -.
DR TreeFam; TF316498; -.
DR Proteomes; UP000002281; Chromosome 16.
DR Bgee; ENSECAG00000006486; Expressed in testis and 23 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0003682; F:chromatin binding; IBA:GO_Central.
DR GO; GO:0042393; F:histone binding; IBA:GO_Central.
DR GO; GO:0003714; F:transcription corepressor activity; IEA:InterPro.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IBA:GO_Central.
DR CDD; cd20111; MBT_SFMBT1_rpt1; 1.
DR CDD; cd20113; MBT_SFMBT1_rpt2; 1.
DR CDD; cd20115; MBT_SFMBT1_rpt3; 1.
DR CDD; cd20117; MBT_SFMBT1_rpt4; 1.
DR CDD; cd09581; SAM_Scm-like-4MBT1_2; 1.
DR Gene3D; 2.30.30.140; -; 4.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR047352; MBT_SFMBT1_rpt2.
DR InterPro; IPR047351; MBT_SFMBT1_rpt3.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR037604; Scm-like-4MBT1/2_SAM.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF77; SCM-LIKE WITH FOUR MBT DOMAINS PROTEIN 1; 1.
DR Pfam; PF02820; MBT; 4.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 4.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 4.
DR PROSITE; PS51079; MBT; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002281};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 20..120
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 128..232
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 242..348
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 356..453
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 817..883
FT /note="SAM"
FT /evidence="ECO:0000259|SMART:SM00454"
FT REGION 665..801
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 685..703
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 723..751
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 759..774
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 775..801
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 890 AA; 100633 MW; 6D72BDB99EAFBD47 CRC64;
MNGEQQLDAD AGSGMEEVEL SWEDYLEETG STAVPYGSFK HVDTRLQNGF APGMKLEVVV
KTDPETYWVA TIITTCEQLL LLRYDGYGED RRADFWCDIR KAGLYPIGWC EQNKKTLEAP
EGIRDKMSNW DEFLRQTLMG ACSPPVSLLE GLRNGRNPLD LIAPGSRLEC QAFRDSLSTW
IVTVVENIGG RLKLRYEGLE SSDNFDHWLY YLDPFLHHVG WAAQQGYELQ PPLVIRHLKN
EAEWQEILAK VKEEEEEPLP SYLFKDKQVI GTHSFSVNMK LEAVDPWSPF GISPATVVKV
FDEKYFLVEM DDLRPENQAQ RCFVCHVDSP GIFPVQWSLK NGLHISPPPG YPSQDFDWAD
YLKQCGAEAA PQRCFPPSIC EHEFKENMKL EAVNPLLPEE VCVATITAVR GSYLWLQLEG
SKKPIPECIV SAESMDIFPL GWCETNGHPL SAPRRARVFK QRKIAVVQPE KQIPSSRTVH
EGLKNQELNS TDSVVINGKY CCPKIYFNHR CFSGPYLNKG RIAELPQCVG PGNCVLVLRE
VLTLLINAAY KPSRVLRELQ LDKDSVWHGC GEVLKANLSW MGNSCPVYWS VDHTLRITAL
GYKGKSYRAT VEIVKTADRV TEFCRQTCIK LECCPNLFGP RMVLDKCSEN CSVLTKTKYT
HYYGKKKNKR IGRPPGGHSN LACALKKASK RRKRRKNVFV HKKKRSSASV DNTPAGSPQG
SGGEDEDDPD EGDDDSLSEG STSEQQDELQ EESEMSEKKS CSSSPTQSEI STSLPPERQR
RKRELRTFSF SDDENKPPSP KAVKMEVAER LHLDSNPLKW SVADVVRFIR STDCAPLARI
FLDQEIDGQA LLLLTLPTVQ ECMDLKLGPA IKLCHHIERI KFAFYEQFAN
//