ID W5PVD3_SHEEP Unreviewed; 883 AA.
AC W5PVD3;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE RecName: Full=SAM domain-containing protein {ECO:0000259|SMART:SM00454};
GN Name=SFMBT2 {ECO:0000313|Ensembl:ENSOARP00000014416.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000014416.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000014416.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000014416.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000014416.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01023940; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023941; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023942; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023943; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023944; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023945; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01023946; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5PVD3; -.
DR SMR; W5PVD3; -.
DR STRING; 9940.ENSOARP00000014416; -.
DR PaxDb; 9940-ENSOARP00000014416; -.
DR Ensembl; ENSOART00000014628.1; ENSOARP00000014416.1; ENSOARG00000013446.1.
DR eggNOG; KOG3766; Eukaryota.
DR HOGENOM; CLU_005352_0_0_1; -.
DR OMA; NMSEPFH; -.
DR Proteomes; UP000002356; Chromosome 13.
DR Bgee; ENSOARG00000013446; Expressed in gastric lymph node and 54 other cell types or tissues.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0042393; F:histone binding; IEA:InterPro.
DR GO; GO:0003714; F:transcription corepressor activity; IEA:InterPro.
DR CDD; cd20112; MBT_SFMBT2_rpt1; 1.
DR CDD; cd20114; MBT_SFMBT2_rpt2; 1.
DR CDD; cd20116; MBT_SFMBT2_rpt3; 1.
DR CDD; cd20118; MBT_SFMBT2_rpt4; 1.
DR CDD; cd09581; SAM_Scm-like-4MBT1_2; 1.
DR Gene3D; 2.30.30.140; -; 4.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR047353; MBT_SFMBT2_rpt1.
DR InterPro; IPR047354; MBT_SFMBT2_rpt3.
DR InterPro; IPR047355; MBT_SFMBT2_rpt4.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR037604; Scm-like-4MBT1/2_SAM.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF62; SCM-LIKE WITH FOUR MBT DOMAINS PROTEIN 2; 1.
DR Pfam; PF02820; MBT; 4.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 4.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 4.
DR PROSITE; PS51079; MBT; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 44..144
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 152..256
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 266..369
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 377..474
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 809..876
FT /note="SAM"
FT /evidence="ECO:0000259|SMART:SM00454"
FT REGION 1..39
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 671..802
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..22
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 683..699
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 712..733
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 758..772
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 883 AA; 99518 MW; F19219CC7EBFFA0E CRC64;
MEGSVSASSM RDPSSSPLEK HQSSADRNGD LDSEEGSSLE ETGFNWGEYL EETGASAAPH
TSFKHVEISI QSNFQPGMKL EVANKKNPDT YWVATVITTC GQLLLLRYCG YGEDRRADFW
CDVVIADLHP VGWCTQNNKT LMPPDAIKEK YTDWTEFLIR DLTGSRTAPA SLLEGPLRGK
GPIDLITVDS LIELQDSQDP FQFWIVSVLE NAGGRLRLRY VGLEDTESYD QWLFYLDYRL
RPVGWCQENE YRMDPPSEIY PLKTDSEWKR ALEKSLIDAA KFPLPMEVFK DHADLRSHFF
TVGMKLETVN MSHICPASVT KVFNNHFFQV TIDDLRPEPS KLSMLCHADS LGILPVQWCL
KNGVNLTPPK GYAGQDFDWA DYHKQHGTEE APPFCFRNTS FSRGFTKNMK LEAVNPRNPG
ELCVASVIAV KGRLMWLRLE GLQSPAPEFI VDVESMDIFP VGWCEANSYP LTTPHKTASQ
KKRKIAVVQP ENQSPSTVPV EKIPHDLCLF PHLDATGTVN GKYCCPQLFI NHRCFSGPYL
NKGRIAELPQ SVGPGKCVLV LKEILSMLIN AAYKPGRVLR ELQLVEDPHW NFQEETLKAK
YRGKTYRAVA KIVRTSDQVA DFCRRVCAKL ECCPNLFSPV LVSENCPENC SIHTKTKYTY
YYGKRRKISK PPIGESRVES GHPKPARRRK RRKSMFVQKK RRSSAVDLAA GSGEESEEED
ADAMDDDSGS EETGSEIRDD QTDTSSAEVP STRPQRAWRR APVERSRRAR RGSGAPQAEG
APRSPATSQE AAEDVKQEEE ERLVLESNPL EWSVTDVVRF IKLTDCAPLA KIFQEQQDID
GQALLLLTLP TVQECMELKL GPAIKLCHQI ERVKVAFYAQ YAN
//