ID G1P2P9_MYOLU Unreviewed; 999 AA.
AC G1P2P9;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 19-OCT-2011, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=Scm like with four mbt domains 2 {ECO:0000313|Ensembl:ENSMLUP00000004148.2};
GN Name=SFMBT2 {ECO:0000313|Ensembl:ENSMLUP00000004148.2};
OS Myotis lucifugus (Little brown bat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Microchiroptera; Vespertilionidae;
OC Myotis.
OX NCBI_TaxID=59463 {ECO:0000313|Ensembl:ENSMLUP00000004148.2, ECO:0000313|Proteomes:UP000001074};
RN [1] {ECO:0000313|Ensembl:ENSMLUP00000004148.2, ECO:0000313|Proteomes:UP000001074}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21993624; DOI=10.1038/nature10530;
RA Lindblad-Toh K., Garber M., Zuk O., Lin M.F., Parker B.J., Washietl S.,
RA Kheradpour P., Ernst J., Jordan G., Mauceli E., Ward L.D., Lowe C.B.,
RA Holloway A.K., Clamp M., Gnerre S., Alfoldi J., Beal K., Chang J.,
RA Clawson H., Cuff J., Di Palma F., Fitzgerald S., Flicek P., Guttman M.,
RA Hubisz M.J., Jaffe D.B., Jungreis I., Kent W.J., Kostka D., Lara M.,
RA Martins A.L., Massingham T., Moltke I., Raney B.J., Rasmussen M.D.,
RA Robinson J., Stark A., Vilella A.J., Wen J., Xie X., Zody M.C., Baldwin J.,
RA Bloom T., Chin C.W., Heiman D., Nicol R., Nusbaum C., Young S.,
RA Wilkinson J., Worley K.C., Kovar C.L., Muzny D.M., Gibbs R.A., Cree A.,
RA Dihn H.H., Fowler G., Jhangiani S., Joshi V., Lee S., Lewis L.R.,
RA Nazareth L.V., Okwuonu G., Santibanez J., Warren W.C., Mardis E.R.,
RA Weinstock G.M., Wilson R.K., Delehaunty K., Dooling D., Fronik C.,
RA Fulton L., Fulton B., Graves T., Minx P., Sodergren E., Birney E.,
RA Margulies E.H., Herrero J., Green E.D., Haussler D., Siepel A., Goldman N.,
RA Pollard K.S., Pedersen J.S., Lander E.S., Kellis M.;
RT "A high-resolution map of human evolutionary constraint using 29 mammals.";
RL Nature 478:476-482(2011).
RN [2] {ECO:0000313|Ensembl:ENSMLUP00000004148.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AAPE02017005; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G1P2P9; -.
DR STRING; 59463.ENSMLUP00000004148; -.
DR Ensembl; ENSMLUT00000004559.2; ENSMLUP00000004148.2; ENSMLUG00000004547.2.
DR eggNOG; KOG3766; Eukaryota.
DR GeneTree; ENSGT00940000158123; -.
DR HOGENOM; CLU_005352_0_0_1; -.
DR InParanoid; G1P2P9; -.
DR OMA; NMSEPFH; -.
DR TreeFam; TF316498; -.
DR Proteomes; UP000001074; Unassembled WGS sequence.
DR GO; GO:0016235; C:aggresome; IEA:Ensembl.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0016607; C:nuclear speck; IEA:Ensembl.
DR GO; GO:0042393; F:histone binding; IEA:Ensembl.
DR GO; GO:0043565; F:sequence-specific DNA binding; IEA:InterPro.
DR GO; GO:0003714; F:transcription corepressor activity; IEA:InterPro.
DR GO; GO:0010629; P:negative regulation of gene expression; IEA:Ensembl.
DR CDD; cd20112; MBT_SFMBT2_rpt1; 1.
DR CDD; cd20114; MBT_SFMBT2_rpt2; 1.
DR CDD; cd20116; MBT_SFMBT2_rpt3; 1.
DR CDD; cd20118; MBT_SFMBT2_rpt4; 1.
DR CDD; cd09581; SAM_Scm-like-4MBT1_2; 1.
DR Gene3D; 2.30.30.140; -; 4.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR047353; MBT_SFMBT2_rpt1.
DR InterPro; IPR047354; MBT_SFMBT2_rpt3.
DR InterPro; IPR047355; MBT_SFMBT2_rpt4.
DR InterPro; IPR003118; Pointed_dom.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR037604; Scm-like-4MBT1/2_SAM.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF62; SCM-LIKE WITH FOUR MBT DOMAINS PROTEIN 2; 1.
DR Pfam; PF02820; MBT; 4.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 4.
DR SMART; SM00454; SAM; 1.
DR SMART; SM00251; SAM_PNT; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 4.
DR PROSITE; PS51079; MBT; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000001074};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 41..141
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 149..253
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 263..369
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 377..474
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 910..993
FT /note="PNT"
FT /evidence="ECO:0000259|SMART:SM00251"
FT DOMAIN 926..992
FT /note="SAM"
FT /evidence="ECO:0000259|SMART:SM00454"
FT REGION 722..912
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 733..748
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 797..812
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 823..839
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 866..887
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 999 AA; 112135 MW; E0A1876EFDB02BD2 CRC64;
RLLPLLIRRN SSNRIDEVIG MCQSSFRLSI EEGSSLEEIG FNWGEYLEET GASAAPHTSF
KHVEISIQSN FQPGMKLEVA NKNNPDTYWV ATVITTCGQL LLLRYCGYGE DRRADFWCDV
VIADLHPVGW CTQNNKALMP PDAIKEKYTD WTEFLIRDLT GSRTAPANLL EGPLRGKGPI
DLITVDSLIE LQDSQNPFQY WIVSVIENVG GRLRLRYVGL EDTESYDQWL FYLDYRLRPV
GWCQENKYRM DPPSEIYPLK MASEWKYALE KSLIDAAKFP LPMEVFKDHA DLRSHFFTVG
MKLETVNMSE PFHICPASVT KVFNNHFFQV TIDDLRPEPS KLSMLCHADS LGILPVQWCL
KNGVNLTPPK GYSGQDFDWA DYHKQHGTEE APPFCFKNTS FSRGFTKNMK LEAVNPRNPG
ELCVASVVRV KGRLLWLHLE GLQTPAPEFI VDVESMDIFP VGWCEANSYP LTTPHKIVPQ
QKRKIAVVQP EKQLTSTVPV EKIPHDHCLF PHLETTALPT GPVNGKYCCP QLFINHRCFS
GPYLNKGRIA ELPQSVGPGK CVLVLKEVLS MIINAAYKPG RVLRELQLVE DPHWNFQEET
LKAKYRGKTY RAVAKIVRTS DQVADFCRRV CAKLECCPNL FSPVLVSENC PENCSIHTKT
KYIGDLCLLE LLPGPCPASA LVSLCTADSA SSENSLSLAP APSVSPSAYY YGKRKKIIKP
PIGESSIESG HPKPARRRKR RKSIFVQKKR RSSTVDFAAG SGEVCLPGAP PSPAQGRRAW
RASRWPQWKS DPNGWKQEIP APSSSFAENR SSHVHEGSSH ESEEEDADAV DDDTGSEETG
SELRDDQTDT SSAEVPSARP RRAVTLRSSS EPERRLPVER TRRGRKVQAT SCAEGGDKGP
AAGQDKDTAQ ETKQEEEERL ILESNPLEWT VTDVVRFIKL TDCAPLAKIF QEQDIDGQAL
LLLTLPTVQE CMELKLGPAI KLCHQIERVK VAFYAQYAN
//