ID U3I741_ANAPP Unreviewed; 871 AA.
AC U3I741;
DT 13-NOV-2013, integrated into UniProtKB/TrEMBL.
DT 05-JUN-2019, sequence version 2.
DT 27-MAR-2024, entry version 58.
DE SubName: Full=Scm like with four mbt domains 1 {ECO:0000313|Ensembl:ENSAPLP00000003062.2};
GN Name=SFMBT1 {ECO:0000313|Ensembl:ENSAPLP00000003062.2};
OS Anas platyrhynchos platyrhynchos (Northern mallard).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Galloanserae; Anseriformes; Anatidae;
OC Anatinae; Anas.
OX NCBI_TaxID=8840 {ECO:0000313|Ensembl:ENSAPLP00000003062.2, ECO:0000313|Proteomes:UP000016666};
RN [1] {ECO:0000313|Ensembl:ENSAPLP00000003062.2, ECO:0000313|Proteomes:UP000016666}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Hou Z.-C., Zhou Z.-K., Zhu F., Hou S.-S.;
RT "A new Pekin duck reference genome.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSAPLP00000003062.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; U3I741; -.
DR STRING; 8840.ENSAPLP00000003062; -.
DR Ensembl; ENSAPLT00000003657.2; ENSAPLP00000003062.2; ENSAPLG00000003543.2.
DR GeneTree; ENSGT00940000157363; -.
DR HOGENOM; CLU_005352_0_0_1; -.
DR OMA; HWSLKNG; -.
DR Proteomes; UP000016666; Chromosome 13.
DR GO; GO:0005654; C:nucleoplasm; IEA:Ensembl.
DR GO; GO:0042393; F:histone binding; IEA:Ensembl.
DR GO; GO:0003714; F:transcription corepressor activity; IEA:InterPro.
DR GO; GO:0048635; P:negative regulation of muscle organ development; IEA:Ensembl.
DR CDD; cd20111; MBT_SFMBT1_rpt1; 1.
DR CDD; cd20113; MBT_SFMBT1_rpt2; 1.
DR CDD; cd20115; MBT_SFMBT1_rpt3; 1.
DR CDD; cd09581; SAM_Scm-like-4MBT1_2; 1.
DR Gene3D; 2.30.30.140; -; 4.
DR Gene3D; 3.90.1150.190; SLED domain; 1.
DR Gene3D; 1.10.150.50; Transcription Factor, Ets-1; 1.
DR InterPro; IPR004092; Mbt.
DR InterPro; IPR047352; MBT_SFMBT1_rpt2.
DR InterPro; IPR047351; MBT_SFMBT1_rpt3.
DR InterPro; IPR001660; SAM.
DR InterPro; IPR013761; SAM/pointed_sf.
DR InterPro; IPR037604; Scm-like-4MBT1/2_SAM.
DR InterPro; IPR021987; SLED.
DR InterPro; IPR038348; SLED_sf.
DR PANTHER; PTHR12247; POLYCOMB GROUP PROTEIN; 1.
DR PANTHER; PTHR12247:SF77; SCM-LIKE WITH FOUR MBT DOMAINS PROTEIN 1; 1.
DR Pfam; PF02820; MBT; 4.
DR Pfam; PF00536; SAM_1; 1.
DR Pfam; PF12140; SLED; 1.
DR SMART; SM00561; MBT; 4.
DR SMART; SM00454; SAM; 1.
DR SUPFAM; SSF47769; SAM/Pointed domain; 1.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 4.
DR PROSITE; PS51079; MBT; 4.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000016666};
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT REPEAT 22..122
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 130..234
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 244..353
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT REPEAT 361..458
FT /note="MBT"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00459"
FT DOMAIN 798..864
FT /note="SAM"
FT /evidence="ECO:0000259|SMART:SM00454"
FT REGION 646..762
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..684
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 704..732
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 871 AA; 98066 MW; 4FD1041A1D4DA636 CRC64;
VKAEKCLASL PDAGSGVEEM EFNWDEYLED TGAIAAPHGS FKHVDTSLQN GFAPGMKLEV
AVKSDQNTYW VATIITTCGQ LLLLRYDGYG EDRKADFWCD IMTADLHPIG WCEQNKKILK
VPEGIKDKIP DQEEFLQRVL KGACSAPANL LEGLHRGKNP LDLIAPGSRL ELQNFRDSLE
AWIVNVVENV GGRLKVRYEG LEDSDKLDQW IFYLDPFLHQ VGWATQNGYN LQPPLAIRSL
KSEADWQEIL KKVKEEEEEK AGDPLYLIFQ DKPVIGVHSF SEGMKLEAVD PMAPFVISPA
TVLKVYNEKY FKIEIDDLRP ERTTSRSYIC HVNSAGIFPV QWSLKNGLHL SPPPGYPGQD
FDWADYLKQC GAEAAPQSCF PSLTSDHGFK ENMKLEAVNP VDPEEVCIAT VTKLKDSYLW
LQLEGSKKPV PDCIVSVESM DIFPVGWCET NGYQLRPPRK AIGISSVLFA VVSWFDRVLS
SRTVHDGLKN QELNSSDSVI INGKYCCPKI YFNHRCFSGP YLNKGRIAEL PQSVGPGNCV
LVLKEVLTLL INAAYKPSRV LRELQLDEEA AWHGHGETLK AKYKGKSYRA TVEVVRTADR
VADFCRKTCI KLECCPNLFG PQMVLDKCSE NCSVLTKTKY THYYGKRKNK RIGRPPGGHS
NLEVAMKKPN KRRKRRKHFF VHKKKRSSTS VDNTPAGSPQ GSGGEEEDDQ DEVDEESLTE
DSTSEQQDEL LEESEVSEKK SLSSSPSQSE LSHSLAQDRD KRKRKLRTFS FSDDENKPPS
PKEIKIEVAE KLQLDSNPLE WSVADVVRFL KSTDCAPLAR IFLDQEIDGQ ALLLLTLPTV
QECMDLKLGP AIKLCHHIER VKLAFYQQFA N
//