ID I3M082_ICTTR Unreviewed; 552 AA.
AC I3M082;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 08-NOV-2023, entry version 53.
DE SubName: Full=MLLT1 super elongation complex subunit {ECO:0000313|Ensembl:ENSSTOP00000002092.3};
GN Name=MLLT1 {ECO:0000313|Ensembl:ENSSTOP00000002092.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000002092.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000002092.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (JUL-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00376}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01068930; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AGTP01068931; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; I3M082; -.
DR STRING; 43179.ENSSTOP00000002092; -.
DR Ensembl; ENSSTOT00000002341.3; ENSSTOP00000002092.3; ENSSTOG00000002340.3.
DR eggNOG; KOG3149; Eukaryota.
DR GeneTree; ENSGT00940000158800; -.
DR HOGENOM; CLU_036086_0_0_1; -.
DR InParanoid; I3M082; -.
DR TreeFam; TF314586; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:Ensembl.
DR GO; GO:0001650; C:fibrillar center; IEA:Ensembl.
DR GO; GO:0008023; C:transcription elongation factor complex; IEA:Ensembl.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR CDD; cd16906; YEATS_AF-9_like; 1.
DR Gene3D; 1.20.1270.290; -; 1.
DR Gene3D; 2.60.40.1970; YEATS domain; 1.
DR InterPro; IPR040930; AF-9_AHD.
DR InterPro; IPR038704; YEAST_sf.
DR InterPro; IPR005033; YEATS.
DR PANTHER; PTHR47827; AHD DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR47827:SF4; PROTEIN ENL; 1.
DR Pfam; PF17793; AHD; 1.
DR Pfam; PF03366; YEATS; 1.
DR PROSITE; PS51037; YEATS; 1.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00376}; Reference proteome {ECO:0000313|Proteomes:UP000005215}.
FT DOMAIN 488..548
FT /note="AF-9 ANC1 homology"
FT /evidence="ECO:0000259|Pfam:PF17793"
FT REGION 144..481
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 171..258
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 300..317
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 318..358
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 359..398
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 426..442
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 446..464
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 552 AA; 61345 MW; 3D7CD6FDB397FA8B CRC64;
CTVQVKLELG HRAQLRKKPT TEGFTHDWMV FVRGPEQCEI QHFVEKVIFR LHDSFPKPKR
VCKEPPYKVE ESGYAGFIMP IEVYFKNKEE PRKVCFTYDL FLNLEGNPPV NHLRCEKLTF
NNPTTEFRYK LLMAGGVMVM PEGAETVSRP SPDYPMLPTI PLSAFSDPKK TKPSHGSKDA
NKESSKAPKT HKVTKEHRER PRKDSESKSS SKELEREQAK GAKEAARKLG EGRLPKEEKA
PPPKAAFKEP KMALKETKLE SMSPKGAPQP PVLPKASSKR PAAADSPKPS AKKQKKSSSK
GSRSAPSTSP RTSSSSFPDK KPPKDKSSTK GEKMKAESES RETKKALEVE ESNSEDEASF
KSESAQSSPS NSSSSSDSSS DSDFEPSQNH SQGPLRSMVE DLQSEESDED DSSSGEEATV
KANPGRDSRL SFSDSESDNS ADSCLPGREP PPPQKPPPPN SKASGRRSPE PCSKPEKILK
KGTYDKAYTD ELVELHRRLM ALRERNVLQQ IVNLIEETGH FNVTNTTFDF DLFSLDETTV
RKLQSYLEAV AT
//