ID I3MMS9_ICTTR Unreviewed; 944 AA.
AC I3MMS9;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 2.
DT 27-MAR-2024, entry version 64.
DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|Ensembl:ENSSTOP00000012871.3};
GN Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSSTOP00000012871.3};
OS Ictidomys tridecemlineatus (Thirteen-lined ground squirrel) (Spermophilus
OS tridecemlineatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Sciuromorpha; Sciuridae;
OC Xerinae; Marmotini; Ictidomys.
OX NCBI_TaxID=43179 {ECO:0000313|Ensembl:ENSSTOP00000012871.3, ECO:0000313|Proteomes:UP000005215};
RN [1] {ECO:0000313|Proteomes:UP000005215}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG The Broad Institute Genome Assembly & Analysis Group;
RG Computational R&D Group;
RG and Sequencing Platform;
RA Di Palma F., Alfoldi J., Johnson J., Berlin A., Gnerre S., Jaffe D.,
RA MacCallum I., Young S., Walker B.J., Lindblad-Toh K.;
RT "The Draft Genome of Spermophilus tridecemlineatus.";
RL Submitted (NOV-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSSTOP00000012871.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGTP01053389; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_005328594.1; XM_005328537.1.
DR AlphaFoldDB; I3MMS9; -.
DR Ensembl; ENSSTOT00000014370.3; ENSSTOP00000012871.3; ENSSTOG00000014343.3.
DR GeneID; 101972011; -.
DR CTD; 9569; -.
DR GeneTree; ENSGT00940000159414; -.
DR HOGENOM; CLU_014412_0_0_1; -.
DR OrthoDB; 5308582at2759; -.
DR TreeFam; TF352524; -.
DR Proteomes; UP000005215; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR InterPro; IPR016659; TF_II-I.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 5.
DR PIRSF; PIRSF016441; TF_II-I; 2.
DR SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR PROSITE; PS51139; GTF2I; 5.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000005215};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 98..119
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 438..489
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 509..560
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 876..914
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..560
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 893..914
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 944 AA; 104560 MW; 4BCC58472C93AAEC CRC64;
MALLGKRCDL PANGCGPDRW SSAFARKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE
SAFVVGTEKG RVFLSARKEL QADFLRFCRG PLWKDPEGEH PKKVQRGEGG CRGTPRSSLE
RGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE
YDPEALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGI SLLPRGSRDC SLHGQVPKAA
PQDLPPTATS SSVASFLYTT ALPVHTLREL KQEPPSCPLA PGDLGLGRPG PEPKAPGAQD
FSDCCGQKPP GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIRETEDINT LRECVQILFN
SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH
FIIKRMFDER IFTGNKFTKD PTKLEPASPP EDTSAEVPRA TMLDLAATAR SDKGSVSEDC
GPGTSGELGI LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTE
KGPSEDPRPE ERPVEDSPGD VIRPLRKQVE LLFNSRYAKA IGISEPVKVP YSKFLMHPEE
LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP VVDSQERDPG
DPLVDESLKR QGFQENFDAR LSRIDIANTL REQVQDLFNK KYGEALGIKY PVQVPYKRIK
SNPGSVIIEG LPPGIPFRKP CTFGSQNLER ILAVADKIKF TVTRPFQGLI PKPDEDDANR
LGEKVILREQ VKELFNEKYG EALGLNRPVL VPYKLIRDSP DAVEVTGLPD DIPFRNPNTY
DIHRLEKILK AREHVRMVII NQLQPFAELC NDAKVPAKDS NVPKRKRKRV SEGNSVSSSS
SSSSSSSSNP ESVAPTNQIS LVQWPMYMVD YAGLNTQLPG PLNY
//