ID A0A4W2DR87_BOBOX Unreviewed; 934 AA.
AC A0A4W2DR87;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=General transcription factor IIi {ECO:0008006|Google:ProtNLM};
GN Name=GTF2I {ECO:0000313|Ensembl:ENSBIXP00000027004.1};
OS Bos indicus x Bos taurus (Hybrid cattle).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=30522 {ECO:0000313|Ensembl:ENSBIXP00000027004.1, ECO:0000313|Proteomes:UP000314981};
RN [1] {ECO:0000313|Ensembl:ENSBIXP00000027004.1, ECO:0000313|Proteomes:UP000314981}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Low W.Y., Tearle R., Bickhart D.M., Rosen B.D., Koren S., Rhie A.,
RA Hiendleder S., Phillippy A.M., Smith T.P.L., Williams J.L.;
RT "Haplotype-resolved cattle genomes.";
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSBIXP00000027004.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A4W2DR87; -.
DR Ensembl; ENSBIXT00000014816.1; ENSBIXP00000027004.1; ENSBIXG00000028937.1.
DR Proteomes; UP000314981; Chromosome 25.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 3.90.1460.10; GTF2I-like; 6.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR InterPro; IPR016659; TF_II-I.
DR PANTHER; PTHR46304:SF2; GENERAL TRANSCRIPTION FACTOR II-I; 1.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 6.
DR PIRSF; PIRSF016441; TF_II-I; 1.
DR SUPFAM; SSF117773; GTF2I-like repeat; 6.
DR PROSITE; PS51139; GTF2I; 6.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000314981};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 208..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 250..324
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 642..661
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 667..694
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 796..817
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 668..693
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 934 AA; 105365 MW; 0B75622B66B6F332 CRC64;
MAQVAVSTLP VEDEESSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
RGRAFVNTRK DFQKDFVKYC VEEEEKAAEM HKMKSTAQAN RMSVDAVEIE TLRKTVEDYF
CFCYGKALGK STVVPVPYEK MLRDQSAVVV QGLPEGVAFK HPENYDLATL KWILENKAGI
SFIIKRPFLE PKKHLGGRVM VTDAERSMIS PSGSCGPVKV KTEPSEDSGI SLEMAAVTVK
EESEDPDYYQ YNIQGSHHSS EGNEGTEMEV PAEDSTQHVP SETSEDPEVE VTIEDDDYPP
PAKRPKSSEP PQPPVTEPAN AGKRKVREFN FEKWNARITD LRKQVEELFE RKYAQAIKAK
GPVTIPYPLF QSHVEDLYVE GLPEGIPFRR PSTYGIPRLE RILLAKERIR FVIKKHELLN
STREDLQLDK PASGVKEEWY ARITKLRKMV DQLFCKKFAE ALGSTEAKAV PYQKFEAHPN
DLYVEGLPEN IPFRSPSWYG IPRLEKIIQV GNRIKFVIKR PELLTHSTTE VTQPRTNTPV
KEDWNVRITK LRKQVEEIFN LKFAQALGLT EAVKVPYPVF ESNPEFLYVE GLPEGIPFRS
PTWFGIPRLE RIVRGSNKIK FVVKKPELVI SYLPPGMASK INTKALQSPK RPRSPGSNSK
VPEIEVTVEG PNNSNPQTSA VRTPTQTNGS NVPFKPRGRE FSFEAWNAKI TDLKQKVENL
FNEKCGEALG LKQAVKVPFA LFESFPEDFY VEGLPEGVPF RRPSTFGIPR LEKILRNKAK
IKFIIKKPEM FETAIKESTS SSKSPPRKIN SSPSVNTTAS GVEDLNIIQV TIPDDDNERL
SKVEKARQLR EQVNDLFSRK FGEAIGMGFP VKVPYRKITI NPGCVVVDGM PPGVSFKAPS
YLEISSMRRI LDSAEFIKFT VIRPFPGLVI NNPF
//