ID A0A4W2BQE6_BOBOX Unreviewed; 915 AA.
AC A0A4W2BQE6;
DT 18-SEP-2019, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|Ensembl:ENSBIXP00005028980.1};
GN Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSBIXP00005028980.1};
OS Bos indicus x Bos taurus (Hybrid cattle).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Bovinae; Bos.
OX NCBI_TaxID=30522 {ECO:0000313|Ensembl:ENSBIXP00000001050.1, ECO:0000313|Proteomes:UP000314981};
RN [1] {ECO:0000313|Proteomes:UP000314981, ECO:0000313|Proteomes:UP000429181}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Low W.Y., Tearle R., Bickhart D.M., Rosen B.D., Koren S., Rhie A.,
RA Hiendleder S., Phillippy A.M., Smith T.P.L., Williams J.L.;
RT "Haplotype-resolved cattle genomes.";
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSBIXP00000001050.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR Ensembl; ENSBIXT00000014866.1; ENSBIXP00000001050.1; ENSBIXG00000028999.1.
DR Ensembl; ENSBIXT00005012192.1; ENSBIXP00005028980.1; ENSBIXG00005008846.1.
DR GeneTree; ENSGT00940000159414; -.
DR Proteomes; UP000314981; Chromosome 25.
DR Proteomes; UP000429181; Chromosome 25.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR InterPro; IPR016659; TF_II-I.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 5.
DR PIRSF; PIRSF016441; TF_II-I; 1.
DR SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR PROSITE; PS51139; GTF2I; 5.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000314981};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 95..116
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 231..251
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 278..309
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 468..561
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 892..915
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..109
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..561
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 915 AA; 101151 MW; D430AEE49ED99A46 CRC64;
MALLGKRCDI PANGCGPDRW TSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE
SAFVVGTEKG RMFLNARKEL QSDFLRFCRG APWKEPEAEH PKKVPRGEGG GRNVPRSALE
HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE
YDPKALMAIL EHSHRIRFKL KRPLEDGGRD SKALVELNGV SLLAKGARDC GLHGQTPKGP
PQDLPPPATS SSVASFLYST ALPNHAVREL KQEAPACPLG PSDLGLGRPG PEPKAPAAQD
FPDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN
SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH
FVIKRMFDER IFTGNKFTKD PTKLEPASPP EDASTEVARA AVLDLAGTTR SDKSSLSEDC
GPGTSGELGG LRPIKIEPED PDIIQVTVPD PSPASEEMTD SMPGHLPSED SGYGMEMLTD
KGAGEDPRPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMYPEE
LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP LSDSQERDSG
DPLVDESLKR QGFQENYDAR LSRIDIANTL REQVQDLFNK KYGEALGIKY PVQVPYKRIK
SNPGSVIIEG LPPGIPFRKP CTFGSQNLER ILAVADKIKF TVTRPFQGLI PKPDEDDANR
LGEKVILREQ VKELFNEKYG EALGLNRPVL VPYKLIRDSP DAVEVTGLPD DIPFRNPNTY
DIHRLEKILK AREHVRMVII NQLQPFAEIC NDAKVPAMAN VHGGLCRPER ATPGTSELLD
LSAESGPHSE RLKEM
//