ID A0A452QXI8_URSAM Unreviewed; 825 AA.
AC A0A452QXI8;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|Ensembl:ENSUAMP00000010544.1};
GN Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSUAMP00000010544.1};
OS Ursus americanus (American black bear) (Euarctos americanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Carnivora; Caniformia; Ursidae; Ursus.
OX NCBI_TaxID=9643 {ECO:0000313|Ensembl:ENSUAMP00000010544.1, ECO:0000313|Proteomes:UP000291022};
RN [1] {ECO:0000313|Proteomes:UP000291022}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Korstanje R., Srivastava A., Sarsani V.K., Sheehan S.M., Seger R.L.,
RA Barter M.E., Lindqvist C., Brody L.C., Mullikin J.C.;
RT "De novo assembly and RNA-Seq shows season-dependent expression and editing
RT in black bear kidneys.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSUAMP00000010544.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A452QXI8; -.
DR Ensembl; ENSUAMT00000011846.1; ENSUAMP00000010544.1; ENSUAMG00000008608.1.
DR GeneTree; ENSGT00940000159414; -.
DR Proteomes; UP000291022; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR Gene3D; 3.90.1460.10; GTF2I-like; 4.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 4.
DR SUPFAM; SSF117773; GTF2I-like repeat; 4.
DR PROSITE; PS51139; GTF2I; 4.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000291022};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 70..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 203..223
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 241..289
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 440..466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 481..531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 766..825
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 514..531
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 825 AA; 90858 MW; 1F74B8EAAE4B444D CRC64;
SHPQNCHWRC PQRGGTSLSK LNAEVACVAV HDESAFVVGT EKGRMFLNAR KELQSDFLRF
CRGAPWKEPE AEHPKKVSRG EGGGRSVPRS SLEHGSDVYL LRKMVDELFD VLYSEALGRA
SVVPLPYERL LREPGLLAVQ GLPEGLAFRR PVDYDPKALM AILEHSHRIR FKLKRPLEDG
GRDSKALVEL NGVSLIAKGS RDCGLHGQAP KGPPQDLPPT ATSSSVASFL YSTALPNHTT
RELKQEAPAC PLAPSDLGLG RPGPEPKASG AQDFPDCCGQ KPPGPGGPLI QNVHASKRIL
FSIVHDKSEK WDAFIKETED INTLRECVQI LFNSRYAEAL GLDHMVPVPY RKIACDPEAV
EIVGIPDKIP FKRPCTYGVP KLKRILEERH SIHFIIKRMF DERIFTGNKF TKDPTKLEPA
SPPEDTSAEI SRAAVLDLPG TARSDKNGVS EDCGPGTSGE LSGLRPIKME PEDLDIIQVT
VPDPSPTSEE MTDSMPGHLP SEDSGYGMEM LTEKGPSEDP RPEERPVEDS HGDVIRPLRK
QVELLFNTRY AKAIGISEPV KVPYSKFLMY PEELFVVGLP EGISLRRPNC FGIAKLRKIL
EASNSIQFVI KRPELLTEGV KEPITDSQEN YDARLSRIDI ANTLREQVQD LFNKKYGEAL
GIKYPVQVPY KRIKSNPGSV IIEGLPPGIP FRKPCTFGSQ NLERILAVAD KIKFTVTRPF
QGLIPKPGKR HWLWGGRLAW EAGCSAGGRR PEPSLLWDAR PCPTVLPPPL GTGPRWEEVT
VEPRAEKKGW GGGTGDSRAQ PGQGSGRRGR AGSLPSTPDE GAWLT
//