ID A0A091SNV4_PELCR Unreviewed; 878 AA.
AC A0A091SNV4;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE SubName: Full=General transcription factor II-I {ECO:0000313|EMBL:KFQ60013.1};
DE Flags: Fragment;
GN ORFNames=N334_02283 {ECO:0000313|EMBL:KFQ60013.1};
OS Pelecanus crispus (Dalmatian pelican).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC Coelurosauria; Aves; Neognathae; Pelecaniformes; Pelecanidae; Pelecanus.
OX NCBI_TaxID=36300 {ECO:0000313|EMBL:KFQ60013.1, ECO:0000313|Proteomes:UP000054150};
RN [1] {ECO:0000313|EMBL:KFQ60013.1, ECO:0000313|Proteomes:UP000054150}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BGI_N334 {ECO:0000313|EMBL:KFQ60013.1};
RA Zhang G., Li C.;
RT "Genome evolution of avian class.";
RL Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KK479302; KFQ60013.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A091SNV4; -.
DR Proteomes; UP000054150; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR Gene3D; 3.90.1460.10; GTF2I-like; 6.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR PANTHER; PTHR46304:SF2; GENERAL TRANSCRIPTION FACTOR II-I; 1.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 6.
DR SUPFAM; SSF117773; GTF2I-like repeat; 6.
DR PROSITE; PS51139; GTF2I; 6.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000054150};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 159..237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 555..612
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 712..731
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 558..573
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 587..611
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 716..731
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:KFQ60013.1"
FT NON_TER 878
FT /evidence="ECO:0000313|EMBL:KFQ60013.1"
SQ SEQUENCE 878 AA; 98761 MW; 64D36B1F6F660535 CRC64;
VTEEERAAEL QKTKTIPPVN RLTVDIVELE ALRKSVEDFF CFCYGKALGK STVVPVPYEK
IQRDQSAVVV QGLPEGLAFK HPANYDVSTL KWILENKSGI SFVIKRPFLE PKKHLGAHMT
DPSHSIIPPG GSCPPIQVKT EPNEDSGISL EMATVAVKEE SDDPDYYQYN IPGPSETSEM
DEKIALAKSY TESSQHAPSE TSEDPEVEVT IEDDDDYLPP NKRSKNTESA NEAANTGRRK
VREFNFEKWN ARITDLRKQV EELFEKKYAQ AIKAKGPVSI PYPLFQSHVE DLYVEGLPEG
IPFRRPSTYG IPRLERILLA KERIRFVIKK HELLNSTRED VPLDKPVAGV KEEWYARITK
LRKMVDQLFC KKFAEALGST EPKAVPYQKF EAHPTDLCVE GLPENIPFRS PSWYGIPRLE
KIIQVSNRIK FVIKRPELLT LTSTEVIQPR SNTPVKEDWN VRITKLRKQV EEIFNMKFAQ
ALGLSEAVKV PYPVFESNPE YLYVEGLPEG IPFRSPTWFG IPRLERIVRG SGKIKFVVKK
PELVTSYLPP GLASKINTKA MSPKSSKRSR SPAGNSKVPE IEVTVEEGPN KQQTTEVRTN
SQTNGSNMSF KPRGREFSFE AWNAKITDLK QKVENLFNEK CGEALGLTEP VKVPFALFES
FPEVFYVEGL PEGVPFRRPS TFGIPRLEKI LRNKSKIKFI IKKPEMFEAA VKESSARSPQ
RKNNSSNIAN TASTTTNTVV NTAAGVEDLN IIQVTIPDDE SERLSKVEKA RQLREQVNDL
FSRKFGEAIG MNFPVKVPYR KITINPGCVV VDGMPPGVAF KAPSYLEISS MRKILESAEF
IKFTVIRPFP GLVINNQLME KAEAEAPPTA PAAAPVLP
//