GenomeNet

Database: UniProt
Entry: A0A087QWJ8_APTFO
LinkDB: A0A087QWJ8_APTFO
Original site: A0A087QWJ8_APTFO 
ID   A0A087QWJ8_APTFO        Unreviewed;       962 AA.
AC   A0A087QWJ8;
DT   29-OCT-2014, integrated into UniProtKB/TrEMBL.
DT   29-OCT-2014, sequence version 1.
DT   27-MAR-2024, entry version 33.
DE   SubName: Full=General transcription factor II-I {ECO:0000313|EMBL:KFM05602.1};
DE   Flags: Fragment;
GN   ORFNames=AS27_05276 {ECO:0000313|EMBL:KFM05602.1};
OS   Aptenodytes forsteri (Emperor penguin).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Archelosauria; Archosauria; Dinosauria; Saurischia; Theropoda;
OC   Coelurosauria; Aves; Neognathae; Sphenisciformes; Spheniscidae;
OC   Aptenodytes.
OX   NCBI_TaxID=9233 {ECO:0000313|EMBL:KFM05602.1, ECO:0000313|Proteomes:UP000053286};
RN   [1] {ECO:0000313|EMBL:KFM05602.1, ECO:0000313|Proteomes:UP000053286}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=BGI_AS27 {ECO:0000313|EMBL:KFM05602.1};
RA   Zhang G., Li C.;
RT   "Genome evolution of avian class.";
RL   Submitted (APR-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KL225959; KFM05602.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A087QWJ8; -.
DR   STRING; 9233.A0A087QWJ8; -.
DR   Proteomes; UP000053286; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   Gene3D; 3.90.1460.10; GTF2I-like; 6.
DR   InterPro; IPR004212; GTF2I.
DR   InterPro; IPR036647; GTF2I-like_rpt_sf.
DR   PANTHER; PTHR46304:SF2; GENERAL TRANSCRIPTION FACTOR II-I; 1.
DR   PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR   Pfam; PF02946; GTF2I; 6.
DR   SUPFAM; SSF117773; GTF2I-like repeat; 6.
DR   PROSITE; PS51139; GTF2I; 6.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000053286};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   REGION          239..258
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          269..317
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          638..694
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          796..817
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        638..662
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        671..694
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        800..817
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   NON_TER         962
FT                   /evidence="ECO:0000313|EMBL:KFM05602.1"
SQ   SEQUENCE   962 AA;  108081 MW;  8D313105E9184CEC CRC64;
     MAQTTAPATS IHDEDSSESR MVVTFLMSAL ESMCKELAKS KAEVACIAVY ETDVFVVGTE
     RGRAFVNSRK DFQKDFVKYC VTEEERAAEL QKTKTVPPVN RLTVDIVELE ALRKSVEDFF
     CFCYGKALGK STVVPVPYEK IQRDQSAVVV QGLPEGLAFK HPANYDVSTL KWILENKSGI
     SFIIKRPFLE PKKHLGAHMT ESSHSVIPPG GSCPPIQVKT EPNEDSGISL EMATVTVKEE
     SDDPDYYQYN IPGPSETSEM DEKIALAKSY TDSSQHAPSE TSEDPEVEVT IEDDDDYLPP
     NKRPKNTESA NEAANTGRRK VREFNFEKWN ARITDLRKQV EELFEKKYAQ AIKAKGPVSI
     PYPLFQSHVE DLYVEGLPEG IPFRRPSTYG IPRLERILLA KERIRFVIKK HELLNSTQDD
     LPLDKPVAGV KEEWYARITK LRKMVDQLFC KKFAEALGST EPKAVPYQKF QAHPTDLCVE
     GLPENIPFRS PSWYGIPRLE KIVQAGNRIK FVIKRPELLT LTSTEVIQPR SNTPAKEDWN
     VRITKLRKQV EEIFNTKFAQ ALGLSEAVKV PYPVFESNPE YLYVEGLPEG IPFRSPTWFG
     IPRLERIVRG SGKIKFVVKK PELVTSYLPP GLASKINTKN KQLTMSPKSS KRSRSPADNS
     NVPEIEVTVE EGPNKQQTTD VRTNSQTNGS NMSFKPRGRE FSFEAWNAKI TDLKQKVENL
     FNEKCGEALG LTEPVKVPFA LFESFPEVFY VEGLPEGVPF RRPSTFGIPR LEKILRNKSK
     IKFIIKKPEM FEAAVKESSA RSPQRKNNSS NITNTASTTT NTVVNTAAGV EDLNIIQVTI
     PDDESERLSK VEKARQLREQ VNDLFSRKFG EAIGMNFPVK VPYRKITINP GCVVVDGMPP
     GVAFKAPSYL EISSMRKILE SAEFIKFTVI RPFPGLVINN QLMEKAEAEA PPTAPAAAPV
     LP
//
DBGET integrated database retrieval system