ID L5K659_PTEAL Unreviewed; 1070 AA.
AC L5K659;
DT 06-MAR-2013, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE SubName: Full=General transcription factor II-I repeat domain-containing protein 1 {ECO:0000313|EMBL:ELK06842.1};
GN ORFNames=PAL_GLEAN10011980 {ECO:0000313|EMBL:ELK06842.1};
OS Pteropus alecto (Black flying fox).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Chiroptera; Megachiroptera; Pteropodidae;
OC Pteropodinae; Pteropus.
OX NCBI_TaxID=9402 {ECO:0000313|EMBL:ELK06842.1, ECO:0000313|Proteomes:UP000010552};
RN [1] {ECO:0000313|Proteomes:UP000010552}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=23258410; DOI=10.1126/science.1230835;
RA Zhang G., Cowled C., Shi Z., Huang Z., Bishop-Lilly K.A., Fang X.,
RA Wynne J.W., Xiong Z., Baker M.L., Zhao W., Tachedjian M., Zhu Y., Zhou P.,
RA Jiang X., Ng J., Yang L., Wu L., Xiao J., Feng Y., Chen Y., Sun X.,
RA Zhang Y., Marsh G.A., Crameri G., Broder C.C., Frey K.G., Wang L.F.,
RA Wang J.;
RT "Comparative analysis of bat genomes provides insight into the evolution of
RT flight and immunity.";
RL Science 339:456-460(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB030994; ELK06842.1; -; Genomic_DNA.
DR AlphaFoldDB; L5K659; -.
DR STRING; 9402.L5K659; -.
DR eggNOG; ENOG502QPVX; Eukaryota.
DR InParanoid; L5K659; -.
DR Proteomes; UP000010552; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR InterPro; IPR016659; TF_II-I.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 5.
DR PIRSF; PIRSF016441; TF_II-I; 1.
DR SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR PROSITE; PS51139; GTF2I; 5.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000010552};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 230..250
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 511..531
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 539..558
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 879..913
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 541..558
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 892..913
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1070 AA; 118334 MW; 7FE48F9F47493092 CRC64;
MALLGKRCDV PANGCGPDRW NSAFTRKDEI ITSLVSALDS MCSALSKLNA EVACVAVHDE
SAFVVGTEKG RMFLNARKEL QSDFLRFCRG PPWKEPEAEH HKKVLRGEAG GRNIPRSTLE
HGSDVYLLRK MVEEVFDVLY SEALGRASVV PLPYERLLRE PGLLAVQGLP EGLAFRRPAE
YDPKALMAIL EHSHRIRFKL KRPPEDGGRD SKALVELNGV SLIAKGSRDC SLHGQASKGP
PQDLPPTATS SSMASFLYST AVPNHAVREL KQEMPACPLA PSDLGLGRPG PEPKTPAAQD
FSDCCGQKPT GPGGPLIQNV HASKRILFSI VHDKSEKWDA FIKETEDINT LRECVQILFN
SRYAEALGLD HMVPVPYRKI ACDPEAVEIV GIPDKIPFKR PCTYGVPKLK RILEERHSIH
FIIKRMFDER IFTGNKFTKD PTKLEPASPP EDTSAEISRP AILDLAGTVR SDKSGISEDC
GPGTSGELGG LRPIKIEPED LDIIQVTVPD PSPTSEEMTD SMPGHLPSED SGYGMEMLTD
KGPSEDLRPE ERPVEDSHGD VIRPLRKQVE LLFNTRYAKA IGISEPVKVP YSKFLMYPEE
LFVVGLPEGI SLRRPNCFGI AKLRKILEAS NSIQFVIKRP ELLTEGVKEP ITDSQERDSG
DPFVDESLKR QGFQENYDAR LSRIDIANTL REQVQDLFNK KYGEALGIKY PVQVPYKRIK
SNPGSVIIEG LPPGIPFRKP CTFGSQNLER ILAVADKIKF TVTRPFQGLI PKPDEDDANR
LGEKVILREQ VKELFNEKYG EALGLNRPVL VPYKLIRDSP DAVEVTGLPD DIPFRNPNTY
DIHRLEKILK AREHVRMVII NQLQPFAEIC NDAKVPAKDS IPKRKRKRVS EGNSVSSSSS
SSSSSSSNPE SVASTNQISL VTLGPLFLMD KKQQVNSEHI ITKEDQSSEE NEQELQNNYE
AGGVGSNCTA LMNLPSLSGF PRTLPSTYPA NGAQFALSLS WMNDNRFAFE DLFLGLPIIC
TCGANVAQNQ AYSSFPEWVA QVRLEMETGM TGILINVSST NYLGGTEGPL
//