ID A0A3P9PFR5_POERE Unreviewed; 1013 AA.
AC A0A3P9PFR5;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=GTF2I repeat domain containing 1 {ECO:0000313|Ensembl:ENSPREP00000020523.1};
GN Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSPREP00000020523.1};
OS Poecilia reticulata (Guppy) (Acanthophacelus reticulatus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Atherinomorphae; Cyprinodontiformes; Poeciliidae; Poeciliinae;
OC Poecilia.
OX NCBI_TaxID=8081 {ECO:0000313|Ensembl:ENSPREP00000020523.1, ECO:0000313|Proteomes:UP000242638};
RN [1] {ECO:0000313|Proteomes:UP000242638}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Guanapo {ECO:0000313|Proteomes:UP000242638};
RA Kuenstner A., Dreyer C.;
RT "The genomic landscape of the Guanapo guppy.";
RL Submitted (NOV-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSPREP00000020523.1}
RP IDENTIFICATION.
RC STRAIN=Guanapo {ECO:0000313|Ensembl:ENSPREP00000020523.1};
RG Ensembl;
RL Submitted (SEP-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR AlphaFoldDB; A0A3P9PFR5; -.
DR STRING; 8081.ENSPREP00000020523; -.
DR Ensembl; ENSPRET00000020737.1; ENSPREP00000020523.1; ENSPREG00000013881.1.
DR GeneTree; ENSGT00940000159414; -.
DR OMA; VFDVLYX; -.
DR Proteomes; UP000242638; Unassembled WGS sequence.
DR Bgee; ENSPREG00000013881; Expressed in head and 1 other cell type or tissue.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 5.
DR SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR PROSITE; PS51139; GTF2I; 5.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000242638};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 102..129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 298..323
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 485..533
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 548..592
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 955..976
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 106..124
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 552..569
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 573..589
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1013 AA; 110994 MW; E025844AEB9BA564 CRC64;
MAQMRKLACD GLRASSRGEP QVQLLSARQE ILTSLVSALD SVCMAMSKLN AEVACVTVHE
DSVIAVGTEK GRMFLNSRRE IQTDFYKFCR APCLQSLTAV NTHTKDQEGD PIKLSKDGEH
AKQRAPSDPQ SNVFVLRKMV EEVFTVLYSE ALGKSSLVPV PYEWIQKDPG YVVAHGLPEG
VVLKKPSEYD TKTLMKILEH SHRIHFTAKR QAEDLSREAK PSTEVNNNPS AAAAVVTPSS
HGAAKAVTSP SPATANNSVL SNFLYGMPMS SKPHPDSKLD FKPATLLNLG KDRLANWTEK
GSTGKDIGNT DEPTRLPAEQ GQSPPGIHIS KRLLFSIVHE KSEKWDLFIR ETEDINTLRE
CVQILFNSRY AEALGLDHMV PVPYRKIACD PGAVEIIGIP DQIPFKRPCT YGVPKLKRIL
DERHGIRFII KRMFDERIFT AAGKIAREEG KLDPGCAPED GFSDCLGFPP TAAELLSNAH
SSRSTSACVS PQADSEAGPS GDGAPLRRIK TEPPDGDIIQ VTVPDAGTSG EDPVECLGEA
VSAALCPTTA SPSAPPAAPA PPVPPAPLHP KENQSAEASS PSGAPQSIRR PTEAGSLVED
IGEMILQLRR QVENLFSIKY AEALGLPEPA KVPYSKFQMY PEDLYVTGLP EGISLRRPNC
FGAAKLRKIL AVSSQIQFFI KRPELLAEQV KQEMPSMPVC DAGMFQLELQ ILSEPDAKDS
TPAEDTAALS KRPGFSECME SKLSRIDLAN TLREQVQDLF NRKYGEALGI KYPVQVPYKR
IKNNPDSVII EGLPPGIPFR KPCTFGSQNL ERILAVADKI SFTITRPFQG LIPKPAPRRV
TLLKKAYASI SDDDDINRMG EKVVLREQVK ELFNKKYGEA LGLDRSVVVP YKLIRGSPES
VEVGGLPDDV PFRNPNTYDI VCLEKILQAA DKVTFNIKSQ LQPFAEICSQ ACNTAGTDAS
TNRRKRKRVQ ESSRIPASSD LGISANQIPV MQWPMYMVDY SGVNVQVPGK VNY
//