ID W5PPH8_SHEEP Unreviewed; 1003 AA.
AC W5PPH8;
DT 16-APR-2014, integrated into UniProtKB/TrEMBL.
DT 16-APR-2014, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=General transcription factor II-I repeat domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
GN Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSOARP00000012354.1};
OS Ovis aries (Sheep).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC Caprinae; Ovis.
OX NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000012354.1, ECO:0000313|Proteomes:UP000002356};
RN [1] {ECO:0000313|Ensembl:ENSOARP00000012354.1, ECO:0000313|Proteomes:UP000002356}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000012354.1,
RC ECO:0000313|Proteomes:UP000002356};
RX PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA Wang W., Xun X.;
RT "The sheep genome reference sequence: a work in progress.";
RL Anim. Genet. 41:449-453(2010).
RN [2] {ECO:0000313|Ensembl:ENSOARP00000012354.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AMGL01069259; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069260; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069261; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069262; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069263; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069264; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; AMGL01069266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; W5PPH8; -.
DR SMR; W5PPH8; -.
DR STRING; 9940.ENSOARP00000012354; -.
DR PaxDb; 9940-ENSOARP00000012354; -.
DR Ensembl; ENSOART00000012533.1; ENSOARP00000012354.1; ENSOARG00000011517.1.
DR eggNOG; ENOG502QPVX; Eukaryota.
DR HOGENOM; CLU_014412_0_0_1; -.
DR OMA; VFDVLYX; -.
DR Proteomes; UP000002356; Chromosome 24.
DR Bgee; ENSOARG00000011517; Expressed in pituitary gland and 51 other cell types or tissues.
DR ExpressionAtlas; W5PPH8; baseline and differential.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR InterPro; IPR004212; GTF2I.
DR InterPro; IPR036647; GTF2I-like_rpt_sf.
DR InterPro; IPR016659; TF_II-I.
DR PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF02946; GTF2I; 5.
DR PIRSF; PIRSF016441; TF_II-I; 1.
DR SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR PROSITE; PS51139; GTF2I; 5.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT REGION 1..24
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 153..175
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 289..308
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 337..366
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 526..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 935..971
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 153..167
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 599..618
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 951..971
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1003 AA; 110280 MW; AD572AA5AF6C1473 CRC64;
MEDPWGRRPV LDSSHLGGGA GLGGSAGYGG DRRYVCPAPP TSACSPPPSL HPSRQPSTME
LLGKRCDIPA NGCGPDRWTS AFARKDEIIT SLVSALDSMC SALSKLNAEV ACVAVHDESA
FVVGTEKGRM FLNARKELQS DFLRFCRGPP WKEPEAEHPK KVPRGEGGGR NVPRSALEHG
SDVYLLRKMV EEVFDVLYSE ALGRASVVPL PYERLLREPG LLAVQGLPEG LAFRRPAEYD
PKALMAILEH SHRIRFKLKR PLEDGGRDSK ALVELNGVSL LAKGARDCGL HGQTPKGPPQ
DLPPTATSSS VASFLYSTAL PNHTVRELKQ EAPACPLGPS DLGLGRPGPE PKAPAAQDFP
DCCGQKPTGP GGPLIQNVHA SKRILFSIVH DKSEKWDAFI KETEDINTLR ECVQILFNSR
YAEALGLDHM VPVPYRKIAC DPEAVEIVGI PDKIPFKRPC TYGVPKLKRI LEERHSIHFV
IKRMFDERIF TGNKFTKDPT KLEPASPPED ASTEVARAAV LDLAGTARSD KSGLSEDCGP
GTSGELGGLR PIKIEPEDPD IIQVTVPDPS PASEEMTDSM PGHLPSEDSG YGMEMLTDKG
PGEDPRPEER PVEDSHGDVI RPLRKQVELL FNTRYAKAIG ISEPVKVPYS KFLMYPEELF
VVGLPEGISL RRPNCFGIAK LRKILEASNS IQFVIKRPEL LTEGVKEPLS DSQERDSGDP
LVDESLKRQG FQENYDARLS RIDIANTLRE QVQDLFNKKY GEALGIKYPV QVPYKRIKSN
PGSVIIEGLP PGIPFRKPCT FGSQNLERIL AVADKIKFTV TRPFQGLIPK PDEDDANRLG
EKVILREQVK ELFNEKYGEA LGLNRPVLVP YKLIRDSPDA VEVTGLPDDI PFRNPNTYDI
HRLEKILKAR EHVRMVIINQ LQPFAEICSD AKVPAKDSSI PKRKRKRVSE GNSVSSSSSS
SSSSSSSNPE SVASTNQISL VQWPMYMVDY AGLNVQLPGP LNY
//