GenomeNet

Database: UniProt
Entry: W5PPH8_SHEEP
LinkDB: W5PPH8_SHEEP
Original site: W5PPH8_SHEEP 
ID   W5PPH8_SHEEP            Unreviewed;      1003 AA.
AC   W5PPH8;
DT   16-APR-2014, integrated into UniProtKB/TrEMBL.
DT   16-APR-2014, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   RecName: Full=General transcription factor II-I repeat domain-containing protein 1 {ECO:0008006|Google:ProtNLM};
GN   Name=GTF2IRD1 {ECO:0000313|Ensembl:ENSOARP00000012354.1};
OS   Ovis aries (Sheep).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Eutheria; Laurasiatheria; Artiodactyla; Ruminantia; Pecora; Bovidae;
OC   Caprinae; Ovis.
OX   NCBI_TaxID=9940 {ECO:0000313|Ensembl:ENSOARP00000012354.1, ECO:0000313|Proteomes:UP000002356};
RN   [1] {ECO:0000313|Ensembl:ENSOARP00000012354.1, ECO:0000313|Proteomes:UP000002356}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=Texel {ECO:0000313|Ensembl:ENSOARP00000012354.1,
RC   ECO:0000313|Proteomes:UP000002356};
RX   PubMed=20809919; DOI=10.1111/j.1365-2052.2010.02100.x;
RA   Archibald A.L., Cockett N.E., Dalrymple B.P., Faraut T., Kijas J.W.,
RA   Maddox J.F., McEwan J.C., Hutton Oddy V., Raadsma H.W., Wade C., Wang J.,
RA   Wang W., Xun X.;
RT   "The sheep genome reference sequence: a work in progress.";
RL   Anim. Genet. 41:449-453(2010).
RN   [2] {ECO:0000313|Ensembl:ENSOARP00000012354.1}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AMGL01069259; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069260; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069261; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069262; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069263; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069264; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069265; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   EMBL; AMGL01069266; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR   AlphaFoldDB; W5PPH8; -.
DR   SMR; W5PPH8; -.
DR   STRING; 9940.ENSOARP00000012354; -.
DR   PaxDb; 9940-ENSOARP00000012354; -.
DR   Ensembl; ENSOART00000012533.1; ENSOARP00000012354.1; ENSOARG00000011517.1.
DR   eggNOG; ENOG502QPVX; Eukaryota.
DR   HOGENOM; CLU_014412_0_0_1; -.
DR   OMA; VFDVLYX; -.
DR   Proteomes; UP000002356; Chromosome 24.
DR   Bgee; ENSOARG00000011517; Expressed in pituitary gland and 51 other cell types or tissues.
DR   ExpressionAtlas; W5PPH8; baseline and differential.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0006366; P:transcription by RNA polymerase II; IEA:InterPro.
DR   Gene3D; 3.90.1460.10; GTF2I-like; 5.
DR   InterPro; IPR004212; GTF2I.
DR   InterPro; IPR036647; GTF2I-like_rpt_sf.
DR   InterPro; IPR016659; TF_II-I.
DR   PANTHER; PTHR46304; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR   PANTHER; PTHR46304:SF1; GENERAL TRANSCRIPTION FACTOR II-I REPEAT DOMAIN-CONTAINING PROTEIN 1; 1.
DR   Pfam; PF02946; GTF2I; 5.
DR   PIRSF; PIRSF016441; TF_II-I; 1.
DR   SUPFAM; SSF117773; GTF2I-like repeat; 5.
DR   PROSITE; PS51139; GTF2I; 5.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000002356};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   REGION          1..24
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          153..175
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          289..308
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          337..366
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          526..618
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          935..971
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        153..167
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        599..618
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        951..971
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1003 AA;  110280 MW;  AD572AA5AF6C1473 CRC64;
     MEDPWGRRPV LDSSHLGGGA GLGGSAGYGG DRRYVCPAPP TSACSPPPSL HPSRQPSTME
     LLGKRCDIPA NGCGPDRWTS AFARKDEIIT SLVSALDSMC SALSKLNAEV ACVAVHDESA
     FVVGTEKGRM FLNARKELQS DFLRFCRGPP WKEPEAEHPK KVPRGEGGGR NVPRSALEHG
     SDVYLLRKMV EEVFDVLYSE ALGRASVVPL PYERLLREPG LLAVQGLPEG LAFRRPAEYD
     PKALMAILEH SHRIRFKLKR PLEDGGRDSK ALVELNGVSL LAKGARDCGL HGQTPKGPPQ
     DLPPTATSSS VASFLYSTAL PNHTVRELKQ EAPACPLGPS DLGLGRPGPE PKAPAAQDFP
     DCCGQKPTGP GGPLIQNVHA SKRILFSIVH DKSEKWDAFI KETEDINTLR ECVQILFNSR
     YAEALGLDHM VPVPYRKIAC DPEAVEIVGI PDKIPFKRPC TYGVPKLKRI LEERHSIHFV
     IKRMFDERIF TGNKFTKDPT KLEPASPPED ASTEVARAAV LDLAGTARSD KSGLSEDCGP
     GTSGELGGLR PIKIEPEDPD IIQVTVPDPS PASEEMTDSM PGHLPSEDSG YGMEMLTDKG
     PGEDPRPEER PVEDSHGDVI RPLRKQVELL FNTRYAKAIG ISEPVKVPYS KFLMYPEELF
     VVGLPEGISL RRPNCFGIAK LRKILEASNS IQFVIKRPEL LTEGVKEPLS DSQERDSGDP
     LVDESLKRQG FQENYDARLS RIDIANTLRE QVQDLFNKKY GEALGIKYPV QVPYKRIKSN
     PGSVIIEGLP PGIPFRKPCT FGSQNLERIL AVADKIKFTV TRPFQGLIPK PDEDDANRLG
     EKVILREQVK ELFNEKYGEA LGLNRPVLVP YKLIRDSPDA VEVTGLPDDI PFRNPNTYDI
     HRLEKILKAR EHVRMVIINQ LQPFAEICSD AKVPAKDSSI PKRKRKRVSE GNSVSSSSSS
     SSSSSSSNPE SVASTNQISL VQWPMYMVDY AGLNVQLPGP LNY
//
DBGET integrated database retrieval system