GenomeNet

Database: UniProt
Entry: A0A2P6U3U1_CHLSO
LinkDB: A0A2P6U3U1_CHLSO
Original site: A0A2P6U3U1_CHLSO 
ID   A0A2P6U3U1_CHLSO        Unreviewed;       323 AA.
AC   A0A2P6U3U1;
DT   23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT   23-MAY-2018, sequence version 1.
DT   27-MAR-2024, entry version 20.
DE   RecName: Full=General transcription and DNA repair factor IIH subunit TFB4 {ECO:0000256|RuleBase:RU368090};
DE   AltName: Full=RNA polymerase II transcription factor B subunit 4 {ECO:0000256|RuleBase:RU368090};
GN   ORFNames=C2E21_0659 {ECO:0000313|EMBL:PRW60988.1};
OS   Chlorella sorokiniana (Freshwater green alga).
OC   Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC   Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX   NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW60988.1, ECO:0000313|Proteomes:UP000239899};
RN   [1] {ECO:0000313|EMBL:PRW60988.1, ECO:0000313|Proteomes:UP000239899}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX   PubMed=29178410; DOI=10.1111/tpj.13789;
RA   Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA   Barney B.M.;
RT   "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT   conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL   Plant J. 93:566-586(2018).
CC   -!- FUNCTION: Component of the general transcription and DNA repair factor
CC       IIH (TFIIH) core complex, which is involved in general and
CC       transcription-coupled nucleotide excision repair (NER) of damaged DNA
CC       and, when complexed to CAK, in RNA transcription by RNA polymerase II.
CC       In NER, TFIIH acts by opening DNA around the lesion to allow the
CC       excision of the damaged oligonucleotide and its replacement by a new
CC       DNA fragment. In transcription, TFIIH has an essential role in
CC       transcription initiation. When the pre-initiation complex (PIC) has
CC       been established, TFIIH is required for promoter opening and promoter
CC       escape. Phosphorylation of the C-terminal tail (CTD) of the largest
CC       subunit of RNA polymerase II by the kinase module CAK controls the
CC       initiation of transcription. {ECO:0000256|RuleBase:RU368090}.
CC   -!- SUBUNIT: Component of the 7-subunit TFIIH core complex composed of XPB,
CC       XPD, TFB1/GTF2H1, GTF2H2/P44, TFB4/GTF2H3, TFB2/GTF2H4 and TFB5/GTF2H5,
CC       which is active in NER. The core complex associates with the 3-subunit
CC       CDK-activating kinase (CAK) module composed of CYCH1/cyclin H1, CDKD
CC       and MAT1/At4g30820 to form the 10-subunit holoenzyme (holo-TFIIH)
CC       active in transcription. {ECO:0000256|RuleBase:RU368090}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|RuleBase:RU368090}.
CC   -!- SIMILARITY: Belongs to the TFB4 family. {ECO:0000256|ARBA:ARBA00005273,
CC       ECO:0000256|RuleBase:RU368090}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PRW60988.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LHPG02000001; PRW60988.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A2P6U3U1; -.
DR   STRING; 3076.A0A2P6U3U1; -.
DR   OrthoDB; 45434at2759; -.
DR   Proteomes; UP000239899; Unassembled WGS sequence.
DR   GO; GO:0000439; C:transcription factor TFIIH core complex; IEA:UniProtKB-UniRule.
DR   GO; GO:0005675; C:transcription factor TFIIH holo complex; IEA:UniProtKB-UniRule.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006289; P:nucleotide-excision repair; IEA:UniProtKB-UniRule.
DR   GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   InterPro; IPR004600; TFIIH_Tfb4/GTF2H3.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   PANTHER; PTHR12831:SF0; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 3; 1.
DR   PANTHER; PTHR12831; TRANSCRIPTION INITIATION FACTOR IIH TFIIH , POLYPEPTIDE 3-RELATED; 1.
DR   Pfam; PF03850; Tfb4; 1.
PE   3: Inferred from homology;
KW   DNA damage {ECO:0000256|RuleBase:RU368090};
KW   DNA repair {ECO:0000256|RuleBase:RU368090};
KW   Metal-binding {ECO:0000256|RuleBase:RU368090};
KW   Nucleus {ECO:0000256|RuleBase:RU368090};
KW   Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163,
KW   ECO:0000256|RuleBase:RU368090};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW   ECO:0000256|RuleBase:RU368090}; Zinc {ECO:0000256|RuleBase:RU368090};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|RuleBase:RU368090}.
SQ   SEQUENCE   323 AA;  32854 MW;  19B60B1807745E98 CRC64;
     MAAEWEDDSS LLAVLLDVSP TGLAHLAAIP GLGLQSLLEQ LLTFLNAFTL LSDANRLAVF
     TAGAGGCQLA FCSPSCQPAP LMAAAGGHVA ADPWPPSAAI LSGLQRVVQQ AAAAAEVAGV
     QPGAAAAAAA AAAAHGGGGS CQLSSALSRV LCFIHSMQRR AAAAQGSLLG GSGGGAPPAA
     ERLQPARLLC LTAAPDVPSQ YIGVMNAIFS AQRSGILIDA CQLGAGHSSF LQQAAYLTGG
     TYLKPARPQA LTQYLNSVFA ADARSRRYLR LPGTARVDFR ASCFCHKRPI DLGFVCSACL
     SIFCQQLPAC TTCGTEFGAG DKK
//
DBGET integrated database retrieval system