ID U5HCM6_USTV1 Unreviewed; 523 AA.
AC U5HCM6;
DT 11-DEC-2013, integrated into UniProtKB/TrEMBL.
DT 11-DEC-2013, sequence version 1.
DT 24-JAN-2024, entry version 53.
DE RecName: Full=General transcription and DNA repair factor IIH {ECO:0000256|PIRNR:PIRNR015919};
GN ORFNames=MVLG_04907 {ECO:0000313|EMBL:KDE04683.1};
OS Microbotryum lychnidis-dioicae (strain p1A1 Lamole / MvSl-1064) (Anther
OS smut fungus).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Pucciniomycotina;
OC Microbotryomycetes; Microbotryales; Microbotryaceae; Microbotryum.
OX NCBI_TaxID=683840 {ECO:0000313|EMBL:KDE04683.1};
RN [1] {ECO:0000313|Proteomes:UP000017200}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=p1A1 Lamole {ECO:0000313|Proteomes:UP000017200};
RA Cuomo C., Perlin M., Young S.K., Zeng Q., Gargeya S., Alvarado L.,
RA Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M., Goldberg J.,
RA Griggs A., Gujja S., Heilman E., Heiman D., Howarth C., Mehta T.,
RA Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P.,
RA Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C., Birren B.;
RT "The genome sequence of Microbotryum violaceum strain p1A1 Lamole.";
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KDE04683.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=P1A1 Lamole {ECO:0000313|EMBL:KDE04683.1};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Butler R., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Brown A., Chapman S.B., Chen Z., Dunbar C.,
RA Freedman E., Gearin G., Gellesch M., Goldberg J., Griggs A., Gujja S.,
RA Heilman E., Heiman D., Howarth C., Larson L., Lui A., MacDonald P.J.P.,
RA Mehta T., Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M.,
RA Roberts A., Saif S., Shea T., Shenoy N., Sisk P., Stolte C., Sykes S.,
RA White J., Yandava C., Wortman J., Nusbaum C., Birren B.;
RL Submitted (NOV-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KDE04683.1, ECO:0000313|Proteomes:UP000017200}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=p1A1 Lamole {ECO:0000313|Proteomes:UP000017200}, and P1A1
RC Lamole {ECO:0000313|EMBL:KDE04683.1};
RX PubMed=26076695; DOI=10.1186/s12864-015-1660-8;
RA Perlin M.H., Amselem J., Fontanillas E., Toh S.S., Chen Z., Goldberg J.,
RA Duplessis S., Henrissat B., Young S., Zeng Q., Aguileta G., Petit E.,
RA Badouin H., Andrews J., Razeeq D., Gabaldon T., Quesneville H., Giraud T.,
RA Hood M.E., Schultz D.J., Cuomo C.A.;
RT "Sex and parasites: genomic and transcriptomic analysis of Microbotryum
RT lychnidis-dioicae, the biotrophic and plant-castrating anther smut
RT fungus.";
RL BMC Genomics 16:461-461(2015).
RN [4] {ECO:0000313|EnsemblFungi:MVLG_04907T0}
RP IDENTIFICATION.
RG EnsemblFungi;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- FUNCTION: Component of the general transcription and DNA repair factor
CC IIH (TFIIH) core complex, which is involved in general and
CC transcription-coupled nucleotide excision repair (NER) of damaged DNA
CC and, when complexed to TFIIK, in RNA transcription by RNA polymerase
CC II. {ECO:0000256|PIRNR:PIRNR015919}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR015919}.
CC -!- SIMILARITY: Belongs to the GTF2H2 family.
CC {ECO:0000256|ARBA:ARBA00006092, ECO:0000256|PIRNR:PIRNR015919}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEIJ01000485; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; GL541701; KDE04683.1; -; Genomic_DNA.
DR AlphaFoldDB; U5HCM6; -.
DR STRING; 683840.U5HCM6; -.
DR EnsemblFungi; MVLG_04907T0; MVLG_04907T0; MVLG_04907.
DR HOGENOM; CLU_028556_1_1_1; -.
DR InParanoid; U5HCM6; -.
DR OMA; INWVEVP; -.
DR OrthoDB; 276422at2759; -.
DR Proteomes; UP000017200; Unassembled WGS sequence.
DR GO; GO:0000112; C:nucleotide-excision repair factor 3 complex; IEA:EnsemblFungi.
DR GO; GO:0000439; C:transcription factor TFIIH core complex; IEA:UniProtKB-UniRule.
DR GO; GO:0005675; C:transcription factor TFIIH holo complex; IEA:UniProtKB-UniRule.
DR GO; GO:0016251; F:RNA polymerase II general transcription initiation factor activity; IEA:EnsemblFungi.
DR GO; GO:0061630; F:ubiquitin protein ligase activity; IEA:EnsemblFungi.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:UniProtKB-UniRule.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProtKB-UniRule.
DR GO; GO:0006367; P:transcription initiation at RNA polymerase II promoter; IEA:EnsemblFungi.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR046349; C1-like_sf.
DR InterPro; IPR007198; Ssl1-like.
DR InterPro; IPR004595; TFIIH_C1-like_dom.
DR InterPro; IPR012170; TFIIH_SSL1/p44.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR InterPro; IPR000433; Znf_ZZ.
DR NCBIfam; TIGR00622; ssl1; 1.
DR PANTHER; PTHR12695; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2; 1.
DR PANTHER; PTHR12695:SF2; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2-RELATED; 1.
DR Pfam; PF07975; C1_4; 1.
DR Pfam; PF04056; Ssl1; 1.
DR PIRSF; PIRSF015919; TFIIH_SSL1; 2.
DR SMART; SM01047; C1_4; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57889; Cysteine-rich domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
DR PROSITE; PS01357; ZF_ZZ_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR015919};
KW Reference proteome {ECO:0000313|Proteomes:UP000017200};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRNR:PIRNR015919};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 180..356
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT ZN_FING 414..431
FT /note="C4-type"
FT /evidence="ECO:0000256|PIRSR:PIRSR015919-1"
FT REGION 1..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 15..30
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 31..65
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 523 AA; 56613 MW; AA6159A579764CEC CRC64;
MSTTTTAAAA ASGKRKLSKG KDRAFGPDDD SDDDALLLDD EDEDEDEDMI AAADDDDDDD
SELDPGSGPG WDGTGKRKRK LVSKTDRSLV EASKGGIKIV VGGIGDPLRA AGAGGGNSGK
RKNKKGKGRV WEGEFEHTWD NVQEDERGTL EGAVSGALLG TKNRRILRDT TSIQRGIIRH
VYLVIDLSAV MLEREFKSSW LDLALQYARE FISEFFDQNP ISQMAVLVTR DGAAERLSPL
GGNPVDHLKA LQNEKKLEAR GDPSLQNVLK MAQSGLSHLP PHGSREVIII LGSLTTCDPG
NIHTTIKDVE KDRIRVNIIG LAAEMKICRD IATRTKGTYN VARDDLYLRE LLFEFVSPPA
TLAPSKSHVL GGPSASAPSS SADLMQMGFP QLIQAPYPGL CSCHLKLKNS GYNCPRCKSR
ICDVPTECRV CGLTVVNAPQ LARSYRHLFP VANYEIVSEP SSSYPLSCKA CSHPFSTTAI
KSSLTSVDIS PLGRYSCATC EKHFCLDCDK LVHDALGFCP GCC
//