ID C1HE13_PARBA Unreviewed; 514 AA.
AC C1HE13;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 04-FEB-2015, sequence version 2.
DT 24-JAN-2024, entry version 81.
DE RecName: Full=General transcription and DNA repair factor IIH {ECO:0000256|PIRNR:PIRNR015919};
GN ORFNames=PAAG_09006 {ECO:0000313|EMBL:EEH40553.2};
OS Paracoccidioides lutzii (strain ATCC MYA-826 / Pb01) (Paracoccidioides
OS brasiliensis).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Onygenales; Onygenales incertae sedis; Paracoccidioides.
OX NCBI_TaxID=502779 {ECO:0000313|EMBL:EEH40553.2, ECO:0000313|Proteomes:UP000002059};
RN [1] {ECO:0000313|EMBL:EEH40553.2, ECO:0000313|Proteomes:UP000002059}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC MYA-826 / Pb01 {ECO:0000313|Proteomes:UP000002059};
RX PubMed=22046142; DOI=10.1371/journal.pgen.1002345;
RA Desjardins C.A., Champion M.D., Holder J.W., Muszewska A., Goldberg J.,
RA Bailao A.M., Brigido M.M., Ferreira M.E., Garcia A.M., Grynberg M.,
RA Gujja S., Heiman D.I., Henn M.R., Kodira C.D., Leon-Narvaez H.,
RA Longo L.V.G., Ma L.-J., Malavazi I., Matsuo A.L., Morais F.V., Pereira M.,
RA Rodriguez-Brito S., Sakthikumar S., Salem-Izacc S.M., Sykes S.M.,
RA Teixeira M.M., Vallejo M.C., Walter M.E., Yandava C., Young S., Zeng Q.,
RA Zucker J., Felipe M.S., Goldman G.H., Haas B.J., McEwen J.G., Nino-Vega G.,
RA Puccia R., San-Blas G., Soares C.M., Birren B.W., Cuomo C.A.;
RT "Comparative genomic analysis of human fungal pathogens causing
RT paracoccidioidomycosis.";
RL PLoS Genet. 7:E1002345-E1002345(2011).
CC -!- FUNCTION: Component of the general transcription and DNA repair factor
CC IIH (TFIIH) core complex, which is involved in general and
CC transcription-coupled nucleotide excision repair (NER) of damaged DNA
CC and, when complexed to TFIIK, in RNA transcription by RNA polymerase
CC II. {ECO:0000256|PIRNR:PIRNR015919}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR015919}.
CC -!- SIMILARITY: Belongs to the GTF2H2 family.
CC {ECO:0000256|ARBA:ARBA00006092, ECO:0000256|PIRNR:PIRNR015919}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KN294051; EEH40553.2; -; Genomic_DNA.
DR RefSeq; XP_015701744.1; XM_015846702.1.
DR AlphaFoldDB; C1HE13; -.
DR STRING; 502779.C1HE13; -.
DR GeneID; 9092296; -.
DR KEGG; pbl:PAAG_09006; -.
DR VEuPathDB; FungiDB:PAAG_09006; -.
DR eggNOG; KOG2807; Eukaryota.
DR HOGENOM; CLU_028556_2_0_1; -.
DR OMA; INWVEVP; -.
DR OrthoDB; 276422at2759; -.
DR Proteomes; UP000002059; Partially assembled WGS sequence.
DR GO; GO:0000439; C:transcription factor TFIIH core complex; IEA:UniProtKB-UniRule.
DR GO; GO:0005675; C:transcription factor TFIIH holo complex; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:UniProtKB-UniRule.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IEA:UniProtKB-UniRule.
DR CDD; cd01453; vWA_transcription_factor_IIH_type; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR046349; C1-like_sf.
DR InterPro; IPR007198; Ssl1-like.
DR InterPro; IPR004595; TFIIH_C1-like_dom.
DR InterPro; IPR012170; TFIIH_SSL1/p44.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR NCBIfam; TIGR00622; ssl1; 1.
DR PANTHER; PTHR12695; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2; 1.
DR PANTHER; PTHR12695:SF2; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2-RELATED; 1.
DR Pfam; PF07975; C1_4; 1.
DR Pfam; PF04056; Ssl1; 1.
DR PIRSF; PIRSF015919; TFIIH_SSL1; 1.
DR SMART; SM01047; C1_4; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57889; Cysteine-rich domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR015919};
KW Reference proteome {ECO:0000313|Proteomes:UP000002059};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRNR:PIRNR015919};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 106..285
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT ZN_FING 347..364
FT /note="C4-type"
FT /evidence="ECO:0000256|PIRSR:PIRSR015919-1"
FT REGION 1..56
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 292..315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 422..443
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 493..514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..44
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 422..437
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 494..508
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 514 AA; 55606 MW; B26340D7733CC8F4 CRC64;
MPTSDPEYVA QPSDNDLDEL IASASDDDDE LDTRPSASGR DARSKDARRR KKGRGGAEWE
VSRTWETLVE GADGTIRATV EGLLEAGKRK RVLRDTTPLQ RGIIRHLILI LDLSSAMSEK
DLRPTRYLLT LRYAQDFVRE FFDQNPISQL GVLGMRDGLA VRISDMSGNP TEHILAIQGL
RAKDPKGMPS LQNALEMARG ALFHTPSHGT REVLIIYGAL LSSDPGDIHK TITSLITDKI
HVYVLGLAAQ VSICQELVTR TNNGDDSGYN VAMNEQHFRE LVLNVTTPPA TTLASHTTTT
TGAANGTSTN TSTDGTLLPM GFPNRHLTPH PTLCACHSTP SRSGYLCPRC CTKVCTLPAS
CPSCKLTLIL STHLARSYHH LFPLMNWVEV SWRKAARAEA EGRVGCFACG VGFAGVPGEF
VGAEGDEDRE GEGKGEGKGA SRGISVSGRY ECLVCRCHFC IDCDVFAHEV VHNCPGCQSG
VVQMREQEGL NLGGDANGSG NWNGNGVVDV MDTE
//