ID A0A251VIC9_HELAN Unreviewed; 418 AA.
AC A0A251VIC9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 28.
DE RecName: Full=General transcription factor IIH subunit {ECO:0000256|PIRNR:PIRNR015919};
GN Name=ATGTF2H2 {ECO:0000313|EMBL:OTG35355.1};
GN ORFNames=HannXRQ_Chr02g0055911 {ECO:0000313|EMBL:OTG35355.1},
GN HanXRQr2_Chr02g0082361 {ECO:0000313|EMBL:KAF5819860.1};
OS Helianthus annuus (Common sunflower).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC Heliantheae alliance; Heliantheae; Helianthus.
OX NCBI_TaxID=4232 {ECO:0000313|EMBL:OTG35355.1, ECO:0000313|Proteomes:UP000215914};
RN [1] {ECO:0000313|EMBL:KAF5819860.1, ECO:0000313|Proteomes:UP000215914}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5819860.1};
RX PubMed=28538728; DOI=10.1038/nature22380;
RA Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA Vincourt P., Rieseberg L.H., Langlade N.B.;
RT "The sunflower genome provides insights into oil metabolism, flowering and
RT Asterid evolution.";
RL Nature 546:148-152(2017).
RN [2] {ECO:0000313|EMBL:OTG35355.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC TISSUE=Leaves {ECO:0000313|EMBL:OTG35355.1};
RA Langlade N., Munos S.;
RT "Sunflower complete genome.";
RL Submitted (FEB-2017) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KAF5819860.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Leaves {ECO:0000313|EMBL:KAF5819860.1};
RA Gouzy J., Langlade N., Munos S.;
RT "Helianthus annuus Genome sequencing and assembly Release 2.";
RL Submitted (JUN-2020) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR015919}.
CC -!- SIMILARITY: Belongs to the GTF2H2 family.
CC {ECO:0000256|ARBA:ARBA00006092, ECO:0000256|PIRNR:PIRNR015919}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; MNCJ02000317; KAF5819860.1; -; Genomic_DNA.
DR EMBL; CM007891; OTG35355.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A251VIC9; -.
DR STRING; 4232.A0A251VIC9; -.
DR EnsemblPlants; mRNA:HanXRQr2_Chr02g0082361; mRNA:HanXRQr2_Chr02g0082361; HanXRQr2_Chr02g0082361.
DR Gramene; mRNA:HanXRQr2_Chr02g0082361; mRNA:HanXRQr2_Chr02g0082361; HanXRQr2_Chr02g0082361.
DR InParanoid; A0A251VIC9; -.
DR OMA; INWVEVP; -.
DR OrthoDB; 276422at2759; -.
DR Proteomes; UP000215914; Chromosome 2.
DR GO; GO:0000439; C:transcription factor TFIIH core complex; IEA:InterPro.
DR GO; GO:0005675; C:transcription factor TFIIH holo complex; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IBA:GO_Central.
DR GO; GO:0006357; P:regulation of transcription by RNA polymerase II; IBA:GO_Central.
DR CDD; cd01453; vWA_transcription_factor_IIH_type; 1.
DR Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR046349; C1-like_sf.
DR InterPro; IPR007198; Ssl1-like.
DR InterPro; IPR004595; TFIIH_C1-like_dom.
DR InterPro; IPR012170; TFIIH_SSL1/p44.
DR InterPro; IPR002035; VWF_A.
DR InterPro; IPR036465; vWFA_dom_sf.
DR InterPro; IPR013087; Znf_C2H2_type.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR NCBIfam; TIGR00622; ssl1; 1.
DR PANTHER; PTHR12695; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2; 1.
DR PANTHER; PTHR12695:SF2; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2-RELATED; 1.
DR Pfam; PF07975; C1_4; 1.
DR Pfam; PF04056; Ssl1; 1.
DR PIRSF; PIRSF015919; TFIIH_SSL1; 1.
DR SMART; SM01047; C1_4; 1.
DR SMART; SM00327; VWA; 1.
DR SUPFAM; SSF57889; Cysteine-rich domain; 1.
DR SUPFAM; SSF53300; vWA-like; 1.
DR PROSITE; PS50234; VWFA; 1.
DR PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR015919};
KW Reference proteome {ECO:0000313|Proteomes:UP000215914};
KW Transcription {ECO:0000256|ARBA:ARBA00023163,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW ECO:0000256|PIRNR:PIRNR015919};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRNR:PIRNR015919};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 86..275
FT /note="VWFA"
FT /evidence="ECO:0000259|PROSITE:PS50234"
FT ZN_FING 305..322
FT /note="C4-type"
FT /evidence="ECO:0000256|PIRSR:PIRSR015919-1"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 418 AA; 46448 MW; 793EC84EB32AC6E8 CRC64;
MSNNGKRKPV NEEEDDDEED GNERDLEAWE KAYADERSWE SLQEDESGLL RPIDNQALHH
AQYRRRLRSL SSASAASRIQ KGLIRYLYLV IDLSKAAGEM DLRPSRMVVV AKQVEAFIRE
FFDQNPLSQI GLVVIKDGVA QCLTDLGGSP ESHIKALMGK LGCSGEASLQ NALELVHEQL
NQIPSYGHRE VIILYSALST CDPGDVMETI QKCKKSKIRC SVIGLSAEIY ICKYLCQETG
GLYSVALDEA HLKDLILEHA PPPPAIAEFA IANLIKMGFP QRAAEGVISI CSCHKEAKFG
GGYICPRCKA RVCELPTECR ICGLTLVSSP HLARSYHHLF PVTPFDDVAP MLVPNQHRRP
KSCFGCQQSL LNPGNMPVRC VTCPKCKQFF CLDCDIYIHE SLHNCPGCES LRDSKSVN
//