GenomeNet

Database: UniProt
Entry: A0A0D2S1Q4_GOSRA
LinkDB: A0A0D2S1Q4_GOSRA
Original site: A0A0D2S1Q4_GOSRA 
ID   A0A0D2S1Q4_GOSRA        Unreviewed;       413 AA.
AC   A0A0D2S1Q4;
DT   29-APR-2015, integrated into UniProtKB/TrEMBL.
DT   29-APR-2015, sequence version 1.
DT   24-JAN-2024, entry version 39.
DE   RecName: Full=General transcription factor IIH subunit {ECO:0000256|PIRNR:PIRNR015919};
GN   ORFNames=B456_006G120700 {ECO:0000313|EMBL:KJB35571.1}, Gorai_000799
GN   {ECO:0000313|EMBL:MBA0587676.1};
OS   Gossypium raimondii (New World cotton).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Malvoideae; Gossypium.
OX   NCBI_TaxID=29730 {ECO:0000313|EMBL:KJB35571.1, ECO:0000313|Proteomes:UP000032304};
RN   [1] {ECO:0000313|EMBL:KJB35571.1, ECO:0000313|Proteomes:UP000032304}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=23257886; DOI=10.1038/nature11798;
RA   Paterson A.H., Wendel J.F., Gundlach H., Guo H., Jenkins J., Jin D.,
RA   Llewellyn D., Showmaker K.C., Shu S., Udall J., Yoo M.J., Byers R.,
RA   Chen W., Doron-Faigenboim A., Duke M.V., Gong L., Grimwood J., Grover C.,
RA   Grupp K., Hu G., Lee T.H., Li J., Lin L., Liu T., Marler B.S., Page J.T.,
RA   Roberts A.W., Romanel E., Sanders W.S., Szadkowski E., Tan X., Tang H.,
RA   Xu C., Wang J., Wang Z., Zhang D., Zhang L., Ashrafi H., Bedon F.,
RA   Bowers J.E., Brubaker C.L., Chee P.W., Das S., Gingle A.R., Haigler C.H.,
RA   Harker D., Hoffmann L.V., Hovav R., Jones D.C., Lemke C., Mansoor S.,
RA   ur Rahman M., Rainville L.N., Rambani A., Reddy U.K., Rong J.K.,
RA   Saranga Y., Scheffler B.E., Scheffler J.A., Stelly D.M., Triplett B.A.,
RA   Van Deynze A., Vaslin M.F., Waghmare V.N., Walford S.A., Wright R.J.,
RA   Zaki E.A., Zhang T., Dennis E.S., Mayer K.F., Peterson D.G., Rokhsar D.S.,
RA   Wang X., Schmutz J.;
RT   "Repeated polyploidization of Gossypium genomes and the evolution of
RT   spinnable cotton fibres.";
RL   Nature 492:423-427(2012).
RN   [2] {ECO:0000313|EMBL:MBA0587676.1, ECO:0000313|Proteomes:UP000593578}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=8 {ECO:0000313|EMBL:MBA0587676.1};
RC   TISSUE=Leaf {ECO:0000313|EMBL:MBA0587676.1};
RX   PubMed=30476109;
RA   Grover C.E., Arick M.A. 2nd, Thrash A., Conover J.L., Sanders W.S.,
RA   Peterson D.G., Frelichowski J.E., Scheffler J.A., Scheffler B.E.,
RA   Wendel J.F.;
RT   "Insights into the Evolution of the New World Diploid Cottons (Gossypium,
RT   Subgenus Houzingenia) Based on Genome Sequencing.";
RL   Genome Biol. Evol. 11:53-71(2019).
RN   [3] {ECO:0000313|EMBL:MBA0587676.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=8 {ECO:0000313|EMBL:MBA0587676.1};
RC   TISSUE=Leaf {ECO:0000313|EMBL:MBA0587676.1};
RA   Grover C.E., Arick M.A. II, Thrash A., Conover J.L., Sanders W.S.,
RA   Peterson D.G., Scheffler J.A., Scheffler B.E., Wendel J.F.;
RL   Submitted (APR-2020) to the EMBL/GenBank/DDBJ databases.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|PIRNR:PIRNR015919}.
CC   -!- SIMILARITY: Belongs to the GTF2H2 family.
CC       {ECO:0000256|ARBA:ARBA00006092, ECO:0000256|PIRNR:PIRNR015919}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001745; KJB35568.1; -; Genomic_DNA.
DR   EMBL; CM001745; KJB35571.1; -; Genomic_DNA.
DR   EMBL; CM001745; KJB35572.1; -; Genomic_DNA.
DR   EMBL; CM001745; KJB35573.1; -; Genomic_DNA.
DR   EMBL; JABEZZ010000006; MBA0587676.1; -; Genomic_DNA.
DR   RefSeq; XP_012485244.1; XM_012629790.1.
DR   RefSeq; XP_012485245.1; XM_012629791.1.
DR   RefSeq; XP_012485246.1; XM_012629792.1.
DR   AlphaFoldDB; A0A0D2S1Q4; -.
DR   STRING; 29730.A0A0D2S1Q4; -.
DR   EnsemblPlants; KJB35568; KJB35568; B456_006G120700.
DR   EnsemblPlants; KJB35571; KJB35571; B456_006G120700.
DR   EnsemblPlants; KJB35572; KJB35572; B456_006G120700.
DR   EnsemblPlants; KJB35573; KJB35573; B456_006G120700.
DR   GeneID; 105799309; -.
DR   Gramene; KJB35568; KJB35568; B456_006G120700.
DR   Gramene; KJB35571; KJB35571; B456_006G120700.
DR   Gramene; KJB35572; KJB35572; B456_006G120700.
DR   Gramene; KJB35573; KJB35573; B456_006G120700.
DR   KEGG; gra:105799309; -.
DR   OMA; INWVEVP; -.
DR   OrthoDB; 276422at2759; -.
DR   Proteomes; UP000032304; Chromosome 6.
DR   Proteomes; UP000593578; Unassembled WGS sequence.
DR   GO; GO:0000439; C:transcription factor TFIIH core complex; IEA:InterPro.
DR   GO; GO:0005675; C:transcription factor TFIIH holo complex; IEA:UniProtKB-UniRule.
DR   GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR   GO; GO:0006289; P:nucleotide-excision repair; IEA:UniProtKB-UniRule.
DR   CDD; cd01453; vWA_transcription_factor_IIH_type; 1.
DR   Gene3D; 3.40.50.410; von Willebrand factor, type A domain; 1.
DR   Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR   InterPro; IPR046349; C1-like_sf.
DR   InterPro; IPR007198; Ssl1-like.
DR   InterPro; IPR004595; TFIIH_C1-like_dom.
DR   InterPro; IPR012170; TFIIH_SSL1/p44.
DR   InterPro; IPR002035; VWF_A.
DR   InterPro; IPR036465; vWFA_dom_sf.
DR   InterPro; IPR013087; Znf_C2H2_type.
DR   InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR   NCBIfam; TIGR00622; ssl1; 1.
DR   PANTHER; PTHR12695; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2; 1.
DR   PANTHER; PTHR12695:SF2; GENERAL TRANSCRIPTION FACTOR IIH SUBUNIT 2-RELATED; 1.
DR   Pfam; PF07975; C1_4; 1.
DR   Pfam; PF04056; Ssl1; 1.
DR   PIRSF; PIRSF015919; TFIIH_SSL1; 1.
DR   SMART; SM01047; C1_4; 1.
DR   SMART; SM00327; VWA; 1.
DR   SUPFAM; SSF57889; Cysteine-rich domain; 1.
DR   SUPFAM; SSF53300; vWA-like; 1.
DR   PROSITE; PS50234; VWFA; 1.
DR   PROSITE; PS00028; ZINC_FINGER_C2H2_1; 1.
PE   3: Inferred from homology;
KW   DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW   DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW   ECO:0000256|PIRNR:PIRNR015919};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR015919};
KW   Reference proteome {ECO:0000313|Proteomes:UP000032304};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163,
KW   ECO:0000256|PIRNR:PIRNR015919};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015,
KW   ECO:0000256|PIRNR:PIRNR015919};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|PIRNR:PIRNR015919};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT   DOMAIN          84..223
FT                   /note="VWFA"
FT                   /evidence="ECO:0000259|PROSITE:PS50234"
FT   ZN_FING         303..320
FT                   /note="C4-type"
FT                   /evidence="ECO:0000256|PIRSR:PIRSR015919-1"
FT   REGION          1..27
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   413 AA;  45806 MW;  3F8680ECC236D4D6 CRC64;
     MNNGAGRRIN GGAEEDDDED DVNGDLDAWE RTYTDERSWE SLQEDESGML RPIDNQALYH
     SQYRRRLRSL SSTAARIQKG LIRYLYIVID LSRAASEMDF RPSRIAVIAK HVETFIREFF
     YQNPLSQVGI VTIKDGVAHC LTDIGGSPES HINALMKKLE CSGDSSLQNA LDLVDGYLNQ
     IPSYGHREVL ILYSALSTCD PGDIMDTIQK CRKSKIRCSV IGLAAEMFIC KHLCQETGGT
     YSVALDESHF KELILEHAPP PPAIAEFAVA NLIKMGFPQR AAEGSISICS CHKEAKVGAG
     YTCPRCKARA CELPTECCVC GLTLVSSPHL ARSYHHLFPI APFDEVTSSH LNNPNCKLQR
     NCFGCQQSLL DPGNKPGPAV VCPKCKRYFC LDCDIYIHES LHNCPGCDSL RHS
//
DBGET integrated database retrieval system