GenomeNet

Database: UniProt
Entry: A0A061FQF4_THECC
LinkDB: A0A061FQF4_THECC
Original site: A0A061FQF4_THECC 
ID   A0A061FQF4_THECC        Unreviewed;       302 AA.
AC   A0A061FQF4;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 41.
DE   SubName: Full=Global transcription factor group, putative isoform 1 {ECO:0000313|EMBL:EOY19550.1};
GN   ORFNames=TCM_044683 {ECO:0000313|EMBL:EOY19550.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY19550.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY19550.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001888; EOY19550.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061FQF4; -.
DR   STRING; 3641.A0A061FQF4; -.
DR   EnsemblPlants; EOY19550; EOY19550; TCM_044683.
DR   Gramene; EOY19550; EOY19550; TCM_044683.
DR   eggNOG; KOG1856; Eukaryota.
DR   InParanoid; A0A061FQF4; -.
DR   Proteomes; UP000026915; Chromosome 10.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   GO; GO:0140673; P:transcription elongation-coupled chromatin remodeling; IEA:InterPro.
DR   CDD; cd09918; SH2_Nterm_SPT6_like; 1.
DR   Gene3D; 2.40.50.140; Nucleic acid-binding proteins; 1.
DR   Gene3D; 3.30.505.10; SH2 domain; 2.
DR   InterPro; IPR012340; NA-bd_OB-fold.
DR   InterPro; IPR003029; S1_domain.
DR   InterPro; IPR036860; SH2_dom_sf.
DR   InterPro; IPR049540; Spt6-like_S1.
DR   InterPro; IPR035420; Spt6_SH2.
DR   InterPro; IPR035019; Spt6_SH2_N.
DR   InterPro; IPR017072; TF_Spt6.
DR   PANTHER; PTHR10145; TRANSCRIPTION ELONGATION FACTOR SPT6; 1.
DR   PANTHER; PTHR10145:SF6; TRANSCRIPTION ELONGATION FACTOR SPT6; 1.
DR   Pfam; PF14633; SH2_2; 1.
DR   Pfam; PF21710; Spt6_S1; 1.
DR   SMART; SM00316; S1; 1.
DR   SUPFAM; SSF50249; Nucleic acid-binding proteins; 1.
DR   SUPFAM; SSF55550; SH2 domain; 1.
DR   PROSITE; PS50126; S1; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          13..83
FT                   /note="S1 motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50126"
SQ   SEQUENCE   302 AA;  34925 MW;  3C6FBA08DA0903A2 CRC64;
     MICGKNGTAL AEGSLVLAIV RHVESQRAFC VLDSGLTGII MKDDFSDEDG DFALEDKLHE
     GDKVSCKVKQ IDKSTFQAFL TCKESEVKRS RYEDILEVDP YYHESGNILL NQQEKACMDE
     KLDKKHFKPR TISHPFFRNM TLDQAMEFLS DKDAGESIFR PSSRGPSYLT LTLKVFDELY
     LSKDIVESGK DHKDMTSLLH LGKVLKIGND KFRDLDEVRD RYVIPLVKHL KEMLGFQKFK
     RGAKSEVDEV LRAEKLEYPM RVVYCFGISY EHPGTFILSY IKSRNLHHES MYNWWFARPV
     EL
//
DBGET integrated database retrieval system