GenomeNet

Database: UniProt
Entry: A0A061DQ32_THECC
LinkDB: A0A061DQ32_THECC
Original site: A0A061DQ32_THECC 
ID   A0A061DQ32_THECC        Unreviewed;       356 AA.
AC   A0A061DQ32;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 39.
DE   RecName: Full=Transcription factor {ECO:0000256|RuleBase:RU369104};
DE            Short=bHLH transcription factor {ECO:0000256|RuleBase:RU369104};
DE   AltName: Full=Basic helix-loop-helix protein {ECO:0000256|RuleBase:RU369104};
GN   ORFNames=TCM_004140 {ECO:0000313|EMBL:EOX94532.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX94532.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOX94532.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC       ECO:0000256|RuleBase:RU369104}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001879; EOX94531.1; -; Genomic_DNA.
DR   EMBL; CM001879; EOX94532.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061DQ32; -.
DR   STRING; 3641.A0A061DQ32; -.
DR   EnsemblPlants; EOX94531; EOX94531; TCM_004140.
DR   EnsemblPlants; EOX94532; EOX94532; TCM_004140.
DR   Gramene; EOX94531; EOX94531; TCM_004140.
DR   Gramene; EOX94532; EOX94532; TCM_004140.
DR   eggNOG; ENOG502QUEW; Eukaryota.
DR   HOGENOM; CLU_021132_0_1_1; -.
DR   InParanoid; A0A061DQ32; -.
DR   OMA; AVVRISC; -.
DR   Proteomes; UP000026915; Chromosome 1.
DR   GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IBA:GO_Central.
DR   GO; GO:0000976; F:transcription cis-regulatory region binding; IBA:GO_Central.
DR   GO; GO:0010629; P:negative regulation of gene expression; IEA:EnsemblPlants.
DR   GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central.
DR   InterPro; IPR045084; AIB/MYC-like.
DR   InterPro; IPR025610; MYC/MYB_N.
DR   PANTHER; PTHR11514; MYC; 1.
DR   PANTHER; PTHR11514:SF53; TRANSCRIPTION FACTOR BHLH3; 1.
DR   Pfam; PF14215; bHLH-MYC_N; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW   Nucleus {ECO:0000256|RuleBase:RU369104};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   Transcription {ECO:0000256|RuleBase:RU369104};
KW   Transcription regulation {ECO:0000256|RuleBase:RU369104}.
FT   DOMAIN          50..234
FT                   /note="Transcription factor MYC/MYB N-terminal"
FT                   /evidence="ECO:0000259|Pfam:PF14215"
FT   REGION          321..356
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        333..356
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   356 AA;  38733 MW;  3B654A1770F4C6EA CRC64;
     MGEKFWVNEE DKAKVESVLG AEACEFLISL ASKQVLSELV TRPPSDLGVQ QRLCQIVDGS
     NWNYAIFWQV SSLKSGGSIL IWGDGHCRDP KLGGVGDAST SGDGKLEGVE DKNEVKKLVL
     QKLHACFGGS EEDNYAAKLD GVSDMEMFYL TSMHFTFHCD SSYGPGESYK SSRSIWTSDV
     NNCSDHYQSR SFLARSAGLQ TVVFIPVKSG VVELGSINLI PEEQNSVEMV NNVFGGSSSV
     QTKTIPKIFG RELSLGGSKS RSISINFSPK VEDESGFTLE TYDVQALGSN QIYGNSSNGC
     RSDDGEAKLF PQLLVGGFNA QARISGLEQP KDDSPSLPDE RKPRKRGSQP MGEKNH
//
DBGET integrated database retrieval system