GenomeNet

Database: UniProt
Entry: A0A061DZ79_THECC
LinkDB: A0A061DZ79_THECC
Original site: A0A061DZ79_THECC 
ID   A0A061DZ79_THECC        Unreviewed;       266 AA.
AC   A0A061DZ79;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   08-NOV-2023, entry version 33.
DE   SubName: Full=Basic-leucine zipper transcription factor family protein isoform 1 {ECO:0000313|EMBL:EOX97657.1};
GN   ORFNames=TCM_006625 {ECO:0000313|EMBL:EOX97657.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOX97657.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOX97657.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001880; EOX97657.1; -; Genomic_DNA.
DR   EMBL; CM001880; EOX97660.1; -; Genomic_DNA.
DR   EMBL; CM001880; EOX97661.1; -; Genomic_DNA.
DR   EMBL; CM001880; EOX97662.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061DZ79; -.
DR   EnsemblPlants; EOX97657; EOX97657; TCM_006625.
DR   EnsemblPlants; EOX97660; EOX97660; TCM_006625.
DR   EnsemblPlants; EOX97661; EOX97661; TCM_006625.
DR   EnsemblPlants; EOX97662; EOX97662; TCM_006625.
DR   Gramene; EOX97657; EOX97657; TCM_006625.
DR   Gramene; EOX97660; EOX97660; TCM_006625.
DR   Gramene; EOX97661; EOX97661; TCM_006625.
DR   Gramene; EOX97662; EOX97662; TCM_006625.
DR   Proteomes; UP000026915; Chromosome 2.
DR   GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR   GO; GO:0006351; P:DNA-templated transcription; IEA:InterPro.
DR   CDD; cd14686; bZIP; 1.
DR   Gene3D; 1.20.5.170; -; 1.
DR   InterPro; IPR004827; bZIP.
DR   InterPro; IPR046347; bZIP_sf.
DR   InterPro; IPR031106; C/EBP.
DR   PANTHER; PTHR23334:SF49; BASIC LEUCINE ZIPPER 23; 1.
DR   PANTHER; PTHR23334; CCAAT/ENHANCER BINDING PROTEIN; 1.
DR   Pfam; PF07716; bZIP_2; 1.
DR   SMART; SM00338; BRLZ; 1.
DR   SUPFAM; SSF57959; Leucine zipper domain; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT   DOMAIN          84..151
FT                   /note="BZIP"
FT                   /evidence="ECO:0000259|SMART:SM00338"
FT   REGION          72..99
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          239..266
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        76..99
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        239..257
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   266 AA;  28956 MW;  7DC43F05987A7AE5 CRC64;
     MDDGELDFLN QEVFSGNMAD IPSSCSMDSF FDELLNDSHA CTHTHTCNPP GPDNSHTHTC
     FHVHTKIVPA PTEDKAAIDD TAESREKKSK KRPLGNREAV RKYREKVKAR AASLEDEVVR
     LRALNQQLLK RLQGQAALEA EIARLKCLLV DIRGRIEGEI GSFPYQKSTT NVNMMNLPGA
     YVMNPCNVQC NDQMYCLHPG ADGKTGEVAE LNGQGFNVCE FDNLPCLANQ NSGEKELSTY
     GVGSAGSNGN SSGTKRRKGA HAATAG
//
DBGET integrated database retrieval system