GenomeNet

Database: UniProt
Entry: A0A061GZP4_THECC
LinkDB: A0A061GZP4_THECC
Original site: A0A061GZP4_THECC 
ID   A0A061GZP4_THECC        Unreviewed;       212 AA.
AC   A0A061GZP4;
DT   03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT   03-SEP-2014, sequence version 1.
DT   27-MAR-2024, entry version 50.
DE   SubName: Full=Duplicated homeodomain-like superfamily protein, putative {ECO:0000313|EMBL:EOY33389.1};
GN   ORFNames=TCM_041362 {ECO:0000313|EMBL:EOY33389.1};
OS   Theobroma cacao (Cacao) (Cocoa).
OC   Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC   Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC   rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX   NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY33389.1, ECO:0000313|Proteomes:UP000026915};
RN   [1] {ECO:0000313|EMBL:EOY33389.1, ECO:0000313|Proteomes:UP000026915}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX   PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA   Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA   Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA   Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA   Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA   Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA   Kuhn D.N.;
RT   "The genome sequence of the most widely cultivated cacao type and its use
RT   to identify candidate genes regulating pod color.";
RL   Genome Biol. 14:R53.1-R53.24(2013).
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM001887; EOY33389.1; -; Genomic_DNA.
DR   AlphaFoldDB; A0A061GZP4; -.
DR   STRING; 3641.A0A061GZP4; -.
DR   EnsemblPlants; EOY33389; EOY33389; TCM_041362.
DR   Gramene; EOY33389; EOY33389; TCM_041362.
DR   eggNOG; KOG0724; Eukaryota.
DR   HOGENOM; CLU_060837_4_0_1; -.
DR   InParanoid; A0A061GZP4; -.
DR   OMA; KSAMEVR; -.
DR   Proteomes; UP000026915; Chromosome 9.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR   CDD; cd00167; SANT; 2.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017930; Myb_dom.
DR   InterPro; IPR006447; Myb_dom_plants.
DR   InterPro; IPR001005; SANT/Myb.
DR   InterPro; IPR017884; SANT_dom.
DR   NCBIfam; TIGR01557; myb_SHAQKYF; 1.
DR   PANTHER; PTHR44042; DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEIN-RELATED; 1.
DR   PANTHER; PTHR44042:SF65; MYB-LIKE PROTEIN I; 1.
DR   Pfam; PF00249; Myb_DNA-binding; 2.
DR   SMART; SM00717; SANT; 2.
DR   SUPFAM; SSF46689; Homeodomain-like; 2.
DR   PROSITE; PS51294; HTH_MYB; 1.
DR   PROSITE; PS50090; MYB_LIKE; 2.
DR   PROSITE; PS51293; SANT; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000313|EMBL:EOY33389.1};
KW   Homeobox {ECO:0000313|EMBL:EOY33389.1};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT   DOMAIN          4..58
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          102..158
FT                   /note="HTH myb-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51294"
FT   DOMAIN          102..154
FT                   /note="Myb-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50090"
FT   DOMAIN          110..158
FT                   /note="SANT"
FT                   /evidence="ECO:0000259|PROSITE:PS51293"
SQ   SEQUENCE   212 AA;  24522 MW;  32F7203DA12596D6 CRC64;
     MIRSNNDSPS SWSWHQDKLF ERALIMFPED SPDRWEKIAA QLPGKSAMEV RKHYADLEHD
     VMEIESGRVQ MPSYEGELES ASWVNESGGS QGWVGWKKDR ESERRKGVPW TEEEHRLFLI
     GLQKYGKGDW RSISRNAVVS RTPTQVASHA QKYFLRLNSI TKKDKKRSSI HDITMADDIT
     NQNGVGTMGD HSSIELMDQS KFFNEQGRSF WE
//
DBGET integrated database retrieval system