ID A0A061GZP4_THECC Unreviewed; 212 AA.
AC A0A061GZP4;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE SubName: Full=Duplicated homeodomain-like superfamily protein, putative {ECO:0000313|EMBL:EOY33389.1};
GN ORFNames=TCM_041362 {ECO:0000313|EMBL:EOY33389.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY33389.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY33389.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001887; EOY33389.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061GZP4; -.
DR STRING; 3641.A0A061GZP4; -.
DR EnsemblPlants; EOY33389; EOY33389; TCM_041362.
DR Gramene; EOY33389; EOY33389; TCM_041362.
DR eggNOG; KOG0724; Eukaryota.
DR HOGENOM; CLU_060837_4_0_1; -.
DR InParanoid; A0A061GZP4; -.
DR OMA; KSAMEVR; -.
DR Proteomes; UP000026915; Chromosome 9.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR CDD; cd00167; SANT; 2.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR006447; Myb_dom_plants.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR017884; SANT_dom.
DR NCBIfam; TIGR01557; myb_SHAQKYF; 1.
DR PANTHER; PTHR44042; DUPLICATED HOMEODOMAIN-LIKE SUPERFAMILY PROTEIN-RELATED; 1.
DR PANTHER; PTHR44042:SF65; MYB-LIKE PROTEIN I; 1.
DR Pfam; PF00249; Myb_DNA-binding; 2.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 2.
DR PROSITE; PS51294; HTH_MYB; 1.
DR PROSITE; PS50090; MYB_LIKE; 2.
DR PROSITE; PS51293; SANT; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000313|EMBL:EOY33389.1};
KW Homeobox {ECO:0000313|EMBL:EOY33389.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 4..58
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 102..158
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 102..154
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 110..158
FT /note="SANT"
FT /evidence="ECO:0000259|PROSITE:PS51293"
SQ SEQUENCE 212 AA; 24522 MW; 32F7203DA12596D6 CRC64;
MIRSNNDSPS SWSWHQDKLF ERALIMFPED SPDRWEKIAA QLPGKSAMEV RKHYADLEHD
VMEIESGRVQ MPSYEGELES ASWVNESGGS QGWVGWKKDR ESERRKGVPW TEEEHRLFLI
GLQKYGKGDW RSISRNAVVS RTPTQVASHA QKYFLRLNSI TKKDKKRSSI HDITMADDIT
NQNGVGTMGD HSSIELMDQS KFFNEQGRSF WE
//