ID A0A061FIQ0_THECC Unreviewed; 333 AA.
AC A0A061FIQ0;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 42.
DE SubName: Full=Myb domain protein 52 {ECO:0000313|EMBL:EOY16562.1};
GN ORFNames=TCM_035359 {ECO:0000313|EMBL:EOY16562.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY16562.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY16562.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY16562.1; -; Genomic_DNA.
DR RefSeq; XP_007019337.1; XM_007019275.2.
DR AlphaFoldDB; A0A061FIQ0; -.
DR SMR; A0A061FIQ0; -.
DR EnsemblPlants; EOY16562; EOY16562; TCM_035359.
DR EnsemblPlants; Tc08v2_t011990.1; Tc08v2_p011990.1; Tc08v2_g011990.
DR GeneID; 18592495; -.
DR Gramene; EOY16562; EOY16562; TCM_035359.
DR Gramene; Tc08v2_t011990.1; Tc08v2_p011990.1; Tc08v2_g011990.
DR KEGG; tcc:18592495; -.
DR eggNOG; KOG0048; Eukaryota.
DR HOGENOM; CLU_028567_18_1_1; -.
DR InParanoid; A0A061FIQ0; -.
DR OMA; WNFASVP; -.
DR OrthoDB; 1220073at2759; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IBA:GO_Central.
DR GO; GO:0000978; F:RNA polymerase II cis-regulatory region sequence-specific DNA binding; IBA:GO_Central.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IBA:GO_Central.
DR CDD; cd00167; SANT; 2.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR PANTHER; PTHR45614; MYB PROTEIN-RELATED; 1.
DR PANTHER; PTHR45614:SF221; MYB-LIKE DNA-BINDING DOMAIN CONTAINING PROTEIN, EXPRESSED; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 13..63
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 17..63
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 64..118
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 64..114
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 191..212
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 191..207
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 333 AA; 37627 MW; 5D2485E04FBEB624 CRC64;
MEDSGAASSD DVKTCPRGHW RPAEDEKLRQ LVEQYGAQNW NSIAEKLQGR SGKSCRLRWF
NQLDPRINRR PFTEEEEERL LAAHRIHGNK WALIARLFPG RTDNAVKNHW HVIMARKQRE
QSKLCGKRSF QDGLSDSKLS STGFTPRKAR CQEAFSSRIG FGDSRFLEFQ NPSKERIFSV
SYSSTSSPSW TFASSTMMPS NNSSSAELSR RDGKDYLSGS GSSYYSMENS KILDQSLYKY
HSNASAYCSS LKNSSAFGLP NYRRVVPSPF GYLKLGDNYE SNNGVMRKEL MSVIDNAPKL
ANIRVSSQQE NDDDSIKQKD TPFIDFLGVG ISS
//