ID A0A061F8T9_THECC Unreviewed; 967 AA.
AC A0A061F8T9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=Cell division cycle 5 isoform 1 {ECO:0000313|EMBL:EOY10929.1};
GN ORFNames=TCM_026195 {ECO:0000313|EMBL:EOY10929.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY10929.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY10929.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- SIMILARITY: Belongs to the CEF1 family.
CC {ECO:0000256|ARBA:ARBA00010506}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY10929.1; -; Genomic_DNA.
DR EMBL; CM001883; EOY10930.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F8T9; -.
DR STRING; 3641.A0A061F8T9; -.
DR EnsemblPlants; EOY10929; EOY10929; TCM_026195.
DR EnsemblPlants; EOY10930; EOY10930; TCM_026195.
DR Gramene; EOY10929; EOY10929; TCM_026195.
DR Gramene; EOY10930; EOY10930; TCM_026195.
DR eggNOG; KOG0050; Eukaryota.
DR HOGENOM; CLU_009082_1_0_1; -.
DR InParanoid; A0A061F8T9; -.
DR OMA; KMGMAGE; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0000974; C:Prp19 complex; IBA:GO_Central.
DR GO; GO:0005681; C:spliceosomal complex; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0051301; P:cell division; IEA:UniProtKB-KW.
DR GO; GO:0042742; P:defense response to bacterium; IEA:EnsemblPlants.
DR GO; GO:0050832; P:defense response to fungus; IEA:EnsemblPlants.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR CDD; cd00167; SANT; 1.
DR CDD; cd11659; SANT_CDC5_II; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 2.
DR InterPro; IPR047242; CDC5L/Cef1.
DR InterPro; IPR021786; Cdc5p/Cef1_C.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017930; Myb_dom.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR047240; SANT_CDC5L_II.
DR PANTHER; PTHR45885; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR PANTHER; PTHR45885:SF1; CELL DIVISION CYCLE 5-LIKE PROTEIN; 1.
DR Pfam; PF11831; Myb_Cef; 1.
DR Pfam; PF13921; Myb_DNA-bind_6; 1.
DR SMART; SM00717; SANT; 2.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS51294; HTH_MYB; 2.
DR PROSITE; PS50090; MYB_LIKE; 2.
PE 3: Inferred from homology;
KW Cell cycle {ECO:0000313|EMBL:EOY10929.1};
KW Cell division {ECO:0000313|EMBL:EOY10929.1};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 2..57
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT DOMAIN 2..53
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 54..103
FT /note="Myb-like"
FT /evidence="ECO:0000259|PROSITE:PS50090"
FT DOMAIN 58..107
FT /note="HTH myb-type"
FT /evidence="ECO:0000259|PROSITE:PS51294"
FT REGION 110..149
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 187..223
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 341..360
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 381..420
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 597..619
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 919..967
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 514..541
FT /evidence="ECO:0000256|SAM:Coils"
FT COILED 689..716
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 187..212
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 399..419
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 919..935
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 967 AA; 108904 MW; 8A885F988A7CAA7A CRC64;
MRIMIKGGVW KNTEDEILKA AVMKYGKNQW ARISSLLVRK SAKQCKARWY EWLDPSIKKT
EWTREEDEKL LHLAKLMPTQ WRTIAPIVGR TPSQCLERYE KLLDAACARD ENYEPGDDPR
KLRPGEIDPN PESKPARPDP VDMDEDEKEM LSEARARLAN TRGKKAKRKA REKQLEEARR
LASLQKRREL KAAGIDTRQR KRKRKGIDYN SEIPFEKRPP PGFYDVADED RLVEQPKFPT
TIEELEGKRR VDIESQLRKQ DIAKNKIAQR QDAPSAILQA NKLNDPETVR KRSKLMLPAP
QISDHELEEI AKMGYASDLL AGNDELAEGS GATRALLANY SQTPRQGMTP LRTPQRTPAG
KGDAIMMEAE NLARLRESQT PLLGGENPEL HPSDFSGVTP KKRENQTPNP MSTPSMTPGG
AGLTPRIGMT PSRDGYSFGV TPKGTPIRDE LHINEDMDLN DSAKLEQRRQ PDLRRNLRSG
LGSLPQPKNE YQIVIQPLPE ENEEPEEKIE EDMSDRIARE RAEEEARLQA LLKKRSKVLQ
RELPRPPSAS LELIRDSLLR TDGDKSSFVP PTSIEQADEM IRKELLSLLE HDNAKYPLDE
KANKGKKKGT KRPANGSIPS IEDFEEDEMK EADSLIKEEA EFLRVAMGHE NESLDDFVEA
HNTCLNDLMY FPTRNAYGLS SVAGNMEKLA ALQTEFDNVK KKLDNDKSKA ESMEKKFNVL
TQGYERRAAT LWRQIESTFK QMDTAGTELE CFQALQKQEQ FAASHRINGL WEEVQKQKEL
EQTLQRRYGN LIAELERIQI LMNIYRVQAQ KQEEAAGKDH ALELSEAAVA ANPAVVPSTV
LSEPVPSSEH VDSSLDEQSS LKADMNVDSR KEHAIMDVET DGIMSGNVPL VVEDKEDNIS
KTLDGMTGNI VTSSEVAAES INPDAVSTKQ DSIQETLEGE GVADHTKVDN SSVLGGDTAE
KQTGMEE
//