ID A0A061EQF6_THECC Unreviewed; 449 AA.
AC A0A061EQF6;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE SubName: Full=Carbohydrate-binding-like fold, putative isoform 1 {ECO:0000313|EMBL:EOY07250.1};
GN ORFNames=TCM_021717 {ECO:0000313|EMBL:EOY07250.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY07250.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY07250.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY07250.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061EQF6; -.
DR STRING; 3641.A0A061EQF6; -.
DR EnsemblPlants; EOY07250; EOY07250; TCM_021717.
DR Gramene; EOY07250; EOY07250; TCM_021717.
DR eggNOG; ENOG502QU99; Eukaryota.
DR HOGENOM; CLU_047483_0_0_1; -.
DR InParanoid; A0A061EQF6; -.
DR OMA; ENDIQWG; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:2001070; F:starch binding; IEA:InterPro.
DR CDD; cd05467; CBM20; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR013784; Carb-bd-like_fold.
DR InterPro; IPR002044; CBM_fam20.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR15048; STARCH-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR PANTHER; PTHR15048:SF0; STARCH-BINDING DOMAIN-CONTAINING PROTEIN 1; 1.
DR Pfam; PF00686; CBM_20; 1.
DR SMART; SM01065; CBM_2; 1.
DR SUPFAM; SSF49452; Starch-binding domain-like; 1.
DR PROSITE; PS51166; CBM20; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 85..187
FT /note="CBM20"
FT /evidence="ECO:0000259|PROSITE:PS51166"
FT REGION 63..89
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 386..449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 386..425
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 449 AA; 50156 MW; AC29F0D954BE8EB4 CRC64;
MKTLTSSCSK AIIDKHRDKG LSCFNDLSLN RGEVCLFPSK KLVRIRLLRL LSVQHRRLQP
VLSSSSLSPD SQVDFETAET QPAEENPSKT VHVKFQLQKE CSFGEHFFIV GDHPMLGIWD
PESAIPLNWL KGHVWTVELD IPVGKSIQFK FVLKTSTGNL LWQPGPDRIF KSWETENTII
VSEDWEEAEY QKLIEEEPSA NQDGPVLDSE MAIVAENLTP PKEELVSDME LVSETDSITN
LEKEPLQAFS EELATSSGAP SLEEPLAIVA ENISYPTENF VANVDNVVLG VKRTDYPNDE
ALATSNKNHL VAEDLGNIGR VETVQNPATA DVEGNLVVHE GSPVLVPGLT PLDTVSTEEA
NLDEYEKNSI TEASIEVNEA NYQKMPELDE KQEPEGEPQE EKPTAVSKDE EEQLDNRHIQ
SRQLAREQPD PDPFQIGNLE TRIEEGVRQ
//