ID A0A061F2F9_THECC Unreviewed; 619 AA.
AC A0A061F2F9;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 38.
DE RecName: Full=cellulase {ECO:0000256|ARBA:ARBA00012601};
DE EC=3.2.1.4 {ECO:0000256|ARBA:ARBA00012601};
GN ORFNames=TCM_023653 {ECO:0000313|EMBL:EOY08664.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY08664.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY08664.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001883; EOY08664.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061F2F9; -.
DR STRING; 3641.A0A061F2F9; -.
DR EnsemblPlants; EOY08664; EOY08664; TCM_023653.
DR Gramene; EOY08664; EOY08664; TCM_023653.
DR eggNOG; ENOG502QSIM; Eukaryota.
DR HOGENOM; CLU_008926_1_3_1; -.
DR InParanoid; A0A061F2F9; -.
DR OMA; TMCSYLH; -.
DR Proteomes; UP000026915; Chromosome 5.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF64; ENDOGLUCANASE 7; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000313|EMBL:EOY08664.1};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 75..94
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 110..583
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
SQ SEQUENCE 619 AA; 69736 MW; D3A06E598994D4DD CRC64;
MHARNHWGGS FDVNHGEEEK SWNTEWDRAA LQSQQQDRSL DETQQGWLLG PPQTKKKDKY
VDLGCIVCSR KAFKWTLISI LSAFIVIAVP IIIAKSLPKH TRRPPPPDNY TVALRKALLF
FNAQKSGNLP KNNGISWRGN SGLNDGKEEM DLKGWLVGGY YDAGDNTKFH FPMAFSMTML
SWSLIEYSHK YQSIGEYDHI RDLIKWGTDY LLLTFNSSAT KIDKIYCQVG GSLNGSIATP
DDHYCWMRPE DMDYKRPVQT AYAGPDLAGE MAAALAAASI VFRDNGAYSR KLIKGAQTVF
AFARDGSKRR SYSRGNPYIQ PYYNSSGYYD EYMWGAAWLY YATGNVSYIS LATNPGLSKN
SKALYDIPTN RALSWDNKLP AAMLLLTRYR IFLSPGYPYE DMLHMYHNVT ALTMCSYLKE
FHYFNWTQGG MIQLNLGKPN PLQYVANAAF LANLFADYLN ATGVPGWNCN SRFFSSEYLR
NFATSQVDYI LGKNPMNMSY VVGYDKKFPR HVHHRGASIP HNNIKYSCTG GWKWRDSQNP
NPNNITGAMV GGPDRFDHFR DVRTNSNYTE PTLAGNAGLI AALASLTRSG GHGIDKNTIF
SAVPPLYPKS PPPAAPWRP
//