ID A0A061G0J4_THECC Unreviewed; 507 AA.
AC A0A061G0J4;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 38.
DE RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN ORFNames=TCM_014945 {ECO:0000313|EMBL:EOY22918.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY22918.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY22918.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966,
CC ECO:0000256|RuleBase:RU361166};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072, ECO:0000256|PROSITE-ProRule:PRU10059,
CC ECO:0000256|RuleBase:RU361166}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY22918.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061G0J4; -.
DR STRING; 3641.A0A061G0J4; -.
DR EnsemblPlants; EOY22918; EOY22918; TCM_014945.
DR Gramene; EOY22918; EOY22918; TCM_014945.
DR eggNOG; ENOG502QQPR; Eukaryota.
DR HOGENOM; CLU_008926_1_2_1; -.
DR InParanoid; A0A061G0J4; -.
DR OMA; QKFPQRI; -.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF115; ENDOGLUCANASE 4; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS00592; GH9_2; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001,
KW ECO:0000256|RuleBase:RU361166};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Signal {ECO:0000256|RuleBase:RU361166}.
FT SIGNAL 1..29
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT CHAIN 30..507
FT /note="Endoglucanase"
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT /id="PRO_5005102728"
FT DOMAIN 32..495
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
FT ACT_SITE 422
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10059"
SQ SEQUENCE 507 AA; 55861 MW; 208A5CFCF9134B04 CRC64;
MASGESTSMI TLLTLLLLLG MAMLDRVASQ NYGDALSKCI LFFEGQRSGK LPPNQRMNWR
KDSALRDGFD IGMNLVGGYY DAGDNVKFNF PMAFSTTMLA WSVIEFGQFM GSDLQYALDS
IRWGTDYLLE ATKFPGKVTA AVGDPNADHN CWQRPEDMDT PRPSYVVNQT HPGSEVSAET
AAALAASSLV FESADKKYSI VLVNRAKQVF DFADMFRGNY KDSVGQGVCP FYCDYDGYVD
ELIWGAAWLF KVTNEQKYWD YLQRNIPRLP KKVTTLVDGQ LQASASFAEF GWDAKYAGIN
VLVSGLVMNS NNSNPYVLQA GKFVCTLLPE SPSKSVSYSP GGLLFKPGGS NMQHATSLSF
LLVVYAQYLK KHNGAVPCGN VNAEPSRLEQ IAKGQVDYIL GKNPLSMSYM VGFGQKFPQR
IHHRAASLPS IGNHPEHIKC KDGTPYFQTK NPNPNLLTGA IVGGPDEKDQ FPDNRMNAPQ
SEPTTYINAP FVGVLAYFKA FPNPQAI
//