ID A0A061G743_THECC Unreviewed; 626 AA.
AC A0A061G743;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN ORFNames=TCM_016456 {ECO:0000313|EMBL:EOY25007.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY25007.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY25007.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966,
CC ECO:0000256|RuleBase:RU361166};
CC -!- SUBCELLULAR LOCATION: Secreted {ECO:0000256|ARBA:ARBA00004613}.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072, ECO:0000256|PROSITE-ProRule:PRU10059,
CC ECO:0000256|RuleBase:RU361166}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001881; EOY25007.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061G743; -.
DR STRING; 3641.A0A061G743; -.
DR EnsemblPlants; EOY25007; EOY25007; TCM_016456.
DR Gramene; EOY25007; EOY25007; TCM_016456.
DR eggNOG; ENOG502QRF6; Eukaryota.
DR HOGENOM; CLU_008926_1_4_1; -.
DR InParanoid; A0A061G743; -.
DR OMA; RCSQGNI; -.
DR Proteomes; UP000026915; Chromosome 3.
DR GO; GO:0005576; C:extracellular region; IEA:UniProtKB-SubCell.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR019028; CBM_49.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF19; ENDOGLUCANASE 19-RELATED; 1.
DR Pfam; PF09478; CBM49; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SMART; SM01063; CBM49; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS00592; GH9_2; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001,
KW ECO:0000256|RuleBase:RU361166};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915};
KW Secreted {ECO:0000256|ARBA:ARBA00022525};
KW Signal {ECO:0000256|RuleBase:RU361166}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT CHAIN 25..626
FT /note="Endoglucanase"
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT /id="PRO_5005102735"
FT DOMAIN 534..614
FT /note="Carbohydrate binding"
FT /evidence="ECO:0000259|SMART:SM01063"
FT ACT_SITE 413
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10059"
FT ACT_SITE 465
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
FT ACT_SITE 474
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
SQ SEQUENCE 626 AA; 69045 MW; D4A42156572172CA CRC64;
MEKLMRLISM APLLLLLCLP FAFAGHDYNQ ALSKSILFFE AQRSGYLPHS QRVTWRASSG
LNDGKASGVD LVGGYYDAGD NVKFGLPMAF TITMMSWSII EYGKQMAASG ELGNALEAVK
WGTDYLIKAH PEPYVLYGEV GDGNSDHYCW QRPEDMTTDR HAYKIDPSNP GSDLAGETAA
AMAAASIVFR RSNPAYSSEL LRHAYQLFEF ADKYRGKYDS SITVAQKYYR SVSGYNDELL
WAAAWLYQAS NNQYYLNYLG KNGDSMGGTG WAMTEFSWDV KYAGVQTLVA KFLMQGKAGL
HAPVFERYHQ RAEYFMCSLI GKGSRNIQKT PGGLIFRQRW NNMQFVTSAS FLATVYSDYL
TSSRGSLNCA AGNVAPSELL SFAKSQVDYL LGDNPRATSY MVGYGNNYPR QVHHRGSSIV
SIKVDPTFVA CRQGYANWYT RKASDPNVLT GALVGGPDAY DNFADERDNY EQTEPATYNN
APLLGILARL GGGHGGYNQL LPVVVPAPNP VVAKPKPAPK PKLTPTPASS SSPITIKQKM
TTSWNHKGKT YYRYSTVVTN NSYKTVKDLK LSISKLYGPL WGLTKSGNSY GFPSWLNSLP
AAKSLEFVYI HSASPADVSV STYNLA
//