ID A0A061FIN7_THECC Unreviewed; 515 AA.
AC A0A061FIN7;
DT 03-SEP-2014, integrated into UniProtKB/TrEMBL.
DT 03-SEP-2014, sequence version 1.
DT 24-JAN-2024, entry version 40.
DE RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN ORFNames=TCM_036320 {ECO:0000313|EMBL:EOY17155.1};
OS Theobroma cacao (Cacao) (Cocoa).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; malvids; Malvales; Malvaceae; Byttnerioideae; Theobroma.
OX NCBI_TaxID=3641 {ECO:0000313|EMBL:EOY17155.1, ECO:0000313|Proteomes:UP000026915};
RN [1] {ECO:0000313|EMBL:EOY17155.1, ECO:0000313|Proteomes:UP000026915}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Matina 1-6 {ECO:0000313|Proteomes:UP000026915};
RX PubMed=23731509; DOI=10.1186/gb-2013-14-6-r53;
RA Motamayor J.C., Mockaitis K., Schmutz J., Haiminen N., Iii D.L.,
RA Cornejo O., Findley S.D., Zheng P., Utro F., Royaert S., Saski C.,
RA Jenkins J., Podicheti R., Zhao M., Scheffler B.E., Stack J.C., Feltus F.A.,
RA Mustiga G.M., Amores F., Phillips W., Marelli J.P., May G.D., Shapiro H.,
RA Ma J., Bustamante C.D., Schnell R.J., Main D., Gilbert D., Parida L.,
RA Kuhn D.N.;
RT "The genome sequence of the most widely cultivated cacao type and its use
RT to identify candidate genes regulating pod color.";
RL Genome Biol. 14:R53.1-R53.24(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966,
CC ECO:0000256|RuleBase:RU361166};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072, ECO:0000256|PROSITE-ProRule:PRU10059,
CC ECO:0000256|RuleBase:RU361166}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM001886; EOY17155.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A061FIN7; -.
DR STRING; 3641.A0A061FIN7; -.
DR EnsemblPlants; EOY17155; EOY17155; TCM_036320.
DR Gramene; EOY17155; EOY17155; TCM_036320.
DR eggNOG; ENOG502QRXS; Eukaryota.
DR HOGENOM; CLU_008926_1_2_1; -.
DR InParanoid; A0A061FIN7; -.
DR OMA; GGFQPFF; -.
DR Proteomes; UP000026915; Chromosome 8.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF54; ENDOGLUCANASE 9; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS00592; GH9_2; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001,
KW ECO:0000256|RuleBase:RU361166};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Reference proteome {ECO:0000313|Proteomes:UP000026915}.
FT DOMAIN 55..504
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
FT ACT_SITE 432
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10059"
FT ACT_SITE 484
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
FT ACT_SITE 493
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
SQ SEQUENCE 515 AA; 57133 MW; F9109D6DF19CDD8C CRC64;
MHMLKQGTKT NQSKVGNCFH LSKSPMATSS SVSFLCLLLF LSPLLLNTVH GNPNYKEALL
KSILFFQGQR SGRLPANQQI TWRSNSGLSD GLLEHVDLTG GYYDAGDNVK FNFPMAFTTT
MLSWSTLEYG KRMGPQLQEA RAAIRWATDY LLKCANAKPG KLYVGVGDPN ADHKCWERPE
DMDTVRTSYS VSPSNPGSDV AAETAAALAA ASMVFRKIDP KYSSLLRETA RKVMAFAIQY
RGAYSDSLGS AVCPFYCSYS GYKDELLWGA SWLLRATNDA YYYNFLKTLG ADDQPDLFSW
DNKYAGAHVL LARRALVEND KNFEQYKQEA ESFMCRILPN SPYSTTQYTQ GGLMYKLPQS
NLQYVTSITF LLTTYGKYMK ARRQTFNCGN LMVSPNSLIG LAKRQVDYIL GENPIKMSYM
VGFGPNFPKR IHHRGSSLPS LASHPQSIGC DGGFQPFFYS SNPNPNILVG AIVGGPNQND
GYPDDRSDYS HSEPATYINA AMVGPLAYFA GLKAH
//