ID A0A1Q3CT97_CEPFO Unreviewed; 489 AA.
AC A0A1Q3CT97;
DT 12-APR-2017, integrated into UniProtKB/TrEMBL.
DT 12-APR-2017, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN ORFNames=CFOL_v3_26799 {ECO:0000313|EMBL:GAV83351.1};
OS Cephalotus follicularis (Albany pitcher plant).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Oxalidales; Cephalotaceae; Cephalotus.
OX NCBI_TaxID=3775 {ECO:0000313|EMBL:GAV83351.1, ECO:0000313|Proteomes:UP000187406};
RN [1] {ECO:0000313|Proteomes:UP000187406}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. St1 {ECO:0000313|Proteomes:UP000187406};
RA Fukushima K., Hasebe M., Fang X.;
RT "Cephalotus genome sequencing.";
RL Submitted (APR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966,
CC ECO:0000256|RuleBase:RU361166};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072, ECO:0000256|PROSITE-ProRule:PRU10059,
CC ECO:0000256|RuleBase:RU361166}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAV83351.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BDDD01002860; GAV83351.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1Q3CT97; -.
DR STRING; 3775.A0A1Q3CT97; -.
DR InParanoid; A0A1Q3CT97; -.
DR OrthoDB; 1347382at2759; -.
DR Proteomes; UP000187406; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 1.50.10.10; -; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR018221; Glyco_hydro_9_His_AS.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF54; ENDOGLUCANASE 9; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS00592; GH9_2; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001,
KW ECO:0000256|RuleBase:RU361166};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU10059};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|PROSITE-ProRule:PRU10059};
KW Reference proteome {ECO:0000313|Proteomes:UP000187406};
KW Signal {ECO:0000256|RuleBase:RU361166}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT CHAIN 26..489
FT /note="Endoglucanase"
FT /evidence="ECO:0000256|RuleBase:RU361166"
FT /id="PRO_5011821685"
FT DOMAIN 29..478
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
FT ACT_SITE 406
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10059"
FT ACT_SITE 458
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
FT ACT_SITE 467
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
SQ SEQUENCE 489 AA; 53893 MW; 6D95A7E1567ACE48 CRC64;
MAMRREVRVA LLFLLCLFPL NTVHGNPNYK EALAMSILFF QGQRSGRLPK NNQIVWRSNS
GLSDGLLAHV DLTGGYYDAG DNVKFNFPMA FATTMLSWGT LEYGKRMGSQ LQDSRAAIRW
ATDYLLKCAT ATPSKLYVGV GDPNVDHKCW ERPEDMDTDR TVYSVSPSNP GSDVAGETAA
ALAAASLVFR TVDPKYSKLL ISTAKKVMQF AMQYRGAYSD SLGSSVCPFY CSYSGYKDEL
VWGAAWLLRA TNDASYFNFL KSFGNDDGVD IFSWDNKYAG ASVLLARRAL VNSDRNFEPF
VHQAENFMCR ILPNSPSSST LYTQGGLLFK LPQSNLQYVT SITFLLTTYA KYMKVAKHTF
NCGNVLVTQG SLMSLAKRQV DYILGVNPIK LSYMVGFGPN FPKRIHHRGS SLPSLASHPQ
SIGCDGGFQQ FFYSSNPNPN ILTGAIVGGP NQNDGYQDDR TDYSHSEPAT YINAAIVGPL
AYFAGSFSS
//