ID R9MD54_9FIRM Unreviewed; 336 AA.
AC R9MD54;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE RecName: Full=cellulase {ECO:0000256|ARBA:ARBA00012601};
DE EC=3.2.1.4 {ECO:0000256|ARBA:ARBA00012601};
GN ORFNames=C818_02822 {ECO:0000313|EMBL:EOS68945.1};
OS Lachnospiraceae bacterium MD308.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=1235799 {ECO:0000313|EMBL:EOS68945.1, ECO:0000313|Proteomes:UP000014117};
RN [1] {ECO:0000313|EMBL:EOS68945.1, ECO:0000313|Proteomes:UP000014117}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3-2 {ECO:0000313|EMBL:EOS68945.1,
RC ECO:0000313|Proteomes:UP000014117};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3-2.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 5 (cellulase A) family.
CC {ECO:0000256|RuleBase:RU361153}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS68945.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTE01000037; EOS68945.1; -; Genomic_DNA.
DR AlphaFoldDB; R9MD54; -.
DR STRING; 1235799.C818_02822; -.
DR PATRIC; fig|1235799.3.peg.2964; -.
DR eggNOG; COG2730; Bacteria.
DR HOGENOM; CLU_012932_3_0_9; -.
DR OrthoDB; 154460at2; -.
DR Proteomes; UP000014117; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR018087; Glyco_hydro_5_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR34142:SF1; CELLULASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR34142; ENDO-BETA-1,4-GLUCANASE A; 1.
DR Pfam; PF00150; Cellulase; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS00659; GLYCOSYL_HYDROL_F5; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023326};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361153};
KW Hydrolase {ECO:0000256|RuleBase:RU361153};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000014117};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..336
FT /note="cellulase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038410421"
FT DOMAIN 54..304
FT /note="Glycoside hydrolase family 5"
FT /evidence="ECO:0000259|Pfam:PF00150"
SQ SEQUENCE 336 AA; 38063 MW; 5CB2C812EB3917BC CRC64;
MKKHTRKYLC LLLILVLAFS IPAEITAAPA KSTKLKTPLA KNGRLKVKKT SLLNSKGKPV
VLKGVSTHGI NWFPEYVNPD SFKTLRDNWR VNCIRLAMYT EEYNGYCSGG SKKDLKSLID
KGVTYATELG MYVIIDWHIL SDSNPNKNKK QAISFFKEMA KKYRKNKNVL YEICNEPNGG
TSWKDIKSYA KSVIKTIRTY DKKNIILVGT PTWSQDVDTA ADSPIKGYSN LMYTFHFYAA
THKDNYRSKV EKAIKKGLPV FVSEFGISES SGNGKINKKE ANKWIKFLKK RKISFVCWNL
SNKNESSALL KSSCQKTKGF KSSDLSAAGK WFKKVL
//