ID E9SF24_RUMAL Unreviewed; 840 AA.
AC E9SF24;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 24-JAN-2024, entry version 45.
DE RecName: Full=cellulase {ECO:0000256|ARBA:ARBA00012601};
DE EC=3.2.1.4 {ECO:0000256|ARBA:ARBA00012601};
GN ORFNames=CUS_5680 {ECO:0000313|EMBL:EGC02117.1};
OS Ruminococcus albus 8.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=246199 {ECO:0000313|EMBL:EGC02117.1, ECO:0000313|Proteomes:UP000004259};
RN [1] {ECO:0000313|EMBL:EGC02117.1, ECO:0000313|Proteomes:UP000004259}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=8 {ECO:0000313|EMBL:EGC02117.1,
RC ECO:0000313|Proteomes:UP000004259};
RA Nelson K.E., Sutton G., Torralba M., Durkin S., Harkins D., Montgomery R.,
RA Ziemer C., Klaassens E., Ocuiv P., Morrison M.;
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGC02117.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADKM02000110; EGC02117.1; -; Genomic_DNA.
DR RefSeq; WP_002851471.1; NZ_ADKM02000110.1.
DR AlphaFoldDB; E9SF24; -.
DR STRING; 246199.CUS_5680; -.
DR eggNOG; COG4124; Bacteria.
DR eggNOG; COG5297; Bacteria.
DR OrthoDB; 2078139at2; -.
DR Proteomes; UP000004259; Unassembled WGS sequence.
DR GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:UniProtKB-KW.
DR CDD; cd14256; Dockerin_I; 1.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.60.40.710; Endoglucanase-like; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR001956; CBM3.
DR InterPro; IPR036966; CBM3_sf.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR001701; Glyco_hydro_9.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF29; ENDOGLUCANASE; 1.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF00759; Glyco_hydro_9; 2.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51172; CBM3; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000004259};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..840
FT /note="cellulase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003247121"
FT DOMAIN 587..753
FT /note="CBM3"
FT /evidence="ECO:0000259|PROSITE:PS51172"
FT DOMAIN 766..838
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
SQ SEQUENCE 840 AA; 93969 MW; ED3CF05317F889B1 CRC64;
MKLKHRLVSA AASVAIALSQ TSTLVPTVNA ADEINDAIAN SSGVDYDFAR ALQYSMYFYD
ANMCGDDVEG NNRFEWRGNC HAYDAKVPLH PIEDWVGVNL TEEQIEKYRD ILDPDGDGCV
DVAGGFHDAG DHVEFGMPEN YAASTVGWGY YEFRDSYVKI GEQEHIETIL RHFNDYLMKC
TFLDKNGDVV LHCYQVGDGD LDHKFWNSPE MDEMGRPAFF LTEDKPQTDY VASAAASLAI
NYLNFKDTDK AYAEKNLKYA KALFKFAMDH PKELSDNADG PKGYYRSSKW EDDYCWAAAW
LYKITGDHMY LEEIYPNYDY YAAPCYVHCW NDVWGGVQCI LGEISEDKPF NKGEYVYPNF
IDEFKVAANK SPYEQMNCWA SVEKAINTYM TGGIGEITPQ GYWWMDTWGS ARYNTAGQLM
ALVYTKYNGD APSKYSDWAK GQMEYILGKN DITYLERIDE NTEAEKNGEP AVWSEDELHG
SRCFLVGYNE NSVKYPHHRA SSGLTKCEDP DPQRYVLFGA LAGGPDNADR HRDVTKDWIY
NEVTIDYNAA MVGACAALYK YYGTPDMAPT KNFPPKPTFT GSDGEAGSNG CWVTACGIDD
LHADGAGVTK VSLYVMTAST KKVEDISVRY YFSTKGMSDP NHVQVSELYD QAAVEAAPAN
GTISGPYKYD KMDDVYYVEI KWSDYNIANS GKKYQFTVGL YYGDMWQPAD DWSYQNLKIY
EQDDAFFATG TEVRNDNICV YSGDKLVGGV EPDGTVPDFG TDTDKTAVNY GDADCDGAVT
MNDAVLMSQY ISSPKKYPLT EAGLKNADCV EDGAITSADV KAVQMLLAGI YTQSDLPAKK
//