ID R7H2I7_9FIRM Unreviewed; 617 AA.
AC R7H2I7;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 31.
DE SubName: Full=Cellulase (Glycosyl hydrolase family 5) {ECO:0000313|EMBL:CDE33874.1};
GN ORFNames=BN645_00339 {ECO:0000313|EMBL:CDE33874.1};
OS Ruminococcus sp. CAG:403.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=1262958 {ECO:0000313|EMBL:CDE33874.1, ECO:0000313|Proteomes:UP000017925};
RN [1] {ECO:0000313|EMBL:CDE33874.1, ECO:0000313|Proteomes:UP000017925}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:403 {ECO:0000313|Proteomes:UP000017925};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 5 (cellulase A) family.
CC {ECO:0000256|RuleBase:RU361153}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDE33874.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBIS010000200; CDE33874.1; -; Genomic_DNA.
DR AlphaFoldDB; R7H2I7; -.
DR STRING; 1262958.BN645_00339; -.
DR Proteomes; UP000017925; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR CDD; cd14256; Dockerin_I; 1.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR018087; Glyco_hydro_5_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR31297:SF41; ENDOGLUCANASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_5G01830)-RELATED; 1.
DR PANTHER; PTHR31297; GLUCAN ENDO-1,6-BETA-GLUCOSIDASE B; 1.
DR Pfam; PF00150; Cellulase; 1.
DR Pfam; PF00404; Dockerin_1; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
DR PROSITE; PS00659; GLYCOSYL_HYDROL_F5; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023326};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361153};
KW Hydrolase {ECO:0000256|RuleBase:RU361153, ECO:0000313|EMBL:CDE33874.1};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000017925}.
FT DOMAIN 549..617
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
SQ SEQUENCE 617 AA; 68364 MW; 26B94EC69700E568 CRC64;
MKQSHTSHPY HRIQACLVAM LCLLTLLGAS ISQTISASAK ETTLTGQSAK DLTGQMTIGW
NLGNTLDAYG GSSTAAPEKY VTTWGNPAPT QATFDAIKKA GFNTVRIPTT WYPHVTFDTT
SQTYVVDETW MNYVKQVVDF AYQQDMFVIL NVHHEETWIN VSQFTDETLE TAKKMLSDIW
TQIATTFQDY DQHLVFEGMN EPRQKANSAV GEWGNGSGDN GYTWSYINTL NAVFVQTVRS
QGSSANQERL LMLPGYCASS DPVAINQITV PENAGNVALS VHAYAPYFFT MDTSDYANHS
FPGKSGWGED YETNLTNLFQ SLKSISDSKQ VPIIIGEFSA SDFENTADRE RWATSYLTHA
KEAGIPCVLW DNNVSANGTG EAHGYLYRLT NTWYPNSISV IRAMMAVYDI TPELPEYEEY
VAPTFSWDDM QIGEDWVELY KKEAGKSLAA WKNAMVLTDW KTYLKPGAKF VMFADTTSDP
ELVLQGGWYR LASSGSTGTF VYEFSYESIV EALEAEGANL DDMQNLYVSA TAQAAKIYGL
YAVPAKSSSE TVLGDLNGDG ELTILDAVLM QRFLLGDLTL TDEQAAVMDC NEDGVWNAFD
GAYLKRKLLL PMAEGVQ
//