ID R7GYB9_9FIRM Unreviewed; 702 AA.
AC R7GYB9;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 29.
DE SubName: Full=Endoglucanase {ECO:0000313|EMBL:CDE32105.1};
GN ORFNames=BN645_01588 {ECO:0000313|EMBL:CDE32105.1};
OS Ruminococcus sp. CAG:403.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=1262958 {ECO:0000313|EMBL:CDE32105.1, ECO:0000313|Proteomes:UP000017925};
RN [1] {ECO:0000313|EMBL:CDE32105.1, ECO:0000313|Proteomes:UP000017925}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:403 {ECO:0000313|Proteomes:UP000017925};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDE32105.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBIS010000118; CDE32105.1; -; Genomic_DNA.
DR AlphaFoldDB; R7GYB9; -.
DR STRING; 1262958.BN645_01588; -.
DR Proteomes; UP000017925; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR040753; DUF5620.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR018087; Glyco_hydro_5_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR31297:SF41; ENDOGLUCANASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_5G01830)-RELATED; 1.
DR PANTHER; PTHR31297; GLUCAN ENDO-1,6-BETA-GLUCOSIDASE B; 1.
DR Pfam; PF00150; Cellulase; 1.
DR Pfam; PF18522; DUF5620; 2.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR PROSITE; PS00659; GLYCOSYL_HYDROL_F5; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023326};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000017925};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..702
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038506892"
FT DOMAIN 87..200
FT /note="DUF5620"
FT /evidence="ECO:0000259|Pfam:PF18522"
FT DOMAIN 226..336
FT /note="DUF5620"
FT /evidence="ECO:0000259|Pfam:PF18522"
FT DOMAIN 393..674
FT /note="Glycoside hydrolase family 5"
FT /evidence="ECO:0000259|Pfam:PF00150"
FT REGION 25..68
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 26..68
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 702 AA; 76493 MW; 42F537B6E4AD6E10 CRC64;
MMNWKKGLAV VLAGLSLAIP LSGCKKDTAD SSSEASTSAA ESSGTDAPEA TTTKEETTVT
EHVSPTETVS STLGIEKNGI NMTLDRDKTN TYKANLDQFI QDGDQVQSFT FIFYAADGSS
NIGTYKGACG VSVKDGSSAA TDKNWYQSDD FSQEANGSYI ELNWTVPAEV QKDIDPSGQV
LVGYWWGDVQ QVRLDSIVCT FTRTATVPVD GTKSISPNTT LDYADESAKE AKISLGDLIT
EGDTLQTVTF DVSGGGSLGK FTGAFGVSVA EDSKAATDKG WYQSSNVVLF TDSNAVSLTW
IVPDEVKADI QAGGEVMLGY WWSDQNPVTL TNVSVRYSND GITGSTATTA TPADSEKKEA
VDNTVEAGDM TASEIVDNIR IGWNLGNTFD SYNTSSSDTE TGWGNPKTTK AMIDTVKDAG
FNAIRIPVTW GEHLSADYTI DANWMARIKE VVDYAYDQNM FVILNVHHDD ALWLVPTNAK
LEEDKTILSK IWTQIGTTFQ DYDSHLIFEG MNEPRVIGSS TEWSGGTPEE RQVINQLFVT
FTDTVRGLGG NNAKRALIVT SHAQSIVKDA VQAIEIPNND PNIIVSIHSY APWDFAGTDS
ERSTWGTDAD KQELDQNFQF LKETFVDKGI PVILDEFGAV NKNNEADRDA YYEYYVKSAK
AHGGIKCFVW DNGTQKEFGL LNRSQNSWYF PSIVDAMMRG AA
//