ID R6DF64_9CLOT Unreviewed; 644 AA.
AC R6DF64;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 30.
DE SubName: Full=Endoglucanase {ECO:0000313|EMBL:CDA86415.1};
GN ORFNames=BN547_01303 {ECO:0000313|EMBL:CDA86415.1};
OS Clostridium sp. CAG:230.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262782 {ECO:0000313|EMBL:CDA86415.1, ECO:0000313|Proteomes:UP000017961};
RN [1] {ECO:0000313|EMBL:CDA86415.1, ECO:0000313|Proteomes:UP000017961}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:230 {ECO:0000313|Proteomes:UP000017961};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 5 (cellulase A) family.
CC {ECO:0000256|RuleBase:RU361153}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA86415.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBCQ010000091; CDA86415.1; -; Genomic_DNA.
DR STRING; 1262782.BN547_01303; -.
DR Proteomes; UP000017961; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.1080; -; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR003343; Big_2.
DR InterPro; IPR001547; Glyco_hydro_5.
DR InterPro; IPR018087; Glyco_hydro_5_CS.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR008964; Invasin/intimin_cell_adhesion.
DR PANTHER; PTHR31297:SF41; ENDOGLUCANASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_5G01830)-RELATED; 1.
DR PANTHER; PTHR31297; GLUCAN ENDO-1,6-BETA-GLUCOSIDASE B; 1.
DR Pfam; PF02368; Big_2; 2.
DR Pfam; PF00150; Cellulase; 1.
DR SMART; SM00635; BID_2; 2.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49373; Invasin/intimin cell-adhesion fragments; 2.
DR PROSITE; PS00659; GLYCOSYL_HYDROL_F5; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023326};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361153};
KW Hydrolase {ECO:0000256|RuleBase:RU361153};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000017961};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..644
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039491490"
FT DOMAIN 420..495
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT DOMAIN 513..593
FT /note="BIG2"
FT /evidence="ECO:0000259|SMART:SM00635"
FT REGION 595..644
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 601..644
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 644 AA; 70505 MW; 33F3665DD5FB0DD9 CRC64;
MRKWSKAIAF MMTAALAVGS LSVGPAVQKA DAADRIGNYM SWDDDQTTKD IKIPVNPQTF
RDLSGTEIIE EMGIGWILGN TFDSHTNQTP GETTWGAPVT TKKMIKAVHD LGFNTIRVPV
TWGTMVKDDG SIDAAWISRV EDVINYCMDE DMYVILNAHH DGADNVGTDK EGKTVHGWLD
IGGTDEEFAA VEAKYQKMWA SIANYFKNYD EHLIFESMNE VYSGSGDTNL QKDMERINKL
NKTFGAAVRS TGSNNAKRWL LLASRNTNIK SLYKNADKFE IPNDGTDRYM VSVHDYDDFK
IGGYTDSMNE SKSDSYANQF KKLKAAFVDK GIPVVVGECG FRGGSDRTYK FEGVSYMLKK
YGLAGCIWDN HGTQGTTDNY EIFDREQCAP YNKNYTDGVM RGFYTDSDDS QLNEKTTVSA
MTSLDLDKDS VSIAVGSMEK VTATTAPADN NDVVLWKSDN SRVASVSNGR IHARRIGTAT
ITAFAQSGSV EKKITVTVTK KTLEKETTDI QTDYDAFKFE KFDYEINDQG LLVSPVAYLN
ASAVPASNGA VTFESSDENV VSVSSTGKLL GYGYGKAVIT LTAADGFTKE IPVSIIDPNA
TPEPDPTSTT APSAQPSVQP GSXXXCSAGQ PANSRYKCRT NSKP
//