ID A0A3N2MIW6_9BACT Unreviewed; 456 AA.
AC A0A3N2MIW6;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=Carbohydrate-binding protein {ECO:0000313|EMBL:ROT10102.1};
GN ORFNames=EEL33_01550 {ECO:0000313|EMBL:ROT10102.1};
OS Muribaculaceae bacterium Isolate-037 (Harlan).
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Muribaculaceae.
OX NCBI_TaxID=2486461 {ECO:0000313|EMBL:ROT10102.1, ECO:0000313|Proteomes:UP000275399};
RN [1] {ECO:0000313|EMBL:ROT10102.1, ECO:0000313|Proteomes:UP000275399}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Isolate-037 (Harlan) {ECO:0000313|Proteomes:UP000275399};
RA Clavel T., Strowig T.;
RT "Sequence and cultivation study of Muribaculaceae reveals novel species,
RT host preference, and functional potential of this yet undescribed family.";
RL Submitted (NOV-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865, ECO:0000256|RuleBase:RU361187}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ROT10102.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RIAZ01000001; ROT10102.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3N2MIW6; -.
DR Proteomes; UP000275399; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProtKB-KW.
DR CDD; cd04084; CBM6_xylanase-like; 1.
DR CDD; cd18618; GH43_Xsa43E-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR43772:SF6; -; 1.
DR PANTHER; PTHR43772; ENDO-1,4-BETA-XYLANASE; 1.
DR Pfam; PF03422; CBM_6; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS51175; CBM6; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00022651};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361187};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU361187};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00022651};
KW Reference proteome {ECO:0000313|Proteomes:UP000275399};
KW Signal {ECO:0000256|SAM:SignalP};
KW Xylan degradation {ECO:0000256|ARBA:ARBA00022651}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..456
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018056874"
FT DOMAIN 330..456
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT ACT_SITE 32
FT /note="Proton acceptor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT ACT_SITE 214
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT SITE 148
FT /note="Important for catalytic activity, responsible for
FT pKa modulation of the active site Glu and correct
FT orientation of both the proton donor and substrate"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-2"
SQ SEQUENCE 456 AA; 50752 MW; 28FFD6CF9C49F25B CRC64;
MRKSIIISVG MLSCILSASA DNPFVQTCFT TDPAPMVHDG TLYVYTGHDE DGADFFWMQE
WRVYSTKDMV NWTDHGSPLA IEDFSWGDDR AWAPQCVERN GKFYFYVPLH SKLTGAMAIG
VAVGDSPTGP FKDAIGKPLV DGDWAYIDPT VYIDDDGQAY LYWGNPEIFY VKLNDDMVSY
SGDVNVVEQT VEGFGAPSPK LREKDKKYKD NYTEGPWFYK RNGKYYLLYA AGGVPEHIAY
SMSDSPTGPW KYMGEIMPLC DTGSFTNHCG VAEFKGQPYF FYHTGKLPKG GGFARSIAVE
PFEYNADGTF PTIMPTEEGA KAVDTLNPYD RVEGETMAWS KGLKTKQNAQ TGVYVTDVHD
GDFIKVANVN FMSGGPKMFE ARAASGLRGG SIEVRMDSIG GPVVAELPVT GTGGWETWQT
FKTSCKSEVP GVHDLYLLFK GRKGPELFNL DWWRFE
//