ID R5A6E6_9CLOT Unreviewed; 510 AA.
AC R5A6E6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE SubName: Full=Glycosyl hydrolase family 43 {ECO:0000313|EMBL:CCX38675.1};
GN ORFNames=BN452_00290 {ECO:0000313|EMBL:CCX38675.1};
OS Clostridium sp. CAG:1013.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262769 {ECO:0000313|EMBL:CCX38675.1, ECO:0000313|Proteomes:UP000018382};
RN [1] {ECO:0000313|EMBL:CCX38675.1, ECO:0000313|Proteomes:UP000018382}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:1013 {ECO:0000313|Proteomes:UP000018382};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865, ECO:0000256|RuleBase:RU361187}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCX38675.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAWF010000282; CCX38675.1; -; Genomic_DNA.
DR AlphaFoldDB; R5A6E6; -.
DR Proteomes; UP000018382; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd18617; GH43_XynB-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR041542; GH43_C2.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR42812; BETA-XYLOSIDASE; 1.
DR PANTHER; PTHR42812:SF12; BETA-XYLOSIDASE-RELATED; 1.
DR Pfam; PF17851; GH43_C2; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361187};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU361187};
KW Reference proteome {ECO:0000313|Proteomes:UP000018382}.
FT DOMAIN 320..509
FT /note="Beta-xylosidase C-terminal Concanavalin A-like"
FT /evidence="ECO:0000259|Pfam:PF17851"
FT ACT_SITE 14
FT /note="Proton acceptor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT ACT_SITE 175
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT SITE 120
FT /note="Important for catalytic activity, responsible for
FT pKa modulation of the active site Glu and correct
FT orientation of both the proton donor and substrate"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-2"
SQ SEQUENCE 510 AA; 57581 MW; 464C1876D205092D CRC64;
MKYRNPLLPG FYPDPSVCRV GEDYYLVNSS FEFFPGVPLW HSRDMVNWES LGYVLTRRSQ
LELDGARVSG GIFAPTIRFH NGKYYMITTN VTHGGNFVVT ADDPKGPWSD PVWIDQGGID
PSLFWDEDGK VYMQRTHTDD QGVQCIGQFE VDLETGKVLS EVRPIWYGTG GKCPEGPHIY
KIFGKYYLMI AEGGTEYGHM ETIARSDSIW GPFESCPRNP ILTHRNLNPR NGEAQALGHA
DLVEDGKSNW WMVFHGIRPS QFMLHHMGRE TMLAPVTWDE EGWPVVGEGK PIQWEMEGPG
TPEQTLPDGY ALQNTFAQEE DFTTAKELSP LWSYLRNPYE ENYQLGQGLT LTAGKDDLES
LGSPTFLGRR QQHFDATVKT VMEFDPQSEQ AEAGLTVFHT KDHHYDLVVT LREGKRAAFL
RRRSADMLVE SQPVFLPEEG KLTLTIQASR LAFEFFVQAE GGEPQSLGTA STQLISTECM
ICTFTGCFFG MYCQGEPGAQ ATFEQFSYRP
//