ID R5NKS1_9BACT Unreviewed; 525 AA.
AC R5NKS1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 32.
DE SubName: Full=Glycosyl hydrolase family 43 {ECO:0000313|EMBL:CCZ02684.1};
GN ORFNames=BN471_01317 {ECO:0000313|EMBL:CCZ02684.1};
OS Paraprevotella clara CAG:116.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Paraprevotella.
OX NCBI_TaxID=1263095 {ECO:0000313|EMBL:CCZ02684.1, ECO:0000313|Proteomes:UP000017958};
RN [1] {ECO:0000313|EMBL:CCZ02684.1, ECO:0000313|Proteomes:UP000017958}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:116 {ECO:0000313|Proteomes:UP000017958};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865, ECO:0000256|RuleBase:RU361187}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCZ02684.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAZH010000124; CCZ02684.1; -; Genomic_DNA.
DR RefSeq; WP_008621762.1; NZ_HF993690.1.
DR AlphaFoldDB; R5NKS1; -.
DR GeneID; 78583760; -.
DR Proteomes; UP000017958; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProtKB-KW.
DR CDD; cd09001; GH43_FsAxh1-like; 1.
DR Gene3D; 2.60.120.200; -; 1.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR041542; GH43_C2.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR42812; BETA-XYLOSIDASE; 1.
DR PANTHER; PTHR42812:SF13; HYDROLASE, PUTATIVE (AFU_ORTHOLOGUE AFUA_2G00930)-RELATED; 1.
DR Pfam; PF17851; GH43_C2; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU361187};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU361187};
KW Reference proteome {ECO:0000313|Proteomes:UP000017958}.
FT DOMAIN 340..525
FT /note="Beta-xylosidase C-terminal Concanavalin A-like"
FT /evidence="ECO:0000259|Pfam:PF17851"
FT SITE 162
FT /note="Important for catalytic activity, responsible for
FT pKa modulation of the active site Glu and correct
FT orientation of both the proton donor and substrate"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-2"
SQ SEQUENCE 525 AA; 60131 MW; 790DC4813455FDA9 CRC64;
MRYLWSIVFS LCLCTGLKGQ QQREWGKWES WGDQGDGTYM NPVIPSDYSD IDCIRVGDDY
YAISSTFQFS PGMTILHSKD LVNWEICGNA VEDLTQIGEE LDWTRMNRYN RGIWAGTLRY
HDGRFYLFFG TPDEGYFMTS AVRPEGPWEP LTSLLSEPGW DDCTAIWDDK GKAYFAGTHF
ADGYKTYLFK MSEDGKSIDR KSAVLINEGS GREASKLIKV NGWYYLVFSE HKPGVGRYVL
AKRSKKVTGP YKEERQLALP SVEAMEPNQG GIVQGRDNNW YFLTHHGTGD WSGRIVSLLP
VTWIDDWPIL GEVLDSNIGT MKWTGAMPFQ ADEKLDIQRS DDFDESRLPP QWQWNHQPRK
GFFSLTERPG WLRLKAYRPL EPNQLLKAGN TLTQRTFRKQ DNVVVVKMDI SGMENGQKAG
LCHFSSPHSA IGVTKEGGIC YLEFRENGKI TKGMQVPSKH IWFSSQWGLD GKSRYAYSLD
GDNFLPFGTP YQMAWGNYKG DRIGMYCFND NSESGFVDVD YFHYR
//