ID R5PFD1_9BACT Unreviewed; 614 AA.
AC R5PFD1;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Glycoside hydrolase family 43 {ECO:0000313|EMBL:CCZ14803.1};
GN ORFNames=BN679_01671 {ECO:0000313|EMBL:CCZ14803.1};
OS Prevotella sp. CAG:487.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Prevotellaceae;
OC Prevotella.
OX NCBI_TaxID=1262928 {ECO:0000313|EMBL:CCZ14803.1, ECO:0000313|Proteomes:UP000018275};
RN [1] {ECO:0000313|EMBL:CCZ14803.1, ECO:0000313|Proteomes:UP000018275}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:487 {ECO:0000313|Proteomes:UP000018275};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- PATHWAY: Glycan metabolism; L-arabinan degradation.
CC {ECO:0000256|ARBA:ARBA00004834}.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCZ14803.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAZM010000234; CCZ14803.1; -; Genomic_DNA.
DR AlphaFoldDB; R5PFD1; -.
DR Proteomes; UP000018275; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProtKB-KW.
DR CDD; cd18616; GH43_ABN-like; 1.
DR CDD; cd08991; GH43_HoAraf43-like; 1.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR43301; ARABINAN ENDO-1,5-ALPHA-L-ARABINOSIDASE; 1.
DR PANTHER; PTHR43301:SF3; ARABINOSIDASE-RELATED; 1.
DR Pfam; PF04616; Glyco_hydro_43; 2.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 2.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000313|EMBL:CCZ14803.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018275};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..21
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 22..614
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004400034"
FT ACT_SITE 36
FT /note="Proton acceptor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT ACT_SITE 197
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-1"
FT SITE 150
FT /note="Important for catalytic activity, responsible for
FT pKa modulation of the active site Glu and correct
FT orientation of both the proton donor and substrate"
FT /evidence="ECO:0000256|PIRSR:PIRSR606710-2"
SQ SEQUENCE 614 AA; 69112 MW; F4BC0EF94C5E622D CRC64;
MRKLRILAAS LLLSASVCTF AGDTYTNPVI NTSLPDPTVI RADDGYFYLY ATEDIRNLPI
YRSRDLTDWQ FVGTAFTDDT RPQWNKKGNM WAPDINKIGD KYVLYYSKSE WGGEWTCGIG
AATADRPEGP FTDHGPLFIS SEIGVRNSID QFYIEDNGHK YLFWGSFHGI YGIELSDDGL
SVKPGAVKKQ VSGTFMEGTY IHKRGKYYYL FGSAGTCCDG ARSTYRVTYG RSENLFGPYV
DKKGQRLLDN HYEVMLHGDD TFVGTGHNAE FVTDDLGQDW ILYHGYKKAE ADDGRVVFLS
RVDWKDGWPE VAGSVPEKEN VKPSFGQIHL ADPTVFCDNG TYYLYGTSPV SDNGFWVYTS
TDLQHWSGPA GVVDGYALRG NTYGTQGFWA PQVFKKDGRY AMAYTANEQI AIAWADSPLG
PFVQDEPAMI PAKTKEIDPF VFHDDDGKTY MYHVRLIGGN RIYVAEMNDD LRSMKEETAR
ECIAVNDKGW ENTAEGKWGV SEGPTVVKLD GTYYMFYSCN DFRSIDYAMG YATAKSPLGP
WKKHKKPIVS RHLTGENGTG HGDLFRDGDG RWMYVLHTHN SNSKVSPRRT AMVELVKKGK
KFEMVPGSLR YVTR
//