ID R6JQQ5_9CLOT Unreviewed; 787 AA.
AC R6JQQ5;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 25.
DE SubName: Full=Glycoside hydrolase family 31 {ECO:0000313|EMBL:CDB52437.1};
GN ORFNames=BN539_00642 {ECO:0000313|EMBL:CDB52437.1};
OS Clostridium sp. CAG:217.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1262779 {ECO:0000313|EMBL:CDB52437.1, ECO:0000313|Proteomes:UP000018364};
RN [1] {ECO:0000313|EMBL:CDB52437.1, ECO:0000313|Proteomes:UP000018364}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:217 {ECO:0000313|Proteomes:UP000018364};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 31 family.
CC {ECO:0000256|ARBA:ARBA00007806, ECO:0000256|RuleBase:RU361185}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDB52437.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBDU010000148; CDB52437.1; -; Genomic_DNA.
DR AlphaFoldDB; R6JQQ5; -.
DR STRING; 1262779.BN539_00642; -.
DR Proteomes; UP000018364; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06595; GH31_u1; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.1180; Golgi alpha-mannosidase II; 2.
DR InterPro; IPR033403; DUF5110.
DR InterPro; IPR048395; Glyco_hydro_31_C.
DR InterPro; IPR000322; Glyco_hydro_31_TIM.
DR InterPro; IPR013780; Glyco_hydro_b.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR PANTHER; PTHR22762; ALPHA-GLUCOSIDASE; 1.
DR PANTHER; PTHR22762:SF89; ALPHA-XYLOSIDASE; 1.
DR Pfam; PF17137; DUF5110; 1.
DR Pfam; PF01055; Glyco_hydro_31_2nd; 1.
DR Pfam; PF21365; Glyco_hydro_31_3rd; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF51011; Glycosyl hydrolase domain; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|RuleBase:RU361185};
KW Hydrolase {ECO:0000256|RuleBase:RU361185, ECO:0000313|EMBL:CDB52437.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000018364}.
FT DOMAIN 186..495
FT /note="Glycoside hydrolase family 31 TIM barrel"
FT /evidence="ECO:0000259|Pfam:PF01055"
FT DOMAIN 504..596
FT /note="Glycosyl hydrolase family 31 C-terminal"
FT /evidence="ECO:0000259|Pfam:PF21365"
FT DOMAIN 612..677
FT /note="DUF5110"
FT /evidence="ECO:0000259|Pfam:PF17137"
SQ SEQUENCE 787 AA; 88575 MW; A6AB8E88B0AE5CCD CRC64;
MLKEALRART AGTAKEKQIF VFGHCRLTVL TPRLIRVEHC PDGAFEDRAS TAVWFRAFDV
PPCKATLHRG TITLTTPEIT LTVYARTGTP QTVTFSNGKT ADCLNFGNLH GTCRTLDATF
GPVPLQDGLV SRDGVSLLDD SRSFLLDADG RFCPRKRKGK DVYVFAYGHD YRAAIQDFYK
ISGSVPLIPR FALGVWWSRY HAYTQEEYLG LMKRFAAENV PLTVATVDMD WHWTDPNKRF
GTHYKGKDGW TGYSWNTELF PDYKAFLQDL HQMGLHTTVN LHPADGVRAF EDPYPAMAKA
MGLDPKAKQD IPFRCGSDAF WNAYFDVLHK PYEKEGVDFW WLDWQQGKKS DVPGLDPLVA
LNHYHYLDNA ENGRLPLILS RYSGPGAHRY PLGFSGDTAQ SWRVLHFQPY FTATAANVGY
TWWSHDIGGH YLGERNEELY LRWLQLGVFS PILRLHSTSN DLMGKEPWRY RPDVCAAAKD
WLRLRHRLIP YLYTMDARTH REGLALCEPL YYAYPGAEEA YDKAYGNGYL FGSQLLVYPI
TSPQKKQLGM GAVDAWIPPG RWTDLFTGAV YTGPLCLTLH RELTEMPVLA KAGAILPLSD
DPGNACGNPA ALTLWLYAGD GDFTLYEDNG QTDFDTHKAE TQITQQLQGS TLTVTVAPTA
GDCTVLPKKR QLTLVFKDLE PSVLTCAEAE VTQNAAGEAT VILTDYDPTV GTQLHLQGAT
YRKAVPVNAR VLNIFCRWQG SNAHKTECYR LFKIAKTKEE LRRALKCTRL PGTVRRAVEE
TLLQTDA
//