ID R7CUH3_9BACE Unreviewed; 901 AA.
AC R7CUH3;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Beta-galactosidase {ECO:0000313|EMBL:CDD81979.1};
GN ORFNames=BN666_01407 {ECO:0000313|EMBL:CDD81979.1};
OS Bacteroides sp. CAG:462.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=1262740 {ECO:0000313|EMBL:CDD81979.1, ECO:0000313|Proteomes:UP000018063};
RN [1] {ECO:0000313|EMBL:CDD81979.1, ECO:0000313|Proteomes:UP000018063}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:462 {ECO:0000313|Proteomes:UP000018063};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 2 family.
CC {ECO:0000256|ARBA:ARBA00007401}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDD81979.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBHU010000043; CDD81979.1; -; Genomic_DNA.
DR AlphaFoldDB; R7CUH3; -.
DR STRING; 1262740.BN666_01407; -.
DR Proteomes; UP000018063; Unassembled WGS sequence.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR036156; Beta-gal/glucu_dom_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR040605; Glyco_hydro2_dom5.
DR InterPro; IPR006103; Glyco_hydro_2_cat.
DR InterPro; IPR006102; Glyco_hydro_2_Ig-like.
DR InterPro; IPR006104; Glyco_hydro_2_N.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR013783; Ig-like_fold.
DR PANTHER; PTHR42732; BETA-GALACTOSIDASE; 1.
DR PANTHER; PTHR42732:SF1; BETA-MANNOSIDASE; 1.
DR Pfam; PF18565; Glyco_hydro2_C5; 1.
DR Pfam; PF00703; Glyco_hydro_2; 1.
DR Pfam; PF02836; Glyco_hydro_2_C; 1.
DR Pfam; PF02837; Glyco_hydro_2_N; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49303; beta-Galactosidase/glucuronidase domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000018063}.
FT DOMAIN 110..224
FT /note="Glycosyl hydrolases family 2 sugar binding"
FT /evidence="ECO:0000259|Pfam:PF02837"
FT DOMAIN 252..349
FT /note="Glycoside hydrolase family 2 immunoglobulin-like
FT beta-sandwich"
FT /evidence="ECO:0000259|Pfam:PF00703"
FT DOMAIN 357..525
FT /note="Glycoside hydrolase family 2 catalytic"
FT /evidence="ECO:0000259|Pfam:PF02836"
FT DOMAIN 744..828
FT /note="Glycoside hydrolase family 2"
FT /evidence="ECO:0000259|Pfam:PF18565"
FT REGION 862..901
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 876..895
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 901 AA; 101335 MW; 92F7902922EBA336 CRC64;
MTQLPQAPVA IGRLRADKLG AAEIKQNIMK NIALTLILLL LSLPGLAQRV SSPQEISVAG
FFPLEGSGRQ VYNFNPGWRF YRGDVKGAEA ADFNDRDWEV VSTPHSVQLM PAEASGCRNY
QGIAWYRKHF TVPASLKGMD VTIHFEAIMG KQKFYVNGQL VKEHLGGYLP VNVSLTEAGV
QAGDSCVIAV MADNSNDKTY PPGKTQYTLD FAYHGGIYRD VWMIGKAPVA ITDAIEAGKV
AGGGVFVHFD NLSEESAEVY VNTHVKNSAA ASRRVTVETT LTDADGRLVR RVSQKLTLAA
GKDETAMQKL VVKHPRLWSP DSPYLYRVES RIKDGSRTID GGMTRIGIRW FEFNGEKGFF
LNGKPYGQLV GANRHQDFAY VGNALPNSQQ WRDARLLRDA GCTIIRVAHY PQDPSFMDAC
DELGLFVIVA TPGWQYWNKN PEFAARVHEN TRNIIRRDRN HPSVLMWEPI LNETSYPLDF
ALQALQITKD EYPYPGRPVA AADVHSAGVK DNYDVVYGWP GDDEKANAPK QCIFTREWGE
NVDDWYAHNN NNRASRSWGE RPQKVQALSL AQTYDGLFRT TGKFIGGAQW HPFDHQRGYH
PDPYWGGIFD CFRQPKYAYY MFRSQSPADL KHPTAESGPM VYIMHEMGPF SDPDVVVFSN
CDSVRLSMYD GTKEWVLPVK HEKGHLPNAP VVFKNVWDFW EARSHSYTER NWQMVNMYAE
GIIDGRVVCS TKKMPSRRST KLRLRLAEKQ HDLIADGSDF VVVIAEVTDD LGHVRRLAKD
NIVFRVEGEG EIVGDASINA NPRAVEWGSA PLLVRSTRKA GKIKVYAHVQ FEGTQAPTPA
ELEFESVPAR MPFCYSEEEA AAKVQASQPT TGTRKQQFTE EEKRKMLEEV DRQQSEFGIQ
K
//