ID S2KS45_9FIRM Unreviewed; 1491 AA.
AC S2KS45;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 24-JAN-2024, entry version 35.
DE RecName: Full=Beta-galactosidase {ECO:0000256|RuleBase:RU000675};
DE EC=3.2.1.23 {ECO:0000256|RuleBase:RU000675};
GN ORFNames=HMPREF0994_07103 {ECO:0000313|EMBL:EPC05254.1};
OS Lachnospiraceae bacterium 3_1_57FAA_CT1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=658086 {ECO:0000313|EMBL:EPC05254.1, ECO:0000313|Proteomes:UP000003336};
RN [1] {ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Alvarado L., Arachchi H.M., Berlin A., Brown A.,
RA Chapman S.B., Chen Z., Dunbar C., Freedman E., Gearin G., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heiman D., Howarth C., Larson L., Lui A.,
RA MacDonald P.J.P., Mehta T., Montmayeur A., Murphy C., Neiman D.,
RA Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P.,
RA Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 2_1_58FAA.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EPC05254.1, ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|EMBL:EPC05254.1,
RC ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M.,
RA Haas B., Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M.,
RA Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A., Larimer J.,
RA McCowan C., Murphy C., Pearson M., Poon T.W., Priest M., Roberts A.,
RA Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3-1-57FAA CT1.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of terminal non-reducing beta-D-galactose residues
CC in beta-D-galactosides.; EC=3.2.1.23;
CC Evidence={ECO:0000256|RuleBase:RU000675};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 35 family.
CC {ECO:0000256|ARBA:ARBA00009809, ECO:0000256|RuleBase:RU003679}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPC05254.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACTP02000004; EPC05254.1; -; Genomic_DNA.
DR STRING; 658086.HMPREF0994_07103; -.
DR eggNOG; COG1874; Bacteria.
DR HOGENOM; CLU_249335_0_0_9; -.
DR Proteomes; UP000003336; Unassembled WGS sequence.
DR GO; GO:0004565; F:beta-galactosidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR Gene3D; 3.20.20.80; Glycosidases; 2.
DR Gene3D; 2.60.120.1060; NPCBM/NEW2 domain; 1.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013191; GH98_central.
DR InterPro; IPR031330; Gly_Hdrlase_35_cat.
DR InterPro; IPR013222; Glyco_hyd_98_carb-bd.
DR InterPro; IPR019801; Glyco_hydro_35_CS.
DR InterPro; IPR001944; Glycoside_Hdrlase_35.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR038637; NPCBM_sf.
DR PANTHER; PTHR23421:SF165; BETA-GALACTOSIDASE; 1.
DR PANTHER; PTHR23421; BETA-GALACTOSIDASE RELATED; 1.
DR Pfam; PF01301; Glyco_hydro_35; 1.
DR Pfam; PF08306; Glyco_hydro_98M; 1.
DR Pfam; PF08305; NPCBM; 1.
DR PRINTS; PR00742; GLHYDRLASE35.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS01182; GLYCOSYL_HYDROL_F35; 1.
PE 3: Inferred from homology;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|RuleBase:RU000675};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU000675};
KW Reference proteome {ECO:0000313|Proteomes:UP000003336}.
FT DOMAIN 15..358
FT /note="Glycoside hydrolase 35 catalytic"
FT /evidence="ECO:0000259|Pfam:PF01301"
FT DOMAIN 913..1036
FT /note="Glycosyl hydrolase family 98 central"
FT /evidence="ECO:0000259|Pfam:PF08306"
FT DOMAIN 1331..1470
FT /note="Glycosyl hydrolase family 98 putative carbohydrate-
FT binding module"
FT /evidence="ECO:0000259|Pfam:PF08305"
SQ SEQUENCE 1491 AA; 169995 MW; 211D37F11BD5615D CRC64;
MNEEKSSYSY NNDFLLKNGK PWFPVMGELH YSRYPKKLWP EALAKMKAGG VTVVSSYIFW
IHHEEKEGEY DFTGNRDLRG FLHAVQAAGL PVFLRIGPWC HGEVRNGGLP DWIMEKDYEP
RTNDKGYLEE VKTYFAVLAE QIRGLLWENG GPVIGIQIEN EYGCCGGLDG EEGEAHMRVL
TKLALDAGLK APYMTATGWG GAHTAGLLPV MGGYCDAPWD RSLEKLPPNG NYVFTPERDD
SNIGSDFGKK ETARTADGNY PYLTAELGGG LQPTRHRRPV PSASDTGAMS LVKLGCGANL
LGYYMYHGGT NPKGKYSSLQ ETKAAGSWND LPEYNYDFNA PVGEYGQVRE SFREIKLLAL
FLQDFGEELC AMKPRFPEPL MDNPEDLQTL RTCVREKDGR GYLFVNNHQR LYPMKNHAQV
SLKVKAGEEE LCFPAENVRD GDYFFYPFHM PVGDHAEIVT AMASPLCVLR RRKGNIYVFY
SDTDPQYRIK GDLGDNRIVT LTRREALDAW KTELQGEDYL LFTSGVIREQ EGKVLFSRTF
TEQEETAAFY AFPELPLVPE GFQKVTGEGL TRYEIRMNRE RKAGISWKET GKGEGFREYE
LFLDYPENTG GKECFLEITY DGESACMERG GEVCADHFYT GQPWCVGMGH LGFPVRTKIR
VNALEENAPL YLEQWPDMVQ GKADTLREVK ILEEYEFPLW EKEEESRFLA PFPGQGLTDI
ALHFKWTKEK GSRAVEYILE MADNEEFAGA RRMEAVILED SEVGYYFPRK EELPSHGGRW
FARVREKAGD RWSRTADFCI DMEHGRKPVK RRINAQNPWF TVFDYSEHEP AEVWEMLPED
LKAYTGMGLI ASYKAKADLV IRYMMDEDAK GYLWHLGALG PHETQFGKYC ITSLCEIEYV
MQHSRNLVST GFVEQYLGTK DEGYWRNEYF FRLLALCAKY GIPFIYSDGN RNNLELAAMI
KRPFFMDKMR EYSDYFIFSF KQNHAHGAYS CFGAILGAWM DGACCEIGVQ PENWYWNDAG
FRDRPGECHG YLQGNEQQIT ACMTAEMLLT GLSIGAAHYS CEGESWLIER GTDGRLAWSA
QGTAALSLFR AIVSHKLISD KQEVLKKIHM AVDYEGWSPE ELGDAWTGGI LREVFEPVYH
IRNGFELMPK ESRFFYLPLV TDRRDAFKGM EKLKAGEMDR EEAQIFLSGA YQETGYGNAY
YAVYPELIIV MNSRENEEES QWFCIPGGDD FLMRMQGALS LWQYIVVKKK DRGYLFHVNT
EKGKNLCFRL YFSHRPVWTE TSDQVNILWQ DDFLEIHVQG DGNPVEFGAA DMQEHLPRPE
QAVRNPAPEE TILLSDLPWS VLEAADGCVL QKDACADTAF GRLPLAVDHL RYSRGLSMGN
KTRITWELDG QYRGLSFWYG FDMDAWMPKI LDRETIIWDR ADKSISMRAR LFGDGKELFC
SPELTSTGGV HEGAVDLQGI NRLELVVDGA VVSEDPEARV YLDILNPVLD L
//