ID W4PE35_9BACE Unreviewed; 787 AA.
AC W4PE35;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 26.
DE RecName: Full=beta-N-acetylhexosaminidase {ECO:0000256|ARBA:ARBA00012663};
DE EC=3.2.1.52 {ECO:0000256|ARBA:ARBA00012663};
GN ORFNames=JCM6294_920 {ECO:0000313|EMBL:GAE18071.1};
OS Bacteroides pyogenes DSM 20611 = JCM 6294.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=1121100 {ECO:0000313|EMBL:GAE18071.1, ECO:0000313|Proteomes:UP000018842};
RN [1] {ECO:0000313|Proteomes:UP000018842}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JCM 6294 {ECO:0000313|Proteomes:UP000018842};
RA Sakamoto M., Oshima K., Suda W., Kitamura K., Iida T., Hattori M.,
RA Ohkuma M.;
RT "Draft Genome Sequences of Three Strains of Bacteroides pyogenes Isolated
RT from a Cat and Swine.";
RL Genome Announc.2:e01242-13(2014).
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAE18071.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BAIR01000005; GAE18071.1; -; Genomic_DNA.
DR AlphaFoldDB; W4PE35; -.
DR STRING; 1121100.GCA_000428105_00526; -.
DR eggNOG; COG3525; Bacteria.
DR Proteomes; UP000018842; Unassembled WGS sequence.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06563; GH20_chitobiase-like; 1.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR InterPro; IPR025705; Beta_hexosaminidase_sua/sub.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR026876; Fn3_assoc_repeat.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR015882; HEX_bac_N.
DR PANTHER; PTHR22600; BETA-HEXOSAMINIDASE; 1.
DR PANTHER; PTHR22600:SF21; BETA-HEXOSAMINIDASE A; 1.
DR Pfam; PF00754; F5_F8_type_C; 1.
DR Pfam; PF13287; Fn3_assoc; 1.
DR Pfam; PF00728; Glyco_hydro_20; 1.
DR Pfam; PF02838; Glyco_hydro_20b; 1.
DR PRINTS; PR00738; GLHYDRLASE20.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55545; beta-N-acetylhexosaminidase-like domain; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS50022; FA58C_3; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801}.
FT DOMAIN 622..762
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT ACT_SITE 346
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR625705-1"
SQ SEQUENCE 787 AA; 88603 MW; 8DDFFF2C3010A733 CRC64;
MKYLYLLTTN FYFMKQLLRL TGCLAIAGLA VSCQSVQKEA DYRIIPLPQE IATAQGNPFV
LKSGVKILYP EGNEKMQRNA EFLAGYLKEA TGKSFSMEAG TEGKHAIVLT LGLDAENPEA
YRLSVAESGI TVAAPTEAGV FYGIQSLRKS LPVAMNAEIS LPAAEINDYP RFSYRGAHFD
VARHFFTLDE VKTYIDMMVL HNMNRLHWHL TEDQGWRLEI KKYPKLTEIG SVRKETMVGK
NFDKFDGKPH GGFYTQEEAK EIVKYAAERY ITVIPEIDLP GHMQAALAAY PELGCTGGPY
EVWTKWGVSD NVLCAGNDRT LEFIDDVLTE VMEIFPSEYI HVGGDECPKT QWEKCPKCQA
RIKALGLKSD KEHTKEERLQ SFVINHAEKF LNEHGRQIIG WDEILEGGLA PNATVMSWRG
EAGGIEAAKQ NHDVIMTPNT YLYFDYYQSK DTKNEPLAIG GYLPVERVYG YEPMPSSLTP
EQQKYIKGVQ ANLWTEYIPT FEQAQYMVLP RWAALAEVQW TMPDKKNYED FLSRLPRLIE
WYDAEEYNYA KHVFDVKAEF TPNTEDGTLD VTLVTVDNAP IHYTLDGSEP TASSPKYEGV
LKLKEDTRLM AVAIRPTGNS RVVSEEFHFN KASMKPIVAN QPVNRQYMFK GAPSLVDGLR
GGDNYNTGRW IAFRNNDMDM TIDLRQPTEI SSVSIAAFAA KGDWVFDARG LAVEVSDDGK
SFTKVASEEY PAMKQSDNDG ICEHKLAFTP VKTRFVRVVA LSEKAIPSWH GGKGKPGFLF
VDEIVID
//