ID G7YIR8_CLOSI Unreviewed; 1498 AA.
AC G7YIR8;
DT 25-JAN-2012, integrated into UniProtKB/TrEMBL.
DT 25-JAN-2012, sequence version 1.
DT 27-MAR-2024, entry version 47.
DE RecName: Full=beta-N-acetylhexosaminidase {ECO:0000256|ARBA:ARBA00012663};
DE EC=3.2.1.52 {ECO:0000256|ARBA:ARBA00012663};
DE Flags: Fragment;
GN ORFNames=CLF_108961 {ECO:0000313|EMBL:GAA52851.1};
OS Clonorchis sinensis (Chinese liver fluke).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Platyhelminthes; Trematoda;
OC Digenea; Opisthorchiida; Opisthorchiata; Opisthorchiidae; Clonorchis.
OX NCBI_TaxID=79923 {ECO:0000313|EMBL:GAA52851.1, ECO:0000313|Proteomes:UP000008909};
RN [1] {ECO:0000313|EMBL:GAA52851.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Henan {ECO:0000313|EMBL:GAA52851.1};
RX PubMed=22023798; DOI=10.1186/gb-2011-12-10-r107;
RA Wang X., Chen W., Huang Y., Sun J., Men J., Liu H., Luo F., Guo L., Lv X.,
RA Deng C., Zhou C., Fan Y., Li X., Huang L., Hu Y., Liang C., Hu X., Xu J.,
RA Yu X.;
RT "The draft genome of the carcinogenic human liver fluke Clonorchis
RT sinensis.";
RL Genome Biol. 12:R107-R107(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=Henan;
RA Wang X., Huang Y., Chen W., Liu H., Guo L., Chen Y., Luo F., Zhou W.,
RA Sun J., Mao Q., Liang P., Zhou C., Tian Y., Men J., Lv X., Huang L.,
RA Zhou J., Hu Y., Li R., Zhang F., Lei H., Li X., Hu X., Liang C., Xu J.,
RA Wu Z., Yu X.;
RT "The genome and transcriptome sequence of Clonorchis sinensis provide
RT insights into the carcinogenic liver fluke.";
RL Submitted (OCT-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of terminal non-reducing N-acetyl-D-hexosamine
CC residues in N-acetyl-beta-D-hexosaminides.; EC=3.2.1.52;
CC Evidence={ECO:0000256|ARBA:ARBA00001231};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF143364; GAA52851.1; -; Genomic_DNA.
DR Proteomes; UP000008909; Unassembled WGS sequence.
DR GO; GO:0043226; C:organelle; IEA:UniProt.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd04416; NDPk_TX; 2.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 3.40.30.10; Glutaredoxin; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 3.30.70.141; Nucleoside diphosphate kinase-like domain; 3.
DR InterPro; IPR025705; Beta_hexosaminidase_sua/sub.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR029019; HEX_eukaryotic_N.
DR InterPro; IPR034907; NDK-like_dom.
DR InterPro; IPR036850; NDK-like_dom_sf.
DR InterPro; IPR036249; Thioredoxin-like_sf.
DR InterPro; IPR013766; Thioredoxin_domain.
DR PANTHER; PTHR46135; NME/NM23 FAMILY MEMBER 8; 1.
DR PANTHER; PTHR46135:SF3; THIOREDOXIN DOMAIN-CONTAINING PROTEIN; 1.
DR Pfam; PF00728; Glyco_hydro_20; 1.
DR Pfam; PF14845; Glycohydro_20b2; 1.
DR Pfam; PF00334; NDK; 3.
DR Pfam; PF00085; Thioredoxin; 1.
DR PRINTS; PR00738; GLHYDRLASE20.
DR SMART; SM00562; NDK; 3.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
DR SUPFAM; SSF55545; beta-N-acetylhexosaminidase-like domain; 1.
DR SUPFAM; SSF54919; Nucleoside diphosphate kinase, NDK; 3.
DR SUPFAM; SSF52833; Thioredoxin-like; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000008909}.
FT DOMAIN 211..351
FT /note="Nucleoside diphosphate kinase-like"
FT /evidence="ECO:0000259|SMART:SM00562"
FT DOMAIN 382..523
FT /note="Nucleoside diphosphate kinase-like"
FT /evidence="ECO:0000259|SMART:SM00562"
FT DOMAIN 524..669
FT /note="Nucleoside diphosphate kinase-like"
FT /evidence="ECO:0000259|SMART:SM00562"
FT COILED 165..208
FT /evidence="ECO:0000256|SAM:Coils"
FT ACT_SITE 1397
FT /note="Proton donor"
FT /evidence="ECO:0000256|PIRSR:PIRSR625705-1"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:GAA52851.1"
SQ SEQUENCE 1498 AA; 170224 MW; E1EA6AA2589340F1 CRC64;
PCPMLKLPDS TQRWPALHRN RMTVPCYLIS FSTMNHEPRH ERRRMAKKKV EVALQEEIET
QEEWESALQR EGLISTSYCS QNSLTVIDIY QDWAGPCKAA AGIFRRMKTE LNDDLLNFAI
AKADTIDSLV KYRGKCEPCF LFFGCGKLVA AIRGVNPPEL EKNILEKLKQ EHEVLRGEAE
RVEIRDPVFL AQELAEEEER RRKEEEEEVP QEVTIAVLKP DIVQSGRTDE LIAELEGKGI
SVIRRISYTF TKEEAEEFYV KLKGEPYYKS LVDFMISGPS EILLCAKGAE GVIEDLKGLV
GPAISEASEK EPGLRAKYAS DKIRNAIHAP DSKDEAAREL AFFFPDFEPP IVTVRRPRVQ
VTEDDIGERQ LSALSSGFGE GIQRTVALLR PKAYSMYKDS ILEKIKEAGF VVASQKEVTL
SKEQAEDYYK EHRGETYFGE LTTMMSSGPC LALLLARQDA VDTWRKLLGP KDVAEAKATA
PESLRAQYVS EDKEDMADGK SINLIHGSAS VEEAQHDIER FFPVERTLAA VKPDAYANRD
EIIEMIKSAG FHVAARKDTQ LDEKMAAQLY ENVKDKPFFD DLVRQMTSGR TLFMVLTRED
AIAGWRQLMG PTDPDKAADE VPASSQTTEP SIRAAFGRSI LENAVHGSSS AEQASATIQL
IFGDPSIGMP EKSDKQFLRP SDFYLTTARP FDLVYESNFM VKNNGGVDTY GCELSSQTIS
LFLLSMVEKG GGPAEAVKGV FLRLDLKRAC STWVVAVGRN VLWTCTDMCF EALSSKLAFL
GTVPDLELHD NHLDTDRNSG FTIKLESHRF TSLSERDIDL NFQILSLSTD CISLRAVVEI
PCILSVVTQG FTSSVYDWIL PEFTESFATI DPYTSFGQLY LVLVARVFCY GSMQHRSIGT
VGLTFVGVEI CGSSFRLFSL GVDWRPVEIR VSQFFLDKLL QSPEVDGETC PDSMSSMTER
LNSCVNAENS WGANFEIDFN VESIPKTGTT EKDDQKKNPY IWDRHTITPY DKLVYENSYW
AFTNLPSVRI PVLRAPFPTF GMVMPLPYWW SGTERFYPVD VNGLEFQLVG VDNYILRSAI
SRCKQVITSR SGLSVYPNRW THQEPLLADW TKEYQTNKRT STRVAGTAGA DLDIAERPTF
WWSEKRVQWF YQIFKSPQQQ HQPSTPLHRI RIYVRSSGKD WPSLQMDESY AVLVDGEQIF
LVANETWGAL RGLESLSQLM WRTSDMTQVY INQTYIFDKP RFPHRGLLVD TSRHFISKSI
LLVNLEAMAY NKLNVLHWHI VDDNSFPYQS QTFPSLSQKG AWHKRQVYTQ HDIKEIVEFA
RLRGIRVIPE FDIPGHTRSL AYSKPELLAQ CQGYEDNTVY FGPLNPFINE TYQFIENFLI
EMFNLFPDEY IHLGGDEVQP ACWDADLEMV RTQAKLNLQG ALTLDYFWKR VQNIITELGN
RKPANRRKIV VWQEVAAQVL ELIIDCPAGQ DDSEAVGLKF YRIYDNTFRA FPKFSLTM
//