ID A0A3B1IWS4_ASTMX Unreviewed; 751 AA.
AC A0A3B1IWS4;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=N-acetyl-alpha-glucosaminidase {ECO:0000313|Ensembl:ENSAMXP00000034532.1};
GN Name=NAGLU {ECO:0000313|Ensembl:ENSAMXP00000034532.1};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000034532.1, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000034532.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_007236414.1; XM_007236352.2.
DR AlphaFoldDB; A0A3B1IWS4; -.
DR STRING; 7994.ENSAMXP00000034532; -.
DR Ensembl; ENSAMXT00000030037.1; ENSAMXP00000034532.1; ENSAMXG00000043894.1.
DR GeneID; 103022100; -.
DR KEGG; amex:103022100; -.
DR CTD; 4669; -.
DR GeneTree; ENSGT00390000005900; -.
DR InParanoid; A0A3B1IWS4; -.
DR OrthoDB; 1112009at2759; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000043894; Expressed in intestine and 9 other cell types or tissues.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 1.20.120.670; N-acetyl-b-d-glucoasminidase; 1.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR007781; NAGLU.
DR InterPro; IPR024732; NAGLU_C.
DR InterPro; IPR024240; NAGLU_N.
DR InterPro; IPR024733; NAGLU_tim-barrel.
DR PANTHER; PTHR12872; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR12872:SF1; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF05089; NAGLU; 1.
DR Pfam; PF12972; NAGLU_C; 1.
DR Pfam; PF12971; NAGLU_N; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..751
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017457841"
FT DOMAIN 42..124
FT /note="Alpha-N-acetylglucosaminidase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12971"
FT DOMAIN 139..474
FT /note="Alpha-N-acetylglucosaminidase tim-barrel"
FT /evidence="ECO:0000259|Pfam:PF05089"
FT DOMAIN 482..739
FT /note="Alpha-N-acetylglucosaminidase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF12972"
SQ SEQUENCE 751 AA; 85800 MW; C2D483E3F519725C CRC64;
MTAARVSALL PLLVVLAVVA ASADGALENI RATADDETQS RAVAELLRRL LGDRAREFVV
SVNRSLSAGG LDVCELRSTR NNRVVAVGSS GVAAATGIYN YLKYFCNCHV SWSGDQLDLP
RPLPPLTGVL RIQSPHRFRY YQNVCTSSYS MVWWDWPRWQ REIDWMALNG INLPLAFLGQ
EALWQEVYMS LGLNQTELNQ FFTGPAFLAW NRMGNLFAWG GPLPQSWHLK QLSLQLKILD
RMRAFGMIPV LPAFAGIVPH GITRLFPQAN VTKLAPWSRF NCSYSCAYVL DPRDPLFRRI
GSMFLTLVVR LFGTDHVYNT DTFNEQTPAS SDPTYLSSIS HAIFTTMTSV DPQAVWLMQA
WLFINEPDFW KPPQVKALLH GVPLGRMIVL DLFADSVPAY SFTQSFYGQP FIWCMLHNFG
GNRNLFGTVE SVNLGPFEAL RFPNSTLVGL GMAPEGIEQN PVVYELMSEM AWRKEPVNLV
KWASLYASRR YGNMNESLTA AWRLLFRSVY NCTIPGYKNH NRDPLVRRPS LKLKTDVWYD
PADLYEAWKL LFEAAPSLVS VETFRYDLVD VTRQALQLLA FEFYKEIRDS FQAQKLPELL
VAGGVLVYDL IPELDRLLSS DQHFLLGVWL EQARSFGLDE QEAQLYDINA RNQITLWGPD
GEILDYANKD WAGLMEDYYL QRWGLFVNTL VECLDRGRPF KQDTFNQAVF QVENGFVYNQ
RKYPSKPRGD TYEIARLIFL KYYPHALKRL K
//