ID A0A174FPA4_9BACE Unreviewed; 735 AA.
AC A0A174FPA4;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 24-JAN-2024, entry version 32.
DE SubName: Full=Alpha-N-acetylglucosaminidase {ECO:0000313|EMBL:CUO52092.1, ECO:0000313|EMBL:KAA5503819.1};
DE EC=3.2.1.50 {ECO:0000313|EMBL:CUO52092.1};
GN ORFNames=ERS852494_00087 {ECO:0000313|EMBL:CUO52092.1}, F2Y31_00775
GN {ECO:0000313|EMBL:KAA5503819.1};
OS Bacteroides caccae.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=47678 {ECO:0000313|EMBL:CUO52092.1, ECO:0000313|Proteomes:UP000095657};
RN [1] {ECO:0000313|EMBL:CUO52092.1, ECO:0000313|Proteomes:UP000095657}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2789STDY5834880 {ECO:0000313|EMBL:CUO52092.1,
RC ECO:0000313|Proteomes:UP000095657};
RG Pathogen Informatics;
RL Submitted (SEP-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KAA5503819.1, ECO:0000313|Proteomes:UP000368418}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BIOML-A19 {ECO:0000313|EMBL:KAA5503819.1,
RC ECO:0000313|Proteomes:UP000368418};
RX PubMed=31477907; DOI=.1038/s41591-019-0559-3;
RA Poyet M., Groussin M., Gibbons S.M., Avila-Pacheco J., Jiang X.,
RA Kearney S.M., Perrotta A.R., Berdy B., Zhao S., Lieberman T.D.,
RA Swanson P.K., Smith M., Roesemann S., Alexander J.E., Rich S.A., Livny J.,
RA Vlamakis H., Clish C., Bullock K., Deik A., Scott J., Pierce K.A.,
RA Xavier R.J., Alm E.J.;
RT "A library of human gut bacterial isolates paired with longitudinal
RT multiomics data enables mechanistic microbiome research.";
RL Nat. Med. 25:1442-1452(2019).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CZAI01000001; CUO52092.1; -; Genomic_DNA.
DR EMBL; VVYD01000001; KAA5503819.1; -; Genomic_DNA.
DR STRING; 47678.ERS852494_00087; -.
DR Proteomes; UP000095657; Unassembled WGS sequence.
DR Proteomes; UP000368418; Unassembled WGS sequence.
DR GO; GO:0004561; F:alpha-N-acetylglucosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProt.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 1.20.120.670; N-acetyl-b-d-glucoasminidase; 1.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR007781; NAGLU.
DR InterPro; IPR024732; NAGLU_C.
DR InterPro; IPR024240; NAGLU_N.
DR InterPro; IPR024733; NAGLU_tim-barrel.
DR PANTHER; PTHR12872; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR12872:SF1; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF05089; NAGLU; 1.
DR Pfam; PF12972; NAGLU_C; 1.
DR Pfam; PF12971; NAGLU_N; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000313|EMBL:CUO52092.1};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:CUO52092.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000095657};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..22
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 23..735
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039781481"
FT DOMAIN 29..107
FT /note="Alpha-N-acetylglucosaminidase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12971"
FT DOMAIN 121..454
FT /note="Alpha-N-acetylglucosaminidase tim-barrel"
FT /evidence="ECO:0000259|Pfam:PF05089"
FT DOMAIN 464..727
FT /note="Alpha-N-acetylglucosaminidase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF12972"
SQ SEQUENCE 735 AA; 85521 MW; E08DEBD3A1C8DE1A CRC64;
MKYMKKLSLF ILACICTLGS YADDVAVFNA LVNRLLPDYS SQITGKKLPE GKNDYFQLSS
VGDKIEIAGN NANSMAVGLN YYLKYYCNTN VSWFVDDHLD MPATLPAVDK KVTVDARCKD
RFFLNYCTFG YTMPWWQWED WEHFIDWMAL NGINLPLAIT GQESIWLKVW TKLGLSESEV
RNYFTGPAHL PWHRMLNIDY WQGNLPMSWL DGQEELQKKI VARERELNMK PVLPAFAGHV
PQELKRIYPD AKITKLGAWA GYSDQYACSF LDPMDPLFTK IQKMFLEEQN SIYGTDHIYG
IDLFNELEAP SYEPSYLRRV SRQVYQSLEK ADKKAVWLQM TWLFWNEKKD WTNERIKAYI
TAFPSKKSLL LDYYCERHEV WQQTDKYFGV PYIWCYLGNF GGNTVLVGNL YDINKRLENT
FANGGKNFEG LGSTLEGFDC NPFVYNYVFE KAWDMDIHKD VPRWTEELAV RRIGKDNAKG
KAAWKLLIDS IYADPSRPGQ CAMLSIRPTF GKFKTYYANP RFRYSNKTLL TILGLMLEAD
GKGASYSFDV VNVTRQLLGN YFWAVFKDYE KAYQKGDFKT MKAKEQLMLG ILSDMDRVLS
TQSAFLMGKW ISDARKLGAN EKEKLYFEQN ARNLLTTWGE KASLLNDYAS RSWSGLISTF
YAERWKMFFA AVDRAVLAGQ NFDDAKYEDY KKDVTEYEER WWKDCIGTFP EQPVGNSVII
SKELYDKYKP LIEEM
//