ID A0A369HFM2_9HYPO Unreviewed; 785 AA.
AC A0A369HFM2;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 22.
DE RecName: Full=Alpha-N-acetylglucosaminidase {ECO:0008006|Google:ProtNLM};
DE Flags: Fragment;
GN ORFNames=CP533_5286 {ECO:0000313|EMBL:RDA94032.1};
OS Ophiocordyceps camponoti-saundersi (nom. inval.).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Ophiocordycipitaceae; Ophiocordyceps.
OX NCBI_TaxID=2039874 {ECO:0000313|EMBL:RDA94032.1, ECO:0000313|Proteomes:UP000253071};
RN [1] {ECO:0000313|EMBL:RDA94032.1, ECO:0000313|Proteomes:UP000253071}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=BCC 79314 {ECO:0000313|EMBL:RDA94032.1,
RC ECO:0000313|Proteomes:UP000253071};
RA Kobmoo N., Wichadakul D., Arnamnart N., Rodriguez De La Vega R.C.,
RA Luangsa-Ard J.-J., Giraud T.;
RT "A genome scan of diversifying selection in the zombie-ant fungus
RT Ophiocordyceps unilateralis complex supports a role of enterotoxins in
RT coevolution and host-specificity.";
RL Submitted (OCT-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RDA94032.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PDHQ01000018; RDA94032.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A369HFM2; -.
DR STRING; 2039874.A0A369HFM2; -.
DR Proteomes; UP000253071; Unassembled WGS sequence.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR Gene3D; 3.30.379.10; Chitobiase/beta-hexosaminidase domain 2-like; 1.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 1.20.120.670; N-acetyl-b-d-glucoasminidase; 1.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR029018; Hex-like_dom2.
DR InterPro; IPR007781; NAGLU.
DR InterPro; IPR024732; NAGLU_C.
DR InterPro; IPR024240; NAGLU_N.
DR InterPro; IPR024733; NAGLU_tim-barrel.
DR PANTHER; PTHR12872; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR12872:SF1; ALPHA-N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF05089; NAGLU; 1.
DR Pfam; PF12972; NAGLU_C; 1.
DR Pfam; PF12971; NAGLU_N; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 1.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000253071};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..17
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 18..785
FT /note="Alpha-N-acetylglucosaminidase"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5016836007"
FT DOMAIN 31..123
FT /note="Alpha-N-acetylglucosaminidase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF12971"
FT DOMAIN 138..485
FT /note="Alpha-N-acetylglucosaminidase tim-barrel"
FT /evidence="ECO:0000259|Pfam:PF05089"
FT DOMAIN 495..775
FT /note="Alpha-N-acetylglucosaminidase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF12972"
FT REGION 631..654
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 640..654
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 785
FT /evidence="ECO:0000313|EMBL:RDA94032.1"
SQ SEQUENCE 785 AA; 87278 MW; A54F4DC6166EC986 CRC64;
MIINFLFFLL LFVTASATTA AAAAAATATE GIAALADRIL GPGRGLSFDF ALSTEPELWS
RWHQAGNDSY VVAVVEDGRI RIEGTTLSAL ARGLRHYATD TLHLDLYWFI GPGRQHLPPG
PLPPPKQTLR GSSLVPWRYN LNTVTFSYSF VWYDWDEWEK LLDWSAWRGI NVQLAWVGYE
MIYLQSFRAM GLSDDEIIPF FTGPAFQSWN RLGNVRSSWG GLGDLPLGWI QGQDTLQRRI
VRRMVELGIT PVLPAFPGFV PDALLRVRPN STNSVTRVSW TDVPEAVGGI LFLSPLDPVY
AELQQQFVRF QMEAYGNVSN IYALDQFNEL KPPVSGHTEE TLRAISRATD AALSAANPAA
VWLMQGWLFF NDQSFWSPSR VDAYLGGVAD DKALVVLDLF AESEPQWNRT RSFAGRPWVW
CMLHTFGSNM NLYGRASAVV SGLAVAQSSS SSSSSATMLL GLGLSPEGYG ANEVIYDLVL
DQAWSATSTS QLPDWFSSWA ELRYAGVVVV PRPLIRAWQL LGAHVYDCKD EAIPSAGVGV
YQLSPRLQGL VGRTRHFPSP TALHYDARLL RRVWRLMLAA AAEKAALWTQ PAFQLDLVDV
TRQILSNAFT DRYEALIAAF RAAMTPPAAP TLAPLPFRRS RSDSDSDSDK SGEVAERGAD
LLRLLDSLDL VLSAEPAYRL SKWLGDARRS AAQVVDADDL FAFNARSQIT VWLAEKHELN
DYAARAWAGL TRSYYRPRWE VFVDGLVQAC SVGSINESAI VARIGDFERQ WQYRGFDGEA
RPRPR
//