ID W5JWB0_ANODA Unreviewed; 678 AA.
AC W5JWB0;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=Peptide-N(4)-(N-acetyl-beta-glucosaminyl)asparagine amidase {ECO:0000256|ARBA:ARBA00018546};
DE EC=3.5.1.52 {ECO:0000256|ARBA:ARBA00012158};
DE AltName: Full=Peptide:N-glycanase {ECO:0000256|ARBA:ARBA00032901};
GN ORFNames=AND_000932 {ECO:0000313|EMBL:ETN67264.1};
OS Anopheles darlingi (Mosquito).
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Culicidae;
OC Anophelinae; Anopheles.
OX NCBI_TaxID=43151 {ECO:0000313|EMBL:ETN67264.1};
RN [1] {ECO:0000313|EMBL:ETN67264.1, ECO:0000313|Proteomes:UP000000673}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=20920257; DOI=10.1186/1471-2164-11-529;
RA Mendes N.D., Freitas A.T., Vasconcelos A.T., Sagot M.F.;
RT "Combination of measures distinguishes pre-miRNAs from other stem-loops in
RT the genome of the newly sequenced Anopheles darlingi.";
RL BMC Genomics 11:529-529(2010).
RN [2] {ECO:0000313|EMBL:ETN67264.1}
RP NUCLEOTIDE SEQUENCE.
RA Almeida L.G., Nicolas M.F., Souza R.C., Vasconcelos A.T.R.;
RL Submitted (MAY-2010) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ETN67264.1}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=23761445;
RA Marinotti O., Cerqueira G.C., de Almeida L.G., Ferro M.I., Loreto E.L.,
RA Zaha A., Teixeira S.M., Wespiser A.R., Almeida E Silva A.,
RA Schlindwein A.D., Pacheco A.C., Silva A.L., Graveley B.R., Walenz B.P.,
RA Lima Bde A., Ribeiro C.A., Nunes-Silva C.G., de Carvalho C.R., Soares C.M.,
RA de Menezes C.B., Matiolli C., Caffrey D., Araujo D.A., de Oliveira D.M.,
RA Golenbock D., Grisard E.C., Fantinatti-Garboggini F., de Carvalho F.M.,
RA Barcellos F.G., Prosdocimi F., May G., Azevedo Junior G.M., Guimaraes G.M.,
RA Goldman G.H., Padilha I.Q., Batista Jda S., Ferro J.A., Ribeiro J.M.,
RA Fietto J.L., Dabbas K.M., Cerdeira L., Agnez-Lima L.F., Brocchi M.,
RA de Carvalho M.O., Teixeira Mde M., Diniz Maia Mde M., Goldman M.H.,
RA Cruz Schneider M.P., Felipe M.S., Hungria M., Nicolas M.F., Pereira M.,
RA Montes M.A., Cantao M.E., Vincentz M., Rafael M.S., Silverman N.,
RA Stoco P.H., Souza R.C., Vicentini R., Gazzinelli R.T., Neves Rde O.,
RA Silva R., Astolfi-Filho S., Maciel T.E., Urmenyi T.P., Tadei W.P.,
RA Camargo E.P., de Vasconcelos A.T.;
RT "The genome of Anopheles darlingi, the main neotropical malaria vector.";
RL Nucleic Acids Res. 41:7387-7400(2013).
RN [4] {ECO:0000313|EnsemblMetazoa:ADAC000932-PA}
RP IDENTIFICATION.
RG EnsemblMetazoa;
RL Submitted (JUN-2015) to UniProtKB.
CC -!- FUNCTION: Specifically deglycosylates the denatured form of N-linked
CC glycoproteins in the cytoplasm and assists their proteasome-mediated
CC degradation. Cleaves the beta-aspartyl-glucosamine (GlcNAc) of the
CC glycan and the amide side chain of Asn, converting Asn to Asp. Prefers
CC proteins containing high-mannose over those bearing complex type
CC oligosaccharides. Can recognize misfolded proteins in the endoplasmic
CC reticulum that are exported to the cytosol to be destroyed and
CC deglycosylate them, while it has no activity toward native proteins.
CC Deglycosylation is a prerequisite for subsequent proteasome-mediated
CC degradation of some, but not all, misfolded glycoproteins.
CC {ECO:0000256|ARBA:ARBA00024870}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of an N(4)-(acetyl-beta-D-glucosaminyl)asparagine
CC residue in which the glucosamine residue may be further glycosylated,
CC to yield a (substituted) N-acetyl-beta-D-glucosaminylamine and a
CC peptide containing an aspartate residue.; EC=3.5.1.52;
CC Evidence={ECO:0000256|ARBA:ARBA00001650};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Evidence={ECO:0000256|ARBA:ARBA00001947};
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC -!- SIMILARITY: Belongs to the transglutaminase-like superfamily. PNGase
CC family. {ECO:0000256|ARBA:ARBA00009390, ECO:0000256|PROSITE-
CC ProRule:PRU00731}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADMH02000243; ETN67264.1; -; Genomic_DNA.
DR AlphaFoldDB; W5JWB0; -.
DR STRING; 43151.W5JWB0; -.
DR EnsemblMetazoa; ADAC000932-RA; ADAC000932-PA; ADAC000932.
DR VEuPathDB; VectorBase:ADAC000932; -.
DR VEuPathDB; VectorBase:ADAR2_008655; -.
DR eggNOG; KOG0909; Eukaryota.
DR HOGENOM; CLU_030187_1_0_1; -.
DR OMA; DLQDVTW; -.
DR OrthoDB; 5051at2759; -.
DR Proteomes; UP000000673; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0000224; F:peptide-N4-(N-acetyl-beta-glucosaminyl)asparagine amidase activity; IEA:UniProtKB-EC.
DR GO; GO:0006516; P:glycoprotein catabolic process; IEA:InterPro.
DR CDD; cd09212; PUB; 1.
DR Gene3D; 1.20.58.2190; -; 1.
DR Gene3D; 2.20.25.10; -; 1.
DR Gene3D; 3.10.620.30; -; 1.
DR Gene3D; 2.60.120.1020; Peptide N glycanase, PAW domain; 1.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR038765; Papain-like_cys_pep_sf.
DR InterPro; IPR038680; PAW_sf.
DR InterPro; IPR006588; Peptide_N_glycanase_PAW_dom.
DR InterPro; IPR036339; PUB-like_dom_sf.
DR InterPro; IPR018997; PUB_domain.
DR InterPro; IPR002931; Transglutaminase-like.
DR PANTHER; PTHR12143; PEPTIDE N-GLYCANASE PNGASE -RELATED; 1.
DR PANTHER; PTHR12143:SF19; PEPTIDE-N(4)-(N-ACETYL-BETA-GLUCOSAMINYL)ASPARAGINE AMIDASE; 1.
DR Pfam; PF04721; PAW; 1.
DR Pfam; PF09409; PUB; 1.
DR Pfam; PF01841; Transglut_core; 1.
DR SMART; SM00613; PAW; 1.
DR SMART; SM00580; PUG; 1.
DR SMART; SM00460; TGc; 1.
DR SUPFAM; SSF54001; Cysteine proteinases; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF143503; PUG domain-like; 1.
DR PROSITE; PS51398; PAW; 1.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Reference proteome {ECO:0000313|Proteomes:UP000000673};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 482..678
FT /note="PAW"
FT /evidence="ECO:0000259|PROSITE:PS51398"
FT REGION 103..170
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 203..248
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 112..137
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 148..170
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 678 AA; 77712 MW; 49EC024651889AA7 CRC64;
MAALNQALVL ALETSNPRES YLVATETLLR LLDNIIREPQ NAKYRTVRLE NPSIKEKLLS
VVGMKQLMLG IGFIEANGTL TVPPNVLLAS LRKYREFIHE RKELIKNPPK PSTSIDKDSP
APTTVNAKSN GTGGGRTVAD GTSDVDMASF VPSATGSAND SRSAIDSATT APATPVIRAS
RPFLARIEFP RVLPPGNSLL QQLELLSDQV MQYEDDLLQA SGRSLIPLER LRARAKEKQT
QWRRLLQQGA DSTEQEPSAE DLLVEELTAW FRGEFFRWVN ALPCTVCGNE QTQLVQSTVE
DGVRVEVYRC CGQLRRFYRY NDVEKLLHTR RGRCGEWANC FTFLCRCLGY EARYVFSTGD
HVWTEVWSER RQRWIHVDPC ENVLDAPLMY EHGWRKEITF VFAFAHDDVQ DVSWRYSNDH
TNLVQRRRAL CRESTLLDAV WKLRGKRRAK LSADRVAALR RRTFDECLEL LACGQRVPTP
GELEGRSSGS LEWRLQRGEQ QVNTRYMFIP TADEVQAKQF NIRYCCATDR YERFLKGSSN
RVNERITETS NGWQSLQYMS RNIFRKEEHD WQMVYLARTE GTEEATIEWY FDFSVQGLRV
KQIELRFGQE TYESAKVELY FIKADGSRSE ALSSLCECGK FTLQARLSGG SSWQHAQLFR
QSKASTDECP FEMNVILM
//