ID A0A2I3RN98_PANTR Unreviewed; 584 AA.
AC A0A2I3RN98;
DT 28-FEB-2018, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Glucosamine (N-acetyl)-6-sulfatase {ECO:0000313|Ensembl:ENSPTRP00000066176.1};
GN Name=GNS {ECO:0000313|Ensembl:ENSPTRP00000066176.1,
GN ECO:0000313|VGNC:VGNC:13094};
OS Pan troglodytes (Chimpanzee).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hominidae;
OC Pan.
OX NCBI_TaxID=9598 {ECO:0000313|Ensembl:ENSPTRP00000066176.1, ECO:0000313|Proteomes:UP000002277};
RN [1] {ECO:0000313|Ensembl:ENSPTRP00000066176.1, ECO:0000313|Proteomes:UP000002277}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16136131; DOI=10.1038/nature04072;
RG Chimpanzee sequencing and analysis consortium;
RT "Initial sequence of the chimpanzee genome and comparison with the human
RT genome.";
RL Nature 437:69-87(2005).
RN [2] {ECO:0000313|Ensembl:ENSPTRP00000066176.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- COFACTOR:
CC Name=Ca(2+); Xref=ChEBI:CHEBI:29108;
CC Evidence={ECO:0000256|ARBA:ARBA00001913};
CC -!- PTM: The conversion to 3-oxoalanine (also known as C-formylglycine,
CC FGly), of a serine or cysteine residue in prokaryotes and of a cysteine
CC residue in eukaryotes, is critical for catalytic activity.
CC {ECO:0000256|PIRSR:PIRSR036666-50}.
CC -!- SIMILARITY: Belongs to the sulfatase family.
CC {ECO:0000256|ARBA:ARBA00008779}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AACZ04013035; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR RefSeq; XP_003952183.1; XM_003952134.3.
DR AlphaFoldDB; A0A2I3RN98; -.
DR Ensembl; ENSPTRT00000086822.1; ENSPTRP00000066176.1; ENSPTRG00000005176.5.
DR GeneID; 467054; -.
DR KEGG; ptr:467054; -.
DR CTD; 2799; -.
DR VGNC; VGNC:13094; GNS.
DR GeneTree; ENSGT00940000158420; -.
DR InParanoid; A0A2I3RN98; -.
DR OrthoDB; 1365192at2759; -.
DR Proteomes; UP000002277; Chromosome 12.
DR Bgee; ENSPTRG00000005176; Expressed in adult mammalian kidney and 21 other cell types or tissues.
DR GO; GO:0005764; C:lysosome; IEA:Ensembl.
DR GO; GO:0005539; F:glycosaminoglycan binding; IBA:GO_Central.
DR GO; GO:0008449; F:N-acetylglucosamine-6-sulfatase activity; IBA:GO_Central.
DR GO; GO:0030203; P:glycosaminoglycan metabolic process; IEA:InterPro.
DR CDD; cd16147; G6S; 1.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR012251; GlcNAc_6-SO4ase.
DR InterPro; IPR024607; Sulfatase_CS.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR43108:SF5; N-ACETYLGLUCOSAMINE-6-SULFATASE; 1.
DR PANTHER; PTHR43108; N-ACETYLGLUCOSAMINE-6-SULFATASE FAMILY MEMBER; 1.
DR Pfam; PF00884; Sulfatase; 1.
DR PIRSF; PIRSF036666; G6S; 2.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
DR PROSITE; PS00523; SULFATASE_1; 1.
DR PROSITE; PS00149; SULFATASE_2; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000002277};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 79..416
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
FT MOD_RES 123
FT /note="3-oxoalanine (Cys)"
FT /evidence="ECO:0000256|PIRSR:PIRSR036666-50"
SQ SEQUENCE 584 AA; 65751 MW; 218896164706B12C CRC64;
MRLLPLAPGR LRRGSPRHLP SCSPALLLLV LGGCLGVFGV AAGTRRPNVV LLLTDDQDEV
LGGMAFSEYS TTNCQTGLQG EELRQEHLRT YLAPSRTPLK KTKALIGEMG MTFSSAYVPS
ALCCPSRASI LTGKYPHNHH VVNNTLEGNC SSKSWQKIQE PNTFPAILRS MCGYQTFFAG
KYLNEYGAPD AGGLEHVPLG WSYWYALEKN SKYYNYTLSI NGKARKHGEN YSVDYLTDVL
ANVSLDFLDY KSNFEPFFMM IATPAPHSPW TAAPQYQKAF QNVFAPRNKN FNIHGTNKHW
LIRQAKTPMT NSSIQFLDNA FRKRWQTLLS VDDLVEKLVK RLEFTGELNN TYIFYTSDNG
YHTGQFSLPI DKRQLYEFDI KVPLLVRGPG IKPNQTSKML VANIDLGPTI LDIAGYDLNK
TQMDGMSLLP ILRGASNLTW RSDVLVEYQG EGRNVTDPTC PSLSPGVSQC FPDCVCEDAY
NNTYACVRTM SALWNLQYCE FDDQEVFVEV YNLTADPDQI TNIAKTIDPE LLGKMNYRLM
MLQSCSGPTC RTPGVFDPGY RFDPRLMFSN RGSVRTRRFS KHLL
//