ID G1RKF0_NOMLE Unreviewed; 638 AA.
AC G1RKF0;
DT 19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT 28-FEB-2018, sequence version 3.
DT 27-MAR-2024, entry version 51.
DE SubName: Full=Sulfatase 1 {ECO:0000313|Ensembl:ENSNLEP00000013707.3};
GN Name=SULF1 {ECO:0000313|Ensembl:ENSNLEP00000013707.3};
OS Nomascus leucogenys (Northern white-cheeked gibbon) (Hylobates leucogenys).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Primates; Haplorrhini; Catarrhini; Hylobatidae;
OC Nomascus.
OX NCBI_TaxID=61853 {ECO:0000313|Ensembl:ENSNLEP00000013707.3, ECO:0000313|Proteomes:UP000001073};
RN [1] {ECO:0000313|Ensembl:ENSNLEP00000013707.3, ECO:0000313|Proteomes:UP000001073}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Gibbon Genome Sequencing Consortium;
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSNLEP00000013707.3}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- COFACTOR:
CC Name=Ca(2+); Xref=ChEBI:CHEBI:29108;
CC Evidence={ECO:0000256|ARBA:ARBA00001913};
CC -!- SUBCELLULAR LOCATION: Cell surface {ECO:0000256|ARBA:ARBA00004241}.
CC Endoplasmic reticulum {ECO:0000256|ARBA:ARBA00004240}. Golgi apparatus,
CC Golgi stack {ECO:0000256|ARBA:ARBA00004348}.
CC -!- SIMILARITY: Belongs to the sulfatase family.
CC {ECO:0000256|ARBA:ARBA00008779}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADFV01083784; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01083785; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01083786; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01083787; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR EMBL; ADFV01083788; -; NOT_ANNOTATED_CDS; Genomic_DNA.
DR AlphaFoldDB; G1RKF0; -.
DR Ensembl; ENSNLET00000014385.3; ENSNLEP00000013707.3; ENSNLEG00000011274.3.
DR GeneTree; ENSGT00940000157544; -.
DR HOGENOM; CLU_006332_2_3_1; -.
DR Proteomes; UP000001073; Chromosome 16.
DR GO; GO:0009986; C:cell surface; IEA:UniProtKB-SubCell.
DR GO; GO:0005783; C:endoplasmic reticulum; IEA:UniProtKB-SubCell.
DR GO; GO:0005795; C:Golgi stack; IEA:UniProtKB-SubCell.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR024609; Extracellular_sulfatase_C.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR43108:SF1; EXTRACELLULAR SULFATASE SULF-1; 1.
DR PANTHER; PTHR43108; N-ACETYLGLUCOSAMINE-6-SULFATASE FAMILY MEMBER; 1.
DR Pfam; PF12548; DUF3740; 2.
DR Pfam; PF00884; Sulfatase; 1.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 2.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Golgi apparatus {ECO:0000256|ARBA:ARBA00023034};
KW Reference proteome {ECO:0000313|Proteomes:UP000001073}.
FT DOMAIN 35..128
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
FT DOMAIN 289..433
FT /note="Extracellular sulfatase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF12548"
FT DOMAIN 436..494
FT /note="Extracellular sulfatase C-terminal"
FT /evidence="ECO:0000259|Pfam:PF12548"
FT REGION 263..289
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 326..362
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 397..424
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 638 AA; 73962 MW; 9CF02216A181D242 CRC64;
TPSYNYAPNM DKHWIMQYTG PMLPIHMEFT NILQRKRLQT LMSVDDSVER LYNMLMETGE
LENTYIIYTA DHGYHIGQFG LVKGKSMPYD FDIRVPFFIR GPSVEPGSIV PQIVLNIDLA
PTILDIAGLD TPPDVDGKSV LKLLDPEKPG NRFRTNKKAK IWRDTFLVER GKFLRKKEES
SKNIQQSNHL PKYERVKELC QQARYQTACE QPGQKWQCIE DTSGKLRIHK CKGPSDLLTV
RQSTRNLYAR GFHDKDKECS CRESGYRASR SQRKSQRQFL RNQGTPKYKP RFVHTRQTRS
LSVEFEGEIY DINLEEEELQ VLQPRNTAKR HDEGHKGPRA LQASSGGNRG GMLADSSNTV
GPPTTVRVTH KCFILPNDTI HCERELYQSA RAWKDHKAYI DKEIEALQDK IKNLREVRGH
LKRRKPEECS CSKQSYYNKE KGVKKQEKLK SHLHPFKEAA QEVDSKLQLF KENRRRKKER
KEKRRQRKGE ECSLPGLTCF THDNNHWQTA PFWNLGSFCA CTSSNNNTYW CLRTVNETHN
FLFCEFATGF LEYFDMNTDP YQLTNTVHTV ERGILNQLHV QLMELRSCQG YKQCNPRPKN
LDVGNKDGGS YGLHRMLLFL ASYSAPLTSR AMVSDASF
//