ID K9DWW6_9BACE Unreviewed; 527 AA.
AC K9DWW6;
DT 06-FEB-2013, integrated into UniProtKB/TrEMBL.
DT 06-FEB-2013, sequence version 1.
DT 27-MAR-2024, entry version 37.
DE RecName: Full=Sulfatase N-terminal domain-containing protein {ECO:0000259|Pfam:PF00884};
GN ORFNames=HMPREF9447_02902 {ECO:0000313|EMBL:EKU89464.1};
OS Bacteroides oleiciplenus YIT 12058.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=742727 {ECO:0000313|EMBL:EKU89464.1, ECO:0000313|Proteomes:UP000009872};
RN [1] {ECO:0000313|EMBL:EKU89464.1, ECO:0000313|Proteomes:UP000009872}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=YIT 12058 {ECO:0000313|EMBL:EKU89464.1,
RC ECO:0000313|Proteomes:UP000009872};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Morotomi M., Walker B.,
RA Young S.K., Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A.,
RA Alvarado L., Arachchi H.M., Berlin A.M., Chapman S.B., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Larimer J.,
RA McCowen C., Montmayeur A., Murphy C., Neiman D., Pearson M., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Bacteroides oleiciplenus YIT 12058.";
RL Submitted (SEP-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- PTM: The conversion to 3-oxoalanine (also known as C-formylglycine,
CC FGly), of a serine or cysteine residue in prokaryotes and of a cysteine
CC residue in eukaryotes, is critical for catalytic activity.
CC {ECO:0000256|PIRSR:PIRSR600917-52}.
CC -!- SIMILARITY: Belongs to the sulfatase family.
CC {ECO:0000256|ARBA:ARBA00008779}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKU89464.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADLF01000013; EKU89464.1; -; Genomic_DNA.
DR RefSeq; WP_009130440.1; NZ_JH992942.1.
DR AlphaFoldDB; K9DWW6; -.
DR STRING; 742727.HMPREF9447_02902; -.
DR PATRIC; fig|742727.4.peg.2968; -.
DR eggNOG; COG3119; Bacteria.
DR HOGENOM; CLU_006332_10_3_10; -.
DR OrthoDB; 9765065at2; -.
DR Proteomes; UP000009872; Unassembled WGS sequence.
DR GO; GO:0008484; F:sulfuric ester hydrolase activity; IEA:UniProt.
DR CDD; cd16143; ARS_like; 1.
DR Gene3D; 3.30.1120.10; -; 1.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR024607; Sulfatase_CS.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR42693:SF33; ARYLSULFATASE; 1.
DR PANTHER; PTHR42693; ARYLSULFATASE FAMILY MEMBER; 1.
DR Pfam; PF00884; Sulfatase; 1.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 1.
DR PROSITE; PS00523; SULFATASE_1; 1.
DR PROSITE; PS00149; SULFATASE_2; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..24
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 25..527
FT /note="Sulfatase N-terminal domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003929052"
FT DOMAIN 36..386
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
FT REGION 500..527
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT MOD_RES 85
FT /note="3-oxoalanine (Ser)"
FT /evidence="ECO:0000256|PIRSR:PIRSR600917-52"
SQ SEQUENCE 527 AA; 58287 MW; 1E3D5A303A1982AA CRC64;
MKNITCLGLL SVPFCLGQHA NAQAALQPEM SSPQKPNIIF ILADDMGYGD VSAFNKDSHI
HTQHIDRMAA NGIMFTDAHS CSAVSTPTRY GILTGRYNWR SSLKSGVLYG YSRPIIPTTR
STIASMLKAN GYVTACIGKW HLGWNWGTNP GHEKPNADAL TDEDVDYSKP IANGPADLGF
DYFYGFCGSL DMAPYVYIEN RQPTTTQVTS VPAGKKPGFW RAGAIGNDFN HQDCLPNLTN
RAVDYINRHA NAEHPFFLYL PLPAPHTPIL PAESFKGKTG LGDYGDFALM VDDVVGQVRE
ALQRNGIDSN TIVVFTTDNG CSPAAGIPDM TAKGHHANYI WRGMKADLFD GGHRVPTILE
WADGATKGIC EQTICLNDFY ASFAALNNYP LKDNEAEDSY NILPLIRNPH YAETIREATV
HHSIRGEFAI RKGDWKLLCS PSSGGWSYPK PGVDKEAIAK LPPLQLYNMK EDPTESRNVY
SEHPEIVNKL KDLLQKYIRE GRSTPGRSQP NDAVNKWEQI KGIMQDD
//