ID E2N934_9BACE Unreviewed; 646 AA.
AC E2N934;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE SubName: Full=Arylsulfatase {ECO:0000313|EMBL:EEF91580.1};
DE EC=3.1.6.- {ECO:0000313|EMBL:EEF91580.1};
GN ORFNames=BACCELL_00779 {ECO:0000313|EMBL:EEF91580.1};
OS Bacteroides cellulosilyticus DSM 14838.
OC Bacteria; Bacteroidota; Bacteroidia; Bacteroidales; Bacteroidaceae;
OC Bacteroides.
OX NCBI_TaxID=537012 {ECO:0000313|EMBL:EEF91580.1, ECO:0000313|Proteomes:UP000003711};
RN [1] {ECO:0000313|EMBL:EEF91580.1, ECO:0000313|Proteomes:UP000003711}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 14838 {ECO:0000313|EMBL:EEF91580.1,
RC ECO:0000313|Proteomes:UP000003711};
RA Fulton L., Clifton S., Fulton B., Xu J., Minx P., Pepin K.H., Johnson M.,
RA Bhonagiri V., Nash W.E., Mardis E.R., Wilson R.K.;
RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EEF91580.1, ECO:0000313|Proteomes:UP000003711}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 14838 {ECO:0000313|EMBL:EEF91580.1,
RC ECO:0000313|Proteomes:UP000003711};
RA Sudarsanam P., Ley R., Guruge J., Turnbaugh P.J., Mahowald M., Liep D.,
RA Gordon J.;
RT "Draft genome sequence of Bacteroides cellulosilyticus (DSM 14838).";
RL Submitted (JAN-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- PTM: The conversion to 3-oxoalanine (also known as C-formylglycine,
CC FGly), of a serine or cysteine residue in prokaryotes and of a cysteine
CC residue in eukaryotes, is critical for catalytic activity.
CC {ECO:0000256|PIRSR:PIRSR600917-52}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EEF91580.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACCH01000070; EEF91580.1; -; Genomic_DNA.
DR AlphaFoldDB; E2N934; -.
DR HOGENOM; CLU_006332_7_2_10; -.
DR Proteomes; UP000003711; Unassembled WGS sequence.
DR GO; GO:0016788; F:hydrolase activity, acting on ester bonds; IEA:UniProt.
DR CDD; cd16027; SGSH; 1.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR Gene3D; 1.25.10.10; Leucine-rich Repeat Variant; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR011989; ARM-like.
DR InterPro; IPR016024; ARM-type_fold.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR43751:SF3; N-ACETYLGALACTOSAMINE 6-SULFATASE (GALNS); 1.
DR PANTHER; PTHR43751; SULFATASE; 1.
DR Pfam; PF00884; Sulfatase; 1.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 1.
DR SUPFAM; SSF48371; ARM repeat; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:EEF91580.1}.
FT DOMAIN 23..300
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
FT MOD_RES 71
FT /note="3-oxoalanine (Ser)"
FT /evidence="ECO:0000256|PIRSR:PIRSR600917-52"
SQ SEQUENCE 646 AA; 72903 MW; F6CEBB64D267BE2E CRC64;
MCTAFCGLYS SQTEAADKKN ERPNILWLTF EDTSAYEFGC YGNGDVHTPN ADSLAARGIQ
FMNAWSVAPQ SSAARSSLIT GCYSSTYGMD IHPVPYDTPA DIFFPQRLRE AGYYCTNNSK
THYNSTTDNK SCWDECNNKA SYNSTKRGKD QPFFAVYNTV TSHMGRIRTF HTDGRRDYTQ
EGIFPQQLTL PSYVPDLPEV RSDYAGHLEA VQDVDTWLGI FLKDLETRGL EDNTIIFFFS
DHGGCVARGK GYLYESGLKV PMIAYFPPKW RHLANGAKGK ENGLVNFTDL GPTVLSLAGV
KPPKNMQGKA LYGKFASKEK REVQFALAAN QLHHFMPVRA VTDGRFKYIR SYIPYRQFAL
RNYYQWGMPS NKAWDKLVLG GHNTNPDWKQ TFEPHPAEML FDLEKDPDEL HDLSAIPEYA
ETLYKMRQAL SDHIRTTHDL GFFLPNSRTG HILYEKVRKE KYPLDELYGL VEIAGTATVA
SLPMLEKAIA SPLPEMRFWG VVGYANLARE KQINTCPQAL LALLQDENPY IASEAAYAVV
YLGKAQEGIA RLITPAQEKD RKIGYSSLEC LSLDPEMRDY IRPFLSELKE AAENLPRLEN
EDAGLMARGI LVNLGEMDIK DLHGPEAYKK GLKLNYGRRA MVPLPN
//