ID F7KDK2_9FIRM Unreviewed; 491 AA.
AC F7KDK2;
DT 21-SEP-2011, integrated into UniProtKB/TrEMBL.
DT 21-SEP-2011, sequence version 1.
DT 24-JAN-2024, entry version 44.
DE RecName: Full=Sulfatase N-terminal domain-containing protein {ECO:0000259|Pfam:PF00884};
GN ORFNames=HMPREF0994_03947 {ECO:0000313|EMBL:EGN38712.1};
OS Lachnospiraceae bacterium 3_1_57FAA_CT1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae.
OX NCBI_TaxID=658086 {ECO:0000313|EMBL:EGN38712.1, ECO:0000313|Proteomes:UP000003336};
RN [1] {ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Young S.K., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Alvarado L., Arachchi H.M., Berlin A., Brown A.,
RA Chapman S.B., Chen Z., Dunbar C., Freedman E., Gearin G., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heiman D., Howarth C., Larson L., Lui A.,
RA MacDonald P.J.P., Mehta T., Montmayeur A., Murphy C., Neiman D.,
RA Pearson M., Priest M., Roberts A., Saif S., Shea T., Shenoy N., Sisk P.,
RA Stolte C., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 2_1_58FAA.";
RL Submitted (MAY-2011) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:EGN38712.1, ECO:0000313|Proteomes:UP000003336}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=3_1_57FAA_CT1 {ECO:0000313|EMBL:EGN38712.1,
RC ECO:0000313|Proteomes:UP000003336};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Daigneault M., Strauss J.,
RA Allen-Vercoe E., Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M.,
RA Haas B., Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M.,
RA Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A.,
RA Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A., Larimer J.,
RA McCowan C., Murphy C., Pearson M., Poon T.W., Priest M., Roberts A.,
RA Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Lachnospiraceae bacterium 3-1-57FAA CT1.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the sulfatase family.
CC {ECO:0000256|ARBA:ARBA00008779}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGN38712.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ACTP02000018; EGN38712.1; -; Genomic_DNA.
DR AlphaFoldDB; F7KDK2; -.
DR STRING; 658086.HMPREF0994_03947; -.
DR PATRIC; fig|658086.3.peg.4298; -.
DR eggNOG; COG3119; Bacteria.
DR HOGENOM; CLU_006332_9_2_9; -.
DR Proteomes; UP000003336; Unassembled WGS sequence.
DR GO; GO:0008484; F:sulfuric ester hydrolase activity; IEA:InterPro.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR024607; Sulfatase_CS.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR45953; IDURONATE 2-SULFATASE; 1.
DR PANTHER; PTHR45953:SF1; IDURONATE 2-SULFATASE; 1.
DR Pfam; PF00884; Sulfatase; 1.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 1.
DR PROSITE; PS00149; SULFATASE_2; 1.
PE 3: Inferred from homology;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000003336}.
FT DOMAIN 5..333
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
FT REGION 459..491
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 491 AA; 57432 MW; 2B89C02A00A8B514 CRC64;
MSKRKVLFLM TDSQRADMLG CYGNPDMHTP NLDRLAEQGI RFDKAYTTQP LCQPARAGIF
LGCYPHSCAS WSNSMGVSGN VQSIGKRLSD HGVHTAYIGK WHLDGGDYFG LGRCPEGWDA
DYWYDMRNYL EELTPEERVI SRRTDSIEKY DIKEEFTFGH RVADRAIDFL KNYNDEDFFC
VASFDEPHGP YLCPKKYVDM YADYKLPRPG NFDDTLEDKP EFQRLWARPF RHLTEDPEFR
ASHKEFFGCN TYVDYEIGRV LEAAEKYAPD AVIIYTSDHG DMLFSHSLTG KGPAPYEENA
RIPLLIKGFG SRVDENPVSH INLAPAVFEM MGIDIPKAFE GKSLYPELAD GSVRVNDHIF
VEYGRYEVDH DGFGSFQPMR CIFDGRYKLV INLLDTDEMY DLQEDPDEMV NLIHEERLSQ
VRNKLHEALI DQMYRTRDPF RSYHWEDRPW HRMERDWDSR HMTRQRENEE YEPRQLDYDT
GLEMESAVRK K
//