ID F4KX60_HALH1 Unreviewed; 484 AA.
AC F4KX60;
DT 28-JUN-2011, integrated into UniProtKB/TrEMBL.
DT 28-JUN-2011, sequence version 1.
DT 27-MAR-2024, entry version 56.
DE SubName: Full=Sulphatase-modifying factor protein {ECO:0000313|EMBL:AEE48288.1};
GN OrderedLocusNames=Halhy_0376 {ECO:0000313|EMBL:AEE48288.1};
OS Haliscomenobacter hydrossis (strain ATCC 27775 / DSM 1100 / LMG 10767 / O).
OC Bacteria; Bacteroidota; Saprospiria; Saprospirales; Haliscomenobacteraceae;
OC Haliscomenobacter.
OX NCBI_TaxID=760192 {ECO:0000313|EMBL:AEE48288.1, ECO:0000313|Proteomes:UP000008461};
RN [1] {ECO:0000313|EMBL:AEE48288.1, ECO:0000313|Proteomes:UP000008461}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 27775 / DSM 1100 / LMG 10767 / O
RC {ECO:0000313|Proteomes:UP000008461};
RX PubMed=21886862; DOI=10.4056/sigs.1964579;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Daligault H., Lapidus A., Zeytun A., Nolan M., Lucas S., Del Rio T.G.,
RA Tice H., Cheng J.F., Tapia R., Han C., Goodwin L., Pitluck S., Liolios K.,
RA Pagani I., Ivanova N., Huntemann M., Mavromatis K., Mikhailova N., Pati A.,
RA Chen A., Palaniappan K., Land M., Hauser L., Brambilla E.M., Rohde M.,
RA Verbarg S., Goker M., Bristow J., Eisen J.A., Markowitz V., Hugenholtz P.,
RA Kyrpides N.C., Klenk H.P., Woyke T.;
RT "Complete genome sequence of Haliscomenobacter hydrossis type strain (O).";
RL Stand. Genomic Sci. 4:352-360(2011).
RN [2]
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=DSM 1100;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Lucas S., Han J., Lapidus A., Bruce D., Goodwin L., Pitluck S., Peters L.,
RA Kyrpides N., Mavromatis K., Ivanova N., Ovchinnikova G., Pagani I.,
RA Daligault H., Detter J.C., Han C., Land M., Hauser L., Markowitz V.,
RA Cheng J.-F., Hugenholtz P., Woyke T., Wu D., Verbarg S., Frueling A.,
RA Brambilla E., Klenk H.-P., Eisen J.A.;
RT "Complete sequence of chromosome of Haliscomenobacter hydrossis DSM 1100.";
RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP002691; AEE48288.1; -; Genomic_DNA.
DR RefSeq; WP_013762852.1; NC_015510.1.
DR AlphaFoldDB; F4KX60; -.
DR STRING; 760192.Halhy_0376; -.
DR GeneID; 78196372; -.
DR KEGG; hhy:Halhy_0376; -.
DR eggNOG; COG0265; Bacteria.
DR eggNOG; COG1262; Bacteria.
DR HOGENOM; CLU_563562_0_0_10; -.
DR OrthoDB; 9768004at2; -.
DR Proteomes; UP000008461; Chromosome.
DR Gene3D; 2.40.10.120; -; 1.
DR Gene3D; 3.90.1580.10; paralog of FGE (formylglycine-generating enzyme); 1.
DR InterPro; IPR016187; CTDL_fold.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR005532; SUMF_dom.
DR InterPro; IPR042095; SUMF_sf.
DR PANTHER; PTHR23150:SF37; FGE-SULFATASE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR23150; SULFATASE MODIFYING FACTOR 1, 2; 1.
DR Pfam; PF03781; FGE-sulfatase; 1.
DR Pfam; PF13365; Trypsin_2; 1.
DR SUPFAM; SSF56436; C-type lectin-like; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000008461};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..36
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 37..484
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003312248"
FT DOMAIN 264..483
FT /note="Sulfatase-modifying factor enzyme"
FT /evidence="ECO:0000259|Pfam:PF03781"
SQ SEQUENCE 484 AA; 53818 MW; 96CD4FB071FC9CF8 CRC64;
MKSSTLNTTP FCYPQSNFWR GSLALLLLFW STLMSAQDPI SPVTESLKPN VVAIKASFAD
GTEEKGFGFI TAEQNGRLFL ATAAHVVRGP DKDKSAQHIR VKFLNDISWY PATFKAQWDK
EDLALLELPK PSFVQWQPNC ADFAPGTYRK VHFIGLNGNE PRWVDPGLDG NIFEDKDHEL
NFAIGTIRPG TSGAPLITEM GIVGLITQDE GGISTALKLT QIKTLFSGGG QYPYFALQLL
GGVVTPPPIN TNVPVNVPQA DEYGLVLVKG GTFTMGCTSE QGSDCYDDEK TTHRVILSDF
HIGKYEVTQA QWRKVMGSDP PNLYFKGCDQ CPVEGVSWED IQKFLRKLNA QTGKIYRLPT
EAEWEYAARG GNQSKGYKYA GSNSFTDVAW FEDNSGNKPH PVGTKKANEL GLYDMSGNVW
EWCQDWYSDY SSNTQTNPTG AGSGSYRVYR GGNWFLDEWA CRVSYRNYST SGAHYKYLGF
RLAL
//