ID E9T1C4_RHOHA Unreviewed; 779 AA.
AC E9T1C4;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 44.
DE SubName: Full=Arylsulfatase {ECO:0000313|EMBL:EGD24225.1};
DE EC=3.1.6.- {ECO:0000313|EMBL:EGD24225.1};
GN ORFNames=HMPREF0724_12433 {ECO:0000313|EMBL:EGD24225.1};
OS Prescottella equi ATCC 33707.
OC Bacteria; Actinomycetota; Actinomycetes; Mycobacteriales; Nocardiaceae;
OC Prescottella.
OX NCBI_TaxID=525370 {ECO:0000313|EMBL:EGD24225.1, ECO:0000313|Proteomes:UP000004245};
RN [1] {ECO:0000313|EMBL:EGD24225.1, ECO:0000313|Proteomes:UP000004245}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 33707 {ECO:0000313|EMBL:EGD24225.1,
RC ECO:0000313|Proteomes:UP000004245};
RA Muzny D., Qin X., Buhay C., Dugan-Rocha S., Ding Y., Chen G., Hawes A.,
RA Holder M., Jhangiani S., Johnson A., Khan Z., Li Z., Liu W., Liu X.,
RA Perez L., Shen H., Wang Q., Watt J., Xi L., Xin Y., Zhou J., Deng J.,
RA Jiang H., Liu Y., Qu J., Song X.-Z., Zhang L., Villasana D., Johnson A.,
RA Liu J., Liyanage D., Lorensuhewa L., Robinson T., Song A., Song B.-B.,
RA Dinh H., Thornton R., Coyle M., Francisco L., Jackson L., Javaid M.,
RA Korchina V., Kovar C., Mata R., Mathew T., Ngo R., Nguyen L., Nguyen N.,
RA Okwuonu G., Ongeri F., Pham C., Simmons D., Wilczek-Boney K., Hale W.,
RA Jakkamsetti A., Pham P., Ruth R., San Lucas F., Warren J., Zhang J.,
RA Zhao Z., Zhou C., Zhu D., Lee S., Bess C., Blankenburg K., Forbes L.,
RA Fu Q., Gubbala S., Hirani K., Jayaseelan J.C., Lara F., Munidasa M.,
RA Palculict T., Patil S., Pu L.-L., Saada N., Tang L., Weissenberger G.,
RA Zhu Y., Hemphill L., Shang Y., Youmans B., Ayvaz T., Ross M.,
RA Santibanez J., Aqrawi P., Gross S., Joshi V., Fowler G., Nazareth L.,
RA Reid J., Worley K., Petrosino J., Highlander S., Gibbs R.;
RL Submitted (JAN-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EGD24225.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ADNW02000010; EGD24225.1; -; Genomic_DNA.
DR RefSeq; WP_005515627.1; NZ_CM001149.1.
DR AlphaFoldDB; E9T1C4; -.
DR STRING; 43767.A6I91_14630; -.
DR HOGENOM; CLU_006332_11_0_11; -.
DR OrthoDB; 9777306at2; -.
DR Proteomes; UP000004245; Chromosome.
DR GO; GO:0016787; F:hydrolase activity; IEA:UniProtKB-KW.
DR CDD; cd16025; PAS_like; 1.
DR Gene3D; 3.30.1120.10; -; 1.
DR Gene3D; 3.40.720.10; Alkaline Phosphatase, subunit A; 1.
DR InterPro; IPR017850; Alkaline_phosphatase_core_sf.
DR InterPro; IPR013320; ConA-like_dom_sf.
DR InterPro; IPR000917; Sulfatase_N.
DR PANTHER; PTHR42693; ARYLSULFATASE FAMILY MEMBER; 1.
DR PANTHER; PTHR42693:SF43; BLL2667 PROTEIN; 1.
DR Pfam; PF00884; Sulfatase; 1.
DR SUPFAM; SSF53649; Alkaline phosphatase-like; 1.
DR SUPFAM; SSF49899; Concanavalin A-like lectins/glucanases; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000313|EMBL:EGD24225.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000004245}.
FT DOMAIN 41..454
FT /note="Sulfatase N-terminal"
FT /evidence="ECO:0000259|Pfam:PF00884"
SQ SEQUENCE 779 AA; 85942 MW; 56979BA5289E64B6 CRC64;
MNRHSIPIVR TDPQISTAVD YRNQTEPFSR PQPVRPPSGA PNVLLVMLDD VGFGAPSAFG
GPCRTPTAER LAADGVKYTR FHTTALCSPT RAATLTGRNH HSVGFGVIAE QATAAPGYNG
TRPDSAATVA RILQGNGYAT GAFGKMHQTP TWEISEAGPF DRWPTREGFD RFYGFLGAES
DQFAPVLYRD FTVVDPPCTP EEGYHMSEDL VDRAIEWIES VGTMDPDKPW FCYLPFGACH
APLQVPDSYL DKYRGEFDHG WDRQREITLE RQKQLGVVPP ETELAPWSGD LPHWDDLDEG
QKKVSARLME LYAAFLEHTD DQVGRLVDRL QESGALDNTL VLYMLGDNGA SAEGGMEGSF
NYLAGLNGYK QTTAEVLERF DELGTPTSYP HYPASWALAL DTPYQWAKQA ASHYGGTRNG
LIAHWPKGIS ETGLRHQWHH CVDITPTILE AVGVPAPDSV DGVPQKPMEG VAMNYTFTDA
NAADRHTTQY FEIYGNRGIY DNGWTAVTLH RAPWLMATYG LQLPTFDEDR WELYDTSVDW
SQARDLADEF PEKLAELQQK FLVEAARYQV LPLDDRTVTR NAAPADRPPH PLRGRTSITL
YPHMNGLPEK AAPPFFNRSY VLTASLETGG EPCEGVLASV GGSFGGLALY VAASKPVFCY
NFSGGGTTFV RPDVELTADT REVRLEFDYD GGGIGLGGLA RLFVDDVEVG SGRVEQTTRA
IFSMNEQLDI GVNRGSPVCE DIVGRFAFTG RLHHVRVDLP GEGRPETADE RRRVALATH
//