ID E7S7N8_9STRE Unreviewed; 1187 AA.
AC E7S7N8;
DT 05-APR-2011, integrated into UniProtKB/TrEMBL.
DT 05-APR-2011, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE SubName: Full=Gram-positive signal peptide protein, YSIRK family {ECO:0000313|EMBL:EFW00482.1};
DE EC=3.2.1.52 {ECO:0000313|EMBL:EFW00482.1};
GN ORFNames=HMPREF9421_0074 {ECO:0000313|EMBL:EFW00482.1};
OS Streptococcus australis ATCC 700641.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=888833 {ECO:0000313|EMBL:EFW00482.1, ECO:0000313|Proteomes:UP000002814};
RN [1] {ECO:0000313|EMBL:EFW00482.1, ECO:0000313|Proteomes:UP000002814}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 700641 {ECO:0000313|EMBL:EFW00482.1,
RC ECO:0000313|Proteomes:UP000002814};
RA Muzny D., Qin X., Deng J., Jiang H., Liu Y., Qu J., Song X.-Z., Zhang L.,
RA Thornton R., Coyle M., Francisco L., Jackson L., Javaid M., Korchina V.,
RA Kovar C., Mata R., Mathew T., Ngo R., Nguyen L., Nguyen N., Okwuonu G.,
RA Ongeri F., Pham C., Simmons D., Wilczek-Boney K., Hale W., Jakkamsetti A.,
RA Pham P., Ruth R., San Lucas F., Warren J., Zhang J., Zhao Z., Zhou C.,
RA Zhu D., Lee S., Bess C., Blankenburg K., Forbes L., Fu Q., Gubbala S.,
RA Hirani K., Jayaseelan J.C., Lara F., Munidasa M., Palculict T., Patil S.,
RA Pu L.-L., Saada N., Tang L., Weissenberger G., Zhu Y., Hemphill L.,
RA Shang Y., Youmans B., Ayvaz T., Ross M., Santibanez J., Aqrawi P.,
RA Gross S., Joshi V., Fowler G., Nazareth L., Reid J., Worley K.,
RA Petrosino J., Highlander S., Gibbs R.;
RL Submitted (DEC-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 20 family.
CC {ECO:0000256|ARBA:ARBA00006285}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFW00482.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEQR01000001; EFW00482.1; -; Genomic_DNA.
DR AlphaFoldDB; E7S7N8; -.
DR eggNOG; COG3525; Bacteria.
DR eggNOG; COG3583; Bacteria.
DR HOGENOM; CLU_005832_0_0_9; -.
DR Proteomes; UP000002814; Unassembled WGS sequence.
DR GO; GO:0004563; F:beta-N-acetylhexosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0102148; F:N-acetyl-beta-D-galactosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd06564; GH20_DspB_LnbB-like; 2.
DR Gene3D; 1.20.1270.90; AF1782-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 2.
DR Gene3D; 2.20.230.10; Resuscitation-promoting factor rpfb; 1.
DR InterPro; IPR011098; G5_dom.
DR InterPro; IPR015883; Glyco_hydro_20_cat.
DR InterPro; IPR017853; Glycoside_hydrolase_SF.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR005877; YSIRK_signal_dom.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR01168; YSIRK_signal; 1.
DR PANTHER; PTHR43678:SF1; BETA-N-ACETYLHEXOSAMINIDASE; 1.
DR PANTHER; PTHR43678; PUTATIVE (AFU_ORTHOLOGUE AFUA_2G00640)-RELATED; 1.
DR Pfam; PF07501; G5; 1.
DR Pfam; PF00728; Glyco_hydro_20; 2.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF04650; YSIRK_signal; 1.
DR SMART; SM01208; G5; 1.
DR SUPFAM; SSF51445; (Trans)glycosidases; 2.
DR PROSITE; PS51109; G5; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 3: Inferred from homology;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Glycosidase {ECO:0000313|EMBL:EFW00482.1};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:EFW00482.1};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Reference proteome {ECO:0000313|Proteomes:UP000002814};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..33
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 34..1187
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003221870"
FT DOMAIN 1005..1082
FT /note="G5"
FT /evidence="ECO:0000259|PROSITE:PS51109"
FT DOMAIN 1156..1187
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 35..64
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 110..134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1082..1159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1140..1159
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1187 AA; 130340 MW; 24F60A1ADC4F920C CRC64;
MMKSHQPRYA IRKYAVGVAS VLVGFFASGQ VVAADTPSVT ESPATTLPTQ PSEGDFPQSA
TEETAQPVVE KPIVPAPIVD QSQPTLPDRE LRASDLDRLI REEAISVTEA DGTAQPVTAA
TETPAKDPEQ EEKLAKKKVI SIDAGRKYFS PDQIKEIIDE ASKTGYTDLH LLVGNDGLRF
VLDDMSLQVG DASYSSQAVK EAVEKGTKAY YDDPNGTALT QAQMDEILAY AKSKNIGVIP
TINSPGHMDA ILTAMEQLGI ENPHFNYFGT KKSERTVDLD NQKALDFTKA LVDKYAAYFS
GKTEIFNIGL DEYANDATDA HGWQVLQASK HWPDEGYPDK GYEKFIQYAN DLAAIVKKHK
MKPMAFNDGI YYNGDTSYGT FDKDIIVSYW TGGWNGYDVA SSKFLSELGH QILNTNDAWY
YVLGRDKAGS GWYNLDQGLE GISKSAIDVV QKNDGAKVPF IGGMVAAWAD TPSATYKKEL
LFKLMHAFAD KNADYFVADP EVVEKALAEA PTDLDHYTPE SLVAFTAAKK ALEGVGAETT
RAEAKELIAS LKAAQESLVY TESYAKELAD KEAAEKLAKS KVISIDAGRK YFSLDQLKRI
VDKASELGYS DLHLLVGNDG MRFVLDDMTV EANGKTYASD DVKQALLEGT KAYYDDPNGQ
ALTQAEMDEL LAYATSKGIG LIPAVNSPGH MDAILVAMEK LGIEHPQATF DTVSKTTMDL
TNEEAVNFTK ALIGKYMDYF KGKSKIFNYG TDEYANDATN AQGWYYLKWY ELYGKFADYS
NSLAAMAREK GLQPMAFNDG FYYGDEDDVS FDKDVLISYW SKGWWGYNLA SPQYLADKGY
KFLNTNGDWY YVLGHRGDQS YPLDKAIQHS ETVPFEQLAS TKYPDVKLPV SGSMLGIWGD
EPANEYKEEE VFQVMEAFAN HNKDYFKADY TALRAALANV PTDLSIYTEE SRSALATVLD
QLNWKISRAH QEKVAEQVEQ VTQALSQLKP ITQVGSQAED DVRALVADKP SLEVVEKELD
FEVVERTNPE LAKGERKVVQ AGVKGQQREF VAVSALDESH QVVGTEVSKE AVAEIVEVGT
KEAEVITPTP GLQDGPSPQP DLPQVDQGGH HQVRPVTPEV KPLVNAQPAP VASEQKTVEE
AQPAPVQDQT PSSHQLPQTG QESVLGLALL GAILGATGMS LKGRKED
//