ID S3AII9_9STRE Unreviewed; 1582 AA.
AC S3AII9;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 24-JAN-2024, entry version 56.
DE SubName: Full=YSIRK family gram-positive signal peptide {ECO:0000313|EMBL:EPD87986.1};
GN ORFNames=HMPREF1481_01077 {ECO:0000313|EMBL:EPD87986.1};
OS Streptococcus sp. HPH0090.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=1203590 {ECO:0000313|EMBL:EPD87986.1, ECO:0000313|Proteomes:UP000014396};
RN [1] {ECO:0000313|EMBL:EPD87986.1, ECO:0000313|Proteomes:UP000014396}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HPH0090 {ECO:0000313|EMBL:EPD87986.1,
RC ECO:0000313|Proteomes:UP000014396};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Schmidt T.M., Dover J., Dai D.,
RA Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M., Berlin A.M.,
RA Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M.,
RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., Murphy C.,
RA Pearson M., Poon T.W., Priest M., Roberts A., Saif S., Shea T., Sisk P.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Streptococcus sp. HPH0090.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Cytoplasm, cytosol
CC {ECO:0000256|ARBA:ARBA00004514}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPD87986.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATCD01000001; EPD87986.1; -; Genomic_DNA.
DR RefSeq; WP_016466831.1; NZ_KE150464.1.
DR STRING; 1203590.HMPREF1481_01077; -.
DR PATRIC; fig|1203590.3.peg.1048; -.
DR eggNOG; COG3583; Bacteria.
DR eggNOG; COG4724; Bacteria.
DR HOGENOM; CLU_003704_0_0_9; -.
DR Proteomes; UP000014396; Unassembled WGS sequence.
DR GO; GO:0005829; C:cytosol; IEA:UniProtKB-SubCell.
DR GO; GO:0033925; F:mannosyl-glycoprotein endo-beta-N-acetylglucosaminidase activity; IEA:UniProtKB-EC.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.3630; -; 2.
DR Gene3D; 1.20.120.1850; Ebh helix bundles repeating unit (S and A modules); 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR Gene3D; 3.20.20.80; Glycosidases; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 2.20.230.10; Resuscitation-promoting factor rpfb; 2.
DR InterPro; IPR032979; ENGase.
DR InterPro; IPR011098; G5_dom.
DR InterPro; IPR005201; Glyco_hydro_85.
DR InterPro; IPR022038; Ig-like_bact.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR009063; Ig/albumin-bd_sf.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR005877; YSIRK_signal_dom.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR01168; YSIRK_signal; 1.
DR PANTHER; PTHR13246:SF1; CYTOSOLIC ENDO-BETA-N-ACETYLGLUCOSAMINIDASE; 1.
DR PANTHER; PTHR13246; ENDO BETA N-ACETYLGLUCOSAMINIDASE; 1.
DR Pfam; PF07523; Big_3; 2.
DR Pfam; PF07554; FIVAR; 1.
DR Pfam; PF07501; G5; 2.
DR Pfam; PF03644; Glyco_hydro_85; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF04650; YSIRK_signal; 1.
DR SMART; SM01208; G5; 2.
DR SUPFAM; SSF46997; Bacterial immunoglobulin/albumin-binding domains; 1.
DR PROSITE; PS51109; G5; 2.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..37
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 38..1582
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004517346"
FT DOMAIN 1351..1427
FT /note="G5"
FT /evidence="ECO:0000259|PROSITE:PS51109"
FT DOMAIN 1440..1516
FT /note="G5"
FT /evidence="ECO:0000259|PROSITE:PS51109"
FT DOMAIN 1550..1582
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 48..148
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1517..1555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 1250..1300
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 48..82
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..97
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 98..114
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 126..148
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1536..1551
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1582 AA; 174885 MW; C111692AC579BD18 CRC64;
MKNPFFEKRC RYSIRKLSVG ACSLMIGSVL FANQALAEEV ALPANETTTT AVATEQPSNN
TTEQPSTATT EQPITTSPEV LKQLEETENK VSEQPVSEEK PTLDQLTATD KEATRPVTET
PKVEEVPATT ENKPEEKATV SDVPKKEEKS LRPKEIKFDT WEDLLKWEPG AREDDAINRS
SVELAKRYRG HVVNEKANKD AKVEALANTN SKAKDHASVG GEEFKAYAFD YWQYLNSMVF
WEGLVPTPDV IDAGHRNGVP VLGTLFFNWS NSIADQEKFA AALQQDEDGT FPIARKLVDL
AKYYGFDGYF INQETTGDIV TPLGKKMRDF MLYTKEYAAQ VNHPVKYSWY DAMTYEYGRY
HEDGLGEYNY QFMEKEGDKV PADHFFANFN WTKEKNDYSV TMAQWLGRSQ YDVFAGLELQ
QGGSYKTKVK WDALFDENGK LRLSLGLFAP DTITSLGKTG EDYHKNENIF FTGYQGDPTG
QKPDDKDWYG IANLVADRTP AVGRTFTTSF NTGHGKKWFV DGKVSKDSEW NYRSVSGILP
TWRWWQTSSG AKLQADYDFE DAYNGGNSLK FAGDLAENTN QDVRLYSTKL EVTDKTKLRV
AHKGGKGSKV YVEFATQKNY TYGGENTRKE LTLSDDWTKD EFDLSALAGQ TIYGIKLTFE
NTAALKGYQF NLGELTVTDN QEAPQAPTAL KVAKQSLKNA QEAEAIVQFT GNKDADFYEV
YEKDGDTWRL LTGSSATTIY LPKVSRSASA TGTSQELKVV AVGKNGLRSE AATTKFEWGM
TVQDTSLPRP LAENIVPGAT VIGSTFPDTE GGEGIEGMLN GTITSLSDKW SSAQLSGSVD
IRLTQPRRVV RWVMDHAGAG GESVNDGLMN TKDFDLYYKD EAGEWKLAKA VRGNRAHVSD
ITLDSPITAQ EWRLHVITSD NGTPWKAIRI YNWKMYEALD TESQNIPMAK VAARVLTDNK
IQLGFSEVPA GATITVYDKP DSKTPIATLN TVVGGDLATD SLNFEKRPSL LYYRTQLPDK
EISNTLAVTI PQDERKIKAV SLEVAPKKTT YQDGEELNLK GGLLRVKYEG EEADEVVNLS
HAGVVVNGYD AHRHGEQELT VTYLGLPVAG SFKVQVTGGE TGPKEVAALY ISKQPKIDYL
VGNALDLSEG RFKVLYDDET ETEHSFTDEG VEITGYDSQK TGRQKLQLHY QGQTVEFDVL
VSPKAAINDE YLKQEITSAQ GRKGTTAYTF ADTEKQAALV EKIDAAKAIL ENHDASQEAV
NQALNDLKQA GADLNGVQVY QAAKEELETL LGKVREKNAD DQIIAQAETL IGSTNPTPEA
FADMKEKLNK KLTPAEESHH VGSLEDETAP TVEALPELVI ETETQAFERQ ERPSGDLLLG
ERRVVQAGVE GQTRRLIEVD AKGKRTLLKT EVVKEAVAEI TEVGKKVESR VQPVDGVKDL
TISNPALVIE EETVAYEHEE RVNPGLKAGE RRLVQAGVEG LRRNLVEVNA EGNRSLKGSE
VLRETVTEIV EVGPTVEAQE DKPLTPAPVA QIAKQEPAKE EAEKPEGKQL PSTGEEDANL
LALGLVGVLG GFGLLAQKKK ED
//