ID E0PS09_STRMT Unreviewed; 2567 AA.
AC E0PS09;
DT 02-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 02-NOV-2010, sequence version 1.
DT 24-JAN-2024, entry version 59.
DE SubName: Full=Gram-positive signal peptide protein, YSIRK family {ECO:0000313|EMBL:EFM31223.1};
GN Name=cshA {ECO:0000313|EMBL:EFM31223.1};
GN ORFNames=HMPREF8571_1326 {ECO:0000313|EMBL:EFM31223.1};
OS Streptococcus mitis ATCC 6249.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=864567 {ECO:0000313|EMBL:EFM31223.1, ECO:0000313|Proteomes:UP000003823};
RN [1] {ECO:0000313|EMBL:EFM31223.1, ECO:0000313|Proteomes:UP000003823}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 6249 {ECO:0000313|EMBL:EFM31223.1,
RC ECO:0000313|Proteomes:UP000003823};
RA Muzny D., Qin X., Deng J., Jiang H., Liu Y., Qu J., Song X.-Z., Zhang L.,
RA Thornton R., Coyle M., Francisco L., Jackson L., Javaid M., Korchina V.,
RA Kovar C., Mata R., Mathew T., Ngo R., Nguyen L., Nguyen N., Okwuonu G.,
RA Ongeri F., Pham C., Simmons D., Wilczek-Boney K., Hale W., Jakkamsetti A.,
RA Pham P., Ruth R., San Lucas F., Warren J., Zhang J., Zhao Z., Zhou C.,
RA Zhu D., Lee S., Bess C., Blankenburg K., Forbes L., Fu Q., Gubbala S.,
RA Hirani K., Jayaseelan J.C., Lara F., Munidasa M., Palculict T., Patil S.,
RA Pu L.-L., Saada N., Tang L., Weissenberger G., Zhu Y., Hemphill L.,
RA Shang Y., Youmans B., Ayvaz T., Ross M., Santibanez J., Aqrawi P.,
RA Gross S., Joshi V., Fowler G., Nazareth L., Reid J., Worley K.,
RA Petrosino J., Highlander S., Gibbs R.;
RL Submitted (JUL-2010) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EFM31223.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AEEN01000013; EFM31223.1; -; Genomic_DNA.
DR RefSeq; WP_000513888.1; NZ_GL397179.1.
DR eggNOG; COG2373; Bacteria.
DR HOGENOM; CLU_000552_0_0_9; -.
DR OrthoDB; 2243937at2; -.
DR Proteomes; UP000003823; Unassembled WGS sequence.
DR InterPro; IPR026395; CshA_fibril.
DR InterPro; IPR045474; GEVED.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR005877; YSIRK_signal_dom.
DR NCBIfam; TIGR04225; CshA_fibril_rpt; 18.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR01168; YSIRK_signal; 1.
DR Pfam; PF19076; CshA_repeat; 18.
DR Pfam; PF20009; GEVED; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF04650; YSIRK_signal; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..41
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 42..2567
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5003138721"
FT DOMAIN 2531..2567
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 59..206
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 755..782
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 902..926
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1113..1134
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1316..1338
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1437..1457
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1518..1539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1704..1741
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2112..2138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2309..2358
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 59..124
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1113..1133
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1316..1335
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1518..1537
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1704..1739
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2310..2336
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2567 AA; 270657 MW; B2AFBAA62729CCB0 CRC64;
MGKDLFNSHF HKFSIRKLNV GVCSVLLSTL VLLGVTSQVS ADETSVTNSL NEVAKTNLDK
TTGTSISPTD SKTSEKSETP SVVESSQPTK ETATSNSGTE PADKKNNDQS VISETSQSPV
EKPVTSPEDK SRESAKPETA AAPIIAEPTE TSSATEQSTR SRRVRRDTQA TTVAPASYAG
ADDATPVPRT SKPELSESEK KESTQLAKQI NWIDFSDTAS LKNLDPQGGF KIGTTYTKEI
SPGYVVTLTV TELKPFQSTE IYKKRVEGTS SAGTYDPNAT NNYLKNWKDY GKTPPPVSGQ
AQDKWSTIGG QGFDTKGHKT QIIVPVDGVN WGVKFKIEAT YRGKKVKPAV VMADGEDANP
GEYGIFTTNG EGWEYVGEWM KGPRAKGPYT VTTEELVKQA DKTTRGGLLI LKDKTVDWNK
FLSPDTVTGG LGSQVFGPIV SASKAVPVVM TRGASEVGFY VATAGQQALM MGFLVVDGGD
APESYGEAHH TISTRDSITN VQINQPYLGS TEADIDVDSK NNWTSDDRED VSDEGSQQLL
TADQYSNTND LLDLNKAKNG TYTLKIKANP NGNAKAYVKA WVDFNNNGKF DDNEGSVVKE
ITANGDHTLT FNAIPSLSGG LVDQLGMRVR IATNAGDIEK PTGMAFSGEV EDMLVRRTYP
PQGEKKETTG LQGETQNATV HFTPKGPDRS DFVTNASMSS QAPQILDNQG NVLTPTNGNT
YVRPEGTYVV TTNGDDIDVA FTPNADFTGT AEGINIRRTD SNGSSTNWQS TDASNPNKND
ILNNMDGRYI PTVRKIPTYE STGVQGQEQN KNLIFNDGDS AKTPVTPDAS RPATFVNANG
QPITGNSVPA TSNGQSVGTY ELDPNTGQVT FKPNKNFVGT PDPVTVQVND SNGVPYRAHY
NPTVTKVTPT GTNATSTGPQ GVPQTGTPTF QGGDPLVPID ETVEPTFADG SKEKSIPGQG
TYTIAPDGTV TFTPDKQFVG KPDPITVKRV DKNGTPVTAT YSPEFTKVTP TGTGDKTEGL
QGQVQEGKVT FTPGHDSVPF PADSTPLFDN GTTVKELPNV GKFEVDADGK VTFTPDKQFK
GETPELELTR VDANGTPVTV KYQAVVKEVV PTGTNATSTG PQGVPQTGTP SFQGGDPLVP
IDETVEPTFA DGSKEKVIPG QGTYTIAPDG TVTFIPDKQF VGKPDPVTVK RVDKNGTPVI
ATYSPEFTKV TPTGTGATST GPQGVPQTGT PTFQGGDPLV PIDETVEPTF ADGSKEKVIP
GQGTYTIAPD GTVTFIPDKQ FVGSPVPVTV KRVDKNGTPV TATYSPEFTK VTPTGIGATS
TGPQGLPQTG TPSFQGGDPL VPIDETVEPT FADGSKEKSI SGQGTYTIAP DGTVTFTPDK
QFVGSPDPIT VKRVDKNGTP VTATYSPEFT KVTPTGTGDK TEGLQGQVQE GKVTFTPGHD
SVPFPADSTP LFDNGTTVKE VPNVGKFEVD ADGKVSFTPD KQFKGETPEL ELTRVDANGT
PVTVKYQAVV KEVVPTSTDA TSNGIQGQPQ KGTPTFTEGN PLVPIDDTKP MTFEDGQSTK
IVSGVGEYSI NPDGSITFTP EKQYVGTPDP VTVKRVDKNG TPVTATYTPT VTKVTPTSTN
ATSTGPQGVP QTGTPTFQGG NPLVPIDETV EPVFEDGSKE KTIPGQGTYT IAPDGTVTFT
PDKQFVGNPD PVTVKRVDKN GTPVTATYSP EFTKVTPTGT GATSTGPQGL PQTGTPTFQG
GDPLVPIDET VEPTFEDGSK EKSIPGQGTY TIAPDGTVTF IPDKQFVGSP VPVTVKRVDK
NGTPVTATYS PEFTKVTPTG TGDKTEGLQG QVQEGKVSFT PGHDSVSFPA DSTPLFDNGT
AVKEVPNVGK FEVDADGKVT FTPDKQFKGE TPELELTRVD ANGTPVTVKY QAVVKEVTPT
GTNATSSGPQ GVPQTGTPSF QGGDPLVPID ETVEPTFEDG SKEKSIPGQG TYTIAPDGTV
TFTPDKQFVG KADPVTVKRV DKNGTPVTAT YTPTVTKVTP TSTNATSTGP QGVPQTGTPT
FQGGDPLVPI DETVEPTFAD GSKEKVIPGQ GTYTIAPDGT VTFIPDKQFV GKPDPVTVKR
VDKNGTPVTA TYSPEFTKVT PTGTGATSTG PQGVPQTGTP TFQGGDPLVP IDETVEPTFA
DGSKEKVIPG QGTYTIAPDG TVTFIPDKQF VGKPDPVTVK RVDKNGTPVT ATYSPEFTKV
TPTGKDTSSV NIKGVPQTGT PTFQGGDPLV PINETVEPTF EDESTEKVIP GEGTYVISPD
GTVTFTPEAN FVGKGTGVTI VRKDKNGTPV TASYRPTVVD PSTGHDTTST GAKGRPQVAT
PTFEGHINPT VPPTFEDGST TMVVPGEGSY TIDKDGKITF TPEADFVGTA KGLVVKRLDM
YGNVVTAHYT PTVLGQTQVS DATSEGLKGQ TQTGKPNFTG DVDLTVPPTF EDGTTEKVVP
GEGTYVISPD GTVTFTPEAD FVGQAKGVKV IRKDRNGNIL SGFYTPTVVE LPVVKNPEQQ
GITEERTTKT LPNTGSEETS HLTAGLLAAL SGMGLISLAQ RKKSEEE
//