ID K8MQY4_9STRE Unreviewed; 3068 AA.
AC K8MQY4;
DT 06-FEB-2013, integrated into UniProtKB/TrEMBL.
DT 06-FEB-2013, sequence version 1.
DT 24-JAN-2024, entry version 54.
DE SubName: Full=YSIRK family Gram-positive signal peptide {ECO:0000313|EMBL:EKS20772.1};
GN ORFNames=HMPREF9186_00339 {ECO:0000313|EMBL:EKS20772.1};
OS Streptococcus sp. F0442.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=999425 {ECO:0000313|EMBL:EKS20772.1, ECO:0000313|Proteomes:UP000001140};
RN [1] {ECO:0000313|EMBL:EKS20772.1, ECO:0000313|Proteomes:UP000001140}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F0442 {ECO:0000313|EMBL:EKS20772.1,
RC ECO:0000313|Proteomes:UP000001140};
RG The Broad Institute Genome Sequencing Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Blanton J.M., Baranova O.V.,
RA Mathney J., Dewhirst F.E., Izard J., Tanner A.C., Walker B., Young S.K.,
RA Zeng Q., Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A., Chapman S.B., Goldberg J., Griggs A., Gujja S.,
RA Hansen M., Howarth C., Imamovic A., Larimer J., McCowen C., Montmayeur A.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Streptococcus sp. F0442.";
RL Submitted (APR-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EKS20772.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGZZ01000002; EKS20772.1; -; Genomic_DNA.
DR STRING; 999425.HMPREF9186_00339; -.
DR PATRIC; fig|999425.3.peg.334; -.
DR eggNOG; COG2373; Bacteria.
DR eggNOG; COG4886; Bacteria.
DR HOGENOM; CLU_000552_0_0_9; -.
DR OrthoDB; 2243937at2; -.
DR Proteomes; UP000001140; Unassembled WGS sequence.
DR InterPro; IPR026395; CshA_fibril.
DR InterPro; IPR040683; CshA_NR2.
DR InterPro; IPR045474; GEVED.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR027579; SSSPR51_Rpt.
DR InterPro; IPR005877; YSIRK_signal_dom.
DR NCBIfam; TIGR04225; CshA_fibril_rpt; 20.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR04308; repeat_SSSPR51; 2.
DR NCBIfam; TIGR01168; YSIRK_signal; 1.
DR Pfam; PF18651; CshA_NR2; 1.
DR Pfam; PF19076; CshA_repeat; 20.
DR Pfam; PF20009; GEVED; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF18877; SSSPR-51; 2.
DR Pfam; PF04650; YSIRK_signal; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Signal {ECO:0000256|ARBA:ARBA00022729}.
FT DOMAIN 3034..3068
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 40..81
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 112..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 422..448
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1218..1237
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1322..1343
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1518..1560
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1738..1770
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1931..1973
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2038..2058
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2345..2370
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2473..2492
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2570..2598
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2660..2698
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2767..2803
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2922..3043
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 112..142
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1218..1234
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1322..1342
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1518..1551
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1738..1765
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2345..2359
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2473..2491
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2570..2591
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2660..2694
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2780..2794
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2939..2953
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2974..3012
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 3029..3043
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3068 AA; 319983 MW; 54E1BD4F596A6C22 CRC64;
MGKDLFNDRI SRFSIRKLNV GVCSVLLGTL VMVGTAASAA AEENTDTTSE SVAAVTAASE
APEASTATAT SATSTAETTS TAATTYEASP AVTAAATSTA PASTSQASST STAASAASTA
PATSTTATPK LEATTPANKA AEATTTDKKK EVAAGVEAPA TETTAVTAET SGPKRRNRRA
LGDANDPNLI ADDVEDATST PKVEKPGFTT NVDAKSMASQ ISWLDFGDVA NWTGTTTVLV
PKSEVKPTEN LEEKMALQVG STYTKEIMPG YVVTVKVKSL KPFQATEIYK KRMENQGATE
AEKATYDPNA KNGYVSGVTS NAAKQAFNDG EEAKVIADPQ NNWTEIRFEN IDTKTKKTTI
SSALNGGNIG VQFEISATFR GKTVKPAIVM ADGESANPGE LVMFTTNGQG WQHIGEWKKY
TRPSTSETYS PQDTENLFGP NPKFNNTNLN QLRRSTEVGP EKKPVAWKYF GNPDQVTGGL
GTGIFGPNIS AGNYTVPIVM TRGASEVGLY VASGGKQSAM LGFFPIDEGD APESYGKAMH
TIATVDGVTG AKVNQPYLGN VSPDMDENTS LDWFGDDKAT TADEGIDQLL PDELKGTTNE
MIKMDRTRPG NYKLTVEAHT DGASEAHIYG WVDFNQNGKF DEDERSNLAT ITQDGTVELT
FANSKTYIDP SVKELGARVR IAKKATEIES PTGMAFSGEV EDFRTQITHP PKGEFKETSG
PQGAKQTATV TFTARGEHKY ELNSSAVIDE TVEPYIVDKD GNRATLDADG YYVVPGQGKY
KITANGKDVD VEFIPEDNFL GTADGISIRR SDNNGYDTGW STKFPDQEPN INGQLNTMDG
QYVPTVTPIE IEGVDKTSTD VQGATQKETP TFNTTATNSN GDKIAITPSA EYPAKLVDPA
TGLTTDAPSV TVEGEGTYTI NPSTGEVTFT PDPSFTGTAK GVDVSLSAPV GRNKDGKIQQ
DYIKTATAKY TPTVTPITVT PTDKVSADVQ NVPQTQTPTF DLSNDKTAKI TSKKLVDPAT
GQPTDETTVT VAGEGSYTID PTTGAVTFTP EKDFVGTAKG VTVQATATIT NANGKTATIT
SDATYTPTVV PAVPTANPAT SKDIQGATQT GTPTFAGTTV QVNGEDKAIT IKDNSYTLLD
KDGNEVSSTP AFAEDGTTEI GTFSIDPATG QVTFTPTDKS YTGKVTPAKV QAESSNGIKV
DTTYTPEIVP VTPTATPAET KDVQGATQTG KPEFKGGTVT VDGVEKTVEI NEDVPATFDD
GSTTKTVEGV GTYTVAADGT VTFVPEKSFV GTAPAVTVVR EDKNGTKASA TYTPTVLPVT
PTATPAETTD IQGATQKSKP EFKGGTVTVD GVEKTVEINE DVPATFDDGS TTKTVEGVGT
YTVAPDGTVT FVPEKSFVGT APAVTVVRED KNGTKASATY TPTVTPVKPT ATPVETTDIQ
GATQTGKPVF TEGDSRVPMN DDVPATFDDG STTKTVDGVG TYTVAADGTV TFVPEKSFVG
TAPAVTVVRE DKNGTKASAT YTPTVTPVTP TATPVETTGK QGQTQTGKPE FTEGDSRVPM
NEDVPATFDD GSTTKTVDGV GTYTVAADGT VTFVPEKSFT GKAPAVTVVR EDKNGTKASA
TYTPTVTPVT PTATPAESEA PQGLVQTGTV TFTEGDPVAP IDKNTITLLD ENGQPAASVE
AKSPAGDVIG TYTVDKDTGV VTFTPTDKSY SGDVVPVKVQ AADTNGTTVE TTYTPKITPV
VPTSEDATST DIQGATQTGK PTFTEGNPNV PIDEDTPATF EDGSTTKTVD GEGTYTVAPD
GTVTFVPEKS FTGTATGVTV KRVDKNGTEI TAKYTPTVTP VTPTATPVET TDIQGATQTG
KPVFTEGDSR VPMNDDVPAT FDDGSTTKTV DGVGTYTVAA DGTVTFVPEK SFVGTAPAVT
VVREDKNGTK ASATYTPTVT PVTPTAEDTT STDKQGQTQT GTPTFTPGNP NVPMDNDTPA
TFEDGSTTKT IPGEGTYTVA PDGTVTFVPE KSFTGEGTGV TVKRVDKNGT PVTAKYTPTV
TPVTPTATPA ESEAPQGVVQ TGTVTFTEGD PVAPIDKDTI TLLDENGQPA ESVVAKSPEG
KEIGTFTVDK ETGLVTFTPT DKSYSGDVVP VKVQGKDTNG TVAETTYTPK ITPVVPTAEP
AISTDIQGKT QTGTPSFTPG NPAIPMDDDV PATFEDGSTT KTIPGEGTYT VAPDGTVTFV
PEKSFTGTGT GVTVKRVDKN GTPVTATYTP TVTPVIPTAE PATSTDKQGQ IQTGKPTFTA
GHENVPMNDA VPATFEDGST TKEIPTVGIY TVAADGTVTF TPDPRFVGEA PAVTVVREDK
NGTKASATYT PTVTPITPTA KPARSEAPQG IVQTGKPTFT EGDPIAPMNP STLKLYDDSG
NPTDQLDAFS PAGDIIGIFT VDKEKGEVIF TPTNKAYSGE VLPVGVSMED ANGTYARTTY
TPYITPVVPT AEPATSTDIQ GATQTGKPSF TEGNPAVPMD DDVPATFVDG STTKVVPGEG
TYTVAPDGTV TFVPEKSFVG TASGIVVQRQ DKNGTVVKAM YTPTVTPVTP TGTPAVSTDV
QGATQTGKPE FTEGDSRVPL NDEVSATFDD GSTTKTVEGV GTYTVAPDGT VTFVPEKSFV
GTAPAVTVVR EDVNGTKASA TYTPTVTPVT PTAEPATTTD IQGQTQSGKP TFTPGNPSVP
MDDEVPATFE DGSTTKVIPG EGTYTVAPDG TVTFVPEKSF TGTGTGVTVK RVDKNGTPVT
AKYTPTVTPV APTGEPVTSI GKKGATQTGK PSFTEGDSRV PMNDKVPATF EDGSTTKTIP
GVGTYTVAAD GTVTFTPEPE FTGTAPAVTV VREDVNGTKA SATYTPTVLP ITKFVDKEGK
EIPGYPTVDG EEPKAEIPGY RFVETKKLPN GDTEHVYEKV TTSYVDENGD PIPGNPTEDG
EQPKKEIPGY DFVKTVVDKD GNTQHIYKKT VTPTPTPDPT PTPEPQPQPT PQPQPTPQPQ
PTPQPKPEEP TIPIVPETKE EVKYIDPQNP TAQLPNTGTK ESSSAGLAIF SALAGLSLFG
FAKRKKED
//