ID T0V239_9STRE Unreviewed; 1566 AA.
AC T0V239;
DT 16-OCT-2013, integrated into UniProtKB/TrEMBL.
DT 16-OCT-2013, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE SubName: Full=Putative cell-wall-anchored protein SasA(LPnTGmotif) {ECO:0000313|EMBL:EQC75949.1};
GN ORFNames=HSISM1_239 {ECO:0000313|EMBL:EQC75949.1};
OS Streptococcus sp. HSISM1.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=1316408 {ECO:0000313|EMBL:EQC75949.1};
RN [1] {ECO:0000313|EMBL:EQC75949.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HSISM1 {ECO:0000313|EMBL:EQC75949.1};
RA Van den Bogert B., Boekhorst J., Herrmann R., Smid E.J., Zoetendal E.G.,
RA Kleerebezem M.;
RT "Comparative Genomics Analysis of Streptococcus Isolates from the Human
RT Small Intestine Reveals their Adaptation to a Highly Dynamic Ecosystem.";
RL PLoS ONE 8:E83418-E83418(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EQC75949.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASKI01000080; EQC75949.1; -; Genomic_DNA.
DR PATRIC; fig|1316408.3.peg.1825; -.
DR eggNOG; COG5295; Bacteria.
DR HOGENOM; CLU_003760_0_0_9; -.
DR Proteomes; UP000015933; Chromosome.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR InterPro; IPR044574; ARIP4-like.
DR InterPro; IPR022263; KxYKxGKxW.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR026465; Ser_adhes_glycop_N.
DR NCBIfam; TIGR03715; KxYKxGKxW; 1.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR04224; ser_adhes_Nterm; 1.
DR PANTHER; PTHR45797:SF1; HELICASE ARIP4; 1.
DR PANTHER; PTHR45797; RAD54-LIKE; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF19258; KxYKxGKxW_sig; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}.
FT DOMAIN 1531..1566
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 115..144
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 735..1535
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 764..1535
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1566 AA; 156516 MW; BBAA699B5FEE0EB6 CRC64;
MFFKRSNGEF RETDRVTRFK LIKSGKNWLR AATSNFGLLK VIRGQVEETV VAEVREDEVS
ATEMTSRGLL KGIVAAGAVL GAATVANTAK ADETGSAVAT ASELSSEALA VQGSTVLGTT
STTESTTEST TESASASTSA STSTSVSVSV SESASLSEQG SAELSAALSE SAVATASEAT
SETTTTVSKS EDKVVLEQNS SEAALLNKIA GDYSATVSAP EKKAALDAAI AKVQTELTAS
NSLINANASA QAYADQRERL SKSVDDMMAT LTAAGFTGNT TVNGSPAVYA QLDAIATSST
NPSGVAVTPG MSDPNGASLT DRTQAIPSGY AADPTSGRLT FGIWNLSSYN QLYNKNYYIT
LSVDKTNSAT NPNVFVRLVD KTTGSEVANT TLSNAVAAKE INLGDLSSQK YYPYLVYNAS
TDGNPSSVDI IIKHNEDLNP SGFDYKTEQV YDYLTPANQG EKKPNVEIAV PSTKMNQTTH
YKVVDTASST FDASRSKADT TTQSYKPTGN ETELASYTQT GIQGQDYTAS NPRSFEGYVL
YQQADSSTMS GELGNTVGTK YAELKGGRQH YYAKRIREVV ANDGSTVTKI YVLDPSAVST
YNESVMANNT DTTGYTLVYT TPVIKPGEKY IPAATNMDAD KRLVSSKNGD YQLAVSPWHN
PDKPNEGLVY ITGWYTAGHH GDVNYMFIDE PKGSASAVPG GNTQNVIGGR TKYGIDAATG
KIVTDSLGNP VKSTGGFGNQ YSIPDANEKP SGDTVHYYRK MDDSESASQS QSQSNSQSLS
ESEIVSKSQS VSQSQSTSEK TSKSISESAS ASLSESTRES ASTSTSQSAS LSTSASTSAS
TSASQSASLS TSASTSASTS ASQSASLSTS ASTSASTSAS QSASLSTSAS TSASTSASQS
ASLSTSASVS ASTTQSNSES NTESASQSIS ESRSESASIS LSESVSLSES RSESASISAS
ESASLSESRS ESASTSASES ASTSASQSAS LSTSASESAS TSASQSASLS RSASESASTS
ASQSASLSTS ASESASTSAS QSASLSTSAS ESASTSASQS ASLSTSASES ASTSASQSAS
LSTSASASES ASQSISESQS ESASVSLSES VSVSESLSES ASTSASESAS TSASQSASLS
TSASESASTS ASQSASLSTS ASESASTSAS QSASLSTSAS ESASTSASQS ASLSTSASAS
ESASQSISES QSESASVSLS ESVSVSESLS ESASTSASES ASTSASQSAS LSTSASESAS
TSASQSASLS TSASESASTS ASQSASLSTS ASESASTSAS QSASLSTSAS ESASTSVSQS
ASLSTSASES ASTSASQSAS LSTSASESAS TSASQSASLS TSASESASTS ASQSASLSTS
ASESASTSVS QSVSASESTS TSVSQSVSAS ESTSTSVSQS VSASESTSTS VSQSVSASES
TSTSVSTSVS VSQSQSQSGS TSGSGSYSNS MSQSGSTSAS GSYSNSMSQS GSTSASGSYS
NSMSQSASGS ESLSTSESAN GSQKHSESVA LPNTGETTSV TSALLGAVAG LAGAAVLGRR
KKEDEK
//