ID S3AEP5_9STRE Unreviewed; 2220 AA.
AC S3AEP5;
DT 18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT 18-SEP-2013, sequence version 1.
DT 24-JAN-2024, entry version 51.
DE SubName: Full=Adhesin isopeptide-forming adherence domain-containing protein {ECO:0000313|EMBL:EPD87638.1};
GN ORFNames=HMPREF1481_00727 {ECO:0000313|EMBL:EPD87638.1};
OS Streptococcus sp. HPH0090.
OC Bacteria; Bacillota; Bacilli; Lactobacillales; Streptococcaceae;
OC Streptococcus.
OX NCBI_TaxID=1203590 {ECO:0000313|EMBL:EPD87638.1, ECO:0000313|Proteomes:UP000014396};
RN [1] {ECO:0000313|EMBL:EPD87638.1, ECO:0000313|Proteomes:UP000014396}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=HPH0090 {ECO:0000313|EMBL:EPD87638.1,
RC ECO:0000313|Proteomes:UP000014396};
RG The Broad Institute Genomics Platform;
RA Earl A., Ward D., Feldgarden M., Gevers D., Schmidt T.M., Dover J., Dai D.,
RA Walker B., Young S., Zeng Q., Gargeya S., Fitzgerald M., Haas B.,
RA Abouelleil A., Allen A.W., Alvarado L., Arachchi H.M., Berlin A.M.,
RA Chapman S.B., Gainer-Dewar J., Goldberg J., Griggs A., Gujja S., Hansen M.,
RA Howarth C., Imamovic A., Ireland A., Larimer J., McCowan C., Murphy C.,
RA Pearson M., Poon T.W., Priest M., Roberts A., Saif S., Shea T., Sisk P.,
RA Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The Genome Sequence of Streptococcus sp. HPH0090.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EPD87638.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ATCD01000001; EPD87638.1; -; Genomic_DNA.
DR RefSeq; WP_016466062.1; NZ_KE150464.1.
DR PATRIC; fig|1203590.3.peg.698; -.
DR eggNOG; COG2304; Bacteria.
DR eggNOG; COG4932; Bacteria.
DR HOGENOM; CLU_000474_0_0_9; -.
DR Proteomes; UP000014396; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR Gene3D; 2.60.40.3050; -; 6.
DR Gene3D; 2.60.40.740; -; 4.
DR InterPro; IPR026345; Adh_isopep-form_adh_dom.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR022464; Strep_pil_isopept_link.
DR InterPro; IPR038174; Strep_pil_link_sf.
DR InterPro; IPR005877; YSIRK_signal_dom.
DR NCBIfam; TIGR04228; isopep_sspB_C2; 4.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR NCBIfam; TIGR03786; strep_pil_rpt; 5.
DR NCBIfam; TIGR01168; YSIRK_signal; 1.
DR Pfam; PF17998; AgI_II_C2; 4.
DR Pfam; PF12892; FctA; 6.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF04650; YSIRK_signal; 1.
DR PROSITE; PS50847; GRAM_POS_ANCHORING; 1.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Membrane {ECO:0000256|SAM:Phobius};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Secreted {ECO:0000256|ARBA:ARBA00022512};
KW Signal {ECO:0000256|ARBA:ARBA00022729};
KW Transmembrane {ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|SAM:Phobius}.
FT TRANSMEM 2194..2213
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2186..2220
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|PROSITE:PS50847"
FT REGION 37..171
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 835..862
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2147..2195
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..69
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 83..160
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 835..850
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2159..2182
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2220 AA; 244813 MW; DA1BA9D6A8BDAD59 CRC64;
MIRKEQQRFS LRKYKVGVAS VFLGTTLSFM MANGGAVKAS ELPTQPASEE KTELVKKDVS
SDDSSKKEQD VVANVATTEL AEGNKAEEVK PASEAKTEEA KSEPVSSEKA EEVKPASTET
EIAKKETAKP VEKAEETKPA VSEGDKPKVR NRRATTEDAV SGDHNSKPVS VSTYLKDGET
VDPAITNPNG AKVVSQTVPS GYARKEGDWY TYSIIDLTRF NERYNTNYYT RAYKRFDDST
ETTVELIDKT TGNVVETKTL SASSGIQKFT TTTAASNGQL TVKYDYNKGL GAGPGKTDEP
FIQFGYEVGA SIQALVNPKN EAEQKLYTDV YNARTSTDII NVVEPAYNGR TITDSNAKIP
KFVEKPTYYR VVDKNNATFN ANKTDKTVQD YVPNGKEVDL AKYATKAMEG QHFTASGERQ
FDGYKLYQTA NPDSTTGFVS RPYVVGTKFM DAERAGIKRI KEIVGEDGSV VIRVYLLDPK
QQSKRSDGTL STDGYMLLAE TKPIKPGDYN KEDLVVKKSP LNTIAFTDNK GVNHPNGVEV
PFDFQTAAGY TPKKTVFVPF LGDGIGHLSP NSQLENGAYV QIGTNVDLLN SLTPYKPTVY
YYVKQEPVEV TPEVEKQLDG RVLTDGEFTF KLTEEKSSPD KHEETVTNKD GKATFSKLTF
NKVGTYKYKI TEQKGSDANV DYDAMTVTMT VTVTENSKGD LVASVKYSSE GGFSTSADDK
IFNNYVVAPV KVKFDFTKKL AGRELKDGEF NFVLKNSDGE EVQTVANKAD GTVTFTEIAF
DNTKVGTHTY TVEEVVPAAK EAGMVYDAMK ATITVEVSKK GHALTTVTNV VSTGGVDDNG
ASTDGTEDKV FNNKITPPET PEFQPEKFVL NKEKYDITGT KLVDDDNELT DEYTETNADP
YADKTTNNEP ENINTKTVER GDKLVYQVWL DTRNFTDRNN IQSVGISDTY DAEKVTVNAA
DIKAYDSVTG EEVTSKFDIK VENGVITATS KASMNKSLGD ADNTQVIDTT KFEFGRYYKF
DIPATVKSDV KGGVDIENKA NQIVHVYNPV SKSVETPEKP TQKRVNTVPI TVDLKFTKKL
EGRTLATGEF SFVLKKDNVE IETVKNDAEG KITFKTLKFG KDDLGKTYTY TVSEVPGTDS
TVTYDDMVAT VTVSVSHDGT AKAIVANVTD APDKEFNNVV TPPEEPKFQP KKYVLDQEGF
DIKGDSLLDD DKELTDSVTD TNNNPYADRH DNNEEHNINT LGVRKGERIV YQVWLDTTKF
NTNNKDNIQT VGITDDYDET KVDVNASDIK AYDSVTGADV TNKFDIKVEN GIITATLKDG
FTKSLGDADN TQIIDTTKFE FGRYYKFDIP ATVKDSVVAG ADIENTASQL VHYYNPSTKK
VEKPVKPTEK RVNSVPIDIE FNFTKKLEGR ELKAGEFSFV LKDSEGNEIE TVKNDKDGKV
KFEPLSFMKG DEGTHKYTVE EVAGTDGTVT YDTMKAEITV EISYDGTAKA LVKTVTDAPD
KEFNNVVTPP EEPKFQPEKY VLNTAKFSIT DNKLLDDDVE LTDKYGETNT DPYVDGTSNN
EAENINTKTV KRGDKVYYQV WLDTTKFDAN NKDNIQSVSI TDDFDETKVD VDASAIKAYD
SVTGADVTDK FDIKVENGVM TATLKAGFTK SLGDADNTQI IDTTKFEFGR YYKFDIPATV
KADVAGGVDI ENTAAQIVNY YNPTTKKVEK PEKPTEKRVN SVPIEVEFNF TKKLEGRELK
AGEFSFVLKD SEGTEIETVQ NDKDGKVKFA TLEFTKAQVG THKYTVEEVA GTDGTVTYDT
MKAEITVEVE HDGKAKALVK TVTDAPDKEF NNTVTPPETP EFNPEKYILN EEKFDITGTK
LLDDDKELTD KVADTNKDPY ADKADNNEAQ NINTKTLHKG DKVVYQVWLD TTKFTEAHNI
QSVGITDKYD SENLDVNVAD IKAYDSVTGE DVTAKFDISI VDGVITATSK AELTKSLGDA
ENTQVIDTAK LAFGRYYKFD ILARIKDTAK EGVDIENTAS QIVHQYDPTK KSVEKPEKPT
EKRVVNIPVK VEFNFTKKLE GRALKAGEFT FVLKDKDGNV IETVSNDAEG KIKFSALEFK
RGEEGTYIYH VEEVQGTEAG VEYDKMVATV GVTVTKEGKV LTLTTQMPED TEFNNKVTPP
TPPVTPPTPP TPVTPPTPEK PKGRELPNTG EQSTAGVAAL GAALGLVGLG LVAKSKRRED
//