ID A0A1Y4UQU5_9FIRM Unreviewed; 1676 AA.
AC A0A1Y4UQU5;
DT 30-AUG-2017, integrated into UniProtKB/TrEMBL.
DT 30-AUG-2017, sequence version 1.
DT 24-JAN-2024, entry version 25.
DE RecName: Full=Gram-positive cocci surface proteins LPxTG domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=B5E58_03455 {ECO:0000313|EMBL:OUQ59503.1};
OS Tyzzerella sp. An114.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Tyzzerella.
OX NCBI_TaxID=1965545 {ECO:0000313|EMBL:OUQ59503.1, ECO:0000313|Proteomes:UP000196526};
RN [1] {ECO:0000313|Proteomes:UP000196526}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=An114 {ECO:0000313|Proteomes:UP000196526};
RA Medvecky M., Cejkova D., Polansky O., Karasova D., Kubasova T., Cizek A.,
RA Rychlik I.;
RT "Function of individual gut microbiota members based on whole genome
RT sequencing of pure cultures obtained from chicken caecum.";
RL Submitted (APR-2017) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OUQ59503.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; NFLT01000003; OUQ59503.1; -; Genomic_DNA.
DR OrthoDB; 1747537at2; -.
DR Proteomes; UP000196526; Unassembled WGS sequence.
DR CDD; cd00222; CollagenBindB; 1.
DR Gene3D; 2.60.40.740; -; 3.
DR Gene3D; 2.60.40.1140; Collagen-binding surface protein Cna, B-type domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 2.
DR InterPro; IPR008966; Adhesion_dom_sf.
DR InterPro; IPR008454; Collagen-bd_Cna-like_B-typ_dom.
DR InterPro; IPR047589; DUF11_rpt.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR019931; LPXTG_anchor.
DR InterPro; IPR041033; Prealbumin-like.
DR NCBIfam; TIGR01451; B_ant_repeat; 1.
DR NCBIfam; TIGR01167; LPXTG_anchor; 1.
DR Pfam; PF05738; Cna_B; 1.
DR Pfam; PF00746; Gram_pos_anchor; 1.
DR Pfam; PF17802; SpaA; 2.
DR SUPFAM; SSF49401; Bacterial adhesins; 1.
DR SUPFAM; SSF49478; Cna protein B-type domain; 2.
PE 4: Predicted;
KW Cell wall {ECO:0000256|ARBA:ARBA00022512};
KW Peptidoglycan-anchor {ECO:0000256|ARBA:ARBA00023088};
KW Reference proteome {ECO:0000313|Proteomes:UP000196526};
KW Secreted {ECO:0000256|ARBA:ARBA00022512}.
FT DOMAIN 1292..1366
FT /note="Prealbumin-like fold"
FT /evidence="ECO:0000259|Pfam:PF17802"
FT DOMAIN 1410..1498
FT /note="CNA-B"
FT /evidence="ECO:0000259|Pfam:PF05738"
FT DOMAIN 1522..1594
FT /note="Prealbumin-like fold"
FT /evidence="ECO:0000259|Pfam:PF17802"
FT DOMAIN 1641..1674
FT /note="Gram-positive cocci surface proteins LPxTG"
FT /evidence="ECO:0000259|Pfam:PF00746"
FT REGION 303..329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1676 AA; 188919 MW; AD36B2338224D595 CRC64;
MNWNLDKMVK QLYNTKIKRR RKFALLSLLS FIVLASTFYV LAFPAITAEK PKCGIEEHIH
TGQCYERVLK CNNPDHLEAG SIICEHEGVV IHTHDSNCYN DNGELICNLD EVSEHIHDDS
CFAEEKELVC DIEESLAHVH DDSCYAEDKV LTCTEDESEG HSHDDSCYDE EGNVICGMDE
SEGHSHDENC YTTEKVIVCQ EDTEIEGHVH DDSCYNTKLV PVCGQDEIFE HKHTDECYLD
GELNCGHIEA VKHQHDSSCF VPEKSIEGEV GQITTPSGHT DECYETKLIC DKEEHTHTDE
CYDEEVAGGG GGGRPDEERP SLGDYDFEND MENHEEDESF ALAYKPSFDE EQSLVNKGVS
MLMSFMAKNS PEGADIAKDF TDWITDVKIQ KKDGDTWIDA TDIYFGDIVR CIIEYKVPKD
EVTVENPYIK YQIPEGLIPN ADMEGKVLSY DSSFNPIEVG NYWITDDGLI LIKFYDAFIK
DGESFRGDVS FQGKMEKEDG EGGYEIDFGF DGEAVNVGPK PVKKDFTVSK SGVQGENNTV
NYTVTISSEN GTNDSINITD TFSFGKLGIS YNDGSFQIIK TNAYGVTENV TGLYTVTINP
PQDKISSGSF TINGLAPLEA GEKYDIKYTI THTPVSESGE GNGEIIIKNT ASVTSGEDTR
EAVFEFFLSK KMIEKTGYQD MKNNSIVWII DINKDLRDIS GYTVYDHLPE GLYVTGDIEI
KNNSDNTSVT IKEFPYTFPE GSNDWYTVTF RTAVSDDIPQ GAYKEYTNKA ELIKDNDTYS
SDYTVKYNNN SGLKKEFLGN YKPIYDGWGT ESGEEYQWKT TIYLPENGIT KDTKFEYTDT
MGEYLWTKAD ESINMLDIKK VTSDGRSVSL KSNEYSVEYY DAKGNICKSE EDDVKSFKVI
FKTDIDNSFE RIELTYYTYA DYSEIPEGEN FTFVNKGNVV IGENTHNVES THDYEKIAPV
KKVPGYVDEN GDIQYTVGNM KVDFDKSDRK LYYKLIVNSG KDKSGDITVT DTLPEGTKFN
SSDCKFSFVL SDGSESNTIH IDGSTYDVND IADFSYDEDK NKVEFKFRQG QNKFEFTVKY
SLSFDEDSFW DNAANSEKGY ENIVITDGYE TSITQTVKRE DIVIMKIGEQ GPEGEGNRIK
YNVTINPSGD DLLTDEHVLK LIDTIEIDET EIKDISLISE TVKLYRYDSS LPNHKGNEIS
KDEYSVSYDS KLRKLEVTVP DETPCILEYE YFVNPVSNQE TFEILNKANL VGVGNVEDED
SIIFDTSKSI ASAIKSRVVI KKVDSENMSI VLPGAKFKIK EYDLSSNSWI DVGDEYTTDN
KGEIIFNIDD NDSIIKENTL YMIEEIAPPD GYQIISPAYA YFTVGKGSIQ DVKNSMRETV
EKAGIPVEDV NYFTEVGGSI YFENENAAIM VEKKWQNESG EEIEAPVEYI EVNLMRKIQG
ALTGELVETV RLSEENQWKY SWNDLLKFDS NGNRIYYYVE ELNVPDGFEV SYSVSSDGVL
SGTITVINKF KSAGFDIIKV DETDSPLYGV VFKLYKADDS WNQLGDPIKT ITTDTNGKAV
FDNLSRGKYL LYETETLAGY EKPENPWKIE IVENKDTNKL EVKFVESPEG PLGVIEAEKI
NSNPSNPEND LYKIVNHFVT PELPQTGGEG KYIYTISGIL LMVGALTINS KIRKEG
//