ID W1UDP7_CLOBU Unreviewed; 614 AA.
AC W1UDP7;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 24-JAN-2024, entry version 33.
DE SubName: Full=Surface protective antigen SpaA {ECO:0000313|EMBL:ETI91862.1};
GN ORFNames=Q607_CBUC00018G0055 {ECO:0000313|EMBL:ETI91862.1};
OS Clostridium butyricum DORA_1.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC Clostridium.
OX NCBI_TaxID=1403941 {ECO:0000313|EMBL:ETI91862.1, ECO:0000313|Proteomes:UP000018876};
RN [1] {ECO:0000313|EMBL:ETI91862.1, ECO:0000313|Proteomes:UP000018876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DORA_1 {ECO:0000313|Proteomes:UP000018876};
RA Brown C.T., Sharon I., Thomas B.C., Castelle C.J., Morowitz M.J.,
RA Banfield J.F.;
RT "A Varibaculum cambriense genome reconstructed from a premature infant gut
RT community with otherwise low bacterial novelty that shifts toward anaerobic
RT metabolism during the third week of life.";
RL Submitted (DEC-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:ETI91862.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AZLX01000018; ETI91862.1; -; Genomic_DNA.
DR AlphaFoldDB; W1UDP7; -.
DR PATRIC; fig|1403941.3.peg.220; -.
DR Proteomes; UP000018876; Unassembled WGS sequence.
DR Gene3D; 2.10.270.10; Cholin Binding; 3.
DR InterPro; IPR025883; Cadherin-like_b_sandwich.
DR InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR Pfam; PF12733; Cadherin-like; 2.
DR Pfam; PF01473; Choline_bind_1; 5.
DR SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR PROSITE; PS51170; CW; 4.
PE 4: Predicted;
KW Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT DOMAIN 181..269
FT /note="Cadherin-like beta sandwich"
FT /evidence="ECO:0000259|Pfam:PF12733"
FT DOMAIN 283..373
FT /note="Cadherin-like beta sandwich"
FT /evidence="ECO:0000259|Pfam:PF12733"
FT REPEAT 445..464
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 465..484
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 553..572
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REPEAT 573..592
FT /note="Cell wall-binding"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT REGION 376..400
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 499..532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 499..515
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 614 AA; 69289 MW; 6A83BD006FE6FC2A CRC64;
MNKRIKKIIA LTVAFSAVTT IAPKTFLGLT SKPAYAASYT ARNGELTSLV IKSTNGDKLS
LKDGYNGDTV KLSDEKEYFV KLTDDSEGIE IDAEAKGSDR IVRIFLTDDS DATAYKPGDE
LYLAKGNTTI YVRTYASLSD FRQAKDKDKD VSNCEEEYEV HVKKTTESSY EDTTQDPVYL
KELSLNKEDI TFLKQRTTYN VKVASSVDEI KITAEPEDDS SRVRIDGSLV DEDDNYRKTI
SLDKGKNEIK IKVTDDKDNQ RVYTLNITRG SSSAGDSQGD VYLSSLELDE ADLDFEEDKT
SYEVDVDEDV SKILVTAEPE DEEYLVTING SEVNSGDEYE KKVSLSKGKN TITVVVEDEV
EDEKRTYKIT VNRGTVTDDE DKDDTEDTDD KSDDSSSDDK NTVGWVKVDN DWKYRGEDRK
FYVNKWLYDK DQGVYCYLKE DGFRATGWLQ EGGNWYLLDS KGAMLTGWQY TGGQWYYLQS
SGAMKTGWLK EEKTVTVEDT STATDKKEDE KTSTSGSTNK TDTSKTDDKK EETTAKTKKV
ETWYYLQANG IMKTGWFLDN DKWYYMNLSG AMQIGWVIDN NSKYYLDQSG VMATGTKTID
GKEYKFTTSG ALIS
//