GenomeNet

Database: UniProt
Entry: W1UDP7_CLOBU
LinkDB: W1UDP7_CLOBU
Original site: W1UDP7_CLOBU 
ID   W1UDP7_CLOBU            Unreviewed;       614 AA.
AC   W1UDP7;
DT   19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT   19-MAR-2014, sequence version 1.
DT   24-JAN-2024, entry version 33.
DE   SubName: Full=Surface protective antigen SpaA {ECO:0000313|EMBL:ETI91862.1};
GN   ORFNames=Q607_CBUC00018G0055 {ECO:0000313|EMBL:ETI91862.1};
OS   Clostridium butyricum DORA_1.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae;
OC   Clostridium.
OX   NCBI_TaxID=1403941 {ECO:0000313|EMBL:ETI91862.1, ECO:0000313|Proteomes:UP000018876};
RN   [1] {ECO:0000313|EMBL:ETI91862.1, ECO:0000313|Proteomes:UP000018876}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DORA_1 {ECO:0000313|Proteomes:UP000018876};
RA   Brown C.T., Sharon I., Thomas B.C., Castelle C.J., Morowitz M.J.,
RA   Banfield J.F.;
RT   "A Varibaculum cambriense genome reconstructed from a premature infant gut
RT   community with otherwise low bacterial novelty that shifts toward anaerobic
RT   metabolism during the third week of life.";
RL   Submitted (DEC-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:ETI91862.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AZLX01000018; ETI91862.1; -; Genomic_DNA.
DR   AlphaFoldDB; W1UDP7; -.
DR   PATRIC; fig|1403941.3.peg.220; -.
DR   Proteomes; UP000018876; Unassembled WGS sequence.
DR   Gene3D; 2.10.270.10; Cholin Binding; 3.
DR   InterPro; IPR025883; Cadherin-like_b_sandwich.
DR   InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR   Pfam; PF12733; Cadherin-like; 2.
DR   Pfam; PF01473; Choline_bind_1; 5.
DR   SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR   PROSITE; PS51170; CW; 4.
PE   4: Predicted;
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}.
FT   DOMAIN          181..269
FT                   /note="Cadherin-like beta sandwich"
FT                   /evidence="ECO:0000259|Pfam:PF12733"
FT   DOMAIN          283..373
FT                   /note="Cadherin-like beta sandwich"
FT                   /evidence="ECO:0000259|Pfam:PF12733"
FT   REPEAT          445..464
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REPEAT          465..484
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REPEAT          553..572
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REPEAT          573..592
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REGION          376..400
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          499..532
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        499..515
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   614 AA;  69289 MW;  6A83BD006FE6FC2A CRC64;
     MNKRIKKIIA LTVAFSAVTT IAPKTFLGLT SKPAYAASYT ARNGELTSLV IKSTNGDKLS
     LKDGYNGDTV KLSDEKEYFV KLTDDSEGIE IDAEAKGSDR IVRIFLTDDS DATAYKPGDE
     LYLAKGNTTI YVRTYASLSD FRQAKDKDKD VSNCEEEYEV HVKKTTESSY EDTTQDPVYL
     KELSLNKEDI TFLKQRTTYN VKVASSVDEI KITAEPEDDS SRVRIDGSLV DEDDNYRKTI
     SLDKGKNEIK IKVTDDKDNQ RVYTLNITRG SSSAGDSQGD VYLSSLELDE ADLDFEEDKT
     SYEVDVDEDV SKILVTAEPE DEEYLVTING SEVNSGDEYE KKVSLSKGKN TITVVVEDEV
     EDEKRTYKIT VNRGTVTDDE DKDDTEDTDD KSDDSSSDDK NTVGWVKVDN DWKYRGEDRK
     FYVNKWLYDK DQGVYCYLKE DGFRATGWLQ EGGNWYLLDS KGAMLTGWQY TGGQWYYLQS
     SGAMKTGWLK EEKTVTVEDT STATDKKEDE KTSTSGSTNK TDTSKTDDKK EETTAKTKKV
     ETWYYLQANG IMKTGWFLDN DKWYYMNLSG AMQIGWVIDN NSKYYLDQSG VMATGTKTID
     GKEYKFTTSG ALIS
//
DBGET integrated database retrieval system