GenomeNet

Database: UniProt
Entry: D3AC84_9CLOT
LinkDB: D3AC84_9CLOT
Original site: D3AC84_9CLOT 
ID   D3AC84_9CLOT            Unreviewed;       305 AA.
AC   D3AC84;
DT   23-MAR-2010, integrated into UniProtKB/TrEMBL.
DT   23-MAR-2010, sequence version 1.
DT   24-JAN-2024, entry version 39.
DE   SubName: Full=Cell wall-binding repeat protein {ECO:0000313|EMBL:EFD00562.1};
GN   ORFNames=CLOSTHATH_01212 {ECO:0000313|EMBL:EFD00562.1};
OS   Hungatella hathewayi DSM 13479.
OC   Bacteria; Bacillota; Clostridia; Eubacteriales; Clostridiaceae; Hungatella.
OX   NCBI_TaxID=566550 {ECO:0000313|EMBL:EFD00562.1, ECO:0000313|Proteomes:UP000004968};
RN   [1] {ECO:0000313|EMBL:EFD00562.1, ECO:0000313|Proteomes:UP000004968}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=DSM 13479 {ECO:0000313|EMBL:EFD00562.1,
RC   ECO:0000313|Proteomes:UP000004968};
RA   Weinstock G., Sodergren E., Clifton S., Fulton L., Fulton B., Courtney L.,
RA   Fronick C., Harrison M., Strong C., Farmer C., Delahaunty K., Markovic C.,
RA   Hall O., Minx P., Tomlinson C., Mitreva M., Nelson J., Hou S., Wollam A.,
RA   Pepin K.H., Johnson M., Bhonagiri V., Nash W.E., Warren W., Chinwalla A.,
RA   Mardis E.R., Wilson R.K.;
RL   Submitted (JAN-2010) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EFD00562.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; ACIO01000079; EFD00562.1; -; Genomic_DNA.
DR   RefSeq; WP_006771754.1; NZ_GG667619.1.
DR   AlphaFoldDB; D3AC84; -.
DR   HOGENOM; CLU_896513_0_0_9; -.
DR   Proteomes; UP000004968; Unassembled WGS sequence.
DR   Gene3D; 2.10.270.10; Cholin Binding; 1.
DR   InterPro; IPR018337; Cell_wall/Cho-bd_repeat.
DR   Pfam; PF01473; Choline_bind_1; 1.
DR   Pfam; PF19127; Choline_bind_3; 1.
DR   SUPFAM; SSF69360; Cell wall binding repeat; 1.
DR   PROSITE; PS51170; CW; 2.
PE   4: Predicted;
KW   Repeat {ECO:0000256|ARBA:ARBA00022737}; Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..25
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           26..305
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5003040067"
FT   REPEAT          45..65
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REPEAT          66..86
FT                   /note="Cell wall-binding"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00591"
FT   REGION          133..162
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        133..160
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   305 AA;  32523 MW;  7DCF6E3B0DCBF246 CRC64;
     MRMKKLAVMA LATALSVGSV MTASAAWLQE GSNWRYQNDN GTFQTGTWFR DVDGRWYHFD
     NNGIMQKGWF QDADGKWYFL AYNGVMQVGL IKVDNQVYYM NASGDLFLGD MTINGTTYNF
     GLYGTTNGQP NVPSTATYGG NGNQSLPGGG SSSNGGGSTA TPAEKVEGAV NEVKNAAKES
     IKGAESVIAG IEVSDPVTKG DAAVVEVKVD VIDIKDTDDA ELVKGSIASV VDTTISELDG
     AEKVAVSIPG ISKSFTVDEL RGDKLDDLLD NYVTPDFYKD HKNSSVTVTV PVNGVNVTYT
     ISLAK
//
DBGET integrated database retrieval system