ID A0A1H3CBU9_9FIRM Unreviewed; 510 AA.
AC A0A1H3CBU9;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 24-JAN-2024, entry version 18.
DE RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=SAMN02910264_00020 {ECO:0000313|EMBL:SDX51637.1};
OS Ruminococcaceae bacterium YAD3003.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae.
OX NCBI_TaxID=1520816 {ECO:0000313|EMBL:SDX51637.1, ECO:0000313|Proteomes:UP000199204};
RN [1] {ECO:0000313|EMBL:SDX51637.1, ECO:0000313|Proteomes:UP000199204}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=YAD3003 {ECO:0000313|EMBL:SDX51637.1,
RC ECO:0000313|Proteomes:UP000199204};
RA de Groot N.N.;
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FNPA01000001; SDX51637.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1H3CBU9; -.
DR STRING; 1520816.SAMN02910264_00020; -.
DR OrthoDB; 9812829at2; -.
DR Proteomes; UP000199204; Unassembled WGS sequence.
DR InterPro; IPR025584; Cthe_2159.
DR Pfam; PF14262; Cthe_2159; 1.
DR PROSITE; PS51257; PROKAR_LIPOPROTEIN; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000199204};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..25
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 26..510
FT /note="Carbohydrate-binding domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038595552"
FT REGION 482..510
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 510 AA; 51081 MW; 92133B1EEC576C5B CRC64;
MKPVKTEYKA LAAITAGTIA VLSIAACSIA SSNTPDTTVP SESETIVETT FSDNDLNAGY
DEGEAQIITL NGTSASSDSS SVKTEGSVVT ITGKGTYVIT GTLNDGYIVV DAGDSDDIRI
VLDNADITSS DYAAIYCLNA DNVYITLADG SSNSLSSTGE FDSKDSNSVD GAIFAKTDIT
INGSGTLKIN STDHGIVGKD DVTITGGGIS IKSSSDGIQA NDSVSVKDAV INIVCGKDGI
QADNDDDKTK GYVYIASGSI TISAGDDGIT ASSSLQIDDG TIDITGSYEG LEGRYVTVNG
GTISIVSSDD GINAVSSTSS GEMFQADDSK LCINGGEIYV NTQGDGVDSN GTFEMTGGIL
TVMGPTSGAN GSLDVNGSAV ISGGTVVMAG ASGMATNFTD ATQGTALLTV GNQSEGSEIT
VTDSTGNVIL SATSDCSYQT VLVSSPDMEK DGTYTVTAGS YSETITLSGY LYGAGGGFGG
GVPGGDPGQM PAGNSGDFPS GGPGGMHPPF
//