GenomeNet

Database: UniProt
Entry: R5PB23_9BACT
LinkDB: R5PB23_9BACT
Original site: R5PB23_9BACT 
ID   R5PB23_9BACT            Unreviewed;       521 AA.
AC   R5PB23;
DT   24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT   24-JUL-2013, sequence version 1.
DT   28-FEB-2018, entry version 20.
DE   SubName: Full=Carbohydrate binding domain protein {ECO:0000313|EMBL:CCZ13730.1};
GN   ORFNames=BN465_00292 {ECO:0000313|EMBL:CCZ13730.1};
OS   Prevotella sp. CAG:1092.
OC   Bacteria; Bacteroidetes; Bacteroidia; Bacteroidales; Prevotellaceae;
OC   Prevotella; environmental samples.
OX   NCBI_TaxID=1262919 {ECO:0000313|EMBL:CCZ13730.1, ECO:0000313|Proteomes:UP000017987};
RN   [1] {ECO:0000313|EMBL:CCZ13730.1, ECO:0000313|Proteomes:UP000017987}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=MGS:1092 {ECO:0000313|Proteomes:UP000017987};
RA   Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J.,
RA   Sunagawa S., Plichta D., Gautier L., Le Chatelier E., Peletier E.,
RA   Bonde I., Nielsen T., Manichanh C., Arumugam M., Batto J.,
RA   Santos M.B.Q.D., Blom N., Borruel N., Burgdorf K.S., Boumezbeur F.,
RA   Casellas F., Dore J., Guarner F., Hansen T., Hildebrand F., Kaas R.S.,
RA   Kennedy S., Kristiansen K., Kultima J.R., Leonard P., Levenez F.,
RA   Lund O., Moumen B., Le Paslier D., Pons N., Pedersen O., Prifti E.,
RA   Qin J., Raes J., Tap J., Tims S., Ussery D.W., Yamada T.,
RA   MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P., Wang J.,
RA   Brunak S., Ehrlich S.D.;
RT   "Dependencies among metagenomic species, viruses, plasmids and units
RT   of genetic variation.";
RL   Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:CCZ13730.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; CAZL010000492; CCZ13730.1; -; Genomic_DNA.
DR   Proteomes; UP000017987; Unassembled WGS sequence.
DR   GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:InterPro.
DR   Gene3D; 2.60.120.260; -; 1.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000017987};
KW   Reference proteome {ECO:0000313|Proteomes:UP000017987};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     21       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        22    521       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5004388481.
FT   DOMAIN      360    489       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
SQ   SEQUENCE   521 AA;  56035 MW;  B3D7CD18C8F4330E CRC64;
     MKKIALSIMA LAIAAVTFTS CEDVPAPYDY PGTGGGGSTT EGVYLNQSFA KDLGDFKSFG
     TNDNIAWTID YSSACITGYK DFNGDGTKTN EAGVTYLVSP EIDLTKASKA YIEMNHAMKY
     ERADVNANNT LLISKDYTDD PTKATWTPIA YPTTGLNDAS TKEFVFVTSA ANIPAEFIGQ
     KVRIAFRHTC TDKQSSTWEI KTLSVKEGEV ENGGGEVTPT PTPGEGTGEG TEASPYDVTK
     ALAIITSGNI PASEVYVSGI VSSISEIETA NYGNATYNIS VDGTTTSEQL IVYHGFYLGG
     EKFTSNDQLK VGDKVVVYGK LKQFYEKKEI DYNNKIVSLN GKKAEEGGGE VTPPAPTGDN
     LLANGNCETW DGTTPVNWKT TSTAGNASLK QSTDAHGGSY SISVGFDASK NKRLGYKEIT
     LKAGTYKFSF YAKSTTADKS QCRPGYVFVT DGVADKNYNY RTDYEPLNNS TWTLVSYEFT
     LTETKTICLV VMNPKTTAYA TAQDILVDDA SLVTSNGGFA E
//
DBGET integrated database retrieval system