GenomeNet

Database: UniProt
Entry: E9SF23_RUMAL
LinkDB: E9SF23_RUMAL
Original site: E9SF23_RUMAL 
ID   E9SF23_RUMAL            Unreviewed;       740 AA.
AC   E9SF23;
DT   03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT   03-MAY-2011, sequence version 1.
DT   07-JUN-2017, entry version 25.
DE   SubName: Full=Carbohydrate binding domain protein {ECO:0000313|EMBL:EGC02156.1};
GN   ORFNames=CUS_5679 {ECO:0000313|EMBL:EGC02156.1};
OS   Ruminococcus albus 8.
OC   Bacteria; Firmicutes; Clostridia; Clostridiales; Ruminococcaceae;
OC   Ruminococcus.
OX   NCBI_TaxID=246199 {ECO:0000313|EMBL:EGC02156.1, ECO:0000313|Proteomes:UP000004259};
RN   [1] {ECO:0000313|EMBL:EGC02156.1, ECO:0000313|Proteomes:UP000004259}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=8 {ECO:0000313|EMBL:EGC02156.1,
RC   ECO:0000313|Proteomes:UP000004259};
RA   Nelson K.E., Sutton G., Torralba M., Durkin S., Harkins D.,
RA   Montgomery R., Ziemer C., Klaassens E., Ocuiv P., Morrison M.;
RL   Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:EGC02156.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; ADKM02000110; EGC02156.1; -; Genomic_DNA.
DR   RefSeq; WP_002851541.1; NZ_ADKM02000110.1.
DR   ProteinModelPortal; E9SF23; -.
DR   STRING; 246199.CUS_5679; -.
DR   EnsemblBacteria; EGC02156; EGC02156; CUS_5679.
DR   eggNOG; ENOG4105DT9; Bacteria.
DR   eggNOG; ENOG410XPHB; LUCA.
DR   OrthoDB; POG091H0GD8; -.
DR   BioCyc; RALB246199:G11YK-2504-MONOMER; -.
DR   Proteomes; UP000004259; Unassembled WGS sequence.
DR   GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR   GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR   GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR   Gene3D; 2.115.10.20; -; 1.
DR   Gene3D; 2.60.120.260; -; 2.
DR   InterPro; IPR006584; Cellulose-bd_IV.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR005084; CMB_fam6.
DR   InterPro; IPR016134; Dockerin_dom.
DR   InterPro; IPR018247; EF_Hand_1_Ca_BS.
DR   InterPro; IPR008979; Galactose-bd-like.
DR   InterPro; IPR006710; Glyco_hydro_43.
DR   InterPro; IPR023296; Glyco_hydro_beta-prop.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   Pfam; PF03422; CBM_6; 1.
DR   Pfam; PF04616; Glyco_hydro_43; 1.
DR   SMART; SM00606; CBD_IV; 1.
DR   SUPFAM; SSF49785; SSF49785; 2.
DR   SUPFAM; SSF63446; SSF63446; 1.
DR   SUPFAM; SSF75005; SSF75005; 1.
DR   PROSITE; PS51766; DOCKERIN; 1.
DR   PROSITE; PS00018; EF_HAND_1; 1.
PE   4: Predicted;
KW   Complete proteome {ECO:0000313|Proteomes:UP000004259};
KW   Reference proteome {ECO:0000313|Proteomes:UP000004259};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL        1     30       {ECO:0000256|SAM:SignalP}.
FT   CHAIN        31    740       {ECO:0000256|SAM:SignalP}.
FT                                /FTId=PRO_5003243640.
FT   DOMAIN      202    269       Dockerin. {ECO:0000259|PROSITE:PS51766}.
SQ   SEQUENCE   740 AA;  80895 MW;  D47C200C1B67F474 CRC64;
     MKNRSGAKKL TTAVLAGTML FAQSAFMSDA SVKADAASSD GIYFHDNFED SSGNWEARGD
     GEILLSGRHP FKGTNALLVK DRTKAWQGIQ MALDPSIFQA GQSYSFNVFV DYEDGDDTEY
     FLFSMQYTDG SGTTKYLHIA EGSTSRGKYL QLSNPSFRIP SDASDVYIYV ETSEVTGNFY
     IDEAIIAQDG VKIGSDGVSE YKTDHRGDID LDGKVNIYDL VLAKRGMMSG FENELSRKAA
     DIDGSGEVDN EDISTLQDFV LRKIDKFPEI KIKVDFTEMA NKFGNVNLAA SYKKSNEHNP
     LISQYFGADP GVMEYNGRVY IFMTDDHLLY KNGQLTDIEY GSINCLRCIS SDDLVNWTDH
     GLIKAAGQNG LCKWGGNSWA PTACHKKING KEKFFLYFAN GGNGIAVLEA DSPTGPWRDP
     IGKALISRST PNCGNVEWLF DPAVLVDDDG TGYLYFGGGV PSGQNSHPKT ARCVKLSDDM
     TSIVGTPQTI DAPYLFEDSG IHKFNGKYYY SYCSNFNTYG NQYGMTSGAI NYMVSDSPLG
     PFTYKGEAFK GISTFFGTGG NNHHTFFKFN NQWYLTYHAQ YLQDSMGLKG GYRSTHIDKV
     NINSDGTIQA VTGTKSGVSQ IKSFDPYRTN RAATFSHQGG ITISGSGNSV VQANKGSWFR
     VSGVDCGYNA QEMTIKASSP NGCIVKVCTG SANGTPVAYA EIPAGSSMQE FTVPVKDLSG
     KNDLYFVFNN TVSVDSWSLS
//
DBGET integrated database retrieval system