ID R6BGG7_9FIRM Unreviewed; 959 AA.
AC R6BGG7;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE RecName: Full=CBM6 domain-containing protein {ECO:0000259|PROSITE:PS51175};
GN ORFNames=BN708_00739 {ECO:0000313|EMBL:CDA63368.1};
OS Firmicutes bacterium CAG:56.
OC Bacteria; Bacillota.
OX NCBI_TaxID=1263031 {ECO:0000313|EMBL:CDA63368.1, ECO:0000313|Proteomes:UP000018123};
RN [1] {ECO:0000313|EMBL:CDA63368.1, ECO:0000313|Proteomes:UP000018123}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:56 {ECO:0000313|Proteomes:UP000018123};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA63368.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBCG010000129; CDA63368.1; -; Genomic_DNA.
DR AlphaFoldDB; R6BGG7; -.
DR Proteomes; UP000018123; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR CDD; cd04080; CBM6_cellulase-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 3.80.10.10; Ribonuclease Inhibitor; 1.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR026906; LRR_5.
DR InterPro; IPR032675; LRR_dom_sf.
DR InterPro; IPR014867; Spore_coat_CotH_CotH2/3/7.
DR PANTHER; PTHR45661:SF3; RICH REPEAT DOMAIN PROTEIN, PUTATIVE-RELATED; 1.
DR PANTHER; PTHR45661; SURFACE ANTIGEN; 1.
DR Pfam; PF03422; CBM_6; 1.
DR Pfam; PF08757; CotH; 1.
DR Pfam; PF13306; LRR_5; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR PROSITE; PS51175; CBM6; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000018123};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..30
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 31..959
FT /note="CBM6 domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5004413367"
FT DOMAIN 612..734
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT REGION 735..806
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 750..792
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 959 AA; 105904 MW; 3E428AAE299E0E1F CRC64;
MKSAAKTPGK LLAGLCLILF FCLMPVRSYA AEQQLSEITK LYVTGIYGSE TLPQQKVNWQ
YADKGYYLCM PSGADLSNVQ LHFTENSDGN ENADSYVMIG GQKINCGDST DLSGTSKLQI
SLAGGKSVSV NIVKSAEIPA MFIQTASGTL DKVHASKDNK EKGTMALVKA DRSVDFDGNL
KQIKGRGNST WGFAKKPYNI KLENSSDLLG MGKGKGWCLL ANYADRSLLR NRIVYNLAEE
TGIPFTMDSR NIDLYINGDY MGSYLITEKI EIGKTRVNIT DLEEATSKAN DNADLETYEQ
KGTNDYKAGT QKWVDIPNDP EDITGGYLLE LELGERYKDE TSGFVTTGGQ AVTMKCPECV
SENQIKYISE FYQNMENALY SKDGYTTDSK GERHALSDYI DIESLARMYL LQEFSMNLDS
GITSFYLYKD SDLTGDGKLH AAPVWDFDVA LGNYTSRNGT DFTDPTQWWA KISRMYDNSS
KYNVMAQAVQ HEEVWNKVKE LWQSEFMPAI KYILGESTAY TATKIKTLDA YKAEVSASAA
MNFIQWRDLM ENPWNHSSES FVNTGLTYDD NIEYLRNFMT KRCDFLNRNL GGESTGTDPD
TSQKVTVIED DTRFYAAKNI LNSSGDVRAE DTSDEGGGEN LGYCNQGSLT EYKINVTTAG
TYNVTARVAS NDGTGAFSIY VDNQKIATYR AVNTGGWQVW TTTDAQEITL EAGEHSFAIY
FEESGMNLNW LQFTRETGTD PKPDPKPDPD PDPTPTPTPD PDPTPDPNPT PTPTPDPTPT
PDQTPTPTPS PKPDSSKDQN TVTLTKGSIC QDAKGILKYR ITKMAAKNGT AEVIGIQKKS
GKVTIPSTIT VQGITFKVTA IAEKAFRNDK NLKSVVIGSN VKKIGKQAFE KCRKLSSVTF
KGKKAPTIGK AAFKGIKKKT SVQVAGSMKK SQVKKLQNRM KTAAPSVKIT YKKKITVRF
//