ID A0A3G3JW00_9BACL Unreviewed; 1379 AA.
AC A0A3G3JW00;
DT 13-FEB-2019, integrated into UniProtKB/TrEMBL.
DT 13-FEB-2019, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE SubName: Full=Carbohydrate-binding protein {ECO:0000313|EMBL:AYQ72412.1};
GN ORFNames=EAV92_07405 {ECO:0000313|EMBL:AYQ72412.1};
OS Cohnella candidum.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Cohnella.
OX NCBI_TaxID=2674991 {ECO:0000313|EMBL:AYQ72412.1, ECO:0000313|Proteomes:UP000269097};
RN [1] {ECO:0000313|EMBL:AYQ72412.1, ECO:0000313|Proteomes:UP000269097}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=18JY8-7 {ECO:0000313|EMBL:AYQ72412.1,
RC ECO:0000313|Proteomes:UP000269097};
RA Srinivasan S., Kim M.K.;
RT "Genome Sequence of Cohnella sp.";
RL Submitted (OCT-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CP033433; AYQ72412.1; -; Genomic_DNA.
DR KEGG; coh:EAV92_07405; -.
DR Proteomes; UP000269097; Chromosome.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:InterPro.
DR CDD; cd04080; CBM6_cellulase-like; 4.
DR CDD; cd08991; GH43_HoAraf43-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 4.
DR InterPro; IPR010496; 3-keto-disaccharide_hydrolase.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR42812; BETA-XYLOSIDASE; 1.
DR PANTHER; PTHR42812:SF5; ENDO-ARABINASE; 1.
DR Pfam; PF06439; 3keto-disac_hyd; 1.
DR Pfam; PF03422; CBM_6; 4.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SMART; SM00606; CBD_IV; 4.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 4.
DR PROSITE; PS51175; CBM6; 4.
PE 3: Inferred from homology;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000269097};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1379
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5017977317"
FT DOMAIN 490..630
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 692..819
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 829..969
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 993..1119
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT COILED 1328..1358
FT /evidence="ECO:0000256|SAM:Coils"
SQ SEQUENCE 1379 AA; 150410 MW; 49E1460AAAC28AE6 CRC64;
MLKRSITVLL CVLLLFASQS AFAYKNPASL PDEWGQYGLG DPYVLKFNGT YYLYVSTRDT
DIGVKVWSSP DLVNWSYGGL AATDPVTKGA YAPEVVYWNG YFYMYTSPAG NGHYVLRSES
PTGPFTVQTG NLGHSIDGDV FIDDDGKWYF YHAGGSGILA APMPTPTTIG NDISTGASMN
GWTEGSTLFK RDGKYYMTYT GNHVFSKGYR VDYAVSTNPL TGFVRDKDNP ILISTEGPTV
GLGHNTVVIG PNLDARYIIY HNLEGKGIVG PLRHMGMDRI VFNGDKMSVL GPTATDQPDP
EMPDFEDRFN GAELSPAWTN EDGASWRIDS GLLRMQAAGN SRLLTTAAAG NSYTAEFNLH
LADSSNIGDS RLGIVFSYKG DNDFGTALLD PGSNSLQTQF RVNGVDLGWV ASSLPDGYDY
TKLHNIRVEK AGDTFKIYVD GMLKQTRSVM LGGGSIGYVT ENAKANFGYV AFSNDVNGSS
DFDAAKPVPG QVEAVHYLSG GEGVGYHDQT PGTIGEYRGD GVDTGKANDG TSHVTAFEKG
EWLQYRLNIA ETGKYDIDLS YSDAALGSKV QLTLDGGQPL GEFELAEDTD AENAGWQTAQ
LNGVELPAGV HILKVAAVSG TISLDKLTFH RHVDVPVLYD NFDDGQDDGW QRYEGFWSVK
SDIAEEEQAY DAYHPIPGAI GTVHYISGGE GVGYHDNTLQ NIGGQYRGDS VDIRSNPVSG
GYNVGWNQAG EWLKFNIDVA EAGTYNAEFK VATTLDGGQI RLWLDDTTDL TGVVDIPKTG
DWNIWNVVRK SGVTLPAGKH TLKAETVRGE FDFAGIALTR TNQPIPVPGI IEAEHYKPGG
EGVGYHDLTP ANIGGKYRSD SVDIRSAASG DQIVGWNQTG EWLKYNIHVA EDGMYDLDLW
AATTFTDAQV RLWLDDTTDL TGVLGVPASG DWNKWKSVKK SGILLPAGDH ELKVEIVKGE
FDMTRLSLTQ FDKPHVLPGF IQAVDYKTGG EGTGYHDLTP ANIGSEYRND AVDIRRNAGG
SYNVGWNATG EWLAYSVDIA KAGVYNVDFV TATSMDGAKV RLWLDDATDL TGEVDVPNSV
TWENWATTTK QGVYLPAGTH TLKVETVKGE FDFQGIRIHQ NPVEPKVAHT GEYQAAVGSF
AKTMIGDANW SDYSVESDIK LISNSGDGGI LFRASNPAHG TELNQNNADF VQGYVAYLNK
QGVHLGKLNY SWTYLTGVPL EVKTGEWKHV KVVAQGARIL VYVDDMSKPV IDYTDRSGNP
FTHGKVGMRT FMNETSYDNF TVLPAPINAP AIGKLLDWFS SAGQLDEKTW KLLDKELEHA
TDAVTKNKPD WKQGIKHLEN LLKELNKEKK QQEVSDQAKT TLNRSINALI EVWSAILGK
//