ID A0A1S9CFF6_9FIRM Unreviewed; 1582 AA.
AC A0A1S9CFF6;
DT 10-MAY-2017, integrated into UniProtKB/TrEMBL.
DT 10-MAY-2017, sequence version 1.
DT 24-JAN-2024, entry version 17.
DE RecName: Full=Carbohydrate-binding domain-containing protein {ECO:0000259|Pfam:PF06452};
GN ORFNames=ATN31_10645 {ECO:0000313|EMBL:OON95647.1};
OS Epulopiscium sp. AS2M-Bin001.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Lachnospiraceae;
OC Candidatus Epulonipiscium.
OX NCBI_TaxID=1764958 {ECO:0000313|EMBL:OON95647.1, ECO:0000313|Proteomes:UP000190284};
RN [1] {ECO:0000313|EMBL:OON95647.1, ECO:0000313|Proteomes:UP000190284}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AS2M-Bin001 {ECO:0000313|EMBL:OON95647.1};
RA Seilhamer J.J.;
RL Submitted (AUG-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OON95647.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LNZM01000061; OON95647.1; -; Genomic_DNA.
DR STRING; 1764958.ATN31_10645; -.
DR Proteomes; UP000190284; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0016052; P:carbohydrate catabolic process; IEA:InterPro.
DR Gene3D; 2.60.40.1190; -; 6.
DR Gene3D; 2.60.120.430; Galactose-binding lectin; 1.
DR InterPro; IPR010502; Carb-bd_dom_fam9.
DR Pfam; PF06452; CBM9_1; 6.
DR SUPFAM; SSF49344; CBD9-like; 6.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000190284};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..23
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 24..1582
FT /note="Carbohydrate-binding domain-containing protein"
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5012684545"
FT DOMAIN 45..220
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
FT DOMAIN 274..439
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
FT DOMAIN 489..664
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
FT DOMAIN 721..895
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
FT DOMAIN 950..1124
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
FT DOMAIN 1194..1369
FT /note="Carbohydrate-binding"
FT /evidence="ECO:0000259|Pfam:PF06452"
SQ SEQUENCE 1582 AA; 176506 MW; EF79C4DC4C7DD3E0 CRC64;
MKKHLSALLI AGAISLSATN LFAEEAHPVK MSTTVFGSPD ILESSIDPIW DEAEPINTFR
AKKGEITDET SFPIVKTMWD KNYLYILAEI EDSTIFKNTD PKALHESDCT EYYFNPLKDR
ENNTYDDSEF WIKIYPDGTL ESHQNAPEGI EATAFLTKVG YVTQVKVPHT NYNAMGEITV
GFDLQINNAS KETGKRDLIL GWNDMINVAY KDPSVVGELL FEPEPNPIIA RPTMEYETLI
PAVNQNVKVT PDGHIIKTIN AIYGNPLIES DSSTVDPIWN TVVAATDFHV KRGDKVTPSS
IKTMWNEEYF YVLAEVTDIE IFKHETMIHE SDNTELHIDL DLSRAATYDE GDFWLKIFPD
GFAQNQGEIP EGTITNAIIT ETGYITQYKI PYAAIAGERI GFDIQINDAD KATGQRHNSS
GWNDTTPDTW KTLANVGELY FMPPAQNTAV KYIPIVSTSV AELSTGEEHP VKNIEVGYGS
PADFTSVANL DPAWENATRI SDFHSKKGET TESSSVDILW DEEYIYLLGI IEDDEIYKNI
NPTALHESDY LEVYFNPLID RSNGKYDQEE FWIKIHPDGT LEKHANAPEG IVNFAEVTET
GYVVGAKVPH SFYDASSGTT IGFDLQIGDA NQITMKRDSI VGWNDTINSA WKNPDVVGTL
TLLEKGAIPK NAEIQAIAAN VKYVPIVTTT ETKISTGEDL VVKNLEIKYG YPAEFTSIEE
LDPAWADSTL MNDFFDKKET DVESKPATLN FLWSEDYLHV LAQVEDDEIY KNPEKLYESD
YVEIYFNPLV DRSNGTYDKE EFWIKIHPDG LLETHANAPE GIINHGQITE NGYVVEARIP
HSFYEAKSGT TIGLDLQIGN ASAATTVRDT VIGWNDTIGT AWKNPDVVGT LTLLGEGEIP
KNAVIQMPVT NYRPIVTTTE TKISTGEDLV VKNLEIKYGS PAEFTSIENL DPIWNSSTLM
NDFFNKKDIN VVAKPATLNF LWDEKYLHVL AQVEDDEIYK NPEKLYESDY VEMYFNPLVD
RSNGTYDKEE FWIKIYPDGL LETHANAPDG IINHGQVTEN GYVVTASIPH TFYEAKSGTT
LGLDLQIGNA SSATMDRDTV IGWNDTIGTA WKNPDVVGTL TLLGEGEIPK NAVAIQGDQE
IIISTPREPI GPIVIPEPVV TVLDRQISSG QVIPVKHTTV VYGRPKITES STSLDPIWET
ATVLNELRAK RGAINENTDM AEVRLLWDED YLYVAGIIED SEIFRKDESP GDSDNLELFF
NPLVDRSNGK YDSEEYWLKI YPNGEIENHA NLPEGVVASA YLTKTGYVAQ AKIPHSEYRA
EAGTTIGFDF QVNNGSKDIN TRHTILGWND PLNAAYANPD VLGELTLMGR RGEFGNYVPP
ITVEKEIIPL TFLDPSLLIE ILPDPIAYSE MPEFNFDENE TKNWQVNGTE NTKSSIVTVN
GDRAVRIEMN KRQTDGVGIS TLMLAPEQNA WSLGSDSNSI SAKIINPSQQ PIQVRMKVTD
YFGNEKMNYI TVNPSDIEMM EAELGEAGVF DSSWGQYEGH SGKGIDKTQI TKIEFFIPED
MLDVMPGIES AEFIIDAVKA TK
//