ID A0A373LPB6_9FIRM Unreviewed; 2453 AA.
AC A0A373LPB6;
DT 07-NOV-2018, integrated into UniProtKB/TrEMBL.
DT 07-NOV-2018, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE RecName: Full=Carbohydrate-binding protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=DW006_05245 {ECO:0000313|EMBL:RGF51538.1};
OS Eubacterium sp. AF36-5BH.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Eubacteriaceae;
OC Eubacterium.
OX NCBI_TaxID=2293108 {ECO:0000313|EMBL:RGF51538.1, ECO:0000313|Proteomes:UP000262191};
RN [1] {ECO:0000313|EMBL:RGF51538.1, ECO:0000313|Proteomes:UP000262191}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AF36-5BH {ECO:0000313|EMBL:RGF51538.1,
RC ECO:0000313|Proteomes:UP000262191};
RA Zou Y., Xue W., Luo G.;
RT "A genome reference for cultivated species of the human gut microbiota.";
RL Submitted (AUG-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RGF51538.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QTVG01000004; RGF51538.1; -; Genomic_DNA.
DR OrthoDB; 9816455at2; -.
DR Proteomes; UP000262191; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:UniProtKB-KW.
DR GO; GO:0008152; P:metabolic process; IEA:UniProtKB-KW.
DR CDD; cd04084; CBM6_xylanase-like; 2.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 8.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR000421; FA58C.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR Pfam; PF03422; CBM_6; 3.
DR Pfam; PF00754; F5_F8_type_C; 4.
DR SMART; SM00606; CBD_IV; 3.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 7.
DR PROSITE; PS51175; CBM6; 3.
DR PROSITE; PS50022; FA58C_3; 4.
PE 4: Predicted;
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00023295};
KW Reference proteome {ECO:0000313|Proteomes:UP000262191}.
FT DOMAIN 50..198
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 229..389
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 413..540
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 723..854
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 865..1009
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT DOMAIN 1005..1094
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT DOMAIN 1146..1304
FT /note="F5/8 type C"
FT /evidence="ECO:0000259|PROSITE:PS50022"
FT REGION 1168..1187
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1168..1182
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2453 AA; 275403 MW; 227B7B58C102D4E3 CRC64;
MKGLKGLFWK KGVAVILSVA MVLTGVTITP QNVKADAAIT ATNDVTAVIT DDSKTGYSSE
KDGDLYNYVR YGASAVADDK TGDHVANNAI DGNADTRWAN EAKTKGHYIT IDLKQSYEIS
KINISWEVAS SIDYKIEVSN DGVLFKSVTA VSLNNYTTAK NRIDTLSLKN TVIGRYVRIT
DVGEKYITDS SNNRQYGISI WDVGIFGKDV KATRESAVEA TNSFKKTTEG ETFQDIIDSK
YYNYTRYNGV SATASGQETN DRTPKAAIDN DKSTRWSATN GDSTYYVVDL GETYNAEKLY
LSWESANATV YNIYKSVNGT DYSLVTTVNA IGIYENTATN GNRVDNIEFG SVSARYIKVQ
AVQRCYKGAN YGGGQYNGMS LYEVGVYGKD MEKSYEKKAF NQFKVGNKET IGNVIEAENT
DEIDSKIKKE NGENASGQNL GGITNNTWAE YNINFDRKTS RIYLRYSVKE GNGGTVKVYV
DDSTMSTTPV ASIDVNSTGG WSNYTDISQE VVVPSGNHKI YLKFVTDKGC VCNLDYFKFE
FAPESISNTA DLHQAENAHA FVRGSAAQSS TVQSSDKYSG GKAVGSMNTW IERDRSYLTT
YVKAENAGYY QFEVQYAGTM TTLLQYRINS SDNDKWKSKT ISSVDSNWEN TNRATMQIEL
RKGLNKIDIS GAVWKWAAGT ETGRTDGKNV NAEWLNIDSF SLTYKGENKK AFNSVLNTDL
KGNKIQAEDF NESSGDIKVE GESDSFVDGK NLGGLTNGKW AEYNINFDRK VSKIYLRNSV
RDGNGGKVEV YVDDSTMSGT PAATVTTSST GSDWSNYVDT ATAISIPSGN HEIYLKFVAD
GSKAVCNLDW FQFEYEPETV KDSGDIHEAE NAHGFVQGDA ETEHTLQADG KFSNNLAVGG
MNAWPDNGRA YLTSYVHVKH PGTYKLTVAY ASGSNKDTNI DCRVNSVNDG DWKSISAPTT
GGWTTVKKIT TEVTLNRGVN VIDITGAANI PYAESDAWQQ VNVDYFTLER VPEDGDLAYG
KKVDVSGSQS GFDGSNAVDE DEDTRWASEK EGDQAYLIVD LEKLYEIEKV NIMFEKAYPN
DFQILVSKDK TNWTVARTVR GFKTTEVDKK VYESDGVCLG KARYVKVRCL NMAYNKAMSI
RDIRIYGTKI KGQLSDLALN AEVKVSSSDS SAETSNPANA VDGLDNTRWG APKTDSNPWY
EINLGQQCRI DSVDLKFERA YPKSFRIQIS DDGKTWKDYK TIKDWTEPGN STEIEKSEYY
ANLEFGNSIH MGDVKTQYIR LYADAKIRDN NWGVSIYEFE VWGTEVAKKD YWSNQAKKTY
GIYLVSKLQN TENNGLIDSS LVQGDVIGNG DIYDVVYEEG KDIYFYVNPR ELYYKQADHQ
ICWSSSDSGK NLWGATSHNE NILKYGTQQE ATVKYTIPAG IDFGNSDYVE TEVGCQIYNK
SDLEKENASP KFELVFKVRI WKSTIVIQDK LSENGELCVK NPENGATYQW QKSVDGNTWS
DVYEKRYDLQ ILSGSGNDVV NVAHDLGGGQ YYRVRKVGTT QWSHPYKVQY YNDIQNGDFE
YPAMFSTDED DKAFPFEPNG DEQQYPNGYE GLVWKTTAPG WMSRQKNKIG HDIEIVNGRK
LKTSGEEEQV GQFSVTQDEM YKNNEHGDQF AELNCENVGA LYQDILTTPN SQCYWDLDYA
GRWCQNSMYV VAMSSKDAQN YTTTEQIEAL IRRDDVKAIT TNQEGTKGTT ITLSDGVTAT
LWKVTSKKTA GQWNDHNGIY NVPSGKKNYL TRFFFVSAEG AKRNDGDTPD KTVGSLLDNV
KFQQKKDYVI EYYVNGVKND GLTVKGTVNP YDRVSIPVPK AVSNYTLYDA KISGENDTES
KAFYVDDKDR NMTVAYNHNV LKLYYKSGIV VANVKIQGLE ELPEGYSVVV NLKNTSGTVL
QTHTMSQNEF TKIEKPGGGG VEGYFDTVTF DGTNLTVGDT YTVEEIIVKN NYLPYYLETV
DKNGTKEDVG IDKINTDKLS YTATFTYSVG TDNSVNFINI YNPIRKITIS KNVSGNMVDE
NEKNKSFNFS IEIKKDGKAI DAAKIIDNKL KNTATGQYTF ALKHSEKVEL YVYNNCKVTV
TEDDYSKKPN VNTTIFSDDY YVTSWEKDSK VTDGRIISID SVNSDQELVC NNVYNDLGDV
EVQGFQMNGN KDSGGVSEFS PSFRVVCRVS RNTIKTKKVS KFGVIYAMPK ATDGKTRAEL
EEMMKIDKAD NKTIKVHEET KDGIYPNWTT KSDSQYSTKY WQYYALTFKC LDYMFYTLQE
NITVRAYAKM ADGTYEYGNN IYTVNMYEIA DYLYKNQKMS TLKGHNFLYN NVLNLVTINN
NKLDILKSMM KALDVTSTSN QYYNILNKLY KDMNNYVYCQ QEYKGKYQER EEFVPKTLTS
EEQKKLLSAL NGKTGTSYSD INDWIYNQTE KQGKYKGYYK KVQYAWDNGI YVK
//