ID A0A0Q4HGE5_9MICO Unreviewed; 804 AA.
AC A0A0Q4HGE5;
DT 20-JAN-2016, integrated into UniProtKB/TrEMBL.
DT 20-JAN-2016, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE SubName: Full=Carbohydrate-binding protein {ECO:0000313|EMBL:KQM84428.1};
GN ORFNames=ASE68_03255 {ECO:0000313|EMBL:KQM84428.1};
OS Agromyces sp. Leaf222.
OC Bacteria; Actinomycetota; Actinomycetes; Micrococcales; Microbacteriaceae;
OC Agromyces.
OX NCBI_TaxID=1735688 {ECO:0000313|EMBL:KQM84428.1, ECO:0000313|Proteomes:UP000050813};
RN [1] {ECO:0000313|EMBL:KQM84428.1, ECO:0000313|Proteomes:UP000050813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Leaf222 {ECO:0000313|EMBL:KQM84428.1,
RC ECO:0000313|Proteomes:UP000050813};
RA Gilbert D.G.;
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|EMBL:KQM84428.1, ECO:0000313|Proteomes:UP000050813}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Leaf222 {ECO:0000313|EMBL:KQM84428.1,
RC ECO:0000313|Proteomes:UP000050813};
RA Schulze-Lefert P.;
RT "Functional overlap of the Arabidopsis leaf and root microbiotas.";
RL Submitted (NOV-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KQM84428.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMKQ01000001; KQM84428.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0Q4HGE5; -.
DR STRING; 1735688.ASE68_03255; -.
DR Proteomes; UP000050813; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0005975; P:carbohydrate metabolic process; IEA:UniProtKB-KW.
DR CDD; cd04084; CBM6_xylanase-like; 1.
DR CDD; cd09003; GH43_XynD-like; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR003305; CenC_carb-bd.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR43772:SF2; BETA-1,4-XYLOSIDASE (EUROFUNG); 1.
DR PANTHER; PTHR43772; ENDO-1,4-BETA-XYLANASE; 1.
DR Pfam; PF02018; CBM_4_9; 1.
DR Pfam; PF03422; CBM_6; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR PROSITE; PS51175; CBM6; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00022651};
KW Signal {ECO:0000256|SAM:SignalP};
KW Xylan degradation {ECO:0000256|ARBA:ARBA00022651}.
FT SIGNAL 1..28
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 29..804
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038972031"
FT DOMAIN 571..702
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
SQ SEQUENCE 804 AA; 84123 MW; EC60F5D9EAE339B2 CRC64;
MQQQRRRRVP RRTLAGLALA ALMTASFAAT TSYAAEEELI LNGGFENGTT SWFVNNGNGT
DGGTLATTTD AYSGANAASV TGRTTTGSGP MQDLTGKVSA GQTYALKARI KYENANGPAT
KQFFATMHYG GSTYTNLVNV TATKGQWAYF NGTFTIPAGQ SVSTARLFFE TPWTQTPSTA
PDTHLMDFKL DDVSVIGAPP PAAPSKTIEV VGKLPGEHNP LLGHKFGADG FGFVDNGRVY
MYMTNDTQGY APDPVTGISA GINYGSINQV TLISSTDLVN WTDHGEIQVA GPNGVAPFTG
NSWAPGMAKK VVNGVEKYFL YYANGGGSSN VITGDSPIGP WTSERTSTLI NGSTPGAQGV
SWLFDPAPFV DDDGEEYLFF GGGPASTSMP AAERFNNPKN IRVIELGDDM VSTEGTAAVV
DAPVAFEASQ VFKRGDKYYL SYSSHFGGSD FGGNQTPLAG YPGGGQIGYM MSEDPMSWPK
EAYAGVMFPN QSQFFGAGTG GNNHQSVFEY EGKYYFTYHA PTLNKRINGN TTQGYRSAHI
EELTFKEDGT VNQVVGTYAG VDQVRDFDPF RVFEAETLGW TKGIATAKVA GGSAEFGATA
PNLVVKDVDN GDWTGLSSVA FGENGAATVT AKVKPLQTGG KIQVRLGAPD GPVAGTIEVD
GAAGAWTEVT AELEGATGSQ DVYFSYTGPS GDLFEVDTIE FAEADAAPSL DVAVIAGTRC
VAGKAVVTAQ ATNNDDVAIS VSFDSSFGAK SFASVAPGKN AVHAFTTRLT NVPAASVTAH
VSASVDGAPV SVEVEAPYDA KSCG
//