ID A0A011W0Y0_RUMAL Unreviewed; 775 AA.
AC A0A011W0Y0;
DT 11-JUN-2014, integrated into UniProtKB/TrEMBL.
DT 11-JUN-2014, sequence version 1.
DT 27-MAR-2024, entry version 33.
DE SubName: Full=Glycosyl hydrolase family 43 {ECO:0000313|EMBL:EXM40453.1};
GN ORFNames=RASY3_00735 {ECO:0000313|EMBL:EXM40453.1};
OS Ruminococcus albus SY3.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Ruminococcus.
OX NCBI_TaxID=1341156 {ECO:0000313|EMBL:EXM40453.1, ECO:0000313|Proteomes:UP000021369};
RN [1] {ECO:0000313|EMBL:EXM40453.1, ECO:0000313|Proteomes:UP000021369}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SY3 {ECO:0000313|EMBL:EXM40453.1,
RC ECO:0000313|Proteomes:UP000021369};
RA Dassa B., Borovok I., Lamed R., Flint H., Yeoman C.J., White B.,
RA Bayer E.A.;
RT "Rumen cellulosomics: divergent fiber-degrading strategies revealed by
RT comparative genome-wide analysis of six Ruminococcal strains.";
RL Submitted (JUN-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 43 family.
CC {ECO:0000256|ARBA:ARBA00009865}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EXM40453.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JEOB01000001; EXM40453.1; -; Genomic_DNA.
DR RefSeq; WP_051506264.1; NZ_JEOB01000001.1.
DR AlphaFoldDB; A0A011W0Y0; -.
DR PATRIC; fig|1341156.4.peg.684; -.
DR OrthoDB; 9801455at2; -.
DR Proteomes; UP000021369; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:InterPro.
DR CDD; cd04084; CBM6_xylanase-like; 1.
DR CDD; cd14256; Dockerin_I; 1.
DR CDD; cd09003; GH43_XynD-like; 1.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 2.
DR InterPro; IPR006584; Cellulose-bd_IV.
DR InterPro; IPR003305; CenC_carb-bd.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR006710; Glyco_hydro_43.
DR InterPro; IPR023296; Glyco_hydro_beta-prop_sf.
DR PANTHER; PTHR43772:SF2; BETA-1,4-XYLOSIDASE (EUROFUNG); 1.
DR PANTHER; PTHR43772; ENDO-1,4-BETA-XYLANASE; 1.
DR Pfam; PF02018; CBM_4_9; 1.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF04616; Glyco_hydro_43; 1.
DR SMART; SM00606; CBD_IV; 1.
DR SUPFAM; SSF75005; Arabinanase/levansucrase/invertase; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 2.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000313|EMBL:EXM40453.1};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00022651};
KW Reference proteome {ECO:0000313|Proteomes:UP000021369};
KW Signal {ECO:0000256|SAM:SignalP};
KW Xylan degradation {ECO:0000256|ARBA:ARBA00022651}.
FT SIGNAL 1..20
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 21..775
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5039100946"
FT DOMAIN 212..279
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
SQ SEQUENCE 775 AA; 83689 MW; 6481E5AC6BF7AC25 CRC64;
MKKHKKILAV LMALSCFSAA VPQYTVPVTT QAAVYDQPNN YGWYFNFGFE GTTDGFSARG
SASIASSSDT AFVGSSSLYV SNRTSAWQGA SYKLGSNFKA GTEYSFSAIV SYLDGYDTDT
FHLTLQYDGS DGTTYYDKIA TETVTKGQWT QLANTNYKLP SGASNMQIYV ETDSSTTSFY
IDEVIGAVAG TGILGPEGGS SSSGSSGKTG KGGMLLGDIN FDGDIDAMDI VLSERGMLRG
FDDKNAEKAA DVDQSGVFEN EDVDLLKQFI TNRITEFPVA AKAVDVSAME AKFKNINIRK
SYKYDGENNP ISTTRFGADP GWMVYDGRLY IYTTNDAFET RSDGQYQENT YNSGTINCLS
TADMVNWTDH GAIPIAARNG RTQNGCCSWA SAAWAPDACW KMINGKPKFF LYFANSGGGI
GVVTADSPTG PWSDPLGHAL LSHNSPNCSN VEWMFDPGVY YDENTNEAYL FFGGGRKNGV
PAATPGTGRV VKLGNDMISL AGTPVSMNIP YLFEDSSVIK IGDTWYYSYC SNWNVGNQTI
NGVNFGNADI LYMTSKTPLD ANSWKLSGNV FKNTGSQRID NGGNNHHSII YFKGKYYVAY
HSRQYAIRKI KAENIKVYSS NGQLSADGNY RSTQINECTF NNGKLSCSGD MKGVSQIEWL
NPYTTVQAET MSNQSDDVQV NGLRNTTVSM KSGSWLKVSG VNFSKGVNTI TIKGSGSGTV
IKVCTGSPSG DVIGYVELNG SMSENTVAAI NSVSGQKDLY FVASGNATLD SWSIG
//