ID W4V8Z5_9FIRM Unreviewed; 765 AA.
AC W4V8Z5;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 34.
DE SubName: Full=Rhamnogalacturonan lyase {ECO:0000313|EMBL:GAE89677.1};
GN ORFNames=JCM21531_3225 {ECO:0000313|EMBL:GAE89677.1};
OS Acetivibrio straminisolvens JCM 21531.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Acetivibrio.
OX NCBI_TaxID=1294263 {ECO:0000313|EMBL:GAE89677.1, ECO:0000313|Proteomes:UP000019109};
RN [1] {ECO:0000313|EMBL:GAE89677.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=JCM 21531 {ECO:0000313|EMBL:GAE89677.1};
RA Yuki M., Oshima K., Suda W., Sakamoto M., Kitamura K., Iida T., Hattori M.,
RA Ohkuma M.;
RT "Draft Genome Sequence of Clostridium straminisolvens Strain JCM 21531T,
RT Isolated from a Cellulose-Degrading Bacterial Community.";
RL Genome Announc. 2:e00110-14(2014).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:GAE89677.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; BAVR01000042; GAE89677.1; -; Genomic_DNA.
DR AlphaFoldDB; W4V8Z5; -.
DR STRING; 1294263.JCM21531_3225; -.
DR Proteomes; UP000019109; Unassembled WGS sequence.
DR GO; GO:0030246; F:carbohydrate binding; IEA:InterPro.
DR GO; GO:0004553; F:hydrolase activity, hydrolyzing O-glycosyl compounds; IEA:InterPro.
DR GO; GO:0016829; F:lyase activity; IEA:UniProtKB-KW.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR CDD; cd04082; CBM35_pectate_lyase-like; 1.
DR CDD; cd10318; RGL11; 1.
DR Gene3D; 1.10.1330.10; Dockerin domain; 1.
DR Gene3D; 2.60.120.260; Galactose-binding domain-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR005084; CMB_fam6.
DR InterPro; IPR002105; Dockerin_1_rpt.
DR InterPro; IPR016134; Dockerin_dom.
DR InterPro; IPR036439; Dockerin_dom_sf.
DR InterPro; IPR008979; Galactose-bd-like_sf.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR028994; Integrin_alpha_N.
DR InterPro; IPR041624; RGI_lyase.
DR InterPro; IPR034641; RGL11.
DR InterPro; IPR049366; RGL11_C.
DR PANTHER; PTHR43118; RHAMNOGALACTURONAN LYASE (EUROFUNG); 1.
DR PANTHER; PTHR43118:SF1; RHAMNOGALACTURONAN LYASE (EUROFUNG); 1.
DR Pfam; PF03422; CBM_6; 1.
DR Pfam; PF00404; Dockerin_1; 1.
DR Pfam; PF18370; RGI_lyase; 1.
DR Pfam; PF21348; RGL11_C; 1.
DR SUPFAM; SSF49785; Galactose-binding domain-like; 1.
DR SUPFAM; SSF69318; Integrin alpha N-terminal domain; 1.
DR SUPFAM; SSF63446; Type I dockerin domain; 1.
DR PROSITE; PS51175; CBM6; 1.
DR PROSITE; PS51766; DOCKERIN; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023001};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Lyase {ECO:0000313|EMBL:GAE89677.1};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023001};
KW Reference proteome {ECO:0000313|Proteomes:UP000019109}.
FT DOMAIN 1..34
FT /note="Dockerin"
FT /evidence="ECO:0000259|PROSITE:PS51766"
FT DOMAIN 45..169
FT /note="CBM6"
FT /evidence="ECO:0000259|PROSITE:PS51175"
FT REGION 171..190
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 176..190
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 765 AA; 83084 MW; 8983124703ACEA0C CRC64;
MQVAFDLNGD SKTDSTDLAI LKRYLLNIID KFPVGSVIES GTQKVKYQAE DATLYKAFEE
TVNAGYDGRS YVNYDNEPGG YIEWNVNVSN SGAYKLTFRY ANGTSNNRPM EIRVNSNLIA
GSLDFNPTSA WTVWNDQSIV VALNSGKNVI RATGIAAEGG PNVDYLEVAS TNETPAPTPT
PTPTPSPTPT YVPVTPPTGA RQMENLDRGV VAVKVNNGVF VSWRMLGTDP SNISFNLYRN
GTKVNSSPIT GATNYVDTAG TTSSTYTVRP IINGQELAAS KAASVWAQNY LQIPIQAPGS
AYQANDCSAG DLDGDGEYEI VLKWEPDNAK DNSQSGYTDN VYLDAYKLNG TRLWRIDLGR
NIRAGAHYTQ FMVYDLDGDG KAEVACKTAD GTRDGRGNVI GNPNADYRNS SGYILSGPEY
LTVFNGQTGA AITTVDYDPP RGNVSSWGDN YGNRVDRFLA CIAYLDGQRP SLVMCRGYYT
RSVLVAWDFR NGKLTKRWVF DGNNYSGYNG QGNHNLSVAD VDGDGKDEII YGACTIDDNG
RGLYTSSGLG HGDAMHVSDL NPNRPGLEIW SCFESSGGAA LRDAKTGQVL FRWNRSSDTG
RACAADITAS SPGAELWASG SPLFSCTGQN LGSAPSQINF VIWWDGDELR ELLDGTTISK
FGGGTLLSAS GCASNNGTKS TPCLQADLFG DWREEVIFRT TDNKYLRIYT TTAVTNRRIY
TLMHDPVYRL GIAWQNVAYN QPPHTSFFIG AGMAEPPKPN IYLVP
//