ID A0A117RXP0_9ACTN Unreviewed; 972 AA.
AC A0A117RXP0;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 24-JAN-2024, entry version 23.
DE SubName: Full=Cellulose 1,4-beta-cellobiosidase {ECO:0000313|EMBL:KUO15457.1};
GN ORFNames=AQJ91_40895 {ECO:0000313|EMBL:KUO15457.1};
OS Streptomyces dysideae.
OC Bacteria; Actinomycetota; Actinomycetes; Kitasatosporales;
OC Streptomycetaceae; Streptomyces.
OX NCBI_TaxID=909626 {ECO:0000313|EMBL:KUO15457.1, ECO:0000313|Proteomes:UP000053260};
RN [1] {ECO:0000313|EMBL:KUO15457.1, ECO:0000313|Proteomes:UP000053260}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=RV15 {ECO:0000313|EMBL:KUO15457.1,
RC ECO:0000313|Proteomes:UP000053260};
RA Ruckert C., Abdelmohsen U.R., Winkler A., Hentschel U., Kalinowski J.,
RA Kampfer P., Glaeser S.;
RT "Draft genome sequence of Streptomyces sp. RV15, isolated from a marine
RT sponge.";
RL Submitted (OCT-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KUO15457.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LMXB01000111; KUO15457.1; -; Genomic_DNA.
DR RefSeq; WP_067032948.1; NZ_KQ949116.1.
DR AlphaFoldDB; A0A117RXP0; -.
DR STRING; 909626.AQJ91_40895; -.
DR OrthoDB; 33861at2; -.
DR Proteomes; UP000053260; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:InterPro.
DR GO; GO:0030247; F:polysaccharide binding; IEA:UniProtKB-UniRule.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:InterPro.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.60.40.290; -; 1.
DR Gene3D; 2.170.160.10; Endo-1,4-beta-glucanase f. Domain 2; 1.
DR Gene3D; 4.10.870.10; Endo-1,4-beta-glucanase f. Domain 3; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR001919; CBD2.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR012291; CBM2_carb-bd_dom_sf.
DR InterPro; IPR023309; Endo-1-4-beta-glucanase_dom2.
DR InterPro; IPR027390; Endoglucanase_F_dom3.
DR InterPro; IPR000556; Glyco_hydro_48F.
DR InterPro; IPR013783; Ig-like_fold.
DR Pfam; PF17957; Big_7; 1.
DR Pfam; PF00553; CBM_2; 1.
DR Pfam; PF02011; Glyco_hydro_48; 1.
DR PRINTS; PR00844; GLHYDRLASE48.
DR SMART; SM00637; CBD_II; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS51173; CBM2; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Reference proteome {ECO:0000313|Proteomes:UP000053260};
KW Signal {ECO:0000256|ARBA:ARBA00022729, ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..37
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 38..972
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5007155862"
FT DOMAIN 35..144
FT /note="CBM2"
FT /evidence="ECO:0000259|PROSITE:PS51173"
SQ SEQUENCE 972 AA; 103652 MW; E9D96E4CB9E789A6 CRC64;
MDPGRKRRAV RRLWTAVAAA FALPLSMLAT GTTTANAASV QCSVDYKTND WGSGFTADLT
LTNRGTTAID GWTLTYGYSG NQKLANGWNG TWSQAGQTIT VKNASHNGTV AAGDAVSAGA
QFSYSGTNTA PTDFAVNGTS CTGAHQPPIT VLPSPTAGAT YSRGDAVPLA ATAAAADGAT
ISKVEFYDDT TLLGTDTSAP YTHSASGLTV GSHSLLAKAY DSQGASAEST PVGITVATGP
AVVASTTQLA VQQGKTATYD VKLSTQPSAN VTVTTSRTGG NTGLSVTGGA SLTFTPSNWN
TAQKVTVTAD SSGTGAATFE STAAGHGKAA VTVTQIAASK GYDARFLELY GKITNPANGY
FSPEGIPYHS VETLIVEAPD HGHETTSEAY SYLLWLQARY GKITGDWTKF NGAWEIMEKY
MIPTHADQPT NSFYTASKPA TYGPEEDTPQ QYPTALDPSV PVGADPLASE LKSAYNTDDV
YGMHWIQDVD NTYGFGNTPG GKCEAGPTET GPSYMNTFQR GPQESVWETV PQPTCDAFKY
GGKNGYLDLF TGDKSYAKQW KYTTAPDADA RAVQSAYWAD LWAKQQGKGS AVSGTVAKAA
KMGDYLRYAM YDKYFKKIGN CVGPSSCPAG TGKDASHYLM SWYYAWGGST DTSGGWAWRI
GSSYAHSGYQ NPLAAYALSS HADLKPKSST GAADWSKSLQ RQMEFYRWLQ SDEGAIAGGS
TNSWAGRYAS PPAGKSTFYG MYYDQQPVYH DPPSNQWFGF QAWSMERVAE YYQQTGNAAA
KAVLDKWVKW ALSKTTINPD GSYQIPSTLQ WSGQPDTWNP SSPGANSELH VTVADYTNDV
GVAAAYAKTL TYYGAKSGDA KAKSTAKALL DGMWDHHQDS LGIAVPETRS DYSRFNDSVY
VPSGWTGKMP NGETIDSSST FSSIRSFYKN DPAWSKIEAY LKGGAAPVFT YHRFWAQADI
ALAMGSYAEL LE
//