ID A0A2W1LDK6_9BACL Unreviewed; 998 AA.
AC A0A2W1LDK6;
DT 12-SEP-2018, integrated into UniProtKB/TrEMBL.
DT 12-SEP-2018, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=Cellulose 1,4-beta-cellobiosidase {ECO:0000313|EMBL:PZD96160.1};
GN ORFNames=DNH61_09645 {ECO:0000313|EMBL:PZD96160.1};
OS Paenibacillus sambharensis.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=1803190 {ECO:0000313|EMBL:PZD96160.1, ECO:0000313|Proteomes:UP000249522};
RN [1] {ECO:0000313|EMBL:PZD96160.1, ECO:0000313|Proteomes:UP000249522}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SMB1 {ECO:0000313|EMBL:PZD96160.1,
RC ECO:0000313|Proteomes:UP000249522};
RA Pinnaka A.K., Singh H., Kaur M.;
RT "Paenibacillus imtechensis sp. nov.";
RL Submitted (JUN-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PZD96160.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QKRB01000042; PZD96160.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2W1LDK6; -.
DR OrthoDB; 33861at2; -.
DR Proteomes; UP000249522; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:InterPro.
DR GO; GO:0030248; F:cellulose binding; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:InterPro.
DR CDD; cd00063; FN3; 1.
DR Gene3D; 1.50.10.10; -; 3.
DR Gene3D; 2.60.40.710; Endoglucanase-like; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR008965; CBM2/CBM3_carb-bd_dom_sf.
DR InterPro; IPR001956; CBM3.
DR InterPro; IPR036966; CBM3_sf.
DR InterPro; IPR003961; FN3_dom.
DR InterPro; IPR036116; FN3_sf.
DR InterPro; IPR000556; Glyco_hydro_48F.
DR InterPro; IPR013783; Ig-like_fold.
DR Pfam; PF00942; CBM_3; 1.
DR Pfam; PF02011; Glyco_hydro_48; 2.
DR PRINTS; PR00844; GLHYDRLASE48.
DR SMART; SM01067; CBM_3; 1.
DR SMART; SM00060; FN3; 1.
DR SUPFAM; SSF49384; Carbohydrate-binding domain; 1.
DR SUPFAM; SSF49265; Fibronectin type III; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS51172; CBM3; 1.
DR PROSITE; PS50853; FN3; 1.
PE 4: Predicted;
KW Reference proteome {ECO:0000313|Proteomes:UP000249522};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..26
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 27..998
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5038595994"
FT DOMAIN 749..840
FT /note="Fibronectin type-III"
FT /evidence="ECO:0000259|PROSITE:PS50853"
FT DOMAIN 845..998
FT /note="CBM3"
FT /evidence="ECO:0000259|PROSITE:PS51172"
FT REGION 133..156
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 998 AA; 109600 MW; 4DBD58633C88CF76 CRC64;
MIASSKRKII ALMLTLTMLL SLLTSAAAGA ERVQSAAALD NVHQERFLQM YEQIKDPASG
YFSPEGIPYH AIETLMSEAP DYGHMTTSEA YSYWFWLEAL YGYYTGDWTH LEAAWDNMEK
FIIPSAKEQP TMGYYNPSSP ATSAGEKPQP DMYPSQIDGK YPAGRDPLDA ELKAAYGNND
TYLMHWLVDV DNFYGFGNTL NPSHTATYVN TFQRGEQESV WEAVPHPSQD DHSFGKPGEG
FMTLFTKETA APAKQWRYTN ATDAEGRAVQ VMYWAKEMGY NNPVYLDKAK KMGDYLRYGM
FDKYFMKIGS SANGTPAPGT GKDAAHYLMS WYTAWGGGLG DNGTGNWAWR IGASHVHHAY
QNVVAAYALS QPEGGLIPKS ATAQQDWNTS LKRQLEFYTW LQSSEGAIGG GATNSWDGDY
KTYPAGVSRF YDMAYDEDPV YHDPPSNTWF GFQTWPLERV AELYYIMASS GDTSSENFKM
AKQVIENWID WSIDYVFVNE KPVTDADGYY LDANGSRILG GQDPAVATVS QPGQFYVPGS
QEWQGQPDTW KGYSSFTGNP GFHVVTKDPS QDVGVLGNYI KALTFYAAGT KAENGDFSAL
GSQAKTTAEE LLDVAWGHND GIGITLPESR ADYYRYFTKE VYIPNGWSGK FGQGNTIPGT
AGVPSDPAKG GNGVYISYAD LRPDIKNDPQ WSYLEQLYKS SYNPVTKKWE NGTPTFTYHR
FWSQVDMATA YAEYDRLIAS GDTGSEPTAP VVPAGLRAAA GDGKVELSWN NAAGAAEYVV
KRSEASGGPY TVIATQAGLS YTDTNVTNGK TYYYVVSAAN AQGESANSAE VTAVPAAAPE
EPEEPVGDLK LQYKAGDTNA TDNQFKPHLR IVNAGTSSVP LSDLKIRYYY TVDGDKPQTF
NCDWATVGCS NLSGSLVKMD KAVEGADYYL EIGFAAGAGS VAAGGNTGEM QIRINKNDWS
SFNEADDFSY DATKSAFADW NKVALYHNGK LVWGLEPK
//