ID A0A3S0CY60_9BACL Unreviewed; 1406 AA.
AC A0A3S0CY60;
DT 10-APR-2019, integrated into UniProtKB/TrEMBL.
DT 10-APR-2019, sequence version 1.
DT 27-MAR-2024, entry version 13.
DE SubName: Full=DUF1080 domain-containing protein {ECO:0000313|EMBL:RTE11494.1};
GN ORFNames=EJQ19_01480 {ECO:0000313|EMBL:RTE11494.1};
OS Paenibacillus whitsoniae.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=2496558 {ECO:0000313|EMBL:RTE11494.1, ECO:0000313|Proteomes:UP000276128};
RN [1] {ECO:0000313|EMBL:RTE11494.1, ECO:0000313|Proteomes:UP000276128}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MER 54 {ECO:0000313|EMBL:RTE11494.1,
RC ECO:0000313|Proteomes:UP000276128};
RA Seuylemezian A., Vaishampayan P.;
RT "Bacillus ochoae sp. nov., Paenibacillus whitsoniae sp. nov., Paenibacillus
RT spiritus sp. nov. Isolated from the Mars Exploration Rover during
RT spacecraft assembly.";
RL Submitted (DEC-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RTE11494.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; RXHU01000007; RTE11494.1; -; Genomic_DNA.
DR OrthoDB; 9758662at2; -.
DR Proteomes; UP000276128; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:InterPro.
DR GO; GO:0000272; P:polysaccharide catabolic process; IEA:UniProtKB-KW.
DR CDD; cd02850; E_set_Cellulase_N; 1.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR010496; 3-keto-disaccharide_hydrolase.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR004197; Cellulase_Ig-like.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR Pfam; PF06439; 3keto-disac_hyd; 1.
DR Pfam; PF02927; CelD_N; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
PE 4: Predicted;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Reference proteome {ECO:0000313|Proteomes:UP000276128};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..37
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 38..1406
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5018756082"
FT DOMAIN 313..391
FT /note="Cellulase Ig-like"
FT /evidence="ECO:0000259|Pfam:PF02927"
FT DOMAIN 434..853
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
FT DOMAIN 1238..1403
FT /note="3-keto-disaccharide hydrolase"
FT /evidence="ECO:0000259|Pfam:PF06439"
SQ SEQUENCE 1406 AA; 151382 MW; C9CDE37632A98380 CRC64;
MSSTTRTKGR ARKFWCTCLG VLLVLNTLLL APSLSSAAVT PLPGPGNPLL YDDFSGGGIY
KQNWMNWYNQ AGGTGTFSKT TVDSRQVGLF TQTPASSSSW AKFQPMNETV DLTGYRYLNV
TMKNPGYPNS QIRVAIGDGT TTYNLTGGWV AVPTTWTTSQ FDLDALTPAL NKKTAKLEIW
LRQSGGTYGE TLIDDIFAST ASSGTAPTLT GSMTANSAGG YNQNTSFTFK ATYADADNQK
PFALQLVIDD TVYDMREDDN NDITYTDGKT YTYITKLPVG THTYYFRTTD LTSNQVSTAV
QSGLVVANTT QAIDVNVSQA GYNTGAFKNA KVTAQLPLTD LSYVVKNGAS VVASGNMTYE
GLFWNKHVYS IDFSSITASG TDYTVVTNGI SSYPFAIQAN IWSNYKDEMT AFYRILRASV
ATSDAYPAGY SSVAPSAAIF HGAGHLDDAQ SADSTTHYDL TGSWYDAGDY GKYAGNQWVG
TEIALAYIRN ADAVSSKFDN DNNGIPDLVD EAVFGSEYLI KFANQLGGAM YDLKNNASFV
HPEKSTDNIN GTADDRKLSG LGVGGSAKAA GTLAATARAI HTAIAKGDIA PAKVATLTTF
ANNCEAAAVT FYNYVVANPD GPVGSYATRG GIPNSKLLAD VELYLLTNNN AYKNAATANI
NTLTFNDLAS TNYWDMRPMS LAEFYPVADS TTQSHIQQLL KEQVDFFLTS TDDTPYSVLN
QFKNFGVNEP HVSYLGDLMR YYELFHDPAA LRAVEKGIYW VFGENPWNIS WVSGIGSDFV
TYPHTRLDET ANTSTGTGVV FPGAMVSGPN MKDTKDKLSV SPWYVDRSLY SDDTNQWRYN
EFSISIQAGL LYTIMGLSAN ASASSAGGTT PAALPITSPT IGDYVRGNVT VFSGSAAGLT
NLEYSTTGTT GTFVPMSVSG AVYAATINET ASPAYANKRV DIRGTDAAGR QSYSSTHYTV
AAPLPDPSTP LLYDDFGGGG LWGSVGGSGE WVNWYTQNGG TASFERTTLD GLTVGTFKQT
PTATNSYAKF QPWHDVVDLS GYRYLNFKVK NPNYANLRMK IELNDGSRTY NLTGGWVAVP
TTWTTLSYNL DALTPVVNKK KATLAIWLNQ TSVGYGEMFM DEIQASNVAS GTAPTLSGIS
VDHASGDVET AFTFNTTYTD ADNEKPFSME LILDGVVHQM EAINPADNTY SDGKAYQFTT
KLPVGVHSYY FHTTDTTSNA VSSAVQSGPV VSQELFSSDF NNGTAAGWTP TSGTWSVQSG
QYSGQAGSSN SYSIAGDASW TDYTLEAKVN VTNNINGNKD AGLLVRYTDA DNYYLLLLKN
NDRSGRKMEL IKSVAGVKTS LAFTNPSIAA DTFYTYKIVL GGSHISVYQD GVLQFSVVDT
SLANGKIGAR TYANTKAYFD DVVVTR
//