ID A0A3A1UUJ2_9BACL Unreviewed; 543 AA.
AC A0A3A1UUJ2;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 24-JAN-2024, entry version 12.
DE RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN ORFNames=D3P08_16135 {ECO:0000313|EMBL:RIX51446.1};
OS Paenibacillus nanensis.
OC Bacteria; Bacillota; Bacilli; Bacillales; Paenibacillaceae; Paenibacillus.
OX NCBI_TaxID=393251 {ECO:0000313|EMBL:RIX51446.1, ECO:0000313|Proteomes:UP000266482};
RN [1] {ECO:0000313|EMBL:RIX51446.1, ECO:0000313|Proteomes:UP000266482}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=DSM 22867 {ECO:0000313|EMBL:RIX51446.1,
RC ECO:0000313|Proteomes:UP000266482};
RA Jurado V., Gutierrez-Patricio S., Gonzalez-Pimentel J.L., Miller A.Z.,
RA Laiz L., Saiz-Jimenez C.;
RT "Paenibacillus aracenensis nov. sp. isolated from a cave in southern
RT Spain.";
RL Submitted (SEP-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|RuleBase:RU361166};
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|PROSITE-ProRule:PRU10060, ECO:0000256|RuleBase:RU361166}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:RIX51446.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; QXQA01000010; RIX51446.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A3A1UUJ2; -.
DR OrthoDB; 9758662at2; -.
DR Proteomes; UP000266482; Unassembled WGS sequence.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR CDD; cd02850; E_set_Cellulase_N; 1.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR004197; Cellulase_Ig-like.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR014756; Ig_E-set.
DR PANTHER; PTHR22298; ENDO-1,4-BETA-GLUCANASE; 1.
DR PANTHER; PTHR22298:SF29; ENDOGLUCANASE; 1.
DR Pfam; PF02927; CelD_N; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR SUPFAM; SSF81296; E set domains; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS00698; GH9_3; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277,
KW ECO:0000256|PROSITE-ProRule:PRU10060};
KW Cellulose degradation {ECO:0000256|RuleBase:RU361166};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295, ECO:0000256|PROSITE-
KW ProRule:PRU10060};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|PROSITE-
KW ProRule:PRU10060};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326,
KW ECO:0000256|PROSITE-ProRule:PRU10060};
KW Reference proteome {ECO:0000313|Proteomes:UP000266482}.
FT DOMAIN 6..82
FT /note="Cellulase Ig-like"
FT /evidence="ECO:0000259|Pfam:PF02927"
FT DOMAIN 96..537
FT /note="Glycoside hydrolase family 9"
FT /evidence="ECO:0000259|Pfam:PF00759"
FT ACT_SITE 515
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
FT ACT_SITE 524
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU10060"
SQ SEQUENCE 543 AA; 59811 MW; 8E6BD71A3045C8B0 CRC64;
MDKTRQAGIA INQLGYRSKD VKIAIRRDGS GGFSLVDAGS GETVWQGETG QAEWNEASGG
MVSRGDFSGW QQPGVYRIET DKGEISYPFV IADDVYEEAH RALLKAFYFF RCGMELEERH
AGPWTHRACH LTPAIVYGEE EKRVDAHGGW HDAGDYGKYV GPGAKAVADL LLAEEFYPQA
FKRELGIPES GGFMPDALQE CRYELEWMLR MQDGATGGVY HKLTTLQFPP GDTMPEDDTA
ELYVSPISAT ATGCFAAAMA MAARRYRPYD EAFAASCLRA AEKAWDWLEG HPGYPGFRNP
PAVSTGEYGD EQDADERYWA AAELYRTTGE SKYHEAFRQL VDAVQPFGLF ELGWANMSGY
GTIAYLFSER EQSPALAAQL REGLLRRADE LAAVCAADGY GISLQPEQYI WGSNMLVMNH
AMLLLVANRL SGSAAYERHA IKHVHYLFGA NVLGMSYVTG LGTKPILYPH HRPSEGDGVD
APVPGLVSGG PNYRLQDDYA REHLAGKAPA ASFADVMESY STNEVTIYWN SPAVFVLSHF
VGL
//