GenomeNet

Database: UniProt
Entry: S2ZE29_9ACTN
LinkDB: S2ZE29_9ACTN
Original site: S2ZE29_9ACTN 
ID   S2ZE29_9ACTN            Unreviewed;       747 AA.
AC   S2ZE29;
DT   18-SEP-2013, integrated into UniProtKB/TrEMBL.
DT   18-SEP-2013, sequence version 1.
DT   22-NOV-2017, entry version 20.
DE   RecName: Full=Endoglucanase {ECO:0000256|RuleBase:RU361166};
DE            EC=3.2.1.4 {ECO:0000256|RuleBase:RU361166};
GN   ORFNames=HMPREF1211_00763 {ECO:0000313|EMBL:EPD68217.1};
OS   Streptomyces sp. HGB0020.
OC   Bacteria; Actinobacteria; Streptomycetales; Streptomycetaceae;
OC   Streptomyces.
OX   NCBI_TaxID=1078086 {ECO:0000313|EMBL:EPD68217.1, ECO:0000313|Proteomes:UP000014410};
RN   [1] {ECO:0000313|EMBL:EPD68217.1, ECO:0000313|Proteomes:UP000014410}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=HGB0020 {ECO:0000313|EMBL:EPD68217.1,
RC   ECO:0000313|Proteomes:UP000014410};
RG   The Broad Institute Genomics Platform;
RA   Earl A., Ward D., Feldgarden M., Gevers D., Schmidt T.M., Dai D.,
RA   Dover J., Kim K., Walker B., Young S., Zeng Q., Gargeya S.,
RA   Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA   Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J.,
RA   Goldberg J., Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A.,
RA   Ireland A., Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W.,
RA   Priest M., Roberts A., Saif S., Shea T., Sisk P., Sykes S.,
RA   Wortman J., Nusbaum C., Birren B.;
RT   "The Genome Sequence of Streptomyces sp. HGB0020.";
RL   Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC   -!- CATALYTIC ACTIVITY: Endohydrolysis of (1->4)-beta-D-glucosidic
CC       linkages in cellulose, lichenin and cereal beta-D-glucans.
CC       {ECO:0000256|RuleBase:RU361166}.
CC   -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E)
CC       family. {ECO:0000256|RuleBase:RU361166}.
CC   -!- CAUTION: The sequence shown here is derived from an
CC       EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is
CC       preliminary data. {ECO:0000313|EMBL:EPD68217.1}.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AGER01000003; EPD68217.1; -; Genomic_DNA.
DR   RefSeq; WP_016430739.1; NZ_KE150426.1.
DR   EnsemblBacteria; EPD68217; EPD68217; HMPREF1211_00763.
DR   PATRIC; fig|1078086.3.peg.777; -.
DR   OrthoDB; POG091H04TS; -.
DR   Proteomes; UP000014410; Unassembled WGS sequence.
DR   GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR   GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR   CDD; cd02850; E_set_Cellulase_N; 1.
DR   Gene3D; 2.60.120.260; -; 1.
DR   Gene3D; 2.60.40.10; -; 1.
DR   InterPro; IPR008928; 6-hairpin_glycosidase-like.
DR   InterPro; IPR004197; Cellulase_Ig-like.
DR   InterPro; IPR003305; CenC_carb-bd.
DR   InterPro; IPR008979; Galactose-bd-like_sf.
DR   InterPro; IPR001701; Glyco_hydro_9.
DR   InterPro; IPR033126; Glyco_hydro_9_Asp/Glu_AS.
DR   InterPro; IPR013783; Ig-like_fold.
DR   InterPro; IPR014756; Ig_E-set.
DR   InterPro; IPR006311; TAT_signal.
DR   Pfam; PF02018; CBM_4_9; 1.
DR   Pfam; PF02927; CelD_N; 1.
DR   Pfam; PF00759; Glyco_hydro_9; 1.
DR   SUPFAM; SSF48208; SSF48208; 1.
DR   SUPFAM; SSF49785; SSF49785; 1.
DR   SUPFAM; SSF81296; SSF81296; 1.
DR   PROSITE; PS00698; GLYCOSYL_HYDROL_F9_2; 1.
DR   PROSITE; PS51318; TAT; 1.
PE   3: Inferred from homology;
KW   Carbohydrate metabolism {ECO:0000256|RuleBase:RU361166};
KW   Cellulose degradation {ECO:0000256|RuleBase:RU361166};
KW   Complete proteome {ECO:0000313|Proteomes:UP000014410};
KW   Glycosidase {ECO:0000256|RuleBase:RU361166};
KW   Hydrolase {ECO:0000256|RuleBase:RU361166};
KW   Polysaccharide degradation {ECO:0000256|RuleBase:RU361166};
KW   Reference proteome {ECO:0000313|Proteomes:UP000014410};
KW   Signal {ECO:0000256|RuleBase:RU361166}.
FT   SIGNAL        1     29       {ECO:0000256|RuleBase:RU361166}.
FT   CHAIN        30    747       Endoglucanase. {ECO:0000256|RuleBase:
FT                                RU361166}.
FT                                /FTId=PRO_5005146276.
FT   DOMAIN       33    155       CBM-cenC. {ECO:0000259|Pfam:PF02018}.
FT   DOMAIN      182    264       CelD_N. {ECO:0000259|Pfam:PF02927}.
SQ   SEQUENCE   747 AA;  80398 MW;  19165225E3DD9FA1 CRC64;
     MKRRRTVLLT VTALLGAALS ALPAGPAAAD EVEQLKNGTF DTTTEPWWTT ANVTAGLSDG
     QLCADIPGGT ANRWDATVGQ NDVTLVKGES YKFSFKASGS PGGHVVRAIV GLQVAPYDTY
     FEVSPQLSVS GDTYAYTFTS PVDTTQGQVA LQAGGSADAW RMCMDDASLL GGVPPEVYEP
     DTGPRVRVNQ VAYLPSGPKN ATLVTDAGAK LPWRLKNAGG RTVAHGWTVP RGTDVSSGQN
     VHSIDFGGYR KPGKGFTLVA DGETSRPFDI GTSAYEHLRL DSLKYYYTQR SGIAIRDDLR
     PGYGRAAGHV DVAPNQGDAN VPCRPGVCDY TLDVTGGWYD AGDHGKYVVN GGIATWELLS
     TYERELLART GESGTLRDGT LAIPESGNKV PDILDEARWE LEFLLKMQVP DGQPLAGMAH
     HKIHDEQWTG LPLLPSEDPQ KRELHPPSTQ ATLNLAATAA QAARLYRPYD RAFAARALTA
     ARRAWSAALA HPTMYASAAD GIGGGTYDDT NATDEFYWAA AELYLTTGEK QFADHVLNSP
     VHTADIFGPL GFDWGRTAAA GRLDLATVPS RLPGRDTVRR SVVEGADRYL ATLKSQPYGM
     PYAPPDNIYD WGSSHQILNN GVVLATAYDI TGSAKYRDGA VQGMDYILGR NALNMSYVTG
     YGEANSHNQH SRWYAHELDP NLPNPPHGTL AGGPNSSIQD PYAQSKLQGC VGQFCYIDDI
     QSWSTNETAI NWNAALARMA SFVADQR
//
DBGET integrated database retrieval system