ID K1QS65_CRAGI Unreviewed; 4990 AA.
AC K1QS65;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 65.
DE RecName: Full=cellulase {ECO:0000256|ARBA:ARBA00012601};
DE EC=3.2.1.4 {ECO:0000256|ARBA:ARBA00012601};
GN ORFNames=CGI_10016397 {ECO:0000313|EMBL:EKC24431.1};
OS Crassostrea gigas (Pacific oyster) (Crassostrea angulata).
OC Eukaryota; Metazoa; Spiralia; Lophotrochozoa; Mollusca; Bivalvia;
OC Autobranchia; Pteriomorphia; Ostreida; Ostreoidea; Ostreidae; Crassostrea.
OX NCBI_TaxID=29159 {ECO:0000313|EMBL:EKC24431.1};
RN [1] {ECO:0000313|EMBL:EKC24431.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=05x7-T-G4-1.051#20 {ECO:0000313|EMBL:EKC24431.1};
RX PubMed=22992520; DOI=10.1038/nature11413;
RA Zhang G., Fang X., Guo X., Li L., Luo R., Xu F., Yang P., Zhang L.,
RA Wang X., Qi H., Xiong Z., Que H., Xie Y., Holland P.W., Paps J., Zhu Y.,
RA Wu F., Chen Y., Wang J., Peng C., Meng J., Yang L., Liu J., Wen B.,
RA Zhang N., Huang Z., Zhu Q., Feng Y., Mount A., Hedgecock D., Xu Z., Liu Y.,
RA Domazet-Loso T., Du Y., Sun X., Zhang S., Liu B., Cheng P., Jiang X.,
RA Li J., Fan D., Wang W., Fu W., Wang T., Wang B., Zhang J., Peng Z., Li Y.,
RA Li N., Wang J., Chen M., He Y., Tan F., Song X., Zheng Q., Huang R.,
RA Yang H., Du X., Chen L., Yang M., Gaffney P.M., Wang S., Luo L., She Z.,
RA Ming Y., Huang W., Zhang S., Huang B., Zhang Y., Qu T., Ni P., Miao G.,
RA Wang J., Wang Q., Steinberg C.E., Wang H., Li N., Qian L., Zhang G., Li Y.,
RA Yang H., Liu X., Wang J., Yin Y., Wang J.;
RT "The oyster genome reveals stress adaptation and complexity of shell
RT formation.";
RL Nature 490:49-54(2012).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of (1->4)-beta-D-glucosidic linkages in
CC cellulose, lichenin and cereal beta-D-glucans.; EC=3.2.1.4;
CC Evidence={ECO:0000256|ARBA:ARBA00000966};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the glycosyl hydrolase 9 (cellulase E) family.
CC {ECO:0000256|ARBA:ARBA00007072}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH815810; EKC24431.1; -; Genomic_DNA.
DR HOGENOM; CLU_223351_0_0_1; -.
DR InParanoid; K1QS65; -.
DR GO; GO:0005654; C:nucleoplasm; IEA:UniProt.
DR GO; GO:0008810; F:cellulase activity; IEA:UniProtKB-EC.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0016491; F:oxidoreductase activity; IEA:InterPro.
DR GO; GO:0030245; P:cellulose catabolic process; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd15665; ePHD1_KMT2C_like; 1.
DR CDD; cd15509; PHD1_KMT2C_like; 1.
DR CDD; cd15594; PHD2_KMT2C; 1.
DR CDD; cd15512; PHD4_KMT2C_like; 1.
DR CDD; cd15513; PHD5_KMT2C_like; 1.
DR CDD; cd15514; PHD6_KMT2C_like; 1.
DR Gene3D; 1.50.10.10; -; 1.
DR Gene3D; 1.10.1280.10; Di-copper center containing domain from catechol oxidase; 2.
DR Gene3D; 1.10.30.10; High mobility group box domain; 1.
DR Gene3D; 2.60.40.10; Immunoglobulins; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 7.
DR InterPro; IPR008928; 6-hairpin_glycosidase_sf.
DR InterPro; IPR012341; 6hp_glycosidase-like_sf.
DR InterPro; IPR008922; Di-copper_centre_dom_sf.
DR InterPro; IPR034732; EPHD.
DR InterPro; IPR001701; Glyco_hydro_9.
DR InterPro; IPR036910; HMG_box_dom_sf.
DR InterPro; IPR007110; Ig-like_dom.
DR InterPro; IPR013783; Ig-like_fold.
DR InterPro; IPR047004; KMT2C_PHD2.
DR InterPro; IPR002227; Tyrosinase_Cu-bd.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR001841; Znf_RING.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR45888; HL01030P-RELATED; 1.
DR PANTHER; PTHR45888:SF6; HL01030P-RELATED; 1.
DR Pfam; PF00759; Glyco_hydro_9; 1.
DR Pfam; PF00628; PHD; 3.
DR Pfam; PF00264; Tyrosinase; 2.
DR PRINTS; PR00092; TYROSINASE.
DR SMART; SM00249; PHD; 7.
DR SMART; SM00184; RING; 5.
DR SUPFAM; SSF48056; Di-copper centre-containing domain; 2.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 6.
DR SUPFAM; SSF47095; HMG-box; 1.
DR SUPFAM; SSF48208; Six-hairpin glycosidases; 1.
DR PROSITE; PS51805; EPHD; 1.
DR PROSITE; PS50835; IG_LIKE; 1.
DR PROSITE; PS00497; TYROSINASE_1; 1.
DR PROSITE; PS00498; TYROSINASE_2; 2.
DR PROSITE; PS50016; ZF_PHD_2; 6.
DR PROSITE; PS50089; ZF_RING_2; 1.
PE 3: Inferred from homology;
KW Carbohydrate metabolism {ECO:0000256|ARBA:ARBA00023277};
KW Cellulose degradation {ECO:0000256|ARBA:ARBA00023001};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000313|EMBL:EKC24431.1};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Polysaccharide degradation {ECO:0000256|ARBA:ARBA00023326};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transferase {ECO:0000313|EMBL:EKC24431.1};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
SQ SEQUENCE 4990 AA; 549445 MW; 98D4EBFBA307F9D5 CRC64;
MSSWQKGGFN LTEKGYYTLI VDYGNTGVAN MKSSLEEFRF NTVQIQKVGS GCLLKSFVKV
KLNVACSVYI MTSYFRPKAR EPEEVLVFPK EIEENNSISL ICAAYVGSPR GYIHIWRVFQ
NSNKSKLIYK SNFINNETEN CTKYINVTTT YTVTREDNGA VFRCSSQNNL TQGPGLSKYS
QNISVLWNMA SFFNAAILLG LLGLSSALLE PIPFPTELQE CYEYRSDNVT ASAESAIYIQ
NTCYRSFLTD QMTDGKVWSG ENLTQEGINY IDSLFRRIMA EADEVEKYKK LGGRQKRQTS
TRRFRQEVRS PGAFQPYANC VQRLQNETVE PASAGRNTYQ TLAAFHSGRT LGRAHNGPAF
LPWHRIYLLL LETECDAAIP YWDSGLDHDM VDPTTSILWS DQYFGNGDGE VMSGPFQDMR
TLLGTPIIRN YGTGDSSLFT KEGLRAVLSR RRFQDISEPL PRGSIYSLEG HHNGPHVWVG
GHISALNSAP WDPVFYMHHA YVDAVWARFR ELQIQNGFNP ETDYPRRPRQ GHRRNDVINF
GPYFELVTNL EAMANRFADL VTYEPFPVCE NNCNNSPHLY CDQLRLVCIS RIRTAVQTSV
AGIVAMGASR GVSSNIQSQS LARAVARGPL PVGRKFSDSP FIDIRNRPDN IGTARAAPQI
RRASSQVRTR ARRVQRAVSQ NAAFQENHLQ SVSSLERSFT NTFVIDGVVD VKRWVYIPVR
IVYSRSKNLK GIDPTLFGTV NQSQEKCQTA HSGASKVFVS SNGLNYYGMY TEYAIIDNRQ
PVYSTAMVVG VKNPEYGEGE TLFTAYDSCG RPCRPVCQTS VNGQRNYKGC NSALFTKAGV
RAVLSRRRFQ DISEPLPRGN IYSLERHHNG PHVWVGGHLS ALNSAVWDPV FFMHHAYIDA
VWARFRQLQI QSGINPETNY PSTSQRGHRP NDVISFGTLF ERVTNIQAMA NRFANLVTYE
AFPVCENNCN NSPDLYCDQN RRVCISRARA NSPATASGFV AMGAARGVSL DMQSQALDRA
MARGPLPVGE KFSDSPFHDI RNRPDNLGTA RVATQIRRAH SRVRRHSRRR RMQRRRVRRS
ATLNATLQHN HLQSVSALER SFTNTFIIDG VVDINRWVYI PVHVIYSRSQ NVQGIDPTLF
GTVDQIEDTC QTAHSGASKV FVTSNGLNYY GMYTEYAIID NRQPVYTTTM TVGVKNPEYG
TGETLFTAYD SCGRPCRPLC LTFVNGQRNN KGCSGAFRVS TSSPQMYSIT YSDALNSTLM
TYSQIDTHAD FSAPALTFLC DNKPETVKLE NTVEEAGTQL SVDTIESSVA ADINSDNSQS
STSVPPDITQ ADTYSLDDPY TSDPPDLNPE EDTNLPTKVI KKEGEYLTLP TYEAAELSDS
SGPPDITSAA YEKTEEVEDI EIKDYSYVDS PNSPAPLLTS LEEQGEFTDF PTELPEDPQN
IFPDTDPPPD LLHQEDLTPT EGYLGSSFPE SSLSESLLAV APGDILPVVS LPQINPNFFQ
QNLSPTLAQN IKRGPGRPRK DGQEPAPRRR SSSKPLAKSI VTKAQRILHN SAGISRKFFP
SAEQMDSGSQ SSLSQSELTM TIDTGEDTTQ YEMEEEMLIG VSESLDETSL ESQLPVRVCS
FCNCGERSLL GQGDIFRFEP TQGFNFRKTL AKNDKKPADF DERNQEKEGT TKPLTWRRNR
GPIKGVGERE REKSHSPRRT GNEEDQNILL GDELTFLGFP EDVEAMQVFE PTGHVWAHNR
CAAWSEGVTA GVDGSLLSVD QAVFNGLSQK CSYCRRYGAT ITCIYPECSK KYHYPCAAAG
ACFQDKKKLA ILCPDHSDQA ETIAGEEAFC VLCCQADKIG KQLFCTSCGH HYHGGCLHPS
VALSPEVRAG WQCPDCKVCQ MCRQPGEDSK MLVCDTCDKG YHTFCLKPVM TAIPKNGWKC
KNCRVCGDCG SRTPGSGPSS RWHLNYSVCD SCYQQRNKGL SCPLCGKAYR QFTQKAMIQC
GTCKKHVHAE CDDAIDNLML DRVRNEEQVD YMCSVCRNRD PEFSSPPHIH PHGEGCGIGS
KGGKLNAMSR KKIGVSSSSS SRSRGRPVAP EKKKKPPPSY SEGKRGSKTK MKSNQPGAQA
QISPAQPGEP LKKSQFPTED DDDGDDHPQT IILSNAQDKF VLDQDVCKSC GSFGRGEEGK
LIVCTQCGQC YHPYCASVKV TKVILSKGWR CLDCTVCEGC GKPHDEGRLL LCDECDISYH
IYCLDPPLDQ VPKGTWKCKW CVMCINCGTT TPGFGCNWQN NYTQCGPCRS KIDCPVCRHK
YQDDEMIIQC LQCNRWLHAL CDGLRSEDDM ERAADYDYQC LFCRPKTGKD GPLPPPPPPP
TPPPIEMEEP PTPPFYREPE PIPQKRYLMD GVYLSETGVQ HMQEITIQIP KVKRQRRNNK
RLSVDMLPGQ RLATQMSTEG DEEKDGMDEG SELSTPVTAE ARGEGDPLLL SPTSQAPSGI
PAEGEKKERK KRTTIGLGVG GFIAKARSRQ TNVKRQMSEV SAEMADGQPR PEGEDGEPVP
LLVEGEVKKQ RKRQARKKSQ LENSFPNYLQ EAFFGKDILD KSKAKVKQGH RADSDSDTES
RASTPNLPKD IPQPSLFPEL SQSNTKLPST LPSVGTSQSV LSSGASMVPS FPGTGDDINP
IGDVLPDLEL PHDDIFSIFK EGGKFLLSIL IAKLPMSEEQ PTSSQAGASG SGELPDILSI
ENIDVDLLPE HAEDLPPING QEVDDIFNGV LPPEEDPSQA AHGGGPQFPM PGSVPRLPPH
MPPHPGQMPP GMPQLPYGPD FRGMGGDQPP WQQVATEEEG GGSTSSRRNM LKWECDEDLG
ESATISAVLY CNLRHPEFKQ QYPDWSERVK KIAKAWRELS SDEKQPFLTQ DGKKPKTKVP
SMPMGPPLPP GPQGQMPYPP PAAADNLGSP VVLSPSNRVP SNPSMVTTPE GHSPGQMPGS
LPESPLSHPG TPQDPRMMVG HGDPYNPEGE TDDFAVMVHP GMRHPQGGPQ RPPMARNPSG
GMDPYSQDPY AKPPSTPQTP QKSPSKIQWP GQAQESDPYE NPPATPAEGM SSGHPGHPYD
PYAHPPHTPR PGMVPPQGIH RPPFTRQSSV PATQTSHSDD PYAFPPHTPG PRPGGDGQNP
DIVGRQQGAD MYPGMGDHLR SPYPPSSVPG SMGQPLPSTA PDIYRMAGQS GMRHPFRPPG
PPTSQSMARP DLYAQPLHPG MEPPREQSAG LQDKNSSGET SQQVRNILHK QAESRMRQGK
GSEPPQPPWE NHPQAFTSEE MTRFPVSRGA WPPAPPRVPG PRQIPHPTDM ENFGMRPPYS
EGVHPGQMRQ PRPHGPMMPE MYSPKHSPQM SPGQLPSLRP GQNPLAGHDP RSQFPNMMEH
RFPVPGQQVP PNQQQVPNIS GQPQTYKSYP PETGEQSRIN PGAMGEQRKM SDHDQREEEK
SVETKEEKKE LVEDKQMNEA DIEELLSSDG TFDIIKFVDT DTELNLDENK SIFDDLDDVE
SGSMDKGDSK EDMKMTKEKM DSDLDSSKSG GGIPDFQSKF LEFSQKKNEE RKIGAEGSGQ
MSEMDKKDQQ SISQIAALLQ QQSSEHMPPT LERRDSVKSD SSLPGKEQLP RTPGSGPLTP
SGKQPGVDQG FPGMGSRGPY SAGMHSPLHQ MQSIPTQPSP GSTYPPGQGP PTPGLPSPKI
MPSPRSNIPS PRTPSVQSPF NPMNSQQSPF PQTQSPFSPT VSSAPQSPYA ANKPQPSFSL
PVGSTNQPGY STVESGVPVQ SPGQRSPRGT ITPTNQYMQG TYGQPMMQGP PRATTLTPTP
TSILYGGQSG MAQHGMRNPV PASMNGSRPP FSGASAHPMG HSMETATSQM HSMDPSAVPH
MATSQSSHTP GDMQGHPPSS QSTATMPPYG MPGQRMPGIR PPMGIPTSSV PSSAPGALGG
GRPQLLQDQP LLIQDLLEQE KQEQQRQAQQ QAMGMMQRPP PEAMMGQPPR PGMPMGQYRP
RMEGMRHPVD PNWNVQRPYG QPEIPPGQPQ FVRMPGQQMP PRMPGPYVGQ PPGVVGPMGI
PTPPPPPPQP PMTGELTPEL ERQQQQYEDW LMKHGNYLEM QVKTLEQQIG KCKRTKKAIN
ARNRQAKKTG REPSANDATE LERVTQEQAG LQKQLESYRK QVKQHQMQTQ DYRTKKRERY
GQDWAFSQMP PVTGPAIMNV PPGGLQPRIP GQQGSARLTA SARQAYDEYM QDRLRQNQQI
PGAVGGPRPK HTVVEDNNPF SEEYQEREQR ERVGCLPKPG PEPEKPEMMR QMPPYFDPRT
IPYDQTGRFP APRMPGPAEP MQRFPGPAAT TVGTAEPRPP VQFTPQSETE KRIMEILNNS
AGLAARSQHE AVDKSPGKAK AGDTQGSPAT EQKPEGRKPE EGEGAQRVTS VSVPGPSTQP
TYTTAVSQPA YQSESNDPDD QQEEHILPMN VMIHQSIPQS AAIACSVPPV QVSQSQQMEA
ARIQESESMP QTAAKSVEGS VQSDQGHTQE RPPISGSSEA QPVSSVNSNV NTVQEPETVS
TNVQHLPPHS YPMTSLPGHP LPPPNTQERM SPRGMVPPAY MGAYGHFQGR MSPRQAHYGG
VPPTLMPVSQ NQSPRQSTTP SPGRYSPRQA SPSYPGGPSP GRYSPRQGSP GVSRPPSRPP
SQPGSGYAPP RQGPTPPITG YPSSTPPASP NVTYSPVTTF SAVSVDISSR VETSVAHTTI
NSRPNVSTST VQSVSQNTPT DSVAPPPATE SASAPVVEQS NLKLQDTNTT GDHVKFNLPM
AFSAHVLAYG LNRWKDGYSS SNQLQNMYEM LRTPLDYFMK CWRPQSQEYY AQVGNGAADH
AFWGRPEDMH MQRPAYKCTA SNGGCSDVEG ITVAALAAGS MAFKASDAAY SQRLLASAKS
LYDFANAHKG IYNKGPISDA TSYYGSTGYK DELCVAAMEL YKATKDAKYL NDAKANFEGN
DVAWALSWDD NHVMCELLLY EETKDNRYKG LVESFVRSYM PGGSVHQTPC GLAWRDQWGS
LRYAGIFQRE
//