GenomeNet

Database: UniProt
Entry: G3WJT8_SARHA
LinkDB: G3WJT8_SARHA
Original site: G3WJT8_SARHA 
ID   G3WJT8_SARHA            Unreviewed;       801 AA.
AC   G3WJT8;
DT   16-NOV-2011, integrated into UniProtKB/TrEMBL.
DT   07-APR-2021, sequence version 2.
DT   27-MAR-2024, entry version 67.
DE   RecName: Full=[histone H3]-lysine(27) N-trimethyltransferase {ECO:0000256|ARBA:ARBA00012186};
DE            EC=2.1.1.356 {ECO:0000256|ARBA:ARBA00012186};
GN   Name=EZH1 {ECO:0000313|Ensembl:ENSSHAP00000015693.2};
OS   Sarcophilus harrisii (Tasmanian devil) (Sarcophilus laniarius).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC   Metatheria; Dasyuromorphia; Dasyuridae; Sarcophilus.
OX   NCBI_TaxID=9305 {ECO:0000313|Ensembl:ENSSHAP00000015693.2, ECO:0000313|Proteomes:UP000007648};
RN   [1] {ECO:0000313|Ensembl:ENSSHAP00000015693.2, ECO:0000313|Proteomes:UP000007648}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21709235; DOI=10.1073/pnas.1102838108;
RA   Miller W., Hayes V.M., Ratan A., Petersen D.C., Wittekindt N.E., Miller J.,
RA   Walenz B., Knight J., Qi J., Zhao F., Wang Q., Bedoya-Reina O.C.,
RA   Katiyar N., Tomsho L.P., Kasson L.M., Hardie R.A., Woodbridge P.,
RA   Tindall E.A., Bertelsen M.F., Dixon D., Pyecroft S., Helgen K.M.,
RA   Lesk A.M., Pringle T.H., Patterson N., Zhang Y., Kreiss A., Woods G.M.,
RA   Jones M.E., Schuster S.C.;
RT   "Genetic diversity and population structure of the endangered marsupial
RT   Sarcophilus harrisii (Tasmanian devil).";
RL   Proc. Natl. Acad. Sci. U.S.A. 108:12348-12353(2011).
RN   [2] {ECO:0000313|Ensembl:ENSSHAP00000015693.2}
RP   IDENTIFICATION.
RG   Ensembl;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl(27)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+)
CC         + N(6),N(6),N(6)-trimethyl-L-lysyl(27)-[histone H3] + 3 S-adenosyl-L-
CC         homocysteine; Xref=Rhea:RHEA:60292, Rhea:RHEA-COMP:15535, Rhea:RHEA-
CC         COMP:15548, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.356;
CC         Evidence={ECO:0000256|ARBA:ARBA00000090};
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   AlphaFoldDB; G3WJT8; -.
DR   STRING; 9305.ENSSHAP00000015693; -.
DR   Ensembl; ENSSHAT00000015821.2; ENSSHAP00000015693.2; ENSSHAG00000013372.2.
DR   eggNOG; KOG1079; Eukaryota.
DR   GeneTree; ENSGT00940000156604; -.
DR   HOGENOM; CLU_011342_0_0_1; -.
DR   InParanoid; G3WJT8; -.
DR   TreeFam; TF314509; -.
DR   Proteomes; UP000007648; Unassembled WGS sequence.
DR   GO; GO:0035098; C:ESC/E(Z) complex; IEA:Ensembl.
DR   GO; GO:0140951; F:histone H3K27 trimethyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR   CDD; cd00167; SANT; 1.
DR   CDD; cd19217; SET_EZH1; 1.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   InterPro; IPR026489; CXC_dom.
DR   InterPro; IPR045318; EZH1/2-like.
DR   InterPro; IPR048358; EZH1/2_MCSS.
DR   InterPro; IPR021654; EZH1/EZH2.
DR   InterPro; IPR044438; EZH1_SET.
DR   InterPro; IPR041343; PRC2_HTH_1.
DR   InterPro; IPR041355; Pre-SET_CXC.
DR   InterPro; IPR001005; SANT/Myb.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   InterPro; IPR033467; Tesmin/TSO1-like_CXC.
DR   PANTHER; PTHR45747; HISTONE-LYSINE N-METHYLTRANSFERASE E(Z); 1.
DR   PANTHER; PTHR45747:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE EZH1; 1.
DR   Pfam; PF21358; Ezh2_MCSS; 1.
DR   Pfam; PF11616; EZH2_WD-Binding; 1.
DR   Pfam; PF18118; PRC2_HTH_1; 1.
DR   Pfam; PF18264; preSET_CXC; 1.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM01114; CXC; 1.
DR   SMART; SM00717; SANT; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   PROSITE; PS51633; CXC; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW   Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000007648};
KW   Repressor {ECO:0000256|ARBA:ARBA00022491};
KW   S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT   DOMAIN          558..660
FT                   /note="CXC"
FT                   /evidence="ECO:0000259|PROSITE:PS51633"
FT   DOMAIN          667..782
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
FT   REGION          207..235
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          374..440
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        389..403
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        414..437
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   801 AA;  91799 MW;  FF421093A5B10FD4 CRC64;
     MSKKTEDQRG KVACPKLNSK MDIPNSPTSK CITYWKRKVK SEYMRLRQLK RFQANMGAKA
     LFVANFAKVQ EKTQILNEDW KKLRIQPVQL MKPVSGHPFL KKCTVESNFP GFDSQDMLMR
     SLNTVALVPI MYSWSPLQQN FMVEDETVLC NIPYMGDEVK EEDETFIEEL INNYDGKVHG
     EEEMIPGSVL ISDAVFLELV DALNQYSDEE EDGHNNDSSE GKQEDSKEEL PVLRKRKRLT
     IEGNKKSSKK QFPNDMIFSA ISSMFPENGV PDDMKERYRE LTEVSDPNVL PPQCTPNIDG
     PCAKSVQREQ SLHSFHTLFC RRCFKYDCFL HPFHATPNVY KRKNKEIRIE PDPCGLDCFL
     WLEGAKEYAM LHNPRSKCSG RRRRRHQVVN ASSSNTSTSA VTETKEGDSD RDTGNDWASS
     SSEANSRCQT PTKQKASPAP PQLCVVEAPL EPVEWTGAEE SLFRVFHGTY FNNFCSIARL
     LGTKTCKQVF QFAVKESLIL KLPTNELMNP SQKKKRKHSR SKQFWVIEEI RLVHISPALS
     FTEHFPHNNT MRLWAAHCRK IQLKKDNSAT QVYNYQPCDH PDRPCDSTCP CIMTQNFCEK
     FCQCNPDCQN RFPGCRCKTQ CNTKQCPCYL AVRECDPDLC LTCGASEHWD CKVVSCKNCS
     IQRGLKKHLL LAPSDVAGWG TFIKESVQKN EFISEYCGEL ISQDEADRRG KVYDKYMSSF
     LFNLNNDFVV DATRKGNKIR FANHSVNPNC YAKVVMVNGD HRIGIFAKRA IQAGEELFFD
     YRYSQADALK YVGIERETDV L
//
DBGET integrated database retrieval system