ID C5WMG6_SORBI Unreviewed; 899 AA.
AC C5WMG6; A0A1B6QQ48;
DT 01-SEP-2009, integrated into UniProtKB/TrEMBL.
DT 01-SEP-2009, sequence version 1.
DT 27-MAR-2024, entry version 86.
DE RecName: Full=[histone H3]-lysine(27) N-trimethyltransferase {ECO:0000256|ARBA:ARBA00012186};
DE EC=2.1.1.356 {ECO:0000256|ARBA:ARBA00012186};
GN ORFNames=SORBI_3001G395500 {ECO:0000313|EMBL:KXG40039.2};
OS Sorghum bicolor (Sorghum) (Sorghum vulgare).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; PACMAD clade;
OC Panicoideae; Andropogonodae; Andropogoneae; Sorghinae; Sorghum.
OX NCBI_TaxID=4558 {ECO:0000313|EMBL:ACV60617.1};
RN [1] {ECO:0000313|EMBL:KXG40039.2, ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=19189423; DOI=10.1038/nature07723;
RA Paterson A.H., Bowers J.E., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J.,
RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A.K., Chapman J.,
RA Feltus F.A., Gowik U., Grigoriev I.V., Lyons E., Maher C.A., Martis M.,
RA Narechania A., Otillar R.P., Penning B.W., Salamov A.A., Wang Y., Zhang L.,
RA Carpita N.C., Freeling M., Gingle A.R., Hash C.T., Keller B., Klein P.,
RA Kresovich S., McCann M.C., Ming R., Peterson D.G., Mehboob-ur-Rahman,
RA Ware D., Westhoff P., Mayer K.F., Messing J., Rokhsar D.S.;
RT "The Sorghum bicolor genome and the diversification of grasses.";
RL Nature 457:551-556(2009).
RN [2] {ECO:0000313|EMBL:ACV60617.1}
RP NUCLEOTIDE SEQUENCE.
RA Arun A., Sharma R., Bhat V.;
RT "Isolation of Sorghum bicolor enhancer of zeste-like protein-3.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:KXG40039.2}
RP NUCLEOTIDE SEQUENCE.
RA Paterson A., Mullet J., Bowers J., Bruggmann R., Dubchak I., Grimwood J.,
RA Gundlach H., Haberer G., Hellsten U., Mitros T., Poliakov A., Schmutz J.,
RA Spannagl M., Tang H., Wang X., Wicker T., Bharti A., Chapman J., Feltus F.,
RA Gowik U., Grigoriev I., Lyons E., Maher C., Martis M., Narechania A.,
RA Otillar R., Penning B., Salamov A., Wang Y., Zhang L., Carpita N.,
RA Freeling M., Gingle A., Hash C., Keller B., Klein P., Kresovich S.,
RA Mccann M., Ming R., Peterson D., Rahman M., Ware D., Westhoff P., Mayer K.,
RA Messing J., Sims D., Jenkins J., Shu S., Rokhsar D.;
RT "WGS assembly of Sorghum bicolor.";
RL Submitted (FEB-2017) to the EMBL/GenBank/DDBJ databases.
RN [4] {ECO:0000313|Proteomes:UP000000768}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. BTx623 {ECO:0000313|Proteomes:UP000000768};
RX PubMed=29161754; DOI=10.1111/tpj.13781;
RA McCormick R.F., Truong S.K., Sreedasyam A., Jenkins J., Shu S., Sims D.,
RA Kennedy M., Amirebrahimi M., Weers B.D., McKinley B., Mattison A.,
RA Morishige D.T., Grimwood J., Schmutz J., Mullet J.E.;
RT "The Sorghum bicolor reference genome: improved assembly, gene annotations,
RT a transcriptome atlas, and signatures of genome organization.";
RL Plant J. 93:338-354(2018).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(27)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+)
CC + N(6),N(6),N(6)-trimethyl-L-lysyl(27)-[histone H3] + 3 S-adenosyl-L-
CC homocysteine; Xref=Rhea:RHEA:60292, Rhea:RHEA-COMP:15535, Rhea:RHEA-
CC COMP:15548, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.356;
CC Evidence={ECO:0000256|ARBA:ARBA00000090};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GQ456953; ACV60617.1; -; mRNA.
DR EMBL; CM000760; KXG40039.2; -; Genomic_DNA.
DR RefSeq; XP_002465374.1; XM_002465329.1.
DR AlphaFoldDB; C5WMG6; -.
DR STRING; 4558.C5WMG6; -.
DR EnsemblPlants; KXG40039; KXG40039; SORBI_3001G395500.
DR GeneID; 8083989; -.
DR Gramene; KXG40039; KXG40039; SORBI_3001G395500.
DR KEGG; sbi:8083989; -.
DR eggNOG; KOG1079; Eukaryota.
DR HOGENOM; CLU_011060_0_0_1; -.
DR InParanoid; C5WMG6; -.
DR OMA; CMEVANC; -.
DR OrthoDB; 902834at2759; -.
DR Proteomes; UP000000768; Chromosome 1.
DR GO; GO:0005677; C:chromatin silencing complex; IEA:EnsemblPlants.
DR GO; GO:0005634; C:nucleus; IBA:GO_Central.
DR GO; GO:0031519; C:PcG protein complex; IEA:EnsemblPlants.
DR GO; GO:0003682; F:chromatin binding; IBA:GO_Central.
DR GO; GO:0046976; F:histone H3K27 methyltransferase activity; IBA:GO_Central.
DR GO; GO:0140951; F:histone H3K27 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003727; F:single-stranded RNA binding; IEA:EnsemblPlants.
DR GO; GO:1990110; P:callus formation; IEA:EnsemblPlants.
DR GO; GO:0031507; P:heterochromatin formation; IBA:GO_Central.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:1900055; P:regulation of leaf senescence; IEA:EnsemblPlants.
DR GO; GO:0048587; P:regulation of short-day photoperiodism, flowering; IEA:EnsemblPlants.
DR GO; GO:0009737; P:response to abscisic acid; IEA:EnsemblPlants.
DR GO; GO:0010048; P:vernalization response; IEA:EnsemblPlants.
DR CDD; cd00167; SANT; 1.
DR CDD; cd10519; SET_EZH; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR026489; CXC_dom.
DR InterPro; IPR045318; EZH1/2-like.
DR InterPro; IPR025778; Hist-Lys_N-MeTrfase_plant.
DR InterPro; IPR041355; Pre-SET_CXC.
DR InterPro; IPR001005; SANT/Myb.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR033467; Tesmin/TSO1-like_CXC.
DR PANTHER; PTHR45747; HISTONE-LYSINE N-METHYLTRANSFERASE E(Z); 1.
DR PANTHER; PTHR45747:SF14; HISTONE-LYSINE N-METHYLTRANSFERASE EZA1; 1.
DR Pfam; PF18264; preSET_CXC; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM01114; CXC; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51633; CXC; 1.
DR PROSITE; PS51576; SAM_MT43_EZ; 1.
DR PROSITE; PS50280; SET; 1.
PE 2: Evidence at transcript level;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Reference proteome {ECO:0000313|Proteomes:UP000000768};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 632..736
FT /note="CXC"
FT /evidence="ECO:0000259|PROSITE:PS51633"
FT DOMAIN 751..866
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..27
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 400..453
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 472..524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 874..899
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..18
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 400..425
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..500
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 501..524
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 877..893
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 899 AA; 100625 MW; 498387F7A48F6121 CRC64;
MASSSKASDS SSQRSKRSDQ GMMGRDAAAA SAVSIHANLT QLIRQIKSGR LAYIKEKLEA
NRKTLQRHSC ALFDVAAAAE VASRGTYGGN ALSQRAAEGQ SRLAGSDLAN GIGERDVVYM
QEENLAAGTL ALSSSGAAAQ RTVVRFVKLP LVERIPPYTT WIFLDKNQRM ADDQSVVGRR
RIYYDPVGNE ALICSDSDEE IPEPEEEKHF FTEGEDQLIW RATQEHGLNR EVINVLCQFI
DATPSEIEER SEVLFEKNEK HSASSDKIES QLSLDKTMDA VLDSFDNLFC RRCLVFDCRL
HGCSQNLVFP CEKQPYSFEP AENKKPCGHQ CYLRFPQWRE GFQEMHDDGL GGCATYTMES
GTASHKVDVN IMSESEDSNR EKGNIRSMTL FGTSGSKIIS SVSAEESTTP PSADTSETEN
VPSDLPPSSL RKHKISKHGP RYRERSPGKR QKVFTSDISF ASNILNKLSI PEIRDTRPES
RESGGDKLRI LDESTKKTSS KDIYGENPTT TTENVGRESN KVSSTNNLSE HTLSCWSALE
RDLYLKGIEI FGKNSCLIAR NLLSGLKTCM EVANYMYNNG AAMAKRPLLN KSISGDFAET
EQDYMEQDMV ARTRIYRRRG RNRKLKYTWK SAGHPTVRKR IGDGKQWYTQ YNPCVCQQMC
GKDCPCVENG TCCEKYCGCS KSCKNKFRGC HCAKSQCRSR QCPCFAANRE CDPDVCRNCW
VSCGDGSLGE PPARGDGYQC GNMKLLLKQQ QRILLGRSDV AGWGAFIKNP VNKNDYLGEY
TGELISHKEA DKRGKIYDRA NSSFLFDLND QYVLDAYRKG DKLKFANHSS NPNCYAKVML
VAGDHRVGIY AKEHIEASDE LFYDYRYGPD QAPAWARRPE GSKKDEASVS HHRAHKVAR
//