ID A0A453A7M0_AEGTS Unreviewed; 1566 AA.
AC A0A453A7M0;
DT 08-MAY-2019, integrated into UniProtKB/TrEMBL.
DT 08-MAY-2019, sequence version 1.
DT 27-MAR-2024, entry version 27.
DE RecName: Full=Histone-lysine N-methyltransferase {ECO:0008006|Google:ProtNLM};
OS Aegilops tauschii subsp. strangulata (Goatgrass).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Pooideae; Triticodae; Triticeae; Triticinae; Aegilops.
OX NCBI_TaxID=200361 {ECO:0000313|EnsemblPlants:AET2Gv20013400.7, ECO:0000313|Proteomes:UP000015105};
RN [1] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=25035499; DOI=10.1126/science.1250092;
RG International Wheat Genome Sequencing Consortium,;
RA Marcussen T., Sandve S.R., Heier L., Spannagl M., Pfeifer M.,
RA Jakobsen K.S., Wulff B.B., Steuernagel B., Mayer K.F., Olsen O.A.;
RT "Ancient hybridizations among the ancestral genomes of bread wheat.";
RL Science 345:1250092-1250092(2014).
RN [2] {ECO:0000313|Proteomes:UP000015105}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. AL8/78 {ECO:0000313|Proteomes:UP000015105};
RX PubMed=29158546; DOI=10.1038/s41477-017-0067-8;
RA Zhao G., Zou C., Li K., Wang K., Li T., Gao L., Zhang X., Wang H., Yang Z.,
RA Liu X., Jiang W., Mao L., Kong X., Jiao Y., Jia J.;
RT "The Aegilops tauschii genome reveals multiple impacts of transposons.";
RL Nat. Plants 3:946-955(2017).
RN [3] {ECO:0000313|EnsemblPlants:AET2Gv20013400.7}
RP IDENTIFICATION.
RG EnsemblPlants;
RL Submitted (MAR-2019) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EnsemblPlants; AET2Gv20013400.7; AET2Gv20013400.7; AET2Gv20013400.
DR Gramene; AET2Gv20013400.7; AET2Gv20013400.7; AET2Gv20013400.
DR Proteomes; UP000015105; Chromosome 2D.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd19172; SET_SETD2; 1.
DR Gene3D; 3.30.40.100; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044437; SETD2/Set2_SET.
DR InterPro; IPR011124; Znf_CW.
DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF07496; zf-CW; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS51050; ZF_CW; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000015105};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 947..999
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DOMAIN 1050..1100
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 1102..1219
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1227..1243
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 1..25
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 406..454
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 508..540
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 697..733
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 754..792
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 850..876
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1295..1329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1420..1439
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1509..1530
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 406..420
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 508..523
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 524..539
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 702..716
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 754..775
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 776..792
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 859..876
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1297..1329
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1566 AA; 171360 MW; 5BC46C13B2562D9E CRC64;
RGWSRRLERP RRPVHRQPVA PSGGGSRANY SGCIYDMVDK AAETSQREVA AEGVVLWDEA
LVKTSTEDLQ MRCKELRCYN DIPVMCLKKD VNCDLEKDGL WPKIEAEVST PAHQDSVPLN
FGCNLAVCLD GKAGEIGEVS EHRTGMERAA CGSQGGGMLS FDRGFWKGAV GDKNQFPRME
GCHENGGLSD LGNHDTDKFP QGADALSLID DNHELGRDCF LANIDEEVSF PVDEASIPSF
YQKSYMDVFV EDSKSCIEKL TQDSLEGDML SCERDARFRT EASGDENQHH RMAVSKGQVS
SICKDANSPS LNACGLFPEI EVLRQQADKE YKVFELPPEI YLARSSYNPP CLDGLCRSGK
ESSAVCLGHQ DSSGVKSRCP DHLVQELNAY NSSIDKPCSA NFVENANDEE SQNKISESLN
ASKRRNPRRA ASSRNRAPAE HDHQINKGSS STCKSKKVES SCSLVESTLI KFPSKTTKVR
SGINRPVNST AWGSLEKLVD GFGQNCEPST SNSHLISLEN GGRSNKRSGK KEQPIVRKAR
SSRCPKNKFP AFSVTRYAPD ELNGEPTFSV MDGAYGSAEG YIGNFPKLAP RAFLNVSDDA
HRSVQHMSIQ TDMQQLDRCL DSVAQETCPA YMCGEFAKSI SEPSLNNGGV GFSPDSVLEV
ASVTCENNTS ASHDVKLRGN PSYPAVLTES DLHASDLSVP DFGKNHASSS TDFEQQPKTV
RGDENTRSEE INQSHAIIGY VGEGKVQGLE KSNAVRKTKM LEKQKGRKKD GIKGNNIRDG
SSTKISSSEA SKYRVFSDDP SSLVSSGPLK FSSCFEVVTS ATQGISMHEH GWVQGPSVIG
KEKTSALNNV KSPRCKKSGG LRGKKDMVRD PHVKQESKKK NIADAIFIDS GSSTLPYQLA
TDLATSHTNE QGYRSPAIEY TFQNPAAIST ELPGNAAGST GGVSVPQPKR AAWACCDDCQ
KWRCIPSELA DVIGENRWTC KDNDDKAFAD CSIPQAKTNA EINAELELSD ASADEADKDG
SNSKASRAPS WTNLRSNTYL HRNRRNQSID ESMVCNCKPP QEGRMGCRDG CLNRMLNIEC
AKRTCPCEEQ CSNQQFQRRN YAKIAWFHSG KKGYGLKLQE EVSEGRFLIE YVGEVLDITT
YESRQRDYAS KGKKHFYFMA LDGGEVIDAC TKGNLGRFIN HSCSPNCRTE KWMVNGEVCI
GIFAMRNIKK GEELTFDYNY VRVSGAAPQK CFCGTAKCRG YIGGDISGSG ISTQHVAEAE
YFEPMVTYKD AEEMLGNACS HGANPIVVEL EHETSIQQED SNNCIPVTPD SEPHQTSPVT
PDSEPHQTSP ILFENSELEN SWEMWSPQDA EDPTRTPVHV PRTIDSTLQQ LPVYDTQPLE
FLPKAPNTMD GSKAPNVMNQ SARSSDLGQN LVVPGFHAKK KNNLKDQRDV KSSSCSTDNE
NTLGVEARLN NLLDRDGGIS RRKDSTNGYL RLLLFVTAAA RDNAAAAAAR DNAAASAARD
NAAAAAAHDA TAMEHENAAT PAERDNAGGT SKRFGYRVGM SSKFFVSSLS ELLFNLFQCK
GSVVNS
//