ID K7MYR2_SOYBN Unreviewed; 724 AA.
AC K7MYR2;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 70.
DE RecName: Full=SET domain-containing protein {ECO:0008006|Google:ProtNLM};
GN Name=100809678 {ECO:0000313|EnsemblPlants:KRG95725};
GN ORFNames=GLYMA_19G167900 {ECO:0000313|EMBL:KRG95723.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EnsemblPlants:KRG95725};
RN [1] {ECO:0000313|EMBL:KRG95723.1, ECO:0000313|EnsemblPlants:KRG95725}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRG95725};
RC TISSUE=Callus {ECO:0000313|EMBL:KRG95723.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRG95725}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRG95725};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRG95723.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRG95723.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000852; KRG95723.1; -; Genomic_DNA.
DR EMBL; CM000852; KRG95724.1; -; Genomic_DNA.
DR EMBL; CM000852; KRG95725.1; -; Genomic_DNA.
DR EMBL; CM000852; KRG95726.1; -; Genomic_DNA.
DR RefSeq; XP_006604505.1; XM_006604442.2.
DR RefSeq; XP_006604507.1; XM_006604444.2.
DR RefSeq; XP_006604508.1; XM_006604445.2.
DR AlphaFoldDB; K7MYR2; -.
DR SMR; K7MYR2; -.
DR STRING; 3847.K7MYR2; -.
DR PaxDb; 3847-GLYMA19G35120-2; -.
DR EnsemblPlants; KRG95723; KRG95723; GLYMA_19G167900.
DR EnsemblPlants; KRG95724; KRG95724; GLYMA_19G167900.
DR EnsemblPlants; KRG95725; KRG95725; GLYMA_19G167900.
DR EnsemblPlants; KRG95726; KRG95726; GLYMA_19G167900.
DR GeneID; 100809678; -.
DR Gramene; KRG95723; KRG95723; GLYMA_19G167900.
DR Gramene; KRG95724; KRG95724; GLYMA_19G167900.
DR Gramene; KRG95725; KRG95725; GLYMA_19G167900.
DR Gramene; KRG95726; KRG95726; GLYMA_19G167900.
DR KEGG; gmx:100809678; -.
DR eggNOG; KOG1082; Eukaryota.
DR InParanoid; K7MYR2; -.
DR OMA; QLPPPCN; -.
DR OrthoDB; 443713at2759; -.
DR Proteomes; UP000008827; Chromosome 19.
DR ExpressionAtlas; K7MYR2; baseline.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd10538; SET_SETDB-like; 1.
DR Gene3D; 1.10.8.850; Histone-lysine N methyltransferase , C-terminal domain-like; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR025776; SUVR4/1/2.
DR InterPro; IPR043017; WIYLD_dom_sf.
DR InterPro; IPR018848; WIYLD_domain.
DR PANTHER; PTHR46450:SF9; HISTONE-LYSINE N-METHYLTRANSFERASE SUVR2-LIKE PROTEIN; 1.
DR PANTHER; PTHR46450; INACTIVE HISTONE-LYSINE N-METHYLTRANSFERASE SUVR1-RELATED; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF10440; WIYLD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51580; SAM_MT43_3; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Reference proteome {ECO:0000313|Proteomes:UP000008827}.
FT DOMAIN 449..550
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 553..686
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 81..138
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 150..188
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..305
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 81..99
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 290..305
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 724 AA; 80448 MW; 0890E816BCE42D3A CRC64;
MAPNPRVVAA FMAMSNLGIH ESKVKPVLKK LLKLYDKNWA LIEEESYRAL ADAIFEEEEN
KVNEPDQNIK NKDGVVDDVE AHTHEEPVRP LKRLRLRGQE GQSLRPLTSS GPSSAAFPLK
MPKLEDGTVP ESSSRLQPQS LAALSDGNAR IGAHHVPPQD AVVDKGKKPI SPQVTPRRRR
SLSEPLKEST VEGRAALLAN NKMPHPFILI KPKDEPVDDI PDYEIPLAVI PPDSPMGAVE
KQDVHDTVVS QCRDEDVEHE DVFPSSNEEA TSNVYVALSS MGEEQSVKIT QTDDVSKESE
TNDSSIVRGN KDSVIANGSI SVKSSSAVAE LQVPSSIPSP SDPDDAVLAP KKVAMNGFLQ
SDGGKELEDP ISPNSCTLVV VQKHQLTTDD VRAVHDVNDL TKGEERVKIS WVNNTTNDFP
PLFHYIPRNL VFRDAYVNIS LSRIGNEDCC STCMGNCVLS SNPCSCTNKT GGEFAYTAKG
LLKEEFLDEC IALSHDPQNY FYCKACPLER SKNDDCLEPC KGHLKRKFIK ECWSKCGCGK
HCGNRVVQRG ITCKLQVFLT SDGKGWGLRT LEDLPKGAFV CEFVGEILTL KELHERNLKY
PKNGKYTYPI LLDADWGSGT VKDREALCLY AASYGNAARF INHRCLDANL VEIPVEVEGP
THHYYHFAFF TSRKVAAQEE LTWDYGINFD EHDDQPIELF QCRCSSKFCR NIKRLNRSMR
SSSS
//