ID K7MBL3_SOYBN Unreviewed; 1226 AA.
AC K7MBL3;
DT 09-JAN-2013, integrated into UniProtKB/TrEMBL.
DT 09-JAN-2013, sequence version 1.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=Histone-lysine N-methyltransferase {ECO:0008006|Google:ProtNLM};
GN Name=100806034 {ECO:0000313|EnsemblPlants:KRH12193};
GN ORFNames=GLYMA_15G158500 {ECO:0000313|EMBL:KRH12194.1};
OS Glycine max (Soybean) (Glycine hispida).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; fabids; Fabales; Fabaceae; Papilionoideae; 50 kb inversion clade;
OC NPAAA clade; indigoferoid/millettioid clade; Phaseoleae; Glycine;
OC Glycine subgen. Soja.
OX NCBI_TaxID=3847 {ECO:0000313|EMBL:KRH12194.1};
RN [1] {ECO:0000313|EMBL:KRH12194.1, ECO:0000313|EnsemblPlants:KRH12193}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Williams 82 {ECO:0000313|EnsemblPlants:KRH12193};
RC TISSUE=Callus {ECO:0000313|EMBL:KRH12194.1};
RX PubMed=20075913; DOI=10.1038/nature08670;
RA Schmutz J., Cannon S.B., Schlueter J., Ma J., Mitros T., Nelson W.,
RA Hyten D.L., Song Q., Thelen J.J., Cheng J., Xu D., Hellsten U., May G.D.,
RA Yu Y., Sakurai T., Umezawa T., Bhattacharyya M.K., Sandhu D.,
RA Valliyodan B., Lindquist E., Peto M., Grant D., Shu S., Goodstein D.,
RA Barry K., Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N.,
RA Joshi T., Libault M., Sethuraman A., Zhang X.-C., Shinozaki K.,
RA Nguyen H.T., Wing R.A., Cregan P., Specht J., Grimwood J., Rokhsar D.,
RA Stacey G., Shoemaker R.C., Jackson S.A.;
RT "Genome sequence of the palaeopolyploid soybean.";
RL Nature 463:178-183(2010).
RN [2] {ECO:0000313|EnsemblPlants:KRH12193}
RP IDENTIFICATION.
RC STRAIN=Williams 82 {ECO:0000313|EnsemblPlants:KRH12193};
RG EnsemblPlants;
RL Submitted (FEB-2018) to UniProtKB.
RN [3] {ECO:0000313|EMBL:KRH12194.1}
RP NUCLEOTIDE SEQUENCE.
RC TISSUE=Callus {ECO:0000313|EMBL:KRH12194.1};
RA Schmutz J., Cannon S., Schlueter J., Ma J., Mitros T., Nelson W., Hyten D.,
RA Song Q., Thelen J., Cheng J., Xu D., Hellsten U., May G., Yu Y.,
RA Sakurai T., Umezawa T., Bhattacharyya M., Sandhu D., Valliyodan B.,
RA Lindquist E., Peto M., Grant D., Shu S., Goodstein D., Barry K.,
RA Futrell-Griggs M., Abernathy B., Du J., Tian Z., Zhu L., Gill N., Joshi T.,
RA Libault M., Sethuraman A., Zhang X., Shinozaki K., Nguyen H., Wing R.,
RA Cregan P., Specht J., Grimwood J., Rokhsar D., Stacey G., Shoemaker R.,
RA Jackson S.;
RT "WGS assembly of Glycine max.";
RL Submitted (JUL-2018) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000848; KRH12193.1; -; Genomic_DNA.
DR EMBL; CM000848; KRH12194.1; -; Genomic_DNA.
DR RefSeq; XP_006597764.1; XM_006597701.2.
DR RefSeq; XP_006597765.1; XM_006597702.2.
DR RefSeq; XP_006597766.1; XM_006597703.2.
DR RefSeq; XP_006597767.1; XM_006597704.2.
DR RefSeq; XP_014623411.1; XM_014767925.1.
DR RefSeq; XP_014623412.1; XM_014767926.1.
DR AlphaFoldDB; K7MBL3; -.
DR SMR; K7MBL3; -.
DR STRING; 3847.K7MBL3; -.
DR PaxDb; 3847-GLYMA15G17030-2; -.
DR EnsemblPlants; KRH12193; KRH12193; GLYMA_15G158500.
DR EnsemblPlants; KRH12194; KRH12194; GLYMA_15G158500.
DR GeneID; 100806034; -.
DR Gramene; KRH12193; KRH12193; GLYMA_15G158500.
DR Gramene; KRH12194; KRH12194; GLYMA_15G158500.
DR KEGG; gmx:100806034; -.
DR eggNOG; KOG1080; Eukaryota.
DR InParanoid; K7MBL3; -.
DR OMA; TINGYMH; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000008827; Chromosome 15.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IBA:GO_Central.
DR GO; GO:0042800; F:histone H3K4 methyltransferase activity; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd19169; SET_SETD1; 1.
DR Gene3D; 3.30.1490.40; -; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR003169; GYF.
DR InterPro; IPR035445; GYF-like_dom_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR037841; SET_SETD1A/B.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR Pfam; PF02213; GYF; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF55277; GYF domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50829; GYF; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008827};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 248..304
FT /note="GYF"
FT /evidence="ECO:0000259|PROSITE:PS50829"
FT DOMAIN 1087..1204
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1210..1226
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 50..73
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 747..781
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 842..878
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 947..966
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 50..65
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 859..876
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1226 AA; 136074 MW; 3682ED368A6089E2 CRC64;
MVFSTVFLDE DDPFFSRKRP RVSDLGHQDD LLADAGISSD ISLFSHQDIE RGTGDFPSSS
NTDDKVDPDS GMEMSCPSNV NSGYVPVCST TGHILHMDQS FCGYVQQPAF VSGWMYVNEN
GQMCGPYIKE QLYEGLTSGF LPSELPVYPV INGTIMNPVP LNYFKQFPDH VSTGFAYLSL
GISGTRVPTL AVYEQDRFFE HAAPLAVNPD SQPVSKSHIN YCIKESNRLN SNSEAFKSLI
SCQMLGVECC WLYEDEKGMK HGPHSIKELI SWHRHGYLKD STVISHSDNK YDTFVLLSAV
NTLKGDISGT ICRSVSTSSE VGEMVNLIGE ISEDISSQLH MGIMKAARRV VLDGIIGDII
AEFVTEKKLK RHKLESADCT PENNMSKFSA EISKGSAISS DPASSHTLDD QTCHESSRLP
PAIIKSVGSI ENFWWSFAVV RKVLLDYSMQ VMWNAVFFDT LAEYLSSWRK KMLWSHPKPQ
PSANGCEDHT EKIESEALVF NPDSSESNVD GDNQFGVLTT EKNCPLLFSS PSSLKGGNLL
EGQKVSCPYV NSRDLTCILE SVENELHFSS KVSLADYIRS FVEKEVNKLI PFPEENKFNE
VAVGGTHFSG ILADKTSMKE ILNDKSVDPV KAGNSFGESA SGNHKSDIFS KAFKELCGYV
DDVVEEEIDD LPPGLEKSQT VVPHYNSKFR PSRSAESNPK ITEYVATALC RQKLHDEVLE
KWRLLFLNSV PKQVFISSST VKKHFKSDGH KKRKMADASK EHLNSATSGL GRVKEGAKSS
SEVPPVIGKY TYCRKKLSQK ELIFSKSVAE NDSRTGKQLV TKLRKHVSGD VGEAAEVKIA
SAKHGKTKMI KGKKDTSSKG KSSVSVNSSS HNDQLSLKNK AGRKVLKFSD DVKDFVKSNV
KKLSVSTNNS VGMKKVAKSD GCDRDDTVKE KTTSHCSREI QNATKKVTKS KRKHQMDGTS
SHPTKVLKIS NGGAYLGASK QVPVASRKSA KSKPLNLCPR SDGCARTSID GWEWHKWSRS
ASPAYKARVR GLPCVRNKCI DSENNLFQLS NGKGLSARTN RVKLRNLLAA AEGADLLKVP
QLKARKKHLR FQRSKIHDWG LVALEPIEAE DFVIEYIGEL IRPRISDIRE RQYEKMGIGS
SYLFRLDDGY VVDATKRGGI ARFINHSCEP NCYTKVISVE GQKKIFIYAK RHIAAGEEIT
YNYKFPLEEK KIPCNCGSRK CRGSLN
//