ID W1NIM4_AMBTC Unreviewed; 694 AA.
AC W1NIM4;
DT 19-MAR-2014, integrated into UniProtKB/TrEMBL.
DT 19-MAR-2014, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=SET domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=AMTR_s00008p00186080 {ECO:0000313|EMBL:ERM95358.1};
OS Amborella trichopoda.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Amborellales; Amborellaceae; Amborella.
OX NCBI_TaxID=13333 {ECO:0000313|EMBL:ERM95358.1, ECO:0000313|Proteomes:UP000017836};
RN [1] {ECO:0000313|Proteomes:UP000017836}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=24357323;
RG Amborella Genome Project;
RT "The Amborella genome and the evolution of flowering plants.";
RL Science 342:1241089-1241089(2013).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00358}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KI397486; ERM95358.1; -; Genomic_DNA.
DR RefSeq; XP_006827942.1; XM_006827879.1.
DR AlphaFoldDB; W1NIM4; -.
DR STRING; 13333.W1NIM4; -.
DR EnsemblPlants; ERM95358; ERM95358; AMTR_s00008p00186080.
DR GeneID; 18423277; -.
DR Gramene; ERM95358; ERM95358; AMTR_s00008p00186080.
DR KEGG; atr:18423277; -.
DR eggNOG; KOG1082; Eukaryota.
DR HOGENOM; CLU_004556_0_0_1; -.
DR OMA; RICALNT; -.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000017836; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003690; F:double-stranded DNA binding; IBA:GO_Central.
DR GO; GO:0042054; F:histone methyltransferase activity; IBA:GO_Central.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 2.30.280.10; SRA-YDG; 1.
DR InterPro; IPR025794; H3-K9-MeTrfase_plant.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR015947; PUA-like_sf.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR036987; SRA-YDG_sf.
DR InterPro; IPR003105; SRA_YDG.
DR PANTHER; PTHR45660; HISTONE-LYSINE N-METHYLTRANSFERASE SETMAR; 1.
DR PANTHER; PTHR45660:SF13; HISTONE-LYSINE N-METHYLTRANSFERASE SETMAR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF02182; SAD_SRA; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SMART; SM00466; SRA; 1.
DR SUPFAM; SSF88697; PUA domain-like; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51575; SAM_MT43_SUVAR39_2; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS51015; YDG; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00358}; Reference proteome {ECO:0000313|Proteomes:UP000017836};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 230..388
FT /note="YDG"
FT /evidence="ECO:0000259|PROSITE:PS51015"
FT DOMAIN 461..520
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 523..667
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 681..694
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
SQ SEQUENCE 694 AA; 78434 MW; DAD6B9268564203A CRC64;
MATTVVRKQV LIEQQEKSHL GGQIERRRVK TSGTNIHREC MEESVPDRVQ KAKLRVNRAR
ASNSVLESQS RAKVSNKRTI LLSELSPVKK QRVLSESNEK KQRVLSERNE GSKYQGRVLH
GSGKMAHRHV CCDIYSRLEP IPWGIEGETD LGLPQMAIKP RISLSERIPT IEEVRKTLRL
YQWAYRKFSQ DTEGNIFRGE RNPPNRPELW AMEFLREKNK FVNTGDPILG KVPGIEVGDE
FQFRAELIVV GLHRQRQAGI DCMRKNGTLL ATSVVISGGY ADNDDQGDVI IYSGHGDNAS
YVVKGKPKDQ KLERGNLALL NSKRFKTPVR VIRGFKKSKK CPLALGIQGF VNGRKPLMYT
YDGLYQVESH FIKRGSHGCD VFQFILRRQP NQPELVLNPF KQVDKFRRLA GLERVIVRDV
SKRMETKPIC VVNNINRDAP SQFEYIKKMI YPKIYSPSPH EGCSCIGRCS EIANCSCAIK
NGKEFPYSNG CLVESKPLVY ECGPSCTCHP SCGNRVSQGG IKFELQLFKT KLKGWGVRPL
GSIPSGSFIC EYLGEILTRK MVEKRVGPNE YFFDIGVNFT DKSFKDTLSF LVPESEPTEA
CDTVEIKGFT IDASINGNVA RFINQSCSPN LYAQSVLYDH GDKAVPHIML FAGENIRPFE
ELTIYYKFAL NHVCDNNSNR KKKKCHCGSS ECTG
//