ID A0A212FB81_DANPL Unreviewed; 1204 AA.
AC A0A212FB81;
DT 27-SEP-2017, integrated into UniProtKB/TrEMBL.
DT 27-SEP-2017, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE SubName: Full=Histone-lysine N-methyltransferase EHMT1 isoform X1 {ECO:0000313|RefSeq:XP_032529542.1};
DE SubName: Full=Histone-lysine N-methyltransferase EHMT2 like protein {ECO:0000313|EMBL:OWR50996.1};
GN Name=LOC116779389 {ECO:0000313|RefSeq:XP_032529542.1};
GN ORFNames=KGM_208774 {ECO:0000313|EMBL:OWR50996.1};
OS Danaus plexippus plexippus.
OC Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota;
OC Neoptera; Endopterygota; Lepidoptera; Glossata; Ditrysia; Papilionoidea;
OC Nymphalidae; Danainae; Danaini; Danaina; Danaus; Danaus.
OX NCBI_TaxID=278856 {ECO:0000313|EMBL:OWR50996.1, ECO:0000313|Proteomes:UP000007151};
RN [1] {ECO:0000313|EMBL:OWR50996.1, ECO:0000313|Proteomes:UP000007151}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR50996.1};
RX PubMed=22118469; DOI=10.1016/j.cell.2011.09.052;
RA Zhan S., Merlin C., Boore J.L., Reppert S.M.;
RT "The monarch butterfly genome yields insights into long-distance
RT migration.";
RL Cell 147:1171-1185(2011).
RN [2] {ECO:0000313|EMBL:OWR50996.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=F-2 {ECO:0000313|EMBL:OWR50996.1};
RA Zhan S., Reppert S.M.;
RT "MonarchBase: the monarch butterfly genome database.";
RL Submitted (OCT-2012) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|RefSeq:XP_032529542.1}
RP IDENTIFICATION.
RG RefSeq;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGBW02009363; OWR50996.1; -; Genomic_DNA.
DR RefSeq; XP_032529542.1; XM_032673651.1.
DR STRING; 278856.A0A212FB81; -.
DR EnsemblMetazoa; XM_032673651.1; XP_032529542.1; LOC116779389.
DR KEGG; dpl:KGM_208774; -.
DR eggNOG; KOG1082; Eukaryota.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000007151; Unassembled WGS sequence.
DR Proteomes; UP000596680; Chromosome 2.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0002039; F:p53 binding; IEA:InterPro.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd20905; EHMT_ZBD; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 2.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR043550; EHMT1/EHMT2.
DR InterPro; IPR047762; EHMT_CRR.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46307; G9A, ISOFORM B; 1.
DR PANTHER; PTHR46307:SF4; G9A, ISOFORM B; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF21533; EHMT1-2_CRR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00248; ANK; 5.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 2.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 3.
DR PROSITE; PS50088; ANK_REPEAT; 4.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Methyltransferase {ECO:0000313|EMBL:OWR50996.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000007151};
KW Transferase {ECO:0000313|EMBL:OWR50996.1}.
FT REPEAT 628..660
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 759..791
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 792..824
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 859..891
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 977..1040
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1048..1173
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 346..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 679..735
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 8..51
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..83
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 91..130
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 141..156
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 173..187
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 189..235
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 245..267
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 268..294
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..372
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1204 AA; 133100 MW; 3F7B9D02A447C987 CRC64;
MLTDSENPVT KQDDDNDASP DNLTVRVDKI EVVKKPDKEA DAEDPKPRIV LTFRSEKSGA
RSSNMKIVST EEKHEDISPR RSIRRRNTIN YNYVKESDDD NISHADSDDD EPIESMTHKR
STRRRSKDFS DVIANAIARK EKSYNESSSV PTQRLSRRIK PTAKILANEE LRMGLESQNN
ARLGISTEKT TEEGVRTRRS AQVRNSESVT EKRSSKRKIH EDSTFESKDG DNSNKKLMHM
GTLGLKIAKE EDSSEAENRK TESSAHDGED EEIDDDTEVI SQLLQADEES ASDEDFCPDT
SKRRRSRRNC SPAPLRRSSR KANLGLYNYD AYQFDDIIDS DFEPKRKTTR THTEEPPSED
EPETKIASEA VGGEEEESEP AAPSEAATVV ATCLCEETSN VYAAPADLTE PVFCQAIEMV
EGVRVGCSHR AARAPGGELL ALRRPGLRAP YFLACKLHAA QLAKHMCCPT CGLFCTQGIF
YQCSKDHLFH VECGIGGEAR QRAGCPHCGV LSHRWQPLNT DYGRVRIDMH CSNKRVFLPD
QREQCTPAFL GFSSLDPALL DPEPTFPDDL LPLIPDVKKL IEAADDEDRD HCTAQNIYDL
IMTENDAEQV LTKIVRCDNI NECVPEASGG TLAHAAAVRG RLAPLSVLRA RGADLDAADS
SCRTPLMRAI QALLDKEHSE ETEFEGNEAE VSVKKEDEVV ANDEDKVKTE TEDKEDAEHE
LKDGQEVPED PSRPADDELL SVIKYLIAAG CDVNKQGPEG MSGLHMSCQY GGAAVCLMLL
EAGAAVDARD HGGWTPLVRA AENKHAAVVR LLLAAGADAA SCDNEGNQPI HWCTLAGDSR
CLAMILRAAP HATNAPNAHT DTPLHIAARE GHYSSVVVLL AHGARTDIEN SSGELPVEVC
SGPCHEAISM NMQMTLAVKD TMTRVKVITS DLSNGREPYP VSVVNEVDDA SPAAFTYVSQ
HVLTEHLTID NTIETMQGCE CAGGSCDGEC GCCVLSVRRW YRAGRLPPAF PHHDPPVMFE
CNYTCGCNMK RCTNRVVGRM ESAGSLNTPV QVFRTRTRGW GLRVLTRVSR GELLALYRGE
LVTSERADAR TDDQYMFALD LKPDLLEQCS DKTLLCVDAC RFGSAARFMN HSCRPSAAPV
RVFTSGRDLR LPHVAFFALR DLAPGDELTF DYGDKFWSVK SKWMKCECES PDCRYPTKME
EADT
//