ID I3K510_ORENI Unreviewed; 1303 AA.
AC I3K510;
DT 11-JUL-2012, integrated into UniProtKB/TrEMBL.
DT 17-JUN-2020, sequence version 2.
DT 27-MAR-2024, entry version 67.
DE SubName: Full=Euchromatic histone lysine methyltransferase 1 {ECO:0000313|Ensembl:ENSONIP00000016205.2};
GN Name=ehmt1 {ECO:0000313|Ensembl:ENSONIP00000016205.2};
OS Oreochromis niloticus (Nile tilapia) (Tilapia nilotica).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Neoteleostei; Acanthomorphata;
OC Ovalentaria; Cichlomorphae; Cichliformes; Cichlidae; African cichlids;
OC Pseudocrenilabrinae; Oreochromini; Oreochromis.
OX NCBI_TaxID=8128 {ECO:0000313|Ensembl:ENSONIP00000016205.2, ECO:0000313|Proteomes:UP000005207};
RN [1] {ECO:0000313|Proteomes:UP000005207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RG Broad Institute Genome Assembly Team;
RG Broad Institute Sequencing Platform;
RA Di Palma F., Johnson J., Lander E.S., Lindblad-Toh K.;
RT "The Genome Sequence of Oreochromis niloticus (Nile Tilapia).";
RL Submitted (JAN-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Ensembl:ENSONIP00000016205.2}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR RefSeq; XP_019216844.1; XM_019361299.1.
DR Ensembl; ENSONIT00000016220.2; ENSONIP00000016205.2; ENSONIG00000012876.2.
DR GeneID; 100700166; -.
DR CTD; 402830; -.
DR GeneTree; ENSGT00940000156002; -.
DR HOGENOM; CLU_005790_3_0_1; -.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000005207; Linkage group LG7.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0002039; F:p53 binding; IEA:InterPro.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd20905; EHMT_ZBD; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR043550; EHMT1/EHMT2.
DR InterPro; IPR047762; EHMT_CRR.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46307; G9A, ISOFORM B; 1.
DR PANTHER; PTHR46307:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE EHMT1; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 2.
DR Pfam; PF21533; EHMT1-2_CRR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 6.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 5.
DR PROSITE; PS50088; ANK_REPEAT; 5.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Reference proteome {ECO:0000313|Proteomes:UP000005207}.
FT REPEAT 775..807
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 808..840
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 841..865
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 875..907
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 941..973
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 1064..1127
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1130..1245
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..49
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 82..473
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1276..1303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 19..48
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..113
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 147..187
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 213..227
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 248..262
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 263..286
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 303..324
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 361..380
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 402..419
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1303 AA; 141469 MW; B3FD7FC26439F676 CRC64;
MEAMRRKQPA GLAKSCVDKG GSTKKEGLDP GSNGEEKSDG DKEVADRLGS SPAAEVMLNG
TESDDTSHKD LTTGNNKTVL LLNENGTSDT EPPHGSVTGS NGFILTKQQE QDGSAVVAPG
LGVSPHRTNW LPSGSPTGGH AAKTLPASAS GSAQSPGALR TAPSTGTKPG QGTRDTKNGT
SLSPSPVPAP VTVHRARKTM SRPAVSPAQK LLNRELREAK SAKMESHVAP DLQKSSPQSP
SQNHLPQSPP DTPSSAQTAS SASPAPVPPP QPPLGPAEPA APSASPAPVP AKLQLGSYSG
ALSSRKKKRR MGMYSLVPKK KTKVLKQRTV LEMFKELQQT AKSPEAKEVA SINGEKVGNA
SEDEESEDLE SEEEEQQQPS EEPESAAQET SESIAQVKDE QESEESGEEE LEEEGTESDL
STESGLKKKL KKKTKADSAW LRPSRKRKRR MKSKEPEVAV QPQAPTPAEP HDHKEYTQIV
PPVSLSKPSP SKDTSGSAVE EAQELPLCSC RMETPKSREI LILADRKCMA TESVDGQLTR
CQDAVAKHEM MRPSNSVQLL VLCEDHRNGM VKHQCCPGCG FFCRAGTFME CQPDVNISHR
FHRACASVLK GQSFCPHCGE EASKAKEVTI AKADTTSTVP PALAHGPATP GALEGRADTT
TGSSSCLTVG AEVSGRADSS LSVRSAHGFD TSAVPGSSRA APLQAGMATT ALSTLTPGPR
ETLESILVAL DTEKPKKLRF HPKQLYLSAK QGELKKVLLM LVDGIDPNFK MESQNKRTPL
HAAAEGGYKD ICHMLVQAGA NLDMCDEDQR TPLMEACENN HMEVVLYLLR AGASAMHKDV
EGFTCLHLAA KSGHYKIVEH LLSTGLIDIN CQDDGGWTPM IWATEYKHAD QVKLLLTKGA
DCSIRDKEEN ICLHWAAFSG SVEITELLLN AHCNLQAVNI HGDSPLHIAA RENRLDCITL
LLSRGADVFL KNREGETPPD CCSHNSKVWA ALQANRKERD AKNIRLSRAE EKALHSDIAL
GQERVPIPCV NAVDSEPYPD DYKYIPENCV TSPMNIDRNI THLQYCVCKE DCSASICMCG
QLSLRCWYDK SGRLLPEFCR EEPPLIFECN HACSCWRTCK NRVVQNGLRT RLQLFRTSKK
GWGVQALQDI PQGTFVCEYV GEIISEAEAE MRQNDAYLFS LDDKDLYCID ARFYGNISRF
LNHMCEPNLF ACRVFTKHQD LRFPHIAFFA SENIKAGEEL GFNYGDHFWE VKSKVFSCEC
GSSKCRYSSA AMASLQADST PEDQQQPSAS PDTSSSNSPS SPS
//