ID A0A3B1JKV4_ASTMX Unreviewed; 1345 AA.
AC A0A3B1JKV4;
DT 05-DEC-2018, integrated into UniProtKB/TrEMBL.
DT 05-DEC-2018, sequence version 1.
DT 27-MAR-2024, entry version 26.
DE SubName: Full=Euchromatic histone lysine methyltransferase 1 {ECO:0000313|Ensembl:ENSAMXP00000043027.1};
OS Astyanax mexicanus (Blind cave fish) (Astyanax fasciatus mexicanus).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Actinopterygii; Neopterygii; Teleostei; Ostariophysi; Characiformes;
OC Characoidei; Characidae; Astyanax.
OX NCBI_TaxID=7994 {ECO:0000313|Ensembl:ENSAMXP00000043027.1, ECO:0000313|Proteomes:UP000018467};
RN [1] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RA Jeffery W., Warren W., Wilson R.K.;
RL Submitted (MAR-2013) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000018467}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=female {ECO:0000313|Proteomes:UP000018467};
RX PubMed=25329095; DOI=10.1038/ncomms6307;
RA McGaugh S.E., Gross J.B., Aken B., Blin M., Borowsky R., Chalopin D.,
RA Hinaux H., Jeffery W.R., Keene A., Ma L., Minx P., Murphy D., O'Quin K.E.,
RA Retaux S., Rohner N., Searle S.M., Stahl B.A., Tabin C., Volff J.N.,
RA Yoshizawa M., Warren W.C.;
RT "The cavefish genome reveals candidate genes for eye loss.";
RL Nat. Commun. 5:5307-5307(2014).
RN [3] {ECO:0000313|Ensembl:ENSAMXP00000043027.1}
RP IDENTIFICATION.
RG Ensembl;
RL Submitted (NOV-2023) to UniProtKB.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR STRING; 7994.ENSAMXP00000043027; -.
DR Ensembl; ENSAMXT00000045746.1; ENSAMXP00000043027.1; ENSAMXG00000008776.2.
DR GeneTree; ENSGT00940000156002; -.
DR InParanoid; A0A3B1JKV4; -.
DR OrthoDB; 5481936at2759; -.
DR Proteomes; UP000018467; Unassembled WGS sequence.
DR Bgee; ENSAMXG00000008776; Expressed in brain and 14 other cell types or tissues.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0140938; F:histone H3 methyltransferase activity; IEA:UniProt.
DR GO; GO:0002039; F:p53 binding; IEA:InterPro.
DR GO; GO:0016279; F:protein-lysine N-methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd20905; EHMT_ZBD; 1.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR043550; EHMT1/EHMT2.
DR InterPro; IPR047762; EHMT_CRR.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46307; G9A, ISOFORM B; 1.
DR PANTHER; PTHR46307:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE EHMT1; 1.
DR Pfam; PF00023; Ank; 1.
DR Pfam; PF12796; Ank_2; 2.
DR Pfam; PF21533; EHMT1-2_CRR; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 6.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 5.
DR PROSITE; PS50088; ANK_REPEAT; 6.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|PROSITE-ProRule:PRU00023};
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Reference proteome {ECO:0000313|Proteomes:UP000018467}.
FT REPEAT 817..849
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 850..882
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 883..907
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 917..949
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 950..982
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT REPEAT 983..1015
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 1105..1168
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 1171..1288
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 15..38
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 59..307
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 326..348
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 369..514
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 685..763
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1315..1345
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 66..100
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 110..154
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 188..248
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 272..290
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 291..307
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 334..348
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 393..413
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 438..454
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 492..508
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 685..718
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 739..753
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1315..1338
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1345 AA; 146593 MW; 2EC23621883D319C CRC64;
MLRLSFRRAD QQRELLGHVT SKPGECLHAG GGGKRRTICS RLSRSPSVEA IVEMQPAGLA
RGVVGKRESI KTEGMKDSHS ESEEGADEDR SNRKTEASKA DGELNGTYES TEAGKAQQHV
SVQGSPTQPM ENGLSDTDPP HGSTVGSNGY ILSKRQEDQV PAAQHRTSWS PVGTSAIGHA
AKTLPPAVKS QPQGQDALRT VESKVPSQAE GSSEKETKNG TTSITSPPVT IHRARKTMSR
PASSQSLKLL HRETKEPKVV KDNGPSGADP GLPPQASEPP PTPQIPSTPP TQKQLPQSQT
DATPAVFNAP TVTTTQAALA ATAVISPPAK PHTASVLRKK KRKKLGTYSL IPKKKTKVLK
QRTVLEMFQN ISKSPPSPKL PKEAAQVNGE RVENESEDEE SEEDSEDEED QAGEQGGTAP
KEDNRLHSAS QPGVEHESEE SAEEEGEEEG TESDLSLESS LKKKLKKKAR GDNAWLRPAR
KRKKKLKSAS EGNSGMDVQP QSETLSAAQV PVPAEGKEYT EVPLDTLDLK AQDAVLSSPS
TEVSSTAESA ATDMVQELPL CSCRMETPKS REILTLADRK CMATESVDGQ LSRCQSAVLK
HEMMRPSNLV QLLVLCEDHR TGMIKHQCCP GCGYFCRAGT FMECQPEVNI SHRFHRSCAS
VLKGQSFCPH CGEEVSKAKE VTIAKADTTS TVPPSQGPST PGTLEGKADT TTGGPSRLSI
PGENRADSTL PKTPEAVEVS PTPGTSRSST TAGAGPPAGP PKETLETVLL ALDAEKPKKL
RFHPKQLYIS AKQGELQKVL LMLVDGIDPN FKMENQSKRT PLHVAAEAGH QEVCHMLVQA
GANLDMCDED QRTPLMEACE NNHLDTVNYL LRAGAIVSHK DAEGSTCLHL AAKIGHYTIV
EHLLSAGLVD INCQDDGGWT AMIWATEYKH IDIVKLLLSK GADPNIRDKE ENICLHWAAF
SGSVEIAQIL LDARCDLNTV NVHGDSPLHI ASRENRLECV TLFLPLGANV NLKNREGDSP
MECCSQNSKL WNALQANRKQ REASRDQSSS NQKLLNRDIA RGYERVPVPC VNIVDSEPCP
DDYKYVPDNC VTSPMNIDKN ITHLQYCVCK DDCSSASCMC GQLSLRCWYD QESRLLPEFC
CEEPPLIFEC NHACSCWRTC KNRVVQNGLR IRLQLFRTQK KGWGVRTLQD IPQGTFVCEY
VGEIISDAEA DVRENDSYLF SLDSKVGDMY CIDARFYGNI SRFINHHCEP NLFPCRVFTS
HQDLRFPHIA FFASKSISSG DELGFDYGDH FWDVKGKLFS CQCGSSQCKH SAAAIAQRQA
DSTPGEQQAS ALPDTSSSTT PPSPS
//