ID A0A1A6HT02_NEOLE Unreviewed; 412 AA.
AC A0A1A6HT02;
DT 05-OCT-2016, integrated into UniProtKB/TrEMBL.
DT 05-OCT-2016, sequence version 1.
DT 27-MAR-2024, entry version 35.
DE RecName: Full=Histone-lysine N-methyltransferase {ECO:0000256|PIRNR:PIRNR009343};
DE EC=2.1.1.355 {ECO:0000256|PIRNR:PIRNR009343};
GN ORFNames=A6R68_20657 {ECO:0000313|EMBL:OBS81135.1};
OS Neotoma lepida (Desert woodrat).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea;
OC Cricetidae; Neotominae; Neotoma.
OX NCBI_TaxID=56216 {ECO:0000313|EMBL:OBS81135.1, ECO:0000313|Proteomes:UP000092124};
RN [1] {ECO:0000313|EMBL:OBS81135.1, ECO:0000313|Proteomes:UP000092124}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=417 {ECO:0000313|EMBL:OBS81135.1};
RC TISSUE=Liver {ECO:0000313|EMBL:OBS81135.1};
RA Campbell M., Oakeson K.F., Yandell M., Halpert J.R., Dearing D.;
RT "The Draft Genome Sequence and Annotation of the Desert Woodrat Neotoma
RT lepida.";
RL Submitted (JUN-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(9)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+) +
CC N(6),N(6),N(6)-trimethyl-L-lysyl(9)-[histone H3] + 3 S-adenosyl-L-
CC homocysteine; Xref=Rhea:RHEA:60276, Rhea:RHEA-COMP:15538, Rhea:RHEA-
CC COMP:15546, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.355;
CC Evidence={ECO:0000256|ARBA:ARBA00036480,
CC ECO:0000256|PIRNR:PIRNR009343};
CC -!- SUBCELLULAR LOCATION: Chromosome, centromere
CC {ECO:0000256|ARBA:ARBA00004584}. Nucleus
CC {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PIRNR:PIRNR009343}.
CC -!- SIMILARITY: Belongs to the class V-like SAM-binding methyltransferase
CC superfamily. Histone-lysine methyltransferase family. Suvar3-9
CC subfamily. {ECO:0000256|PIRNR:PIRNR009343}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OBS81135.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LZPO01017290; OBS81135.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1A6HT02; -.
DR STRING; 56216.A0A1A6HT02; -.
DR Proteomes; UP000092124; Unassembled WGS sequence.
DR GO; GO:0000775; C:chromosome, centromeric region; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0140949; F:histone H3K9 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR GO; GO:0045892; P:negative regulation of DNA-templated transcription; IEA:UniProt.
DR CDD; cd18639; CD_SUV39H1_like; 1.
DR CDD; cd10525; SET_SUV39H1; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR011381; H3-K9_MeTrfase_SUV39H1/2-like.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46223; HISTONE-LYSINE N-METHYLTRANSFERASE SUV39H; 1.
DR PANTHER; PTHR46223:SF1; HISTONE-LYSINE N-METHYLTRANSFERASE SUV39H1; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF05033; Pre-SET; 1.
DR Pfam; PF00856; SET; 1.
DR PIRSF; PIRSF009343; SUV39_SET; 1.
DR SMART; SM00298; CHROMO; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51579; SAM_MT43_SUVAR39_3; 1.
DR PROSITE; PS50280; SET; 1.
PE 3: Inferred from homology;
KW Biological rhythms {ECO:0000256|ARBA:ARBA00023108};
KW Cell cycle {ECO:0000256|ARBA:ARBA00023306};
KW Centromere {ECO:0000256|ARBA:ARBA00023328};
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853,
KW ECO:0000256|PIRNR:PIRNR009343}; Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|PIRNR:PIRNR009343};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000256|PIRNR:PIRNR009343};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PIRNR:PIRNR009343};
KW Reference proteome {ECO:0000313|Proteomes:UP000092124};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|PIRNR:PIRNR009343};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|PIRNR:PIRNR009343};
KW Zinc {ECO:0000256|PIRNR:PIRNR009343, ECO:0000256|PIRSR:PIRSR009343-2}.
FT DOMAIN 43..91
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT DOMAIN 179..240
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 243..366
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 396..412
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT BINDING 181
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 181
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 183
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 186
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 186
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 194
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="1"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 195
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 222
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 222
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 226
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="2"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 228
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 232
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="3"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 254..256
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-1"
FT BINDING 323..324
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-1"
FT BINDING 326
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="4"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 365
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-1"
FT BINDING 400
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="4"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 401
FT /ligand="S-adenosyl-L-methionine"
FT /ligand_id="ChEBI:CHEBI:59789"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-1"
FT BINDING 402
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="4"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
FT BINDING 407
FT /ligand="Zn(2+)"
FT /ligand_id="ChEBI:CHEBI:29105"
FT /ligand_label="4"
FT /evidence="ECO:0000256|PIRSR:PIRSR009343-2"
SQ SEQUENCE 412 AA; 47722 MW; A065536C1FCDF742 CRC64;
MAENLKGCSV CCKSSWNQLQ DLCRLAKLSC PALGISKKNL YDFEVEYLCD YKKIREQEYY
LVKWRGYPDS ENTWEPRQNL KCVRILKQFH KDLERELLRR HRRSKPPRHL DPNLANYLVQ
KAKQRRALQR WEQELNAKRG HLGRITVENE VDLDGPPRSF VYINEYRVGE GITLNQVAVG
CECQDCLLAP TGGCCPGASL HKFAYNDQGQ VRLKAGQPIY ECNSRCCCGY DCPNRVVQKG
IRYDLCIFRT NDGRGWGVRT LEKIRKNSFV MEYVGEIITS EEAERRGQIY DRQGATYLFD
LDYVEDVYTV DAAYYGNISH FVNHSCDPNL QVYNVFIDNL DERLPRIAFF ATRTIWAGEE
LTFDYNMQVD PVDMESTRMD SNFGLAGLPG SPKKRVRIEC KCGTAACRKY LF
//