GenomeNet

Database: UniProt
Entry: Q2UMH3
LinkDB: Q2UMH3
Original site: Q2UMH3 
ID   SET1_ASPOR              Reviewed;        1229 AA.
AC   Q2UMH3;
DT   09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT   24-JAN-2006, sequence version 1.
DT   10-APR-2019, entry version 104.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=set1; ORFNames=AO090003000002;
OS   Aspergillus oryzae (strain ATCC 42149 / RIB 40) (Yellow koji mold).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC   Eurotiomycetidae; Eurotiales; Aspergillaceae; Aspergillus.
OX   NCBI_TaxID=510516;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 42149 / RIB 40;
RX   PubMed=16372010; DOI=10.1038/nature04300;
RA   Machida M., Asai K., Sano M., Tanaka T., Kumagai T., Terai G.,
RA   Kusumoto K., Arima T., Akita O., Kashiwagi Y., Abe K., Gomi K.,
RA   Horiuchi H., Kitamoto K., Kobayashi T., Takeuchi M., Denning D.W.,
RA   Galagan J.E., Nierman W.C., Yu J., Archer D.B., Bennett J.W.,
RA   Bhatnagar D., Cleveland T.E., Fedorova N.D., Gotoh O., Horikawa H.,
RA   Hosoyama A., Ichinomiya M., Igarashi R., Iwashita K., Juvvadi P.R.,
RA   Kato M., Kato Y., Kin T., Kokubun A., Maeda H., Maeyama N.,
RA   Maruyama J., Nagasaki H., Nakajima T., Oda K., Okada K., Paulsen I.,
RA   Sakamoto K., Sawano T., Takahashi M., Takase K., Terabayashi Y.,
RA   Wortman J.R., Yamada O., Yamagata Y., Anazawa H., Hata Y., Koide Y.,
RA   Komori T., Koyama Y., Minetoki T., Suharnan S., Tanaka A., Isono K.,
RA   Kuhara S., Ogasawara N., Kikuchi H.;
RT   "Genome sequencing and analysis of Aspergillus oryzae.";
RL   Nature 438:1157-1161(2005).
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) +
CC         N(6)-methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC         Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC         ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.43;
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
DR   EMBL; AP007155; BAE57242.1; -; Genomic_DNA.
DR   RefSeq; XP_001819244.1; XM_001819192.1.
DR   ProteinModelPortal; Q2UMH3; -.
DR   SMR; Q2UMH3; -.
DR   PRIDE; Q2UMH3; -.
DR   EnsemblFungi; BAE57242; BAE57242; AO090003000002.
DR   GeneID; 5992596; -.
DR   KEGG; aor:AO090003000002; -.
DR   HOGENOM; HOG000181654; -.
DR   KO; K11422; -.
DR   OMA; PNSRADP; -.
DR   Proteomes; UP000006564; Chromosome 2.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   Gene3D; 3.30.70.330; -; 1.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR035979; RBD_domain_sf.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR024636; SET_assoc.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 1.
DR   Pfam; PF11764; N-SET; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF11767; SET_assoc; 1.
DR   PIRSF; PIRSF037104; Histone_H3-K4_mtfrase_Set1_fun; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF54928; SSF54928; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS51572; SAM_MT43_1; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; Reference proteome; S-adenosyl-L-methionine; Transferase.
FT   CHAIN         1   1229       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000269766.
FT   DOMAIN     1087   1204       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1213   1229       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
SQ   SEQUENCE   1229 AA;  137590 MW;  C1F3AB36366980C0 CRC64;
     MSRSSAGFAD FFPTAPSVLQ QKRFKVTRER PRPKAQIDSE HSDESSACPT ETRAILNLSN
     GGASLDSGQI SSTDLKKTSP ESSVEGSASS TAGDRSALSL SVAQHGANSH EARLDTLTPL
     TNAESSPPQK ANSPRNKIAE GIVANTTIDT KSGINPLHTP PTPQSQGRRT GSIRGYKLVY
     DPDTEKRSSS KEKRRKPRYV DIILSEQNNC PPDPRLGIPN YMRGAGCKQK RKYRPAPYTL
     KPWPYDASST IGPGPPAQIV ITGFDPLTPI APISALFSSF GDIGEINNRT DPITGRFLGI
     CSVKYKDSAS FRGGGPVLAA SAARRAYYEC RKEQRIGTRR IRVDLDRDGV VSERFVARTI
     ESQRMGQKSN LQSTEEVKSD SETKKNEPPP TAPKGPSGKT SVRPIVAIPE GPRANFLKPV
     MPSLVEEVPI LGQIKRDPYI FIAHCYVPVL STTVPHLKKR LKLFNWKDIR CDKTGYYIIF
     ENSRRGEEET ERCYKMCHMK PLFTYIMNME SQPYGNPSYE RSPSPERCRA EQRERAERER
     LKREVGLDIE EEKRQRAVDL DPCQEVLTII IRDLKDKLLE DVKSRIAAPA LYDYLDPDRH
     ALKRKTLGIA DPEGIKRPMF RIDDSFGTPD SRSGLSDARR PFSGSTPNIL ALPRIRKARH
     LGRTDTAFLD ERRKQPLRRR EVRPLYHRLQ QLHDVDDSDD EQRTPKDTDE QDSRPPSRMS
     SGTSESDDGD GFVSEALGLP VVELAGSGQN KEPDEILKDN QSVGESSQLE SNEISPELRK
     RKRASEELEA RKRQKEDDEL FGINPIAEAE VEGTQIIATP IAVDINLEVS EAALSILPKE
     SNDNRQETGE ANHLDFDGID VTSSTIEKDR RGILDPLDDI DNAAAREESR TEVGWRVSND
     EPRPIVDDDD AIIMDLDGWQ NLIKDDEDLH FLRDILVGYS ESNVGNLSAW AWRQKEIKAL
     NHPGDVGPLR GGTGIAGYYV PNTTGAARTE GRKRILESEK SKYLPHRIKV QKAREEREAR
     AKNDPHTAAV EAARVAAAKN ISKSTSRSTR VNNRRLIADI NAQKQALPTQ SGDGDVLRFN
     QLKKRKKPVR FARSAIHNWG LYAEENISAN DMIIEYVGEK VRQQVADMRE RQYLKSGIGS
     SYLFRIDENT VIDATKRGGI ARFINHSCTP NCTAKIIKVD GSKRIVIYAL RDIERDEELT
     YDYKFEREWD SDDRIPCLCG STGCKGFLN
//
DBGET integrated database retrieval system