GenomeNet

Database: UniProt
Entry: Q4PB36
LinkDB: Q4PB36
Original site: Q4PB36 
ID   SET1_USTMA              Reviewed;        1468 AA.
AC   Q4PB36; A0A0D1C6X9;
DT   09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT   19-JUL-2005, sequence version 1.
DT   16-JAN-2019, entry version 90.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=SET1; ORFNames=UMAG_02677;
OS   Ustilago maydis (strain 521 / FGSC 9021) (Corn smut fungus).
OC   Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC   Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Ustilago.
OX   NCBI_TaxID=237631;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=521 / FGSC 9021;
RX   PubMed=17080091; DOI=10.1038/nature05248;
RA   Kaemper J., Kahmann R., Boelker M., Ma L.-J., Brefort T.,
RA   Saville B.J., Banuett F., Kronstad J.W., Gold S.E., Mueller O.,
RA   Perlin M.H., Woesten H.A.B., de Vries R., Ruiz-Herrera J.,
RA   Reynaga-Pena C.G., Snetselaar K., McCann M., Perez-Martin J.,
RA   Feldbruegge M., Basse C.W., Steinberg G., Ibeas J.I., Holloman W.,
RA   Guzman P., Farman M.L., Stajich J.E., Sentandreu R.,
RA   Gonzalez-Prieto J.M., Kennell J.C., Molina L., Schirawski J.,
RA   Mendoza-Mendoza A., Greilinger D., Muench K., Roessel N., Scherer M.,
RA   Vranes M., Ladendorf O., Vincon V., Fuchs U., Sandrock B., Meng S.,
RA   Ho E.C.H., Cahill M.J., Boyce K.J., Klose J., Klosterman S.J.,
RA   Deelstra H.J., Ortiz-Castellanos L., Li W., Sanchez-Alonso P.,
RA   Schreier P.H., Haeuser-Hahn I., Vaupel M., Koopmann E., Friedrich G.,
RA   Voss H., Schlueter T., Margolis J., Platt D., Swimmer C., Gnirke A.,
RA   Chen F., Vysotskaia V., Mannhaupt G., Gueldener U.,
RA   Muensterkoetter M., Haase D., Oesterheld M., Mewes H.-W.,
RA   Mauceli E.W., DeCaprio D., Wade C.M., Butler J., Young S.K.,
RA   Jaffe D.B., Calvo S.E., Nusbaum C., Galagan J.E., Birren B.W.;
RT   "Insights from the genome of the biotrophic fungal plant pathogen
RT   Ustilago maydis.";
RL   Nature 444:97-101(2006).
RN   [2]
RP   GENOME REANNOTATION.
RC   STRAIN=521 / FGSC 9021;
RA   Gueldener U., Muensterkoetter M., Walter M.C., Mannhaupt G.,
RA   Kahmann R.;
RL   Submitted (SEP-2014) to the EMBL/GenBank/DDBJ databases.
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) +
CC         N(6)-methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC         Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC         ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.43;
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
DR   EMBL; CM003145; KIS69337.1; -; Genomic_DNA.
DR   RefSeq; XP_011389058.1; XM_011390756.1.
DR   ProteinModelPortal; Q4PB36; -.
DR   SMR; Q4PB36; -.
DR   STRING; 5270.UM02677P0; -.
DR   PRIDE; Q4PB36; -.
DR   EnsemblFungi; KIS69337; KIS69337; UMAG_02677.
DR   GeneID; 23563368; -.
DR   KEGG; uma:UMAG_02677; -.
DR   EuPathDB; FungiDB:UMAG_02677; -.
DR   InParanoid; Q4PB36; -.
DR   KO; K11422; -.
DR   OMA; NCNAKIL; -.
DR   OrthoDB; 1017537at2759; -.
DR   Proteomes; UP000000561; Chromosome 6.
DR   Proteomes; UP000000561; Unassembled WGS sequence.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR024636; SET_assoc.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 1.
DR   Pfam; PF11764; N-SET; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF11767; SET_assoc; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00317; SET; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; Reference proteome; RNA-binding; S-adenosyl-L-methionine;
KW   Transferase.
FT   CHAIN         1   1468       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000269777.
FT   DOMAIN     1327   1444       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1453   1468       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
FT   COMPBIAS     27    275       Arg-rich.
FT   COMPBIAS    641    685       Pro-rich.
SQ   SEQUENCE   1468 AA;  164530 MW;  7CC6632973149DF9 CRC64;
     MPYSSQQNGY TSASTSRLSE QTSSHSRSSR EDRHLTEKGR RPPSPEARHR SDRDYDRRRS
     TEYVRDDDYR RSSRSSHDSR YADAYDHWRS ARSAYSPTPR DDRRDEARND LSSTKRHRSP
     EHSTSRLRHR SPESAHRRQN GTANRLDSKP DRGGDRKTGE ALDSGRSRWS QRAYEYDDWR
     NERPSARYER YRHDREPHRS RREDEYETKR SRDDSNGNSI YAPTRRSRSR SRSRSRSRDR
     YRSRDHSRER RRERSRDRSN GTYSSRDDRR PKADRSAHTI KRDEHSTRLN GTSEDSKDLR
     HESQRRVSAS VQSASEGPAS TPVARAVYIK HAEVDQEAPA PPTTRDYHSC PQRWPDQADS
     AVRASSAPNG SATAPSRSDR PPANGSSGRH SPRSLPTREK AEEARTSSTR RPSSQTNDNV
     NNSRDPLTQR KATSERSFGH VLLPHELPVE CRGKNYMATA TYKEGVKSIY KSAADKHLVD
     VDTRDPRRLG KKSSRYRESL HSASFRWDSN SRGKKPLPPP RNLVLTNLSG LLQPHQILLH
     ILPHGRIESS KLEIDPKIGQ SLGIFRVTFA HDFDEHGKPL ESMPAGQNPQ HGAKVAKAAC
     LALNGRMIGQ TRAQAFLDRD GEVIAERIKA KLAENEHKLR PTIVPPAPPA AASSSPATPS
     TTKQSMPPPQ VPRGPKVFMP AAPSPSYASS PASARANTDR YEYSATSHSR YRSSYEESRK
     LASSETYHRR RGTEEYDTYN RSKPYADAQV PAGSRSETRK DIKRPDEEIL NELRDKKRPY
     VHIPRPKNCD IDVTSVEAQL RSTAPIWVRE GQKGFYAAFH TSKEANQCKV VNETLTIGGY
     TLQVDVRSAP SQHAPSQQIR TPSGKHASVP LSMPAPPKQE RKAIDTGLRP PTADEKLKVD
     WSAAELQDAV FRMLQKELAD TFVRDVKSRV VGPYLTAYLK PDGEGGKMLA KATMKKPVIP
     TSINDHGTTL FEATGEARLP SFRKLAGAHP KKKASDADTT TSQAKRDQTD AKKKRGHTHR
     SKVHRDRDVS SSENESDDME RGMVVAARRN SYTRSKSSTK RRGAAAWLLE ASDAEAGTDD
     VDSTETDALS RSVSASVEPT GEEQIEVDVG AKAKKIPKVK AATVSKKKGT TAARKKLDVA
     PPEAVVEADQ GSETATPETD VPIKTAAAKA KVKPAKTSAK AKSALVDPFE AGLVEDSEDC
     HYLRLALEHL SRTGELASEH TLPDEIELEV EAEEQAMAAG GIPKHSTGSA RTEGYYRIPP
     EQKAMHLPDR NKATEDVDTS SNAQILQSAR NNRADSRRLV LGIEQHKRET ATDTDIFKFN
     QLRTRKKQLK FAKSPIHDWG LYAMELIPAG DMVIEYVGEV VRQQVADERE KQYERQGNFS
     TYLFRVDDDL VVDATHKGNI ARLMNHCCTP NCNAKILTLN GEKRIVLFAK TAIRAGEELT
     YDYKFQSSAD DEDAIPCLCG SPGCRRFL
//
DBGET integrated database retrieval system