GenomeNet

Database: UniProt/SWISS-PROT
Entry: SET1_YARLI
LinkDB: SET1_YARLI
Original site: SET1_YARLI 
ID   SET1_YARLI              Reviewed;        1170 AA.
AC   Q6CEK8;
DT   09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT   16-AUG-2004, sequence version 1.
DT   28-MAR-2018, entry version 108.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=SET1; OrderedLocusNames=YALI0B14883g;
OS   Yarrowia lipolytica (strain CLIB 122 / E 150) (Yeast) (Candida
OS   lipolytica).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Saccharomycotina;
OC   Saccharomycetes; Saccharomycetales; Dipodascaceae; Yarrowia.
OX   NCBI_TaxID=284591;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CLIB 122 / E 150;
RX   PubMed=15229592; DOI=10.1038/nature02579;
RA   Dujon B., Sherman D., Fischer G., Durrens P., Casaregola S.,
RA   Lafontaine I., de Montigny J., Marck C., Neuveglise C., Talla E.,
RA   Goffard N., Frangeul L., Aigle M., Anthouard V., Babour A., Barbe V.,
RA   Barnay S., Blanchin S., Beckerich J.-M., Beyne E., Bleykasten C.,
RA   Boisrame A., Boyer J., Cattolico L., Confanioleri F., de Daruvar A.,
RA   Despons L., Fabre E., Fairhead C., Ferry-Dumazet H., Groppi A.,
RA   Hantraye F., Hennequin C., Jauniaux N., Joyet P., Kachouri R.,
RA   Kerrest A., Koszul R., Lemaire M., Lesur I., Ma L., Muller H.,
RA   Nicaud J.-M., Nikolski M., Oztas S., Ozier-Kalogeropoulos O.,
RA   Pellenz S., Potier S., Richard G.-F., Straub M.-L., Suleau A.,
RA   Swennen D., Tekaia F., Wesolowski-Louvel M., Westhof E., Wirth B.,
RA   Zeniou-Meyer M., Zivanovic Y., Bolotin-Fukuhara M., Thierry A.,
RA   Bouchier C., Caudron B., Scarpelli C., Gaillardin C., Weissenbach J.,
RA   Wincker P., Souciet J.-L.;
RT   "Genome evolution in yeasts.";
RL   Nature 430:35-44(2004).
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY: S-adenosyl-L-methionine + L-lysine-[histone] =
CC       S-adenosyl-L-homocysteine + N(6)-methyl-L-lysine-[histone].
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
DR   EMBL; CR382128; CAG83155.1; -; Genomic_DNA.
DR   RefSeq; XP_500904.1; XM_500904.1.
DR   ProteinModelPortal; Q6CEK8; -.
DR   SMR; Q6CEK8; -.
DR   STRING; 4952.XP_500904.1; -.
DR   EnsemblFungi; CAG83155; CAG83155; YALI0_B14883g.
DR   GeneID; 2907346; -.
DR   KEGG; yli:YALI0B14883g; -.
DR   InParanoid; Q6CEK8; -.
DR   KO; K11422; -.
DR   OMA; PSCTAKI; -.
DR   OrthoDB; EOG092C3T9B; -.
DR   Proteomes; UP000001300; Chromosome B.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   Gene3D; 3.30.70.330; -; 1.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR035979; RBD_domain_sf.
DR   InterPro; IPR000504; RRM_dom.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR024636; SET_assoc.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 5.
DR   Pfam; PF11764; N-SET; 1.
DR   Pfam; PF00076; RRM_1; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF11767; SET_assoc; 1.
DR   PIRSF; PIRSF037104; Histone_H3-K4_mtfrase_Set1_fun; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00360; RRM; 2.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF54928; SSF54928; 2.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50102; RRM; 1.
DR   PROSITE; PS51572; SAM_MT43_1; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; Reference proteome; RNA-binding; S-adenosyl-L-methionine;
KW   Transferase.
FT   CHAIN         1   1170       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000269778.
FT   DOMAIN      403    489       RRM. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00176}.
FT   DOMAIN     1029   1146       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1154   1170       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
FT   COMPBIAS     15    270       Arg-rich.
FT   COMPBIAS    808    837       Glu-rich.
SQ   SEQUENCE   1170 AA;  133489 MW;  27FC52DEC9755E44 CRC64;
     MANGSDTKPV PKGPRGARAD DTRTPPHPDR RGSDRDQYFS KDRSYGGRTD RDYSDYDRAY
     PPPRDYDRYG GGKYGKYDKY DKYDRYDRLD SHDYPPRRDR DYDRSRDARD TDSRDYGRLP
     PRDTRPPPRG GREYRYARDE RGWDRDRRDL DRDRDRDPRD RERDARDRDR DRDFRDRDHD
     PRDRDPKGYR PHSLDPLPSR DRDMPPRERE RELVRERDLF SRDRDLITGV RELARSPPPP
     LRDGRNRDRS NDRHDRSFDK HTRDRGTRDR DTKEKGSVIS TPQTAPTPRL TPTNGKPDLP
     TSQPPAPPSP PRLKTFPKEV WETVGRKNNT RLTYDPELSK DKQKGKRGIY EEVKKGASTT
     EDPRSKHPHY FKTSSKSNKK PYQRLPVPRF LYDANSIAPP PSTQIIVKGL SSLTTSKTIT
     AHFKTYGELE AVNMVEDPAT GSSVGACLVR FKVTKNNYEA AHECLKKAIR GQKTGRIDQA
     KYRVEPDEDG AKAKDIIKRV AARAAKKAMP VKQPPTAPAA DKSIMEKLPT PVPSARPSPK
     AAQLMKTSAY IFIAGKYLPS EKVFASDIRR ILRDFGWFDV LKEGDGFYVF FDNNRDTVEC
     YEAMNGRKVN GHQMAMAMIR LSRVAPKTKE SEENATEQAP KLPPKEEARK LIVQELANSL
     RKDIRERVIA AAIVEFLNPA RFTHIKQDPE PSAATEPNTV NASAARVDTP EPVQSSSAIP
     GFLPRFKIKR KGDPTKKKDA TKKNRKKISA RPMNHVLNDY YSDEEDSTRM STPIVPDTSA
     DAAELPIRKP RKKISQSKQR IMDFSSSEGS NESEEEEEVE DDMEEEEDET AQEELQEAST
     LDTSQQLTTA FGEILDWAPA HGFPQPVTAD KKGGALTTIS GFQALVKDDE DMELLQEALE
     GIEPEKINGE AWTWTHKHLN EAVAKENAEF PELAAPNAFV NSTGSWKSQG YFKIPEAAKS
     EYLPHRKKLN IPIDTLQMEN REKKKENASN SRVNRANNRR LVADINMQKQ LLSTETDVLN
     FNQLRKRKKP VKFARSAIHN WGLYAIEPIA ANEMIIEYVG EVVRQEIADL REARYMRSGI
     GSSYLFRVDE STVVDATKRG GIARFINHCC TPSCTAKIIK VEGQKRIVIY ASRDIAANEE
     LTYDYKFEKE IGEERIPCLC GAPGCKGYLN
//
DBGET integrated database retrieval system