GenomeNet

Database: UniProt
Entry: P0CO27
LinkDB: P0CO27
Original site: P0CO27 
ID   SET1_CRYNB              Reviewed;        1469 AA.
AC   P0CO27; Q55U33; Q5KIA9;
DT   28-JUN-2011, integrated into UniProtKB/Swiss-Prot.
DT   28-JUN-2011, sequence version 1.
DT   10-APR-2019, entry version 41.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=SET1; OrderedLocusNames=CNBD2720;
OS   Cryptococcus neoformans var. neoformans serotype D (strain B-3501A)
OS   (Filobasidiella neoformans).
OC   Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina;
OC   Tremellomycetes; Tremellales; Cryptococcaceae; Cryptococcus;
OC   Cryptococcus neoformans species complex.
OX   NCBI_TaxID=283643;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=B-3501A;
RX   PubMed=15653466; DOI=10.1126/science.1103773;
RA   Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D.,
RA   Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E.,
RA   Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J.,
RA   D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J.,
RA   Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I.,
RA   Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E.,
RA   Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L.,
RA   Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A.,
RA   Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R.,
RA   Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W.,
RA   Fraser C.M., Hyman R.W.;
RT   "The genome of the basidiomycetous yeast and human pathogen
RT   Cryptococcus neoformans.";
RL   Science 307:1321-1324(2005).
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) +
CC         N(6)-methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC         Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC         ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.43;
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
CC   -!- SEQUENCE CAUTION:
CC       Sequence=EAL21216.1; Type=Erroneous gene model prediction; Evidence={ECO:0000305};
DR   EMBL; AAEY01000020; EAL21216.1; ALT_SEQ; Genomic_DNA.
DR   RefSeq; XP_775863.1; XM_770770.1.
DR   SMR; P0CO27; -.
DR   EnsemblFungi; EAL21216; EAL21216; CNBD2720.
DR   GeneID; 4935661; -.
DR   KEGG; cnb:CNBD2720; -.
DR   EuPathDB; FungiDB:CNBD2720; -.
DR   eggNOG; KOG1080; Eukaryota.
DR   eggNOG; COG2940; LUCA.
DR   KO; K11422; -.
DR   Proteomes; UP000001435; Chromosome 4.
DR   Proteomes; UP000001435; Unassembled WGS sequence.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   Gene3D; 3.30.70.330; -; 1.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR035979; RBD_domain_sf.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 2.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF54928; SSF54928; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; S-adenosyl-L-methionine; Transferase.
FT   CHAIN         1   1469       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000410118.
FT   DOMAIN     1327   1444       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1453   1469       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
FT   COMPBIAS    317    492       Pro-rich.
SQ   SEQUENCE   1469 AA;  163056 MW;  046DE4E651BCCCB3 CRC64;
     MAPHEKGVNP SESPSGSLKK APPSGPKALR GFASPSAFRN AGIGNGLLQH RGEDERISFA
     FPRKGKESLD RNGERPAPMS LESRLGPPVS RFNRPVGGDS LERGNGKGGW DNRDERTSAS
     SSSIHRINKE KIRPRSDFIE SSANLYSEDD RNRDRGRYHE RDRSRGTNGD RGGGEGHSHR
     EPGRGKEHQN GQGRDRSLYR DHSRERESSR DERDRYSEEY KHQRSKARFS PSPSPERSRL
     KSSLGRHRSP VSVSSSSSSS ARSPPPVQNR EPLRYNGSQS KNGEKELSNG LLENSISRSG
     VSIAVPRKLE TKQLVRPSPP HVNLKSTIQD NPSPPTGKYP PSPPSLDILS KPSCNREPLP
     DQRPPTPPLP ENSSPTSPSL ESSRFDQNQH LLPDELPLPP LSISFPQKPV SSSSLSRLSS
     LSAALSRPNP LDKDGTSMPP SFHFREISNR HQRLSPPNNA ENQLPETSII PPPPSETVPE
     PPWIRPPYIP PPCTKHRPGI GNFFITNLRE KVEDKSGKEE KRVDGMEGGK VVQVTDPRLS
     MTEEQRGRGR GSSKQRAAFY ELTYEWDLYS VTPKPPSPPT AVLITGLGPL TTVDQITKFL
     RPHGRIKEID SKVDRKTDMQ LGICWVKFEG PPLGRPGTAH DVASMAVKVC DGKKISMGGE
     RIRVVLDGRG KRAEQAVKEE MERRYPPKKP SALPSDMKVT PLSGTAMATT SRPPNAPNAS
     TPLIDKQTFD TSAKAPIIRP GVLKPLGQKM YHRPSAPPLV FNNHRFRDES FLNRPFNAGA
     GMISQQQMGY KILPGKPVQQ LASSFTSAPF VRHPRERRED SWTNERGRRL KGESSTRHWR
     ARSLSRSSYS SYSSYSSYSE ESEEERPRHP TKVPYPQRKR LATGPSKEDE YKMEDVREAI
     RENGHPCVFI DAKSLPAARE YESRQSHLGW YILFADDTTA YRVQRVLDTT AVQGHRLSLV
     VHTSSGPRAQ TDASEPVTGG VGESKKGNWR YLTITKKSRP MPAVKKSGKS ATIRRKVYSP
     SVSGSDDDDE QVPVMAQNRK RAPSYASSTS PLSEDDRPFA RSVQREERDI DKEGKFSLIG
     KKADVVSVKA AKGPKSKTIR VDSDEVEENQ GVPLASIGEV TKAEGKQDDS TVVKLETLLS
     ESISELTKGK KRPTKAKGGK ATKKVRLDQE ADDAATKIQI DEDIVPQPPK KKKVVKTEVD
     KLLASGVLMD EEDAYWLGRV LAAQEDGLEP IWSDGEEDLV DEGHPLFHKS GAWRAEGWKK
     VAQVQKSRYL PQRNRAVVNS EDVGGITTGR TARLAGRDQH RQTAAVAANN TVESDLFAFN
     QLRIRKKQLR FARSAIEGYG LYAMETIHAG EMVCEYVGDL VRATVADVRE QRYLKQGIGS
     SYLFRIDNDI VCDATFKGSV SRLINHSCDP SANAKIIKVN GQSKIVIYAE RTLYPGEEIL
     YDYKFPLESD PALRVPCLCG AATCRGWLN
//
DBGET integrated database retrieval system