GenomeNet

Database: UniProt
Entry: P0CO26
LinkDB: P0CO26
Original site: P0CO26 
ID   SET1_CRYNJ              Reviewed;        1469 AA.
AC   P0CO26; Q55U33; Q5KIA9;
DT   28-JUN-2011, integrated into UniProtKB/Swiss-Prot.
DT   28-JUN-2011, sequence version 1.
DT   10-APR-2019, entry version 45.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=SET1; OrderedLocusNames=CND03610;
OS   Cryptococcus neoformans var. neoformans serotype D (strain JEC21 /
OS   ATCC MYA-565) (Filobasidiella neoformans).
OC   Eukaryota; Fungi; Dikarya; Basidiomycota; Agaricomycotina;
OC   Tremellomycetes; Tremellales; Cryptococcaceae; Cryptococcus;
OC   Cryptococcus neoformans species complex.
OX   NCBI_TaxID=214684;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=JEC21 / ATCC MYA-565;
RX   PubMed=15653466; DOI=10.1126/science.1103773;
RA   Loftus B.J., Fung E., Roncaglia P., Rowley D., Amedeo P., Bruno D.,
RA   Vamathevan J., Miranda M., Anderson I.J., Fraser J.A., Allen J.E.,
RA   Bosdet I.E., Brent M.R., Chiu R., Doering T.L., Donlin M.J.,
RA   D'Souza C.A., Fox D.S., Grinberg V., Fu J., Fukushima M., Haas B.J.,
RA   Huang J.C., Janbon G., Jones S.J.M., Koo H.L., Krzywinski M.I.,
RA   Kwon-Chung K.J., Lengeler K.B., Maiti R., Marra M.A., Marra R.E.,
RA   Mathewson C.A., Mitchell T.G., Pertea M., Riggs F.R., Salzberg S.L.,
RA   Schein J.E., Shvartsbeyn A., Shin H., Shumway M., Specht C.A.,
RA   Suh B.B., Tenney A., Utterback T.R., Wickes B.L., Wortman J.R.,
RA   Wye N.H., Kronstad J.W., Lodge J.K., Heitman J., Davis R.W.,
RA   Fraser C.M., Hyman R.W.;
RT   "The genome of the basidiomycetous yeast and human pathogen
RT   Cryptococcus neoformans.";
RL   Science 307:1321-1324(2005).
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) +
CC         N(6)-methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC         Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC         ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.43;
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
DR   EMBL; AE017344; AAW43251.1; -; Genomic_DNA.
DR   RefSeq; XP_570558.1; XM_570558.1.
DR   ProteinModelPortal; P0CO26; -.
DR   SMR; P0CO26; -.
DR   STRING; 5207.AAW43251; -.
DR   PaxDb; P0CO26; -.
DR   EuPathDB; FungiDB:CND03610; -.
DR   eggNOG; KOG1080; Eukaryota.
DR   eggNOG; COG2940; LUCA.
DR   InParanoid; P0CO26; -.
DR   OMA; GIVFIRY; -.
DR   OrthoDB; 67625at2759; -.
DR   Proteomes; UP000002149; Chromosome 4.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   Gene3D; 3.30.70.330; -; 1.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR035979; RBD_domain_sf.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 2.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF54928; SSF54928; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; Reference proteome; S-adenosyl-L-methionine; Transferase.
FT   CHAIN         1   1469       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000269771.
FT   DOMAIN     1327   1444       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1453   1469       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
FT   COMPBIAS    317    492       Pro-rich.
SQ   SEQUENCE   1469 AA;  163101 MW;  E54346DDA84F2739 CRC64;
     MAPHEKGVNP SESPSGSLKK APPSGPKALR GFASPSAFRN ADIGNGLLQH RGEDERISFA
     FPRKGKESLD RNGERPAPMS LESRLGPPVS RFNRPVGGDS LERGNGKGGW DNRDERTSAS
     SSSIHRINKE KIRPRSDFIE SSANLYSEDD RNRDRGRYHE RDRSRGTNGD RGGGEGHSHR
     EPGRGKEHQN GQGRDRSLYR DHSRERESSR DERDRYSEEY KHQRSKARFS PSPSPERSRL
     KSSLGRHRSP VSVSSSSSSS ARSPPPVQNR EPLRYNGSQS KNGEKELSNG LLENSISRSG
     VSIAVPRKLE TKQLVRPSPP HVNLKSTIQD NPSPPTGKYP PSPPSLDILS KPSCNREPLP
     DQRPPTPPLP ENSSPTSPSL ESSRFDQNQH LLPDELPLPP LSISFPQKPV SSSSLSRLSS
     LSAALSRPNP LDKDGTSMPP SFHFREISTR HQRLSPPNNA ENQLPETSII PPPPSETVPE
     PPWIRPPYIP PPCTKHRPGI GNFFITNLRE KVEDKSGKEE KRVDGMEGGK VVQVTDPRLS
     MTEEQRGRGR GSSKQRAAFY ELTYEWDLYS VTPKPPSPPT AVLITGLGPL TTVDQITKFL
     RPHGRIKEID SKVDRKTGMQ LGICWVKFEG PPLGRPGTAH DVASMAVKVC DGKKISMGGE
     RIRVVLDGRG KRAEQAVKEE MERRYPPKKP SALPSDMKVT PLSGTAMATT SRPPNAPNAS
     TPLIDKQTFD TSAKAPIIRP GVLKPLGQKM YHRPSAPPLV FNNHRFRDES FLNRPFNAGA
     GMISQQQMGY KILPGKPVQQ LASSFTSAPF VRHPRERRED SWTNERGRRL KGESSTRHWR
     ARSLSRSSYS SYSSYSSYSE ESEEERPRHP TKVPYPQRKR LATGPSKEDE YKMEDVREAI
     RENGHPCVFI DAKSLPAARE YESRQSHLGW YILFADDTTA YRVQRVLDTT AVQGHRLSLV
     VHTSSGPRAQ TDASEPVTGG VGESKKGNWR YLTITKKSRP MPAVKKSGKS ATIRRKVYSP
     SVSGSDDDDE QVPVMAQNRK RAPSYASSTS PLSEDDRPFA RSVQREERDI DKEGKFSLIG
     KKEDVVSVKA AKGPKSKTIR VDSDEVEENQ GVPLASIGEV TKAEGKQDDS TVVKLETLLS
     ESISELTKGK KRPTKAKGGK ATKKVRLDQE ADDAATKIQI DEDIVPQPPK KKKVVKTEVD
     KLLASGVLMD EEDAYWLGRV LAAQEDGLEP IWSDGEEDLV DEGHPLFHKS GAWRAEGWKK
     VAQVQKSRYL PQRNRAVVNS EDVGGITTGR TARLAGRDQH RQTAAVAANN TVESDLFAFN
     QLRIRKKQLR FARSAIEGYG LYAMETIHAG EMVCEYVGDL VRATVADVRE QRYLKQGIGS
     SYLFRIDNDI VCDATFKGSV SRLINHSCDP SANAKIIKVN GQSKIVIYAE RTLYPGEEIL
     YDYKFPLESD PALRVPCLCG AATCRGWLN
//
DBGET integrated database retrieval system