GenomeNet

Database: UniProt
Entry: Q2GWF3
LinkDB: Q2GWF3
Original site: Q2GWF3 
ID   SET1_CHAGB              Reviewed;        1076 AA.
AC   Q2GWF3;
DT   09-JAN-2007, integrated into UniProtKB/Swiss-Prot.
DT   21-MAR-2006, sequence version 1.
DT   10-APR-2019, entry version 87.
DE   RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific;
DE            EC=2.1.1.43;
DE   AltName: Full=COMPASS component SET1;
DE   AltName: Full=SET domain-containing protein 1;
GN   Name=SET1; ORFNames=CHGG_07701;
OS   Chaetomium globosum (strain ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC
OS   6347 / NRRL 1970) (Soil fungus).
OC   Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina;
OC   Sordariomycetes; Sordariomycetidae; Sordariales; Chaetomiaceae;
OC   Chaetomium.
OX   NCBI_TaxID=306901;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=ATCC 6205 / CBS 148.51 / DSM 1962 / NBRC 6347 / NRRL 1970;
RX   PubMed=25720678; DOI=10.1128/genomeA.00021-15;
RA   Cuomo C.A., Untereiner W.A., Ma L.-J., Grabherr M., Birren B.W.;
RT   "Draft genome sequence of the cellulolytic fungus Chaetomium
RT   globosum.";
RL   Genome Announc. 3:E0002115-E0002115(2015).
CC   -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC       specifically mono-, di- and trimethylates histone H3 to form
CC       H3K4me1/2/3, which subsequently plays a role in telomere length
CC       maintenance and transcription elongation regulation.
CC       {ECO:0000250}.
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=L-lysyl-[histone] + S-adenosyl-L-methionine = H(+) +
CC         N(6)-methyl-L-lysyl-[histone] + S-adenosyl-L-homocysteine;
CC         Xref=Rhea:RHEA:10024, Rhea:RHEA-COMP:9845, Rhea:RHEA-COMP:9846,
CC         ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC         ChEBI:CHEBI:59789, ChEBI:CHEBI:61929; EC=2.1.1.43;
CC   -!- SUBUNIT: Component of the COMPASS (Set1C) complex. {ECO:0000250}.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000305}. Chromosome
CC       {ECO:0000305}.
CC   -!- SIMILARITY: Belongs to the class V-like SAM-binding
CC       methyltransferase superfamily. {ECO:0000255|PROSITE-
CC       ProRule:PRU00190}.
DR   EMBL; CH408033; EAQ86448.1; -; Genomic_DNA.
DR   RefSeq; XP_001225357.1; XM_001225356.1.
DR   ProteinModelPortal; Q2GWF3; -.
DR   SMR; Q2GWF3; -.
DR   STRING; 38033.XP_001225357.1; -.
DR   PRIDE; Q2GWF3; -.
DR   EnsemblFungi; EAQ86448; EAQ86448; CHGG_07701.
DR   GeneID; 4393302; -.
DR   eggNOG; KOG1080; Eukaryota.
DR   eggNOG; COG2940; LUCA.
DR   InParanoid; Q2GWF3; -.
DR   OrthoDB; 1017537at2759; -.
DR   Proteomes; UP000001056; Unassembled WGS sequence.
DR   GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR   GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR   GO; GO:0018024; F:histone-lysine N-methyltransferase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR   Gene3D; 3.30.70.330; -; 1.
DR   InterPro; IPR024657; COMPASS_Set1_N-SET.
DR   InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR   InterPro; IPR003616; Post-SET_dom.
DR   InterPro; IPR035979; RBD_domain_sf.
DR   InterPro; IPR017111; Set1.
DR   InterPro; IPR024636; SET_assoc.
DR   InterPro; IPR001214; SET_dom.
DR   PANTHER; PTHR22884:SF462; PTHR22884:SF462; 1.
DR   Pfam; PF11764; N-SET; 1.
DR   Pfam; PF00856; SET; 1.
DR   Pfam; PF11767; SET_assoc; 1.
DR   PIRSF; PIRSF037104; Histone_H3-K4_mtfrase_Set1_fun; 1.
DR   SMART; SM01291; N-SET; 1.
DR   SMART; SM00508; PostSET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF54928; SSF54928; 1.
DR   PROSITE; PS50868; POST_SET; 1.
DR   PROSITE; PS51572; SAM_MT43_1; 1.
DR   PROSITE; PS50280; SET; 1.
PE   3: Inferred from homology;
KW   Chromatin regulator; Chromosome; Complete proteome; Methyltransferase;
KW   Nucleus; Reference proteome; S-adenosyl-L-methionine; Transferase.
FT   CHAIN         1   1076       Histone-lysine N-methyltransferase, H3
FT                                lysine-4 specific.
FT                                /FTId=PRO_0000269769.
FT   DOMAIN      934   1051       SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00190}.
FT   DOMAIN     1060   1076       Post-SET. {ECO:0000255|PROSITE-
FT                                ProRule:PRU00155}.
SQ   SEQUENCE   1076 AA;  121756 MW;  B10CF6093311D63D CRC64;
     MGTIWLISAA SDDQDDAPPS DPRLAKGGRL NYINVDFHLP KARLRHAPYN LKPYKYDPKT
     SCGPGPPTQV VVTGFNPLIA FSKVTAVFAS FGDIAESSNK MHPDTGSYLG FATFRYRDSK
     PSRSRPISIT GADAAKRAIR AMHGKRIEAN MVRVEYDAEG KKSSRMLVEV LQKGNETTPA
     LGEPRIPTGP KPKEVAPGPP PTAPKGPAAH RGGLMNVQGV WVPKPRPDSI IEVEPVIGHL
     KHDPYIFVGH EHVPVMPTTV AHMKRRLKTY MFEDIRADRT GYYIVFQDSG YGRAEAERCF
     RSADRTAFFT YTMVMVLHLY GTDGKASHAH ASDTRRRTRT PERKHVDEAR PHREHDRSRR
     DEERARRDEQ DRRRREDEAD LEEEKRQRAK NYDPVLEATD VVLRGMKEQL IKIIRTKIAA
     PALFNFLDPV NHLAKRRRLN LEDPHSARLP PIVLDEFEDR SPVSTPNSRA DPIERRTARL
     DVSALPRIRK VKNAGLNTRK HGFNDPFARN RPTARRTAFR SLHYRLRSDS EGESEDEAEN
     RTSLGRDTEE PESRPRSRMS SDDEGDKDDY ASWGPGDDDS MTEASFALGD GPGLAKKRKL
     DLQVETAIKR QKKTDEELFG VTIDRIGTEF PSREDSLEDV LPPGPGGGEE KDIGSSRLPT
     PLLQEGKAKK KAPAKTKRKS KKQLFEEREA LKRQQQEIFE REALQSEDVD EVIPTPEPES
     EPKKSKVEKE KEKEEKVEKP ALDENLYPSQ KVSVLELPHD FRLDVGSLEE LALGPNDQPD
     LDRLRKRFGR GKIDDPELWV WRRDRIRELN STDGSAKTPV RIEGYYVPNP TGCARAEGVK
     KILNSEKSKY LPHHIKVKKA REERQAQNGK NAKDSVLAAA EAARLAAESL VAKGNSRANR
     ANNRRFVADL NDQRKTLGQD SDVLRFNQLK KRKKPVKFAR SAIHNWGLYA MENIPKDDMI
     IEYVGEEVRQ QIAELRENRY LKSGIGSSYL FRIDDNTVID ATKKGGIARF INHSCMPNCT
     AKIIKVEGSK RIVIYALRDI AQNEELTYDY KFERELGSTD RIPCLCGTAA CKGFLN
//
DBGET integrated database retrieval system