ID A0A0F4GBY3_9PEZI Unreviewed; 1053 AA.
AC A0A0F4GBY3;
DT 24-JUN-2015, integrated into UniProtKB/TrEMBL.
DT 24-JUN-2015, sequence version 1.
DT 24-JAN-2024, entry version 48.
DE RecName: Full=Histone-lysine N-methyltransferase, H3 lysine-4 specific {ECO:0000256|ARBA:ARBA00015839, ECO:0000256|PIRNR:PIRNR037104};
DE EC=2.1.1.354 {ECO:0000256|ARBA:ARBA00012182, ECO:0000256|PIRNR:PIRNR037104};
GN ORFNames=TI39_contig4268g00005 {ECO:0000313|EMBL:KJX93705.1};
OS Zymoseptoria brevis.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetidae; Mycosphaerellales; Mycosphaerellaceae; Zymoseptoria.
OX NCBI_TaxID=1047168 {ECO:0000313|EMBL:KJX93705.1, ECO:0000313|Proteomes:UP000033647};
RN [1] {ECO:0000313|EMBL:KJX93705.1, ECO:0000313|Proteomes:UP000033647}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=Zb18110 {ECO:0000313|EMBL:KJX93705.1,
RC ECO:0000313|Proteomes:UP000033647};
RA Grandaubert J., Bhattacharyya A., Stukenbrock E.H.;
RT "RNA-seq based gene annotation and comparative genomics of four
RT Zymoseptoria species reveal species-specific pathogenicity related genes
RT and transposable element activity.";
RL Submitted (MAR-2015) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Catalytic component of the COMPASS (Set1C) complex that
CC specifically mono-, di- and trimethylates histone H3 to form
CC H3K4me1/2/3, which subsequently plays a role in telomere length
CC maintenance and transcription elongation regulation.
CC {ECO:0000256|ARBA:ARBA00002789, ECO:0000256|PIRNR:PIRNR037104}.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=L-lysyl(4)-[histone H3] + 3 S-adenosyl-L-methionine = 3 H(+) +
CC N(6),N(6),N(6)-trimethyl-L-lysyl(4)-[histone H3] + 3 S-adenosyl-L-
CC homocysteine; Xref=Rhea:RHEA:60260, Rhea:RHEA-COMP:15537, Rhea:RHEA-
CC COMP:15547, ChEBI:CHEBI:15378, ChEBI:CHEBI:29969, ChEBI:CHEBI:57856,
CC ChEBI:CHEBI:59789, ChEBI:CHEBI:61961; EC=2.1.1.354;
CC Evidence={ECO:0000256|ARBA:ARBA00000944,
CC ECO:0000256|PIRNR:PIRNR037104};
CC -!- SUBUNIT: Component of the COMPASS (Set1C) complex.
CC {ECO:0000256|ARBA:ARBA00011755, ECO:0000256|PIRNR:PIRNR037104}.
CC -!- SUBCELLULAR LOCATION: Chromosome {ECO:0000256|ARBA:ARBA00004286}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123, ECO:0000256|PIRNR:PIRNR037104}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KJX93705.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LAFY01004227; KJX93705.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0F4GBY3; -.
DR STRING; 1047168.A0A0F4GBY3; -.
DR OrthoDB; 950362at2759; -.
DR Proteomes; UP000033647; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-SubCell.
DR GO; GO:0048188; C:Set1C/COMPASS complex; IEA:InterPro.
DR GO; GO:0140999; F:histone H3K4 trimethyltransferase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 3.30.70.330; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR024657; COMPASS_Set1_N-SET.
DR InterPro; IPR012677; Nucleotide-bd_a/b_plait_sf.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR035979; RBD_domain_sf.
DR InterPro; IPR000504; RRM_dom.
DR InterPro; IPR044570; Set1-like.
DR InterPro; IPR017111; Set1_fungi.
DR InterPro; IPR024636; SET_assoc.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR45814; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR PANTHER; PTHR45814:SF2; HISTONE-LYSINE N-METHYLTRANSFERASE SETD1; 1.
DR Pfam; PF11764; N-SET; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF11767; SET_assoc; 1.
DR PIRSF; PIRSF037104; Histone_H3-K4_mtfrase_Set1_fun; 2.
DR SMART; SM01291; N-SET; 1.
DR SMART; SM00508; PostSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF54928; RNA-binding domain, RBD; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50102; RRM; 1.
DR PROSITE; PS51572; SAM_MT43_1; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|PIRNR:PIRNR037104};
KW Chromosome {ECO:0000256|PIRNR:PIRNR037104};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603,
KW ECO:0000256|PIRNR:PIRNR037104}; Nucleus {ECO:0000256|PIRNR:PIRNR037104};
KW Reference proteome {ECO:0000313|Proteomes:UP000033647};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00176};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691,
KW ECO:0000256|PIRNR:PIRNR037104};
KW Transferase {ECO:0000256|ARBA:ARBA00022679, ECO:0000256|PIRNR:PIRNR037104}.
FT DOMAIN 40..128
FT /note="RRM"
FT /evidence="ECO:0000259|PROSITE:PS50102"
FT DOMAIN 911..1028
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 1037..1053
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 141..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 404..554
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 630..653
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 151..178
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 445..480
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 507..524
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 539..554
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1053 AA; 118442 MW; E1F1C9CA94E6E496 CRC64;
MSGQCRQKLA GPSKAILRPA PYTLPPYKVD RYSIGPGEPQ QVVAVGFDPF LSESTLKLNF
GTFGSIASVQ NKTDPETGSF LGIALIKYRD SIRDGEEISA VNAAKRAEME FNGARIGLHT
IKVELDREGR RCKRHVDHVL KKAKEERAKY APQPPGPPET PTDAKDSPAP PPNAPKGPSA
KPAGKAAAAK AAGKPVRGAP GVATATPAPE TAANPKTGAA ALVEDEPILS KIKRKPYIHI
PHASVPVLGT TIPHLKKRLK AYDWREVRLD VTGYYVIFED SKRGEDETER CYNECNKQAL
FTYKMGMECQ KYGNPDYERS PSPERAMAEK QKQEEFERLQ KEDAEDLEIE KKSRAENLDP
VLGALDQLRI ELRDRILGDI KTRIAIPIFH DSLDPARHVA KRRKLGLPDP SDKENKGSSL
LFAKAGDTPP DTPKNRRGYP LSHSKKPLRP HDPHNQRGRK GGREREPSNA FVDERRRKPP
PRPTHARGLH FRLQQMYAEE EDSDDERQTS VTGRDTDEQE SRPLSRASRN STPFDSESVM
DTPKHKRRKV ADWEDDEKET FEAFHKEMLG HLLHKEPEDL ATRELELVVN TLPRSSKYAT
RARTELFIRQ RSKADDDLFQ VRLDRKDVDE ESVLASKEQA DTPAVESDAK VVKEKAKRKR
KTKKQLLEEQ EALKAEAKKA KEESKPIAAA ETVTKIEQKE LEIEAVVEAE AQDNNVSLHL
TFDKPRRTVE EDRSIVLDID GWQQFIKDEE DMAFARKALA EQPAYDIGDV KLWAWKQKEI
KALNSGGIPG QAQPESLISG YYVLNPTGCA RTEGVKKILN EEKSKYLPHR IKVQRAREER
QAQATSNPTA AAEAAKIKEA EKIASTATSR SNRVNNRRLV NDINLQKQTL ATANSDADVA
IRFNQLKKRK KLVKFDRSAI HGWGLYAEEN IAVNDLIIEY VGEKVRQKIA DLREIRYEKQ
GVGSSYLFRM IDDEIVDATK KGGIARFINH SCSPNCTAKI IKVEGTPRIV IYALKDIGKN
DELTYDYKFE REMDSTDRIP CLCGSANCKG FLN
//