ID A0A1V6Z076_PENNA Unreviewed; 486 AA.
AC A0A1V6Z076;
DT 07-JUN-2017, integrated into UniProtKB/TrEMBL.
DT 07-JUN-2017, sequence version 1.
DT 27-MAR-2024, entry version 28.
DE RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN ORFNames=PENNAL_c0006G06709 {ECO:0000313|EMBL:OQE93079.1},
GN PNAL_LOCUS7555 {ECO:0000313|EMBL:CAG8203228.1};
OS Penicillium nalgiovense.
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Eurotiomycetes;
OC Eurotiomycetidae; Eurotiales; Aspergillaceae; Penicillium.
OX NCBI_TaxID=60175 {ECO:0000313|EMBL:OQE93079.1, ECO:0000313|Proteomes:UP000191691};
RN [1] {ECO:0000313|EMBL:OQE93079.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IBT 13039 {ECO:0000313|EMBL:OQE93079.1};
RA Nielsen J.C., Nielsen J.;
RT "Uncovering the secondary metabolism of Penicillium species provides
RT insights into the evolution of 6-MSA pathways.";
RL Submitted (OCT-2016) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000191691}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=IBT 13039 {ECO:0000313|Proteomes:UP000191691};
RX PubMed=28368369; DOI=10.1038/nmicrobiol.2017.44;
RA Nielsen J.C., Grijseels S., Prigent S., Ji B., Dainat J., Nielsen K.F.,
RA Frisvad J.C., Workman M., Nielsen J.;
RT "Global analysis of biosynthetic gene clusters reveals vast potential of
RT secondary metabolite production in Penicillium species.";
RL Nat. Microbiol. 2:17044-17044(2017).
RN [3] {ECO:0000313|EMBL:CAG8203228.1}
RP NUCLEOTIDE SEQUENCE.
RA Branca A.L. A.;
RL Submitted (JUL-2021) to the EMBL/GenBank/DDBJ databases.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OQE93079.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAJVNV010000444; CAG8203228.1; -; Genomic_DNA.
DR EMBL; MOOB01000006; OQE93079.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1V6Z076; -.
DR STRING; 60175.A0A1V6Z076; -.
DR OMA; RVDWWLE; -.
DR OrthoDB; 51002at2759; -.
DR Proteomes; UP000191691; Unassembled WGS sequence.
DR Proteomes; UP001153461; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 3.90.1420.10; Rubisco LSMT, substrate-binding domain; 1.
DR Gene3D; 3.90.1410.10; set domain protein methyltransferase, domain 1; 1.
DR InterPro; IPR015353; Rubisco_LSMT_subst-bd.
DR InterPro; IPR036464; Rubisco_LSMT_subst-bd_sf.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR13271:SF34; N-LYSINE METHYLTRANSFERASE SETD6; 1.
DR PANTHER; PTHR13271; UNCHARACTERIZED PUTATIVE METHYLTRANSFERASE; 1.
DR Pfam; PF09273; Rubis-subs-bind; 1.
DR Pfam; PF00856; SET; 1.
DR SUPFAM; SSF81822; RuBisCo LSMT C-terminal, substrate-binding domain; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000191691};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 34..280
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 209..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 465..486
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 209..227
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..486
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 486 AA; 53766 MW; 2F35CEAD2D851AD0 CRC64;
MSSTAHFPDA EGFQQQSDNF MSWLQASPGV QLNPKLRLAD LRATGAGRGV VAQSNISEGE
ELFSIPRAMV LTVQNSELRT LLGENLEEQM GPWLSLMLVM VYEYLQGEKS RWAPYFRVLP
SRFDTLMFWS PAELQELQAS TIVEKIGRSG AEESIRNSIA PILAKRPDFF PPPQGLASWE
GAAGDAALIQ LGHIMGSLIM AYAFDIEKSE DDDDEGEAND ESYMTDDEEE QLPKGMVPLA
DLLNADADRN NARLYQEEGA LVMKAIKPIQ PGEEIFNDYG EIPRADLLRR YGYVTDNYAV
YDVLELSLET ICEAAGLPNA DVESQPRLEF LASLDILDDG YVIPRPVNAD PSLQDILPAE
LVVLLATLTL SPEEFKQRVS KDKAPKPVLD ANATAILIKA LQKRQEQYAT SLADDLQFRA
SLSPLPETGD VDEGARRVRM ALQVRIGEKE VLQAILGMLQ PATSNSLKRS ANGDDGESRQ
FKTQRV
//