ID M1W606_CLAP2 Unreviewed; 896 AA.
AC M1W606;
DT 01-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2013, sequence version 1.
DT 27-MAR-2024, entry version 46.
DE RecName: Full=SET domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=CPUR_04093 {ECO:0000313|EMBL:CCE30245.1};
OS Claviceps purpurea (strain 20.1) (Ergot fungus) (Sphacelia segetum).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Sordariomycetes;
OC Hypocreomycetidae; Hypocreales; Clavicipitaceae; Claviceps.
OX NCBI_TaxID=1111077 {ECO:0000313|EMBL:CCE30245.1, ECO:0000313|Proteomes:UP000016801};
RN [1] {ECO:0000313|EMBL:CCE30245.1, ECO:0000313|Proteomes:UP000016801}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=20.1 {ECO:0000313|EMBL:CCE30245.1,
RC ECO:0000313|Proteomes:UP000016801};
RX PubMed=23468653; DOI=10.1371/journal.pgen.1003323;
RA Schardl C.L., Young C.A., Hesse U., Amyotte S.G., Andreeva K., Calie P.J.,
RA Fleetwood D.J., Haws D.C., Moore N., Oeser B., Panaccione D.G.,
RA Schweri K.K., Voisey C.R., Farman M.L., Jaromczyk J.W., Roe B.A.,
RA O'Sullivan D.M., Scott B., Tudzynski P., An Z., Arnaoudova E.G.,
RA Bullock C.T., Charlton N.D., Chen L., Cox M., Dinkins R.D., Florea S.,
RA Glenn A.E., Gordon A., Gueldener U., Harris D.R., Hollin W., Jaromczyk J.,
RA Johnson R.D., Khan A.K., Leistner E., Leuchtmann A., Li C., Liu J., Liu J.,
RA Liu M., Mace W., Machado C., Nagabhyru P., Pan J., Schmid J., Sugawara K.,
RA Steiner U., Takach J.E., Tanaka E., Webb J.S., Wilson E.V., Wiseman J.L.,
RA Yoshida R., Zeng Z.;
RT "Plant-symbiotic fungi as chemical engineers: Multi-genome analysis of the
RT Clavicipitaceae reveals dynamics of alkaloid loci.";
RL PLoS Genet. 9:E1003323-E1003323(2013).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CCE30245.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAGA01000020; CCE30245.1; -; Genomic_DNA.
DR AlphaFoldDB; M1W606; -.
DR STRING; 1111077.M1W606; -.
DR VEuPathDB; FungiDB:CPUR_04093; -.
DR eggNOG; KOG1844; Eukaryota.
DR HOGENOM; CLU_009510_1_0_1; -.
DR OrthoDB; 124870at2759; -.
DR PhylomeDB; M1W606; -.
DR Proteomes; UP000016801; Unassembled WGS sequence.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0006325; P:chromatin organization; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR46462; UPSET, ISOFORM A; 1.
DR PANTHER; PTHR46462:SF3; UPSET, ISOFORM A; 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM00249; PHD; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW Chromatin regulator {ECO:0000256|ARBA:ARBA00022853};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Reference proteome {ECO:0000313|Proteomes:UP000016801};
KW Transferase {ECO:0000256|ARBA:ARBA00022603};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 49..99
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT DOMAIN 289..417
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..40
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 119..142
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 161..209
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 494..669
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 723..822
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 858..896
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..21
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 179..205
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 522..542
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 555..591
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 615..669
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 723..788
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 797..822
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 860..874
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 896 AA; 98209 MW; 40E3121A023B1301 CRC64;
MIETHVPPST QAVTPGQNAQ AAPVDVNVKD DPSIDPSVEQ AEEEPYTIKC ICNFCDDDGN
TIFCETCETW QHIDCFYPNN QDEALTKDFA HSCADCEPRP LDRQRAIERT LRLRNAETVV
SSQAGSLHKR AKRPPTKSMK KKHKLNELHV NINVGSGKVV GSPPLSMPLA KKVKTSHRSA
HSVTSLPPKR SPSYGHNSRP AAAQLSSPAT ALPDLPDGFQ IHHYSAGFYS LYKEQEVVPN
THNNAFVSLV IPRALSRWLR EPDTMKQEVG RTRAEVLQDE PPNLEGNQPK LEVTDKMRSP
EPGTIIRWRF VKSTAPIEKD VPLIELNGAV GFQKDYCADP ENLWADLSSP LPFVFLHPAL
PLYIDTRKEG SSARYVRRSC KPNAQLDTYL SEGSEYHFWL VSDRYIPANE QITLPWDFRL
EKSVRDRWLH LLGLSDEDSN AISEEPELDP AEYTAISNWI DRILSEYGGC ACDLENDCAF
ARFHRHYLFG KSQSRNSKKK SKKARNHASS PTNSRAASEG QFDDSMDVEH GSRSKHSSRD
RTPLQGSFDQ LGILTEPTDR DKRKVAMVED SFRRMEQQQP SRKKKRVSDG PGASTKSKSR
NGSAGPAASA GPASSGVQYV DSGTWRGISD SPSSAHSPST GHVKTKEGQA STTYPSRSRQ
SSTGPRSNYR DVAVQTDPVQ GQWFSQFCHP LPIRKRVISL SQRLMNSRHL CIADAEVRRK
ASLSSQVSNM SAQDAGSLTS GRKPSNASLG SDAPSPKSQT TDGNMTDNST SMTNEGPVEA
TSTKTPELRV QLPPVPAFDN TTLNSSSVAT PRSASSSTGL SPLSNPFSAG SLSGMAVNPS
PIKKKLSLSD YKSRLNKAAI KASSGASAQS KPGISSPEDA KAEIVAEPQP PDKAEE
//