ID A0A2U1MSI7_ARTAN Unreviewed; 461 AA.
AC A0A2U1MSI7;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 20.
DE SubName: Full=Pre-SET zinc-binding sub-group, histone H3-K9 methyltransferase {ECO:0000313|EMBL:PWA64225.1};
GN ORFNames=CTI12_AA346780 {ECO:0000313|EMBL:PWA64225.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA64225.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA64225.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA64225.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA64225.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01004467; PWA64225.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1MSI7; -.
DR STRING; 35608.A0A2U1MSI7; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:InterPro.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd10538; SET_SETDB-like; 1.
DR Gene3D; 1.10.8.850; Histone-lysine N methyltransferase , C-terminal domain-like; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR007728; Pre-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR025776; SUVR4/1/2.
DR InterPro; IPR043017; WIYLD_dom_sf.
DR InterPro; IPR018848; WIYLD_domain.
DR PANTHER; PTHR46450:SF24; HISTONE-LYSINE N-METHYLTRANSFERASE SUVR4; 1.
DR PANTHER; PTHR46450; INACTIVE HISTONE-LYSINE N-METHYLTRANSFERASE SUVR1-RELATED; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF10440; WIYLD; 1.
DR SMART; SM00468; PreSET; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50867; PRE_SET; 1.
DR PROSITE; PS51580; SAM_MT43_3; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Methyltransferase {ECO:0000313|EMBL:PWA64225.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW Transferase {ECO:0000313|EMBL:PWA64225.1}.
FT DOMAIN 198..298
FT /note="Pre-SET"
FT /evidence="ECO:0000259|PROSITE:PS50867"
FT DOMAIN 301..435
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..22
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 81..143
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 95..110
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 461 AA; 52731 MW; 5227393ED0A6839A CRC64;
MTASKKDSKT SDPRPLNPNG PRIRKAISAM EELGISEDVV KPVLNRLWKL YNREWKLIED
DNYRTLADAI FESVDDKKGK AITMQDKQEP SSKRSHSSTY HASSTQKKRR QYCQLVDDDD
DDDDMDHSSS VNPSASDKKS KISDDITKGQ EKIKIALVNE IGIELPEFVY ITQNTTFQNA
HVPFSLARIS DEDCCKKCNG DCLSSRVPCA CSRETGGEFA YTPKGLLKDK FLDACIANYS
EPQKENLFYC QGCCPLEKAK DAHHPEPCNG HPLKKFIKEC WRKCGCTMAC GNRVVQRGPT
CKLEVFATKG KGWAVRTLEY LPKGSFICEY IGEILTNTEL YERNEQRKKK NERHTYPVLL
DNDWGSEQGL KDEEALCLDA THYGNVARFI NHRCFDSNLI DIPVEVETPD HHYYHIAFFT
KRNVEANEEL TWDYGIDFED KLHPIKAFRC HCGSQYCRDA R
//