ID A0A124SEA6_CYNCS Unreviewed; 1581 AA.
AC A0A124SEA6;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 27-MAR-2024, entry version 40.
DE SubName: Full=AWS-like protein {ECO:0000313|EMBL:KVH99520.1};
GN ORFNames=Ccrd_022245 {ECO:0000313|EMBL:KVH99520.1};
OS Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC Carduinae; Cynara.
OX NCBI_TaxID=59895 {ECO:0000313|EMBL:KVH99520.1, ECO:0000313|Proteomes:UP000243975};
RN [1] {ECO:0000313|EMBL:KVH99520.1, ECO:0000313|Proteomes:UP000243975}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2C {ECO:0000313|EMBL:KVH99520.1};
RX PubMed=26786968; DOI=10.1038/srep19427;
RA Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT "The genome sequence of the outbreeding globe artichoke constructed de novo
RT incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL Sci. Rep. 6:19427-19427(2016).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KVH99520.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LEKV01003460; KVH99520.1; -; Genomic_DNA.
DR STRING; 59895.A0A124SEA6; -.
DR EnsemblPlants; KVH99520; KVH99520; Ccrd_022245.
DR Gramene; KVH99520; KVH99520; Ccrd_022245.
DR OMA; IRENDCK; -.
DR Proteomes; UP000243975; Unassembled WGS sequence.
DR GO; GO:0005694; C:chromosome; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0046975; F:histone H3K36 methyltransferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd19172; SET_SETD2; 1.
DR Gene3D; 3.30.40.100; -; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR006560; AWS_dom.
DR InterPro; IPR003616; Post-SET_dom.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR044437; SETD2/Set2_SET.
DR InterPro; IPR011124; Znf_CW.
DR PANTHER; PTHR22884:SF413; HISTONE-LYSINE N-METHYLTRANSFERASE CG1716-RELATED; 1.
DR PANTHER; PTHR22884; SET DOMAIN PROTEINS; 1.
DR Pfam; PF17907; AWS; 1.
DR Pfam; PF00856; SET; 1.
DR Pfam; PF07496; zf-CW; 1.
DR SMART; SM00570; AWS; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51215; AWS; 1.
DR PROSITE; PS50868; POST_SET; 1.
DR PROSITE; PS50280; SET; 1.
DR PROSITE; PS51050; ZF_CW; 1.
PE 4: Predicted;
KW Chromosome {ECO:0000256|ARBA:ARBA00022454};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000243975};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 574..628
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DOMAIN 688..738
FT /note="AWS"
FT /evidence="ECO:0000259|PROSITE:PS51215"
FT DOMAIN 740..857
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT DOMAIN 865..881
FT /note="Post-SET"
FT /evidence="ECO:0000259|PROSITE:PS50868"
FT REGION 87..140
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 474..540
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1056..1080
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1295..1315
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1351..1423
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1497..1524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 87..112
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1063..1078
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1351..1399
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1581 AA; 175575 MW; 6CEB3E51FD624999 CRC64;
MANKDPEILL EEDAQDTPCD FKELVSDFES NCLADTLSVY LESCEPFSVT DPEPSNNLDE
PNISXTDVLV DPLNSIVLTE LSELRDNDGE DSSRIHHACG KKNNEVKRSS TRRSTRKSTL
NQKTDTKLAA RKGSRTSGKR PMLDLLAADV GRRRRSDLVQ RARPSAWGLG GKIDEIFKQY
VEKNVDVNGH IDSRKGRMGR RGGKRNTDFV NQSIQNSQSE SRALNKGLLL KFKLGTKVIQ
NCQFNTIPDM DKDLESYREV KIDTSTLQCD IKENYEKEVP HASLSRASDG NLDKRVAGKA
VSELHDMIGI GESVENRCLD PGTSPDSEVI NVIPETQISG TTVEVLHDLR ISSKDCTAHQ
SGDXGDGSGI PSPEIVSDVI LCDKFEHGEK QIEGSWSSGQ SMSITTGVAS SNASSRDGCP
VEPVASLQET EVGVSMDMLT IESGLKADFS IAGIESVESC FSDKLLSGAK TNGQNLQRSL
KSRGITKSMP RVPDSLNKKR XNCRQKGSQA KSPAKGKLME KSGNDQADXD TESDPITGSH
EISVVRVLEE GCRSSAGALS NSTMVPSEGE NRQGSPRSAW VCCDDCHKWR RISAILADSI
ESTECRWICK DNMDKTFADC SIPQEKSNAD INEELELSDA SCEEDACNAR LNSSQLGQKQ
PIDPPQSSWK LIKTNMFLHR NRKNQTIDEI MVCHCKPLFD GRMGCRDECL NRMLNIECVK
GTCPCGEFCS NQQFQKRKYA KLKCFPCGKK GYGLQLQEDI PKGRFLIEYV GEVLDMPAYE
ARQREYASKG HKHFYFMTLN GSEVIDACAK GNLGRFINHS CEPNCRTEKW MVNGEVCIGL
FATRDIKKGE ELTFDYNYVR VFGAAAKKCV CGSSRCRGVI GGDSQNAETV ILGDSDEEDL
EPVTFYKNSS NKLGLSGTSI YNGAGTQTAE STSKNNGCVI DKFAVASGXV EDVENERSPD
SLLVIDDKND DLAIPTESTE REVPFERSSS AASALESKID NTEEPLPFCA EPLNTSFKVH
DGKRETRFLD ARHHNVAEKD MKGCLSIGXR AKTSLDTSAK LLPNDGDSKK KPKSHTNENK
CAILKPQALS KLPSSAVKRG KAKTHGVNKP PEIDSKPLVM PHKTNRLVED NLTGRFEAVQ
EALNTLLNAD GGISKRRDAS KGYLKLLCLT AHSGNGEGIQ SNRDLSMIMD ALLKTKSRTV
LLDIINKNGL QMLHNLMKRY RKEFSKIPIL RKLLKYLKDV FPLSFLSFKE SILSLTEHTD
KQVHQIARNF RDRWIPWSAR KVNWADRDVE RMENQASPNF KTIPVQHDRD RRPSEVADNC
IQQSFSGVDA RTVEGSAASC LSSCTDGTRT RKRKSRWDEP GDVKPDIESP SNKEHRSKTS
RSEQVNHMVQ EERQVNNDED GDAPPGFSSP INRQLFPSNA PSTSTDTNCC KSVMGYPQER
YISRLPTSFG VPCSVVQRLG TPEGESWVVG PATTFHPFPP LPSYPRKEEQ LWRNNCTSST
NSATDMPFNQ PNFQRFRNSN SNNTLGRRYF KQQKWNNSKS GLPWNHNKYG SASYNRGYGR
NEAAGMMNGS CGNNFYQQPQ Q
//