ID A0A1E7FJP4_9STRA Unreviewed; 674 AA.
AC A0A1E7FJP4;
DT 18-JAN-2017, integrated into UniProtKB/TrEMBL.
DT 18-JAN-2017, sequence version 1.
DT 27-MAR-2024, entry version 17.
DE RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN ORFNames=FRACYDRAFT_236655 {ECO:0000313|EMBL:OEU18378.1};
OS Fragilariopsis cylindrus CCMP1102.
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Bacillariophyceae; Bacillariophycidae; Bacillariales; Bacillariaceae;
OC Fragilariopsis.
OX NCBI_TaxID=635003 {ECO:0000313|EMBL:OEU18378.1, ECO:0000313|Proteomes:UP000095751};
RN [1] {ECO:0000313|EMBL:OEU18378.1, ECO:0000313|Proteomes:UP000095751}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1102 {ECO:0000313|EMBL:OEU18378.1,
RC ECO:0000313|Proteomes:UP000095751};
RG DOE Joint Genome Institute;
RA Mock T., Otillar R.P., Strauss J., Dupont C., Frickenhaus S., Maumus F.,
RA Mcmullan M., Sanges R., Schmutz J., Toseland A., Valas R., Veluchamy A.,
RA Ward B.J., Allen A., Barry K., Falciatore A., Ferrante M., Fortunato A.E.,
RA Gloeckner G., Gruber A., Hipkin R., Janech M., Kroth P., Leese F.,
RA Lindquist E., Lyon B.R., Martin J., Mayer C., Parker M., Quesneville H.,
RA Raymond J., Uhlig C., Valentin K.U., Worden A.Z., Armbrust E.V., Bowler C.,
RA Green B., Moulton V., Van Oosterhout C., Grigoriev I.;
RT "Extensive genetic diversity and differential bi-allelic expression allows
RT diatom success in the polar Southern Ocean.";
RL Submitted (SEP-2016) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KV784356; OEU18378.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A1E7FJP4; -.
DR EnsemblProtists; OEU18378; OEU18378; FRACYDRAFT_236655.
DR KEGG; fcy:FRACYDRAFT_236655; -.
DR InParanoid; A0A1E7FJP4; -.
DR OrthoDB; 56553at2759; -.
DR Proteomes; UP000095751; Unassembled WGS sequence.
DR GO; GO:0008168; F:methyltransferase activity; IEA:UniProtKB-KW.
DR GO; GO:0032259; P:methylation; IEA:UniProtKB-KW.
DR CDD; cd20071; SET_SMYD; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR PANTHER; PTHR46402:SF2; HISTONE-LYSINE N-TRIMETHYLTRANSFERASE SMYD5; 1.
DR PANTHER; PTHR46402; SET AND MYND DOMAIN-CONTAINING PROTEIN 5; 1.
DR Pfam; PF00856; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Methyltransferase {ECO:0000256|ARBA:ARBA00022603};
KW Reference proteome {ECO:0000313|Proteomes:UP000095751};
KW S-adenosyl-L-methionine {ECO:0000256|ARBA:ARBA00022691};
KW Transferase {ECO:0000256|ARBA:ARBA00022679}.
FT DOMAIN 132..555
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 81..123
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 355..380
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 560..642
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 102..123
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 365..380
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 586..603
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 674 AA; 74820 MW; A51C50F4BA6D5D9F CRC64;
MPTKQNDPSK DHQISSNIKS LLIAPTLREI AAVLQPSTIH KTFKDMTVTT SSTSSIVVVP
PVVAELSRSY KYNDHILVKK NDKKTTTRKR PRKDEHADND SSPSRSSSSL QSQSQSQSSP
LVNNSNTNNC CCCFEVRSCP TKNNQNGIFA TQNIACGDLI IAHEDPIVSS TEQRSSSTFC
HGCATPIGSL KDHILAAAAA NTAIIGTTTT TTTTTKTKEI LFELPDLPCG LLKNEHGPMF
SSNNSSSSTL SYRCNIEECG CKKNVVWCST QCRDKTTGNG GNQQQHAFDY VSPTTAVSTT
EQNNAINNFY KGVTHPMAFQ MAAQSITLIL STILNKFIMK ANQINLNNEN ENEYETTTTK
TIDNDDDVED LDPLLEHDDD DDDDDLRQYY WWNDYGSHPL WWEVGSKSQS KQRKEQTDTF
CHLFELHLLK SIKQRRFAAS SSSLLDNENN GGNNVKSNDE ILVQLEKLLK TAVKALCTID
HIGSILGFDW LADHNVHNSD MNGPVIGSGL YPLLTLANHD CTPNASIEFL QESNRGSMVA
LRDIVVGEEI CLSYIPSNED EVEEGNDTND SQVTRHFKPT RTEIWLGKQN KITESGSTST
QQQRQENDDD DDDDGVTVNQ DSPEANHITG SDGDDDDDDD DIILLDDEFS TAKAERARAI
QEYGFECHCQ RCKQ
//