ID A0A096PBK5_OSTTA Unreviewed; 983 AA.
AC A0A096PBK5;
DT 26-NOV-2014, integrated into UniProtKB/TrEMBL.
DT 26-NOV-2014, sequence version 1.
DT 13-SEP-2023, entry version 27.
DE SubName: Full=SET domain {ECO:0000313|EMBL:CEG02064.1};
GN ORFNames=OT_ostta14g01410 {ECO:0000313|EMBL:CEG02064.1};
OS Ostreococcus tauri.
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Bathycoccaceae; Ostreococcus.
OX NCBI_TaxID=70448 {ECO:0000313|EMBL:CEG02064.1, ECO:0000313|Proteomes:UP000009170};
RN [1] {ECO:0000313|Proteomes:UP000009170}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=OTTH0595 {ECO:0000313|Proteomes:UP000009170};
RX PubMed=16868079; DOI=10.1073/pnas.0604795103;
RA Derelle E., Ferraz C., Rombauts S., Rouze P., Worden A.Z., Robbens S.,
RA Partensky F., Degroeve S., Echeynie S., Cooke R., Saeys Y., Wuyts J.,
RA Jabbari K., Bowler C., Panaud O., Piegu B., Ball S.G., Ral J.-P.,
RA Bouget F.-Y., Piganeau G., De Baets B., Picard A., Delseny M., Demaille J.,
RA Van de Peer Y., Moreau H.;
RT "Genome analysis of the smallest free-living eukaryote Ostreococcus tauri
RT unveils many unique features.";
RL Proc. Natl. Acad. Sci. U.S.A. 103:11647-11652(2006).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CEG02064.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CAID01000014; CEG02064.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A096PBK5; -.
DR STRING; 70448.A0A096PBK5; -.
DR InParanoid; A0A096PBK5; -.
DR OrthoDB; 902834at2759; -.
DR Proteomes; UP000009170; Chromosome 14.
DR GO; GO:0042054; F:histone methyltransferase activity; IEA:InterPro.
DR CDD; cd10519; SET_EZH; 1.
DR Gene3D; 2.170.270.10; SET domain; 1.
DR InterPro; IPR026489; CXC_dom.
DR InterPro; IPR045318; EZH1/2-like.
DR InterPro; IPR001214; SET_dom.
DR InterPro; IPR046341; SET_dom_sf.
DR InterPro; IPR033467; Tesmin/TSO1-like_CXC.
DR PANTHER; PTHR45747; HISTONE-LYSINE N-METHYLTRANSFERASE E(Z); 1.
DR PANTHER; PTHR45747:SF4; HISTONE-LYSINE N-METHYLTRANSFERASE E(Z); 1.
DR Pfam; PF00856; SET; 1.
DR SMART; SM01114; CXC; 1.
DR SMART; SM00317; SET; 1.
DR SUPFAM; SSF82199; SET domain; 1.
DR PROSITE; PS51633; CXC; 1.
DR PROSITE; PS50280; SET; 1.
PE 4: Predicted;
KW Coiled coil {ECO:0000256|SAM:Coils};
KW Reference proteome {ECO:0000313|Proteomes:UP000009170}.
FT DOMAIN 699..804
FT /note="CXC"
FT /evidence="ECO:0000259|PROSITE:PS51633"
FT DOMAIN 829..944
FT /note="SET"
FT /evidence="ECO:0000259|PROSITE:PS50280"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 59..85
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 668..697
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 954..983
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COILED 195..222
FT /evidence="ECO:0000256|SAM:Coils"
FT COMPBIAS 680..697
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 954..968
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 969..983
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 983 AA; 110310 MW; FB4F61DB8FBD40A9 CRC64;
MTPKTTTGTT LGFDDDDDDG RATTAMTDAR ANASTVKARV VTPMPTSTSN VEIVNLLSSE
ETSSEGGGTL DEGGEAARGS GKRAEGRAVD VDDVVVLDKE DVNDDAPAPS EVVDELKKFG
ATSAELAFVS ALRAFFEERG EEFAKVMVEK EVEERMRDRA REANALEANT NATNRKAACI
REACLREAKT KEHARAVVRR ALKTYRDELR KWERAVVRAQ VAPASTSDAS PSSSAREDTL
TLPFWRHDRA LVDVPSKTVS VKTQVAVQPF SSHHRNRRSW KMRANSVKTQ KVRCVNLGSS
EAVPPYNYFA YSTHCNSFNE HDDNQSRLLY KDEEGEFVED QADDIEQVKV ANEFTREQEF
VLGALASTFS RFVLFDHVEV NGMQRETVVK HVVEEVSSIL KMEEEIVVDW LNEARTTQST
SRAWCMFLEA IEIVCKLWSP YSKFDNFESL YKLMGGVCET LAQVDTDGIF WQYFAQIVIN
APTMAPLKKP VIIFDTLEEA TEQLSGAFCP RCFVFDCRTH GSLQPKSRGR KHASERKLLW
RERMHKKANV NENDLKCSPA CWYQSSEYKY LATHGMVCVS CDPSERSPLS KDSETEDPFA
KNRRKWRNVL DVEILKKAVE VLSTAGGKPT ACDVTLFFGK RRTCADVGRQ IHSLELISSG
AIVADEDVEE GDEDVNGKNK GKKRKHRSGQ TKKKNPTIAN RLKRSKTDEN ALVTQYTPCD
CEGQCDAATC SCIQKGIFCE RFCNCGPNCD NEFPGCKCET TKKTCRTNTC PCFAAGRECT
PDKCRRCCKT ADALMLPIRQ KYGFVDPAQT AKIPDYPCGN MKLQLRQKEH VCLGKSGVAG
WGAHVLHGAR KDDFIGEYVG ELVTQDEADR RGMVYDRNNC SYLFDLNSEF CIDAQNRGNK
LRFANHSVHP NVRSAVMAVN GDNRLAMFAL RDIAPGEELF FDYRYKDEVA PEWHEKREKV
EKDAKSAKKS KGNRNKHSAK KAY
//