GenomeNet

Database: UniProt
Entry: K0SH37_THAOC
LinkDB: K0SH37_THAOC
Original site: K0SH37_THAOC 
ID   K0SH37_THAOC            Unreviewed;       607 AA.
AC   K0SH37;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   27-MAR-2024, entry version 29.
DE   RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN   ORFNames=THAOC_22067 {ECO:0000313|EMBL:EJK57852.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK57852.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK57852.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK57852.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK57852.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01026874; EJK57852.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0SH37; -.
DR   EnsemblProtists; EJK57852; EJK57852; THAOC_22067.
DR   eggNOG; ENOG502TGC5; Eukaryota.
DR   OMA; RPSWDIA; -.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   CDD; cd10527; SET_LSMT; 1.
DR   Gene3D; 3.90.1410.10; set domain protein methyltransferase, domain 1; 1.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   PANTHER; PTHR13271:SF155; CYTOCHROME C LYSINE N-METHYLTRANSFERASE 1-RELATED; 1.
DR   PANTHER; PTHR13271; UNCHARACTERIZED PUTATIVE METHYLTRANSFERASE; 1.
DR   Pfam; PF00856; SET; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT   DOMAIN          194..334
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
SQ   SEQUENCE   607 AA;  66528 MW;  094AAD037006C20D CRC64;
     MRGVFVVALP YYLAARVESF NGVKSNQSAG VHPHKFVWQR TLENSPNSFV HPSVDLAIRP
     PSNGGTGIVA KDVLAEGSVA LSLSLHEIGM IDARSILDGF DAGEDDDAVL SMLSDMWKNE
     VKHAKDKEVS EDGKRLAVLA GCLAHLQLTR FKDKTAWTSN EVNEGHALRE SRRLGPFLDA
     MPLLPTPDGT NPMPTHYLYW SDDEIQMLLA GTFGLTRARE NRAGIGLVLR QWSPSFLKEH
     STLGQTAILN SVFSSFASVM SRSFGDDVGR DLDGKGRMLV PVVDMLNHDT EDPNVHWTWH
     VKDDEDEISQ GKGDILVTAL RDIKAGEELL KCYGWRPSWD IASSYGFVPA LNNERWDCAV
     IPLFPPILDL EPDEISPPTS ACKDETRVDM LLESNYGPLV KGVLAAVNAA SEIRARQDGG
     NEEVAHVGER PDQLRRVEVV SLFRPAPANV DIDFAFPRRQ PCVVIGTKIQ SESCKSDNSL
     HHKEAIDAVI PAFKAAASAM EQLRRNHREG NTEPISAAQM AKAAASLDTS KDWDTLGIEL
     LEAGIRDRID TLVQEGRAAN AWLATSVAGN SKEREMRAGM AQDVRSSELA VLRAVHPSAQ
     SPIRNAS
//
DBGET integrated database retrieval system