GenomeNet

Database: UniProt
Entry: K0SD38_THAOC
LinkDB: K0SD38_THAOC
Original site: K0SD38_THAOC 
ID   K0SD38_THAOC            Unreviewed;       261 AA.
AC   K0SD38;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   24-JAN-2024, entry version 29.
DE   RecName: Full=SET domain-containing protein {ECO:0000259|PROSITE:PS50280};
GN   ORFNames=THAOC_16095 {ECO:0000313|EMBL:EJK63260.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK63260.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK63260.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK63260.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK63260.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01018357; EJK63260.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0SD38; -.
DR   EnsemblProtists; EJK63260; EJK63260; THAOC_16095.
DR   eggNOG; ENOG502SB3B; Eukaryota.
DR   OMA; AMANWCT; -.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   Gene3D; 2.170.270.10; SET domain; 1.
DR   InterPro; IPR001214; SET_dom.
DR   InterPro; IPR046341; SET_dom_sf.
DR   Pfam; PF00856; SET; 1.
DR   SMART; SM00317; SET; 1.
DR   SUPFAM; SSF82199; SET domain; 1.
DR   PROSITE; PS50280; SET; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW   Signal {ECO:0000256|SAM:SignalP}.
FT   SIGNAL          1..15
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT   CHAIN           16..261
FT                   /note="SET domain-containing protein"
FT                   /evidence="ECO:0000256|SAM:SignalP"
FT                   /id="PRO_5012474921"
FT   DOMAIN          100..248
FT                   /note="SET"
FT                   /evidence="ECO:0000259|PROSITE:PS50280"
SQ   SEQUENCE   261 AA;  27711 MW;  9FDA1B63E8DC1251 CRC64;
     MMLVTVILAA GVVSSSVVSS FRLSRQVANA GSSVAQLAAI PVSTLPLEAS YIVSPASPSN
     DGSITATTTQ ALGLTSSIIT YVSLLLVYDR PKGSLELDES YLSIRESNVP GAGLGLYAVQ
     TIKEGTVLGT YAGVVRPAQE FYSGKCRQFP GAVSYSWRFT DNQFVIDPTD ERGEIQFVCQ
     GGSDVPLSNI FFSLLGTGKS TALCRINEPP IGAGGCNVGA KENLDKREVV FTTLRDIFAG
     EELFLDYGLD YDRSGYSQSP S
//
DBGET integrated database retrieval system