GenomeNet

Database: UniProt
Entry: K0SMF4_THAOC
LinkDB: K0SMF4_THAOC
Original site: K0SMF4_THAOC 
ID   K0SMF4_THAOC            Unreviewed;       478 AA.
AC   K0SMF4;
DT   28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT   28-NOV-2012, sequence version 1.
DT   28-JUN-2023, entry version 26.
DE   RecName: Full=Sm domain-containing protein {ECO:0000259|SMART:SM00651};
GN   ORFNames=THAOC_11464 {ECO:0000313|EMBL:EJK67493.1};
OS   Thalassiosira oceanica (Marine diatom).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK67493.1, ECO:0000313|Proteomes:UP000266841};
RN   [1] {ECO:0000313|EMBL:EJK67493.1, ECO:0000313|Proteomes:UP000266841}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK67493.1,
RC   ECO:0000313|Proteomes:UP000266841};
RX   PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA   Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA   Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA   Rosenstiel P., Hippler M., Laroche J.;
RT   "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT   limitation.";
RL   Genome Biol. 13:R66-R66(2012).
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:EJK67493.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AGNL01013012; EJK67493.1; -; Genomic_DNA.
DR   AlphaFoldDB; K0SMF4; -.
DR   EnsemblProtists; EJK67493; EJK67493; THAOC_11464.
DR   eggNOG; ENOG502QRZV; Eukaryota.
DR   OMA; KQRYFHQ; -.
DR   Proteomes; UP000266841; Unassembled WGS sequence.
DR   GO; GO:0071209; F:U7 snRNA binding; IEA:InterPro.
DR   Gene3D; 2.30.30.100; -; 1.
DR   InterPro; IPR039267; Lsm11.
DR   InterPro; IPR010920; LSM_dom_sf.
DR   InterPro; IPR001163; Sm_dom_euk/arc.
DR   PANTHER; PTHR21415; U7 SNRNA-ASSOCIATED SM-LIKE PROTEIN LSM11; 1.
DR   PANTHER; PTHR21415:SF1; U7 SNRNA-ASSOCIATED SM-LIKE PROTEIN LSM11; 1.
DR   Pfam; PF01423; LSM; 1.
DR   SMART; SM00651; Sm; 1.
DR   SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
PE   4: Predicted;
KW   Reference proteome {ECO:0000313|Proteomes:UP000266841}.
FT   DOMAIN          244..357
FT                   /note="Sm"
FT                   /evidence="ECO:0000259|SMART:SM00651"
FT   REGION          1..26
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          92..122
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          194..217
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          360..383
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   478 AA;  51934 MW;  E1F54B18D71532CF CRC64;
     MSSNSAAAPS GEGARKSPPP VSLGKPSLRH RLESYYSLVS PPSIADPAKW RSNFESIYEK
     YGGSVDGETK LARKLAKKYG GRVMLLVAPP HRSTAKRGVG DGGSSLPAGR GEGKRDESHY
     ELDEGRRGSL VLDFASPAFD QIHALRAPEA EVIEANPSSF ASRDGATQRL DNIHKFPALL
     PACDPLHFTP KARPLASHSN VDVPEKKPGP RNADDSVPKK MSLFQSLSSR YESPKSGPLS
     LLYSILATRS RVRVMVRYVD CIRGVLTGQL VAFDKHFNMI IRDADEVYTG RVTRHAESVE
     AAGGLHGNDG GTVVGPWKAG LEARRRGVGG SGGGLRAKQR YFHQMLIRGD NVVMVWRADS
     ERSAQKQPTQ DGKGGDRPGT PGSLFYAKER GGCPDGNRSG SIHQLRTNFM TSASNGASWF
     HRAASKMTPH YFPQSYRFNS SDAQTNSLRD FLGANENRPV GQIHAESVTG RTFFALLH
//
DBGET integrated database retrieval system