ID K0SMG7_THAOC Unreviewed; 222 AA.
AC K0SMG7;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 28-NOV-2012, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=Small nuclear ribonucleoprotein E {ECO:0000256|RuleBase:RU365053};
DE Short=snRNP-E {ECO:0000256|RuleBase:RU365053};
DE AltName: Full=Sm protein E {ECO:0000256|RuleBase:RU365053};
GN ORFNames=THAOC_11446 {ECO:0000313|EMBL:EJK67508.1};
OS Thalassiosira oceanica (Marine diatom).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=159749 {ECO:0000313|EMBL:EJK67508.1, ECO:0000313|Proteomes:UP000266841};
RN [1] {ECO:0000313|EMBL:EJK67508.1, ECO:0000313|Proteomes:UP000266841}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1005 {ECO:0000313|EMBL:EJK67508.1,
RC ECO:0000313|Proteomes:UP000266841};
RX PubMed=22835381; DOI=10.1186/gb-2012-13-7-r66;
RA Lommer M., Specht M., Roy A.S., Kraemer L., Andreson R., Gutowska M.A.,
RA Wolf J., Bergner S.V., Schilhabel M.B., Klostermeier U.C., Beiko R.G.,
RA Rosenstiel P., Hippler M., Laroche J.;
RT "Genome and low-iron response of an oceanic diatom adapted to chronic iron
RT limitation.";
RL Genome Biol. 13:R66-R66(2012).
CC -!- FUNCTION: Plays a role in pre-mRNA splicing as a core component of the
CC spliceosomal U1, U2, U4 and U5 small nuclear ribonucleoproteins
CC (snRNPs), the building blocks of the spliceosome.
CC {ECO:0000256|RuleBase:RU365053}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU365053}.
CC -!- SIMILARITY: Belongs to the snRNP Sm proteins family.
CC {ECO:0000256|ARBA:ARBA00006850, ECO:0000256|RuleBase:RU365053}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EJK67508.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGNL01012987; EJK67508.1; -; Genomic_DNA.
DR AlphaFoldDB; K0SMG7; -.
DR EnsemblProtists; EJK67508; EJK67508; THAOC_11446.
DR eggNOG; KOG1774; Eukaryota.
DR Proteomes; UP000266841; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0005685; C:U1 snRNP; IEA:UniProtKB-UniRule.
DR GO; GO:0005686; C:U2 snRNP; IEA:UniProtKB-UniRule.
DR GO; GO:0005687; C:U4 snRNP; IEA:UniProtKB-UniRule.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; IEA:UniProtKB-UniRule.
DR GO; GO:0005682; C:U5 snRNP; IEA:UniProtKB-UniRule.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000387; P:spliceosomal snRNP assembly; IEA:UniProtKB-UniRule.
DR CDD; cd01718; Sm_E; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR InterPro; IPR027078; snRNP-E.
DR PANTHER; PTHR11193; SMALL NUCLEAR RIBONUCLEOPROTEIN E; 1.
DR PANTHER; PTHR11193:SF0; SMALL NUCLEAR RIBONUCLEOPROTEIN E; 1.
DR Pfam; PF01423; LSM; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|RuleBase:RU365053};
KW mRNA splicing {ECO:0000256|RuleBase:RU365053};
KW Nucleus {ECO:0000256|RuleBase:RU365053};
KW Reference proteome {ECO:0000313|Proteomes:UP000266841};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274,
KW ECO:0000256|RuleBase:RU365053};
KW RNA-binding {ECO:0000256|RuleBase:RU365053};
KW Spliceosome {ECO:0000256|RuleBase:RU365053}.
FT DOMAIN 150..222
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
SQ SEQUENCE 222 AA; 24503 MW; 917250AD25E8F508 CRC64;
MLMPALSLRA CTQPYPGPQD YKIGKGRESS GLPLLLAVLL PAVSPRTGTG RYGFITAHLL
QDTDGRMTSD HTAGRRTSGT LLVSIEYQSH HAAPKSQKGD DIAHQCHIWS SSVSLERQHM
AACSRRSRAP PNARRGSTGD HLCGPYILIS RIPLAQIRKK TRVKIWLYED TRMSIEGQII
GFDEYMNFVL DSATEVDMKT GKRTDVGRIL LKGDAITLMQ TA
//