ID E4WWG4_OIKDI Unreviewed; 127 AA.
AC E4WWG4;
DT 08-FEB-2011, integrated into UniProtKB/TrEMBL.
DT 08-FEB-2011, sequence version 1.
DT 27-MAR-2024, entry version 41.
DE RecName: Full=Small nuclear ribonucleoprotein Sm D1 {ECO:0000256|RuleBase:RU365054};
DE AltName: Full=snRNP core protein D1 {ECO:0000256|RuleBase:RU365054};
GN ORFNames=GSOID_T00009236001 {ECO:0000313|EMBL:CBY21468.1},
GN GSOID_T00021493001 {ECO:0000313|EMBL:CBY33571.1};
OS Oikopleura dioica (Tunicate).
OC Eukaryota; Metazoa; Chordata; Tunicata; Appendicularia; Copelata;
OC Oikopleuridae; Oikopleura.
OX NCBI_TaxID=34765 {ECO:0000313|EMBL:CBY21468.1};
RN [1] {ECO:0000313|EMBL:CBY21468.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=21097902; DOI=10.1126/science.1194167;
RA Denoeud F., Henriet S., Mungpakdee S., Aury J.M., Da Silva C.,
RA Brinkmann H., Mikhaleva J., Olsen L.C., Jubin C., Canestro C.,
RA Bouquet J.M., Danks G., Poulain J., Campsteijn C., Adamski M., Cross I.,
RA Yadetie F., Muffato M., Louis A., Butcher S., Tsagkogeorga G., Konrad A.,
RA Singh S., Jensen M.F., Cong E.H., Eikeseth-Otteraa H., Noel B.,
RA Anthouard V., Porcel B.M., Kachouri-Lafond R., Nishino A., Ugolini M.,
RA Chourrout P., Nishida H., Aasland R., Huzurbazar S., Westhof E., Delsuc F.,
RA Lehrach H., Reinhardt R., Weissenbach J., Roy S.W., Artiguenave F.,
RA Postlethwait J.H., Manak J.R., Thompson E.M., Jaillon O., Du Pasquier L.,
RA Boudinot P., Liberles D.A., Volff J.N., Philippe H., Lenhard B.,
RA Roest Crollius H., Wincker P., Chourrout D.;
RT "Plasticity of animal genome architecture unmasked by rapid evolution of a
RT pelagic tunicate.";
RL Science 330:1381-1385(2010).
CC -!- FUNCTION: Plays a role in pre-mRNA splicing as a core component of the
CC spliceosomal U1, U2, U4 and U5 small nuclear ribonucleoproteins
CC (snRNPs), the building blocks of the spliceosome.
CC {ECO:0000256|RuleBase:RU365054}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|RuleBase:RU365054}.
CC -!- SIMILARITY: Belongs to the snRNP core protein family.
CC {ECO:0000256|ARBA:ARBA00008146, ECO:0000256|RuleBase:RU365054}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; FN653017; CBY21468.1; -; Genomic_DNA.
DR EMBL; FN654428; CBY33571.1; -; Genomic_DNA.
DR AlphaFoldDB; E4WWG4; -.
DR InParanoid; E4WWG4; -.
DR Proteomes; UP000001307; Unassembled WGS sequence.
DR Proteomes; UP000011014; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0000387; P:spliceosomal snRNP assembly; IEA:UniProtKB-UniRule.
DR CDD; cd01724; Sm_D1; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR InterPro; IPR027141; LSm4/Sm_D1/D3.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR034102; Sm_D1.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR PANTHER; PTHR23338; SMALL NUCLEAR RIBONUCLEOPROTEIN SM; 1.
DR PANTHER; PTHR23338:SF18; SMALL NUCLEAR RIBONUCLEOPROTEIN SM D1; 1.
DR Pfam; PF01423; LSM; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|RuleBase:RU365054};
KW mRNA splicing {ECO:0000256|RuleBase:RU365054};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU365054};
KW Reference proteome {ECO:0000313|Proteomes:UP000001307};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274,
KW ECO:0000256|RuleBase:RU365054}.
FT DOMAIN 2..74
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REGION 78..127
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..102
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 127 AA; 13831 MW; FEA7A0C7B2BA5969 CRC64;
MKLARFLMKL SHETVTIELK NGTQVVGTIA GVDICMNTHL KNVRLTVKGR EPQQLDALSI
RGNNIRYYIL PDSVPLDTLL IDDGPKARGG GSTRDRGRGR GKPTRGGRGG PRGRGGPRGR
GGPRHRR
//