ID C3YGG5_BRAFL Unreviewed; 219 AA.
AC C3YGG5;
DT 28-JUL-2009, integrated into UniProtKB/TrEMBL.
DT 28-JUL-2009, sequence version 1.
DT 24-JAN-2024, entry version 59.
DE RecName: Full=Small nuclear ribonucleoprotein-associated protein {ECO:0000256|PIRNR:PIRNR037187};
GN ORFNames=BRAFLDRAFT_94837 {ECO:0000313|EMBL:EEN60686.1};
OS Branchiostoma floridae (Florida lancelet) (Amphioxus).
OC Eukaryota; Metazoa; Chordata; Cephalochordata; Leptocardii; Amphioxiformes;
OC Branchiostomatidae; Branchiostoma.
OX NCBI_TaxID=7739;
RN [1] {ECO:0000313|EMBL:EEN60686.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=S238N-H82 {ECO:0000313|EMBL:EEN60686.1};
RC TISSUE=Testes {ECO:0000313|EMBL:EEN60686.1};
RX PubMed=18563158; DOI=10.1038/nature06967;
RG US DOE Joint Genome Institute (JGI-PGF);
RA Putnam N.H., Butts T., Ferrier D.E.K., Furlong R.F., Hellsten U.,
RA Kawashima T., Robinson-Rechavi M., Shoguchi E., Terry A., Yu J.-K.,
RA Benito-Gutierrez E.L., Dubchak I., Garcia-Fernandez J., Gibson-Brown J.J.,
RA Grigoriev I.V., Horton A.C., de Jong P.J., Jurka J., Kapitonov V.V.,
RA Kohara Y., Kuroki Y., Lindquist E., Lucas S., Osoegawa K., Pennacchio L.A.,
RA Salamov A.A., Satou Y., Sauka-Spengler T., Schmutz J., Shin-I T.,
RA Toyoda A., Bronner-Fraser M., Fujiyama A., Holland L.Z., Holland P.W.H.,
RA Satoh N., Rokhsar D.S.;
RT "The amphioxus genome and the evolution of the chordate karyotype.";
RL Nature 453:1064-1071(2008).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|PIRNR:PIRNR037187}.
CC -!- SIMILARITY: Belongs to the snRNP SmB/SmN family.
CC {ECO:0000256|ARBA:ARBA00009123, ECO:0000256|PIRNR:PIRNR037187}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG666511; EEN60686.1; -; Genomic_DNA.
DR RefSeq; XP_002604675.1; XM_002604629.1.
DR AlphaFoldDB; C3YGG5; -.
DR STRING; 7739.C3YGG5; -.
DR eggNOG; KOG3168; Eukaryota.
DR InParanoid; C3YGG5; -.
DR GO; GO:0071013; C:catalytic step 2 spliceosome; IBA:GO_Central.
DR GO; GO:0005737; C:cytoplasm; IBA:GO_Central.
DR GO; GO:0005685; C:U1 snRNP; IBA:GO_Central.
DR GO; GO:0005686; C:U2 snRNP; IBA:GO_Central.
DR GO; GO:0071004; C:U2-type prespliceosome; IBA:GO_Central.
DR GO; GO:0005687; C:U4 snRNP; IBA:GO_Central.
DR GO; GO:0046540; C:U4/U6 x U5 tri-snRNP complex; IBA:GO_Central.
DR GO; GO:0005682; C:U5 snRNP; IBA:GO_Central.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IBA:GO_Central.
DR CDD; cd01717; Sm_B; 1.
DR Gene3D; 2.30.30.100; -; 1.
DR InterPro; IPR010920; LSM_dom_sf.
DR InterPro; IPR047575; Sm.
DR InterPro; IPR001163; Sm_dom_euk/arc.
DR InterPro; IPR017131; snRNP-assoc_SmB/SmN.
DR PANTHER; PTHR10701:SF0; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B; 1.
DR PANTHER; PTHR10701; SMALL NUCLEAR RIBONUCLEOPROTEIN-ASSOCIATED PROTEIN B AND N; 1.
DR Pfam; PF01423; LSM; 1.
DR PIRSF; PIRSF037187; snRNP_SmB/SmN; 1.
DR SMART; SM00651; Sm; 1.
DR SUPFAM; SSF50182; Sm-like ribonucleoproteins; 1.
DR PROSITE; PS52002; SM; 1.
PE 3: Inferred from homology;
KW Nucleus {ECO:0000256|PIRNR:PIRNR037187};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274,
KW ECO:0000256|PIRNR:PIRNR037187};
KW RNA-binding {ECO:0000256|PIRNR:PIRNR037187}.
FT DOMAIN 4..86
FT /note="Sm"
FT /evidence="ECO:0000259|PROSITE:PS52002"
FT REGION 127..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 137..151
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 155..169
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 181..219
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 219 AA; 22824 MW; 942F8CC960778A33 CRC64;
MVVGKSSKMQ AHLNYRMRVT LQDSRVFIGT FKAFDKHMNL ILVDCDEFRK IRPKSQKQEE
REEKRVLGLV LLRGENIVSM TVEGPPPADE GVARVPIPGA GPGPGMGRAA GRGVANVAGP
SAPAGLAGPV RGVGGPSQQV MTPQGRAAVS APPQQYQRPP GPPPGARGPP GGMGGRGMPP
GMPMGMPPPG MRPGMRPGGP PPGMMRGPPP PGMRPPPRQ
//