ID Q3UK67_MOUSE Unreviewed; 639 AA.
AC Q3UK67;
DT 11-OCT-2005, integrated into UniProtKB/TrEMBL.
DT 11-OCT-2005, sequence version 1.
DT 24-JAN-2024, entry version 106.
DE RecName: Full=Splicing factor 1 {ECO:0000256|RuleBase:RU367126};
GN Name=Sf1 {ECO:0000313|MGI:MGI:1095403};
GN Synonyms=Zfp162 {ECO:0000313|MGI:MGI:1095403};
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; Mammalia;
OC Eutheria; Euarchontoglires; Glires; Rodentia; Myomorpha; Muroidea; Muridae;
OC Murinae; Mus; Mus.
OX NCBI_TaxID=10090 {ECO:0000313|EMBL:BAE26935.1};
RN [1] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=10349636; DOI=10.1016/S0076-6879(99)03004-9;
RA Carninci P., Hayashizaki Y.;
RT "High-efficiency full-length cDNA cloning.";
RL Methods Enzymol. 303:19-44(1999).
RN [2] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=11042159; DOI=10.1101/gr.145100;
RA Carninci P., Shibata Y., Hayatsu N., Sugahara Y., Shibata K., Itoh M.,
RA Konno H., Okazaki Y., Muramatsu M., Hayashizaki Y.;
RT "Normalization and subtraction of cap-trapper-selected cDNAs to prepare
RT full-length cDNA libraries for rapid discovery of new genes.";
RL Genome Res. 10:1617-1630(2000).
RN [3] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=11076861; DOI=10.1101/gr.152600;
RA Shibata K., Itoh M., Aizawa K., Nagaoka S., Sasaki N., Carninci P.,
RA Konno H., Akiyama J., Nishi K., Kitsunai T., Tashiro H., Itoh M., Sumi N.,
RA Ishii Y., Nakamura S., Hazama M., Nishine T., Harada A., Yamamoto R.,
RA Matsumoto H., Sakaguchi S., Ikegami T., Kashiwagi K., Fujiwake S.,
RA Inoue K., Togawa Y., Izawa M., Ohara E., Watahiki M., Yoneda Y.,
RA Ishikawa T., Ozawa K., Tanaka T., Matsuura S., Kawai J., Okazaki Y.,
RA Muramatsu M., Inoue Y., Kira A., Hayashizaki Y.;
RT "RIKEN integrated sequence analysis (RISA) system--384-format sequencing
RT pipeline with 384 multicapillary sequencer.";
RL Genome Res. 10:1757-1771(2000).
RN [4] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=11217851; DOI=10.1038/35055500;
RG The RIKEN Genome Exploration Research Group Phase II Team and the FANTOM Consortium;
RT "Functional annotation of a full-length mouse cDNA collection.";
RL Nature 409:685-690(2001).
RN [5] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=12466851; DOI=10.1038/nature01266;
RG The FANTOM Consortium and the RIKEN Genome Exploration Research Group Phase I and II Team;
RT "Analysis of the mouse transcriptome based on functional annotation of
RT 60,770 full-length cDNAs.";
RL Nature 420:563-573(2002).
RN [6] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RA Arakawa T., Carninci P., Fukuda S., Hashizume W., Hayashida K., Hori F.,
RA Iida J., Imamura K., Imotani K., Itoh M., Kanagawa S., Kawai J., Kojima M.,
RA Konno H., Murata M., Nakamura M., Ninomiya N., Nishiyori H., Nomura K.,
RA Ohno M., Sakazume N., Sano H., Sasaki D., Shibata K., Shiraki T.,
RA Tagami M., Tagami Y., Waki K., Watahiki A., Muramatsu M., Hayashizaki Y.;
RL Submitted (MAR-2004) to the EMBL/GenBank/DDBJ databases.
RN [7] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RG The FANTOM Consortium;
RG Riken Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group);
RT "The Transcriptional Landscape of the Mammalian Genome.";
RL Science 309:1559-1563(2005).
RN [8] {ECO:0000313|EMBL:BAE26935.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=BALB/C {ECO:0000313|EMBL:BAE26935.1};
RX PubMed=16141073; DOI=10.1126/science.1112009;
RG RIKEN Genome Exploration Research Group and Genome Science Group (Genome Network Project Core Group) and the FANTOM Consortium;
RT "Antisense Transcription in the Mammalian Transcriptome.";
RL Science 309:1564-1566(2005).
CC -!- FUNCTION: Necessary for the splicing of pre-mRNA. Has a role in the
CC recognition of the branch site (5'-UACUAAC-3'), the pyrimidine tract
CC and the 3'-splice site at the 3'-end of introns.
CC {ECO:0000256|RuleBase:RU367126}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU367126}.
CC -!- SIMILARITY: Belongs to the BBP/SF1 family.
CC {ECO:0000256|ARBA:ARBA00010382, ECO:0000256|RuleBase:RU367126}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AK146151; BAE26935.1; -; mRNA.
DR AlphaFoldDB; Q3UK67; -.
DR AGR; MGI:1095403; -.
DR MGI; MGI:1095403; Sf1.
DR ChiTaRS; Sf1; mouse.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0045131; F:pre-mRNA branch point binding; IEA:UniProtKB-UniRule.
DR GO; GO:0008270; F:zinc ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:UniProtKB-UniRule.
DR CDD; cd22382; KH-I_SF1; 1.
DR Gene3D; 6.10.140.1790; -; 1.
DR Gene3D; 3.30.1370.10; K Homology domain, type 1; 1.
DR InterPro; IPR045071; BBP-like.
DR InterPro; IPR004087; KH_dom.
DR InterPro; IPR004088; KH_dom_type_1.
DR InterPro; IPR036612; KH_dom_type_1_sf.
DR InterPro; IPR032570; SF1-HH.
DR InterPro; IPR047086; SF1-HH_sf.
DR InterPro; IPR001878; Znf_CCHC.
DR PANTHER; PTHR11208; RNA-BINDING PROTEIN RELATED; 1.
DR PANTHER; PTHR11208:SF45; SPLICING FACTOR 1; 1.
DR Pfam; PF00013; KH_1; 1.
DR Pfam; PF16275; SF1-HH; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00322; KH; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF54791; Eukaryotic type KH-domain (KH-domain type I); 1.
DR PROSITE; PS50084; KH_TYPE_1; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 2: Evidence at transcript level;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|RuleBase:RU367126};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664,
KW ECO:0000256|RuleBase:RU367126};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187,
KW ECO:0000256|RuleBase:RU367126};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367126};
KW RNA-binding {ECO:0000256|PROSITE-ProRule:PRU00117};
KW Spliceosome {ECO:0000256|RuleBase:RU367126};
KW Zinc {ECO:0000256|ARBA:ARBA00022833, ECO:0000256|RuleBase:RU367126};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00047}.
FT DOMAIN 279..293
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 1..42
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 65..94
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 325..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 346..361
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 415..445
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 472..518
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 519..561
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 564..610
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 625..639
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 639 AA; 68399 MW; D5884406D947504F CRC64;
MATGANATPL DFPSKKRKRS RWNQDTMEQK TVIPGMPTVI PPGLTREQER AYIVQLQIED
LTRKLRTGDL GIPPNPEDRS PSPEPIYNSE GKRLNTREFR TRKKLEEERH TLITEMVALN
PDFKPPADYK PPATRVSDKV MIPQDEYPEI NFVGLLIGPR GNTLKNIEKE CNAKIMIRGK
GSVKEGKVGR KDGQMLPGED EPLHALVTAN TMENVKKAVE QIRNILKQGI ETPEDQNDLR
KMQLRELARL NGTLREDDNR ILRPWQSSET RSITNTTVCT KCGGAGHIAS DCKFQRPGDP
QSAQDKARMD KEYLSLMAEL GEAPVPASVG STSGPATTPL ASAPRPAAPA SNPPPPSLMS
TTQSRPPWMN SGHSENRPYH GMHGGGPGGP GGGPHSFPHP LPSLTGGHGG HPMQHNPNGP
PPPWMQPPPP PMNQGPHPPG HHGPPPMDQY LGSTPVGSGV YRLHQGKGMM PPPPMGMMPP
PPPPPSGQPP PPPSGPLPPW QQQQQQPPPP PPPSSSMASS TPLPWQQNTT TTTTSAGTGS
IPPWQQQQAA AAASPGIPQM QGNPTMVPLP PGVQPPLPPG APPPPPPPPP GSAGMMYAPP
PPPPPPMDPS NFVTMMGMGV AGMPPFGMPP APPPPPPQN
//