ID SUGP1_MOUSE Reviewed; 643 AA.
AC Q8CH02; Q0VAT9; Q3U0W3; Q8R094;
DT 15-MAR-2005, integrated into UniProtKB/Swiss-Prot.
DT 01-MAR-2003, sequence version 1.
DT 03-APR-2013, entry version 75.
DE RecName: Full=SURP and G-patch domain-containing protein 1;
DE AltName: Full=Splicing factor 4;
GN Name=Sugp1; Synonyms=Sf4;
OS Mus musculus (Mouse).
OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi;
OC Muroidea; Muridae; Murinae; Mus; Mus.
OX NCBI_TaxID=10090;
RN [1]
RP NUCLEOTIDE SEQUENCE [MRNA].
RC STRAIN=C57BL/6J;
RX PubMed=12594045; DOI=10.1016/S0378-1119(02)01230-1;
RA Sampson N.D., Hewitt J.E.;
RT "SF4 and SFRS14, two related putative splicing factors on human
RT chromosome 19p13.11.";
RL Gene 305:91-100(2003).
RN [2]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=NOD; TISSUE=Spleen;
RX PubMed=16141072; DOI=10.1126/science.1112014;
RA Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M.,
RA Davis M.J., Wilming L.G., Aidinis V., Allen J.E.,
RA Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L.,
RA Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M.,
RA Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R.,
RA Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G.,
RA di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G.,
RA Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M.,
RA Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N.,
RA Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T.,
RA Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H.,
RA Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K.,
RA Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J.,
RA Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L.,
RA Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K.,
RA Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P.,
RA Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O.,
RA Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G.,
RA Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M.,
RA Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B.,
RA Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K.,
RA Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A.,
RA Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K.,
RA Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C.,
RA Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J.,
RA Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y.,
RA Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T.,
RA Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N.,
RA Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N.,
RA Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S.,
RA Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J.,
RA Hayashizaki Y.;
RT "The transcriptional landscape of the mammalian genome.";
RL Science 309:1559-1563(2005).
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RX PubMed=15489334; DOI=10.1101/gr.2596504;
RG The MGC Project Team;
RT "The status, quality, and expansion of the NIH full-length cDNA
RT project: the Mammalian Gene Collection (MGC).";
RL Genome Res. 14:2121-2127(2004).
RN [4]
RP PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-483, AND MASS
RP SPECTROMETRY.
RC TISSUE=Liver;
RX PubMed=17242355; DOI=10.1073/pnas.0609836104;
RA Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.;
RT "Large-scale phosphorylation analysis of mouse liver.";
RL Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007).
RN [5]
RP STRUCTURE BY NMR OF 165-239.
RG RIKEN structural genomics initiative (RSGI);
RT "Solution structure of SURP domain in BAB30904.";
RL Submitted (AUG-2004) to the PDB data bank.
RN [6]
RP STRUCTURE BY NMR OF 250-314.
RG RIKEN structural genomics initiative (RSGI);
RT "Solution structure of SURP domain in splicing factor 4.";
RL Submitted (NOV-2005) to the PDB data bank.
CC -!- FUNCTION: Plays a role in pre-mRNA splicing (By similarity).
CC -!- SUBUNIT: Component of the spliceosome (By similarity).
CC -!- SUBCELLULAR LOCATION: Nucleus (Probable).
CC -!- SIMILARITY: Contains 1 G-patch domain.
CC -!- SIMILARITY: Contains 2 SURP motif repeats.
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; AF521129; AAN77124.1; -; mRNA.
DR EMBL; AK156508; BAE33738.1; -; mRNA.
DR EMBL; BC120919; AAI20920.1; -; mRNA.
DR EMBL; BC120920; AAI20921.1; -; mRNA.
DR IPI; IPI00454015; -.
DR RefSeq; NP_081757.1; NM_027481.2.
DR UniGene; Mm.17665; -.
DR PDB; 1UG0; NMR; -; A=165-239.
DR PDB; 1X4O; NMR; -; A=250-314.
DR PDBsum; 1UG0; -.
DR PDBsum; 1X4O; -.
DR ProteinModelPortal; Q8CH02; -.
DR SMR; Q8CH02; 165-241, 250-319.
DR PhosphoSite; Q8CH02; -.
DR PaxDb; Q8CH02; -.
DR PRIDE; Q8CH02; -.
DR Ensembl; ENSMUST00000011450; ENSMUSP00000011450; ENSMUSG00000011306.
DR GeneID; 70616; -.
DR KEGG; mmu:70616; -.
DR UCSC; uc009lyn.1; mouse.
DR CTD; 57794; -.
DR MGI; MGI:1917866; Sugp1.
DR eggNOG; NOG299701; -.
DR GeneTree; ENSGT00410000025695; -.
DR HOVERGEN; HBG079172; -.
DR InParanoid; Q8CH02; -.
DR KO; K13096; -.
DR OMA; EKVAMEN; -.
DR OrthoDB; EOG4VX24Z; -.
DR EvolutionaryTrace; Q8CH02; -.
DR NextBio; 331984; -.
DR Bgee; Q8CH02; -.
DR CleanEx; MM_SF4; -.
DR Genevestigator; Q8CH02; -.
DR GermOnline; ENSMUSG00000011306; Mus musculus.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR InterPro; IPR000467; G_patch_dom.
DR InterPro; IPR000061; Surp.
DR Pfam; PF01585; G-patch; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM00443; G_patch; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; SSF109905; 2.
DR PROSITE; PS50174; G_PATCH; 1.
DR PROSITE; PS50128; SURP; 2.
PE 1: Evidence at protein level;
KW 3D-structure; Complete proteome; mRNA processing; mRNA splicing;
KW Nucleus; Phosphoprotein; Reference proteome; Repeat; Spliceosome.
FT CHAIN 1 643 SURP and G-patch domain-containing
FT protein 1.
FT /FTId=PRO_0000097702.
FT REPEAT 187 229 SURP motif 1.
FT REPEAT 262 305 SURP motif 2.
FT DOMAIN 560 607 G-patch.
FT MOTIF 378 384 Nuclear localization signal (Potential).
FT COMPBIAS 324 371 Pro-rich.
FT COMPBIAS 439 478 Gln/Met-rich.
FT MOD_RES 407 407 Phosphoserine (By similarity).
FT MOD_RES 409 409 Phosphoserine (By similarity).
FT MOD_RES 412 412 Phosphoserine (By similarity).
FT MOD_RES 483 483 Phosphoserine.
FT CONFLICT 326 326 P -> L (in Ref. 3; AAI20920/AAI20921).
FT HELIX 167 172
FT HELIX 179 181
FT HELIX 182 198
FT HELIX 200 209
FT TURN 210 212
FT STRAND 214 216
FT HELIX 217 220
FT HELIX 225 239
FT HELIX 257 272
FT HELIX 277 287
FT TURN 291 295
FT STRAND 297 300
FT HELIX 301 313
SQ SEQUENCE 643 AA; 72649 MW; A1FCD38A26998E78 CRC64;
MSLKMDNRDV AGKANRWFGM AQPKSGKMNM NILHQEELIA QKKREIEARM EQKARQSHVP
SPQPPHPGEI ADAHNSCISN KFANDGSFLQ QFLKLQKAQT STDSAPRAPP SMPTPSSLKK
PLVLSKRTGL GLSSPTGPVK NYSHAKQLPV AHRPSVFQSP DDDEEEDYEQ WLEIKVSPPE
GAETRRVIEK LARFVAEGGP ELEKVAMEDY KDNPAFTFLH DKNSREFLYY RRKVAEIRKE
AQKPQAATQK VSPPEDEEAK NLAEKLARFI ADGGPEVETI ALQNNRENQA FSFLYDPNSQ
GYRYYRQKLD EFRKAKAGST GSFPAPAPNP SLRRKSAPEA LSGAVPPITA CPTPVAPAPA
VNPTPSIPGK PTATAAVKRK RKSRWGPEED KVELPPAELA QRDIDASPSP LSVQDLKGLG
YEKGKPVGLV GVTELSDAQK KQLKEQQEMQ QMYDMIMQHK RAMQDMQLLW EKALQQHQHG
YDSDEEVDSE LGTWEHQLRR MEMDKTREWA EQLTQMGRGK HFIGDFLPPD ELEKFMETFK
ALKEGREPDY SEYKEFKLTV ENIGYQMLMK MGWKEGEGLG TEGQGIKNPV NKGATTIDGA
GFGIDRPAEL SKEDDEYEAF RKRMMLAYRF RPNPLNNPRR PYY
//