GenomeNet

Database: UniProt
Entry: Q8CH02
LinkDB: Q8CH02
Original site: Q8CH02 
ID   SUGP1_MOUSE             Reviewed;         643 AA.
AC   Q8CH02; Q0VAT9; Q3U0W3; Q8R094;
DT   15-MAR-2005, integrated into UniProtKB/Swiss-Prot.
DT   01-MAR-2003, sequence version 1.
DT   16-APR-2014, entry version 83.
DE   RecName: Full=SURP and G-patch domain-containing protein 1;
DE   AltName: Full=Splicing factor 4;
GN   Name=Sugp1; Synonyms=Sf4;
OS   Mus musculus (Mouse).
OC   Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi;
OC   Mammalia; Eutheria; Euarchontoglires; Glires; Rodentia; Sciurognathi;
OC   Muroidea; Muridae; Murinae; Mus; Mus.
OX   NCBI_TaxID=10090;
RN   [1]
RP   NUCLEOTIDE SEQUENCE [MRNA].
RC   STRAIN=C57BL/6J;
RX   PubMed=12594045; DOI=10.1016/S0378-1119(02)01230-1;
RA   Sampson N.D., Hewitt J.E.;
RT   "SF4 and SFRS14, two related putative splicing factors on human
RT   chromosome 19p13.11.";
RL   Gene 305:91-100(2003).
RN   [2]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC   STRAIN=NOD; TISSUE=Spleen;
RX   PubMed=16141072; DOI=10.1126/science.1112014;
RA   Carninci P., Kasukawa T., Katayama S., Gough J., Frith M.C., Maeda N.,
RA   Oyama R., Ravasi T., Lenhard B., Wells C., Kodzius R., Shimokawa K.,
RA   Bajic V.B., Brenner S.E., Batalov S., Forrest A.R., Zavolan M.,
RA   Davis M.J., Wilming L.G., Aidinis V., Allen J.E.,
RA   Ambesi-Impiombato A., Apweiler R., Aturaliya R.N., Bailey T.L.,
RA   Bansal M., Baxter L., Beisel K.W., Bersano T., Bono H., Chalk A.M.,
RA   Chiu K.P., Choudhary V., Christoffels A., Clutterbuck D.R.,
RA   Crowe M.L., Dalla E., Dalrymple B.P., de Bono B., Della Gatta G.,
RA   di Bernardo D., Down T., Engstrom P., Fagiolini M., Faulkner G.,
RA   Fletcher C.F., Fukushima T., Furuno M., Futaki S., Gariboldi M.,
RA   Georgii-Hemming P., Gingeras T.R., Gojobori T., Green R.E.,
RA   Gustincich S., Harbers M., Hayashi Y., Hensch T.K., Hirokawa N.,
RA   Hill D., Huminiecki L., Iacono M., Ikeo K., Iwama A., Ishikawa T.,
RA   Jakt M., Kanapin A., Katoh M., Kawasawa Y., Kelso J., Kitamura H.,
RA   Kitano H., Kollias G., Krishnan S.P., Kruger A., Kummerfeld S.K.,
RA   Kurochkin I.V., Lareau L.F., Lazarevic D., Lipovich L., Liu J.,
RA   Liuni S., McWilliam S., Madan Babu M., Madera M., Marchionni L.,
RA   Matsuda H., Matsuzawa S., Miki H., Mignone F., Miyake S., Morris K.,
RA   Mottagui-Tabar S., Mulder N., Nakano N., Nakauchi H., Ng P.,
RA   Nilsson R., Nishiguchi S., Nishikawa S., Nori F., Ohara O.,
RA   Okazaki Y., Orlando V., Pang K.C., Pavan W.J., Pavesi G., Pesole G.,
RA   Petrovsky N., Piazza S., Reed J., Reid J.F., Ring B.Z., Ringwald M.,
RA   Rost B., Ruan Y., Salzberg S.L., Sandelin A., Schneider C.,
RA   Schoenbach C., Sekiguchi K., Semple C.A., Seno S., Sessa L., Sheng Y.,
RA   Shibata Y., Shimada H., Shimada K., Silva D., Sinclair B.,
RA   Sperling S., Stupka E., Sugiura K., Sultana R., Takenaka Y., Taki K.,
RA   Tammoja K., Tan S.L., Tang S., Taylor M.S., Tegner J., Teichmann S.A.,
RA   Ueda H.R., van Nimwegen E., Verardo R., Wei C.L., Yagi K.,
RA   Yamanishi H., Zabarovsky E., Zhu S., Zimmer A., Hide W., Bult C.,
RA   Grimmond S.M., Teasdale R.D., Liu E.T., Brusic V., Quackenbush J.,
RA   Wahlestedt C., Mattick J.S., Hume D.A., Kai C., Sasaki D., Tomaru Y.,
RA   Fukuda S., Kanamori-Katayama M., Suzuki M., Aoki J., Arakawa T.,
RA   Iida J., Imamura K., Itoh M., Kato T., Kawaji H., Kawagashira N.,
RA   Kawashima T., Kojima M., Kondo S., Konno H., Nakano K., Ninomiya N.,
RA   Nishio T., Okada M., Plessy C., Shibata K., Shiraki T., Suzuki S.,
RA   Tagami M., Waki K., Watahiki A., Okamura-Oho Y., Suzuki H., Kawai J.,
RA   Hayashizaki Y.;
RT   "The transcriptional landscape of the mammalian genome.";
RL   Science 309:1559-1563(2005).
RN   [3]
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RX   PubMed=15489334; DOI=10.1101/gr.2596504;
RG   The MGC Project Team;
RT   "The status, quality, and expansion of the NIH full-length cDNA
RT   project: the Mammalian Gene Collection (MGC).";
RL   Genome Res. 14:2121-2127(2004).
RN   [4]
RP   IDENTIFICATION BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RC   TISSUE=Liver;
RX   PubMed=17242355; DOI=10.1073/pnas.0609836104;
RA   Villen J., Beausoleil S.A., Gerber S.A., Gygi S.P.;
RT   "Large-scale phosphorylation analysis of mouse liver.";
RL   Proc. Natl. Acad. Sci. U.S.A. 104:1488-1493(2007).
RN   [5]
RP   PHOSPHORYLATION [LARGE SCALE ANALYSIS] AT SER-483, AND IDENTIFICATION
RP   BY MASS SPECTROMETRY [LARGE SCALE ANALYSIS].
RX   PubMed=19144319; DOI=10.1016/j.immuni.2008.11.006;
RA   Trost M., English L., Lemieux S., Courcelles M., Desjardins M.,
RA   Thibault P.;
RT   "The phagosomal proteome in interferon-gamma-activated macrophages.";
RL   Immunity 30:143-154(2009).
RN   [6]
RP   STRUCTURE BY NMR OF 165-239.
RG   RIKEN structural genomics initiative (RSGI);
RT   "Solution structure of SURP domain in BAB30904.";
RL   Submitted (AUG-2004) to the PDB data bank.
RN   [7]
RP   STRUCTURE BY NMR OF 250-314.
RG   RIKEN structural genomics initiative (RSGI);
RT   "Solution structure of SURP domain in splicing factor 4.";
RL   Submitted (NOV-2005) to the PDB data bank.
CC   -!- FUNCTION: Plays a role in pre-mRNA splicing (By similarity).
CC   -!- SUBUNIT: Component of the spliceosome (By similarity).
CC   -!- SUBCELLULAR LOCATION: Nucleus (Probable).
CC   -!- SIMILARITY: Contains 1 G-patch domain.
CC   -!- SIMILARITY: Contains 2 SURP motif repeats.
CC   -----------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution-NoDerivs License
CC   -----------------------------------------------------------------------
DR   EMBL; AF521129; AAN77124.1; -; mRNA.
DR   EMBL; AK156508; BAE33738.1; -; mRNA.
DR   EMBL; BC120919; AAI20920.1; -; mRNA.
DR   EMBL; BC120920; AAI20921.1; -; mRNA.
DR   RefSeq; NP_081757.1; NM_027481.2.
DR   RefSeq; XP_006509801.1; XM_006509738.1.
DR   UniGene; Mm.17665; -.
DR   PDB; 1UG0; NMR; -; A=165-239.
DR   PDB; 1X4O; NMR; -; A=250-314.
DR   PDBsum; 1UG0; -.
DR   PDBsum; 1X4O; -.
DR   ProteinModelPortal; Q8CH02; -.
DR   SMR; Q8CH02; 165-241, 250-319.
DR   BioGrid; 214168; 1.
DR   IntAct; Q8CH02; 1.
DR   MINT; MINT-4115847; -.
DR   PhosphoSite; Q8CH02; -.
DR   PaxDb; Q8CH02; -.
DR   PRIDE; Q8CH02; -.
DR   Ensembl; ENSMUST00000011450; ENSMUSP00000011450; ENSMUSG00000011306.
DR   GeneID; 70616; -.
DR   KEGG; mmu:70616; -.
DR   UCSC; uc009lyn.1; mouse.
DR   CTD; 57794; -.
DR   MGI; MGI:1917866; Sugp1.
DR   eggNOG; NOG299701; -.
DR   GeneTree; ENSGT00410000025695; -.
DR   HOVERGEN; HBG079172; -.
DR   InParanoid; Q8CH02; -.
DR   KO; K13096; -.
DR   OMA; EKVAMEN; -.
DR   OrthoDB; EOG7VX8VN; -.
DR   PhylomeDB; Q8CH02; -.
DR   TreeFam; TF326321; -.
DR   EvolutionaryTrace; Q8CH02; -.
DR   NextBio; 331984; -.
DR   PRO; PR:Q8CH02; -.
DR   Bgee; Q8CH02; -.
DR   CleanEx; MM_SF4; -.
DR   Genevestigator; Q8CH02; -.
DR   GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR   GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR   GO; GO:0006397; P:mRNA processing; IEA:UniProtKB-KW.
DR   GO; GO:0008380; P:RNA splicing; IEA:UniProtKB-KW.
DR   InterPro; IPR000467; G_patch_dom.
DR   InterPro; IPR000061; Surp.
DR   Pfam; PF01585; G-patch; 1.
DR   Pfam; PF01805; Surp; 2.
DR   SMART; SM00443; G_patch; 1.
DR   SMART; SM00648; SWAP; 2.
DR   SUPFAM; SSF109905; SSF109905; 2.
DR   PROSITE; PS50174; G_PATCH; 1.
DR   PROSITE; PS50128; SURP; 2.
PE   1: Evidence at protein level;
KW   3D-structure; Complete proteome; mRNA processing; mRNA splicing;
KW   Nucleus; Phosphoprotein; Reference proteome; Repeat; Spliceosome.
FT   CHAIN         1    643       SURP and G-patch domain-containing
FT                                protein 1.
FT                                /FTId=PRO_0000097702.
FT   REPEAT      187    229       SURP motif 1.
FT   REPEAT      262    305       SURP motif 2.
FT   DOMAIN      560    607       G-patch.
FT   MOTIF       378    384       Nuclear localization signal (Potential).
FT   COMPBIAS    324    371       Pro-rich.
FT   COMPBIAS    439    478       Gln/Met-rich.
FT   MOD_RES     407    407       Phosphoserine (By similarity).
FT   MOD_RES     409    409       Phosphoserine (By similarity).
FT   MOD_RES     412    412       Phosphoserine (By similarity).
FT   MOD_RES     483    483       Phosphoserine.
FT   CONFLICT    326    326       P -> L (in Ref. 3; AAI20920/AAI20921).
FT   HELIX       167    172
FT   HELIX       179    181
FT   HELIX       182    198
FT   HELIX       200    209
FT   TURN        210    212
FT   STRAND      214    216
FT   HELIX       217    220
FT   HELIX       225    239
FT   HELIX       257    272
FT   HELIX       277    287
FT   TURN        291    295
FT   STRAND      297    300
FT   HELIX       301    313
SQ   SEQUENCE   643 AA;  72649 MW;  A1FCD38A26998E78 CRC64;
     MSLKMDNRDV AGKANRWFGM AQPKSGKMNM NILHQEELIA QKKREIEARM EQKARQSHVP
     SPQPPHPGEI ADAHNSCISN KFANDGSFLQ QFLKLQKAQT STDSAPRAPP SMPTPSSLKK
     PLVLSKRTGL GLSSPTGPVK NYSHAKQLPV AHRPSVFQSP DDDEEEDYEQ WLEIKVSPPE
     GAETRRVIEK LARFVAEGGP ELEKVAMEDY KDNPAFTFLH DKNSREFLYY RRKVAEIRKE
     AQKPQAATQK VSPPEDEEAK NLAEKLARFI ADGGPEVETI ALQNNRENQA FSFLYDPNSQ
     GYRYYRQKLD EFRKAKAGST GSFPAPAPNP SLRRKSAPEA LSGAVPPITA CPTPVAPAPA
     VNPTPSIPGK PTATAAVKRK RKSRWGPEED KVELPPAELA QRDIDASPSP LSVQDLKGLG
     YEKGKPVGLV GVTELSDAQK KQLKEQQEMQ QMYDMIMQHK RAMQDMQLLW EKALQQHQHG
     YDSDEEVDSE LGTWEHQLRR MEMDKTREWA EQLTQMGRGK HFIGDFLPPD ELEKFMETFK
     ALKEGREPDY SEYKEFKLTV ENIGYQMLMK MGWKEGEGLG TEGQGIKNPV NKGATTIDGA
     GFGIDRPAEL SKEDDEYEAF RKRMMLAYRF RPNPLNNPRR PYY
//
DBGET integrated database retrieval system