ID SF3A1_ARATH Reviewed; 785 AA.
AC Q8RXF1; Q0WWB5; Q9MA20;
DT 30-AUG-2005, integrated into UniProtKB/Swiss-Prot.
DT 12-JUN-2007, sequence version 2.
DT 01-MAY-2013, entry version 83.
DE RecName: Full=Probable splicing factor 3A subunit 1;
GN OrderedLocusNames=At1g14650; ORFNames=T5E21.13;
OS Arabidopsis thaliana (Mouse-ear cress).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliophyta; eudicotyledons; core eudicotyledons;
OC rosids; malvids; Brassicales; Brassicaceae; Camelineae; Arabidopsis.
OX NCBI_TaxID=3702;
RN [1]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Columbia;
RX PubMed=11130712; DOI=10.1038/35048500;
RA Theologis A., Ecker J.R., Palm C.J., Federspiel N.A., Kaul S.,
RA White O., Alonso J., Altafi H., Araujo R., Bowman C.L., Brooks S.Y.,
RA Buehler E., Chan A., Chao Q., Chen H., Cheuk R.F., Chin C.W.,
RA Chung M.K., Conn L., Conway A.B., Conway A.R., Creasy T.H., Dewar K.,
RA Dunn P., Etgu P., Feldblyum T.V., Feng J.-D., Fong B., Fujii C.Y.,
RA Gill J.E., Goldsmith A.D., Haas B., Hansen N.F., Hughes B., Huizar L.,
RA Hunter J.L., Jenkins J., Johnson-Hopson C., Khan S., Khaykin E.,
RA Kim C.J., Koo H.L., Kremenetskaia I., Kurtz D.B., Kwan A., Lam B.,
RA Langin-Hooper S., Lee A., Lee J.M., Lenz C.A., Li J.H., Li Y.-P.,
RA Lin X., Liu S.X., Liu Z.A., Luros J.S., Maiti R., Marziali A.,
RA Militscher J., Miranda M., Nguyen M., Nierman W.C., Osborne B.I.,
RA Pai G., Peterson J., Pham P.K., Rizzo M., Rooney T., Rowley D.,
RA Sakano H., Salzberg S.L., Schwartz J.R., Shinn P., Southwick A.M.,
RA Sun H., Tallon L.J., Tambunga G., Toriumi M.J., Town C.D.,
RA Utterback T., Van Aken S., Vaysberg M., Vysotskaia V.S., Walker M.,
RA Wu D., Yu G., Fraser C.M., Venter J.C., Davis R.W.;
RT "Sequence and analysis of chromosome 1 of the plant Arabidopsis
RT thaliana.";
RL Nature 408:816-820(2000).
RN [2]
RP GENOME REANNOTATION.
RC STRAIN=cv. Columbia;
RG The Arabidopsis Information Resource (TAIR);
RL Submitted (APR-2011) to the EMBL/GenBank/DDBJ databases.
RN [3]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RX PubMed=14593172; DOI=10.1126/science.1088305;
RA Yamada K., Lim J., Dale J.M., Chen H., Shinn P., Palm C.J.,
RA Southwick A.M., Wu H.C., Kim C.J., Nguyen M., Pham P.K., Cheuk R.F.,
RA Karlin-Newmann G., Liu S.X., Lam B., Sakano H., Wu T., Yu G.,
RA Miranda M., Quach H.L., Tripp M., Chang C.H., Lee J.M., Toriumi M.J.,
RA Chan M.M., Tang C.C., Onodera C.S., Deng J.M., Akiyama K., Ansari Y.,
RA Arakawa T., Banh J., Banno F., Bowser L., Brooks S.Y., Carninci P.,
RA Chao Q., Choy N., Enju A., Goldsmith A.D., Gurjal M., Hansen N.F.,
RA Hayashizaki Y., Johnson-Hopson C., Hsuan V.W., Iida K., Karnes M.,
RA Khan S., Koesema E., Ishida J., Jiang P.X., Jones T., Kawai J.,
RA Kamiya A., Meyers C., Nakajima M., Narusaka M., Seki M., Sakurai T.,
RA Satou M., Tamse R., Vaysberg M., Wallender E.K., Wong C., Yamamura Y.,
RA Yuan S., Shinozaki K., Davis R.W., Theologis A., Ecker J.R.;
RT "Empirical analysis of transcriptional activity in the Arabidopsis
RT genome.";
RL Science 302:842-846(2003).
RN [4]
RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA].
RC STRAIN=cv. Columbia;
RA Totoki Y., Seki M., Ishida J., Nakajima M., Enju A., Kamiya A.,
RA Narusaka M., Shin-i T., Nakagawa M., Sakamoto N., Oishi K., Kohara Y.,
RA Kobayashi M., Toyoda A., Sakaki Y., Sakurai T., Iida K., Akiyama K.,
RA Satou M., Toyoda T., Konagaya A., Carninci P., Kawai J.,
RA Hayashizaki Y., Shinozaki K.;
RT "Large-scale analysis of RIKEN Arabidopsis full-length (RAFL) cDNAs.";
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
RN [5]
RP STRUCTURE BY NMR OF 683-780.
RG RIKEN structural genomics initiative (RSGI);
RT "Solution structure of ubiquitin-like domain in splicing factor
RT AAL91182.";
RL Submitted (NOV-2004) to the PDB data bank.
CC -!- SUBUNIT: Component of splicing factor SF3A which is composed of
CC three subunits (By similarity).
CC -!- SUBCELLULAR LOCATION: Nucleus (By similarity).
CC -!- SIMILARITY: Contains 2 SURP motif repeats.
CC -!- SIMILARITY: Contains 1 ubiquitin-like domain.
CC -!- SEQUENCE CAUTION:
CC Sequence=AAF63169.1; Type=Erroneous gene model prediction;
CC -----------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution-NoDerivs License
CC -----------------------------------------------------------------------
DR EMBL; AC010657; AAF63169.1; ALT_SEQ; Genomic_DNA.
DR EMBL; CP002684; AEE29196.1; -; Genomic_DNA.
DR EMBL; CP002684; AEE29197.1; -; Genomic_DNA.
DR EMBL; AY081293; AAL91182.1; -; mRNA.
DR EMBL; AK226440; BAE98583.1; -; mRNA.
DR IPI; IPI00538761; -.
DR PIR; G86280; G86280.
DR RefSeq; NP_001117289.1; NM_001123817.1.
DR RefSeq; NP_172917.1; NM_101332.3.
DR UniGene; At.22146; -.
DR PDB; 1WE6; NMR; -; A=683-780.
DR PDBsum; 1WE6; -.
DR ProteinModelPortal; Q8RXF1; -.
DR SMR; Q8RXF1; 67-121, 173-314, 683-782.
DR PaxDb; Q8RXF1; -.
DR PRIDE; Q8RXF1; -.
DR EnsemblPlants; AT1G14650.1; AT1G14650.1; AT1G14650.
DR EnsemblPlants; AT1G14650.2; AT1G14650.2; AT1G14650.
DR GeneID; 838027; -.
DR KEGG; ath:AT1G14650; -.
DR TAIR; At1g14650; -.
DR eggNOG; NOG300902; -.
DR HOGENOM; HOG000238941; -.
DR InParanoid; Q8RXF1; -.
DR KO; K12825; -.
DR OMA; ILHAPRI; -.
DR PhylomeDB; Q8RXF1; -.
DR ProtClustDB; CLSN2682969; -.
DR EvolutionaryTrace; Q8RXF1; -.
DR ArrayExpress; Q8RXF1; -.
DR Genevestigator; Q8RXF1; -.
DR GermOnline; AT1G14650; Arabidopsis thaliana.
DR GO; GO:0005684; C:U2-type spliceosomal complex; ISS:UniProtKB.
DR GO; GO:0003723; F:RNA binding; ISS:UniProtKB.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; ISS:UniProtKB.
DR InterPro; IPR022030; PRP21-like.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR000626; Ubiquitin.
DR InterPro; IPR019955; Ubiquitin_supergroup.
DR Pfam; PF12230; PRP21_like_P; 1.
DR Pfam; PF01805; Surp; 2.
DR Pfam; PF00240; ubiquitin; 1.
DR SMART; SM00648; SWAP; 2.
DR SMART; SM00213; UBQ; 1.
DR SUPFAM; SSF109905; SSF109905; 2.
DR PROSITE; PS50128; SURP; 2.
DR PROSITE; PS00299; UBIQUITIN_1; FALSE_NEG.
DR PROSITE; PS50053; UBIQUITIN_2; 1.
PE 1: Evidence at protein level;
KW 3D-structure; Complete proteome; mRNA processing; mRNA splicing;
KW Nucleus; Reference proteome; Repeat; Spliceosome.
FT CHAIN 1 785 Probable splicing factor 3A subunit 1.
FT /FTId=PRO_0000114919.
FT REPEAT 71 113 SURP motif 1.
FT REPEAT 193 235 SURP motif 2.
FT DOMAIN 707 782 Ubiquitin-like.
FT COMPBIAS 287 386 Glu-rich.
FT COMPBIAS 536 677 Pro-rich.
FT COMPBIAS 588 660 Met-rich.
FT CONFLICT 289 289 D -> G (in Ref. 3; AAL91182).
FT HELIX 686 688
FT HELIX 692 698
FT STRAND 703 707
FT STRAND 713 715
FT STRAND 718 723
FT STRAND 725 728
FT HELIX 729 739
FT TURN 744 746
FT STRAND 747 750
FT STRAND 752 755
FT TURN 762 766
FT STRAND 768 770
FT STRAND 772 776
SQ SEQUENCE 785 AA; 87594 MW; 917E388B77472F8D CRC64;
MFSSMQILPL EAPPTDGKLG PLPPSQLTDQ EVEERELQAE QNNSNLAPPA AVATHTRTIG
IIHPPPDIRT IVEKTAQFVS KNGLEFEKRI IVSNEKNAKF NFLKSSDPYH AFYQHKLTEY
RAQNKDGAQG TDDSDGTTDP QLDTGAADES EAGDTQPDLQ AQFRIPSKPL EAPEPEKYTV
RLPEGITGEE LDIIKLTAQF VARNGKSFLT GLSNRENNNP QFHFMKPTHS MFTFFTSLVD
AYSEVLMPPK DLKEKLRKSA ADLTTVLERC LHRLEWDRSQ EQQKKKEEDE KELERVQMAM
IDWHDFVVVE SIDFADEEDE ELPPPMTLDE VIRRSKASAM EEDEIVEPGK EVEMEMDEEE
VKLVAEGMRA ANLEENVKIE NVHDEEAPMR IVKNWKRPED RIPTERDPTK VVISPITGEL
IPINEMSEHM RISLIDPKFK EQKDRMFAKI RETTLAQDDE IAKNIVGLAR LRPDIFGTTE
EEVSNAVKAE IEKKKDEQPK QVIWDGHTGS IGRTANQALS QNANGEEQGD GVYGDPNSFP
GPAALPPPRP GVPIVRPLPP PPNLALNLPR PPPSAQYPGA PRPLGVPMMQ PMHQQHQLTM
PGPPGHPQMM MNRPPQMQPG MHVPPPPGSQ FAHHMQIPRP YGQLPPSAMG MMQPPPMPGM
APPPPPEEAP PPLPEEPEAK RQKFDESALV PEDQFLAQHP GPATIRVSKP NENDGQFMEI
TVQSLSENVG SLKEKIAGEI QIPANKQKLS GKAGFLKDNM SLAHYNVGAG EILTLSLRER
GGRKR
//