GenomeNet

Database: UniProt
Entry: A0A2P6TTX4_CHLSO
LinkDB: A0A2P6TTX4_CHLSO
Original site: A0A2P6TTX4_CHLSO 
ID   A0A2P6TTX4_CHLSO        Unreviewed;      1670 AA.
AC   A0A2P6TTX4;
DT   23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT   23-MAY-2018, sequence version 1.
DT   22-FEB-2023, entry version 20.
DE   SubName: Full=Putative splicing factor 3A subunit 1 isoform B {ECO:0000313|EMBL:PRW57504.1};
GN   ORFNames=C2E21_3972 {ECO:0000313|EMBL:PRW57504.1};
OS   Chlorella sorokiniana (Freshwater green alga).
OC   Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC   Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX   NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW57504.1, ECO:0000313|Proteomes:UP000239899};
RN   [1] {ECO:0000313|EMBL:PRW57504.1, ECO:0000313|Proteomes:UP000239899}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX   PubMed=29178410; DOI=10.1111/tpj.13789;
RA   Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA   Barney B.M.;
RT   "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT   conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL   Plant J. 93:566-586(2018).
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC   -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC       whole genome shotgun (WGS) entry which is preliminary data.
CC       {ECO:0000313|EMBL:PRW57504.1}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; LHPG02000007; PRW57504.1; -; Genomic_DNA.
DR   STRING; 3076.A0A2P6TTX4; -.
DR   OrthoDB; 168687at2759; -.
DR   Proteomes; UP000239899; Unassembled WGS sequence.
DR   GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR   GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR   GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR   CDD; cd01800; Ubl_SF3a120; 1.
DR   Gene3D; 6.10.140.2220; -; 1.
DR   Gene3D; 1.10.10.790; Surp module; 2.
DR   InterPro; IPR045146; SF3A1.
DR   InterPro; IPR022030; SF3A1_dom.
DR   InterPro; IPR035563; SF3As1_ubi.
DR   InterPro; IPR000061; Surp.
DR   InterPro; IPR035967; SWAP/Surp_sf.
DR   InterPro; IPR000626; Ubiquitin-like_dom.
DR   InterPro; IPR029071; Ubiquitin-like_domsf.
DR   InterPro; IPR002893; Znf_MYND.
DR   PANTHER; PTHR15316; SPLICEOSOME ASSOCIATED PROTEIN 114/SWAP SPLICING FACTOR-RELATED; 1.
DR   PANTHER; PTHR15316:SF1; SPLICING FACTOR 3A SUBUNIT 1; 1.
DR   Pfam; PF12230; PRP21_like_P; 1.
DR   Pfam; PF01805; Surp; 2.
DR   Pfam; PF01753; zf-MYND; 1.
DR   SMART; SM00648; SWAP; 2.
DR   SUPFAM; SSF144232; HIT/MYND zinc finger-like; 1.
DR   SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR   SUPFAM; SSF54236; Ubiquitin-like; 1.
DR   PROSITE; PS50128; SURP; 2.
DR   PROSITE; PS50053; UBIQUITIN_2; 1.
DR   PROSITE; PS50865; ZF_MYND_2; 1.
PE   4: Predicted;
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW   mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW   Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW   Repeat {ECO:0000256|ARBA:ARBA00022737};
KW   Spliceosome {ECO:0000256|ARBA:ARBA00022728};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW   ProRule:PRU00134}.
FT   DOMAIN          40..82
FT                   /note="SURP motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50128"
FT   DOMAIN          159..201
FT                   /note="SURP motif"
FT                   /evidence="ECO:0000259|PROSITE:PS50128"
FT   DOMAIN          625..680
FT                   /note="Ubiquitin-like"
FT                   /evidence="ECO:0000259|PROSITE:PS50053"
FT   DOMAIN          1618..1657
FT                   /note="MYND-type"
FT                   /evidence="ECO:0000259|PROSITE:PS50865"
FT   REGION          117..136
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          302..384
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          494..569
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          707..744
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        121..135
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        500..562
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1670 AA;  179095 MW;  997AE4E6E91AE177 CRC64;
     MENGGAIVAV GVAKSALPDQ LVTQTKAIGV ILPPPDIRAI VDKTAQFVAR NGTDFEKRIL
     ANEAQNQKFN FLKDGDPYNA YYRKRVAEFT AEEKGEAPAE GGAAAAAPAA AAATAAPAPA
     PVVEKPKPAL PPTKPLEAPE EEQYTVHIPE GLTVLDLDVI KLTAQFVARN GKSFLTGLAS
     REHANPQFNF LKPTHSLFGF FTALCDAYSR VLMPPKELRG RLDKDAGDRS VILERALKRL
     EWERVQEKEA KEKAAREEAE REAMMSVDWH EFVVVETIEF YDDELDELPE PMTVKDVIRL
     SKAAAQPEEP EPEAAAAAAA AAQQQDMEMD DEEAALVAQG AAAGRPSAAP GGEADMDMEE
     SDDEAAAAKA AAPPVEEEEE EEGPMRVVKN YQRPAARQQQ QQYDPTKFVV SPITGELIPT
     GEMAEHMRIS LIDPKYREQK EAMMAKIRDT TKASDDEISR NLVGLTRTLP HIFGTTAEEV
     SALVGKQIQE QRAREAAAAR LPPPPMPRPP PPMPRPPMPM GVPTAPPSLG PRAVPTAPGS
     GAPPLPPMPF GGAPPLPPHE APPLPDEEPD AKRARLDTFV LTLEEEFADA HPGQAKVYVQ
     CPKVDGSEAL TGQLLAVEMP SLLATVSELK GRLSEVLSLP PGKQQLSREH VGIMKNELSL
     AFYNVGQKVH LQLGIKERGG RKNLDNMADK QGTVEGDWQI VDKPQAPPEE HATAEPAPQV
     PAVKAPAAPE SPPKPKVRQR REPQPPGLLE EIAAKGLFCL LCVVEAAVKM ASHATQMVRR
     AGMAGARLAH KLVAATRPAL ESAQAAGSAC AAVLRRGDYR GAAKLAGQHA SEAAARLANA
     VAPVLAAIRD TAHNTAVRWR TAAATCCCLP TAAQLREQQQ RLLARAQSSA AEAGKWTSAK
     WSTVDPVGRL PAAWKRARSC RSSTYVLVVA GLGCASLAVG TLLLICSPLM QHFVAQFDAA
     FSMGRLLLDG AGEARPNLNW RDKEQAIAFL LNVSFHQLAS ETLVTEPSHF GPLATALHRL
     LMDVERASLT TSPGLRLMAV LGHIGNAEIR CRTGLPEEAQ LQLMSGCLQD LRGLLADQVW
     LAGAADHAEH HQRAAGLASP MSADNLQLLT LLLYYRCTRN LNHVSIGRGM VVPQQLKMEG
     AINDALLTLE DSGPMRPCLL RRRGLNLDEC VHHIQGLPLF GPPPQPGAAA AAYEAALVAC
     ETSRASLVEA IAALNLAHCL MKGGGGPLWS LARVRSLLGR ALGALTRCKP WLPTPLWHMR
     KGEHFVAHMD ASFALGRLLL DASGQARHNL SERDKEQAIV YLLNISFHQL QSESLFAALN
     HNRELSGPLH GLLIEVERAS LTTSPGLHLM AVLGHIADAE IRSRSGPMPE QLLVPLLTST
     LRDLQGMLAD RGRLASAAGE AEERQLAAGL ATPLSAADLQ RLALLLRFRC ARHMNQSTLR
     LGMAAPELLQ LEGAINDELI ALEGDGPMRP CFLRRRGISI SERLQHSKGL PLFGPPLQPG
     AAAAAYEAAL VACESSPASL VEAIAALNLA QCLLAGGGGP RWSAARVLAL LDRALAALSR
     CKPWLLTAAR QNLLQMHKSL TASLESKAAS CVGDSLPAIS MAKLNETPVP RQLHLPECAA
     CGKHALQLMK CGACKTVAYC SKACQTQHWR AGHKHECAAL KAARSSKPAA
//
DBGET integrated database retrieval system