ID A0A2P6TTX4_CHLSO Unreviewed; 1670 AA.
AC A0A2P6TTX4;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 22-FEB-2023, entry version 20.
DE SubName: Full=Putative splicing factor 3A subunit 1 isoform B {ECO:0000313|EMBL:PRW57504.1};
GN ORFNames=C2E21_3972 {ECO:0000313|EMBL:PRW57504.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW57504.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW57504.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW57504.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000007; PRW57504.1; -; Genomic_DNA.
DR STRING; 3076.A0A2P6TTX4; -.
DR OrthoDB; 168687at2759; -.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd01800; Ubl_SF3a120; 1.
DR Gene3D; 6.10.140.2220; -; 1.
DR Gene3D; 1.10.10.790; Surp module; 2.
DR InterPro; IPR045146; SF3A1.
DR InterPro; IPR022030; SF3A1_dom.
DR InterPro; IPR035563; SF3As1_ubi.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR InterPro; IPR000626; Ubiquitin-like_dom.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR InterPro; IPR002893; Znf_MYND.
DR PANTHER; PTHR15316; SPLICEOSOME ASSOCIATED PROTEIN 114/SWAP SPLICING FACTOR-RELATED; 1.
DR PANTHER; PTHR15316:SF1; SPLICING FACTOR 3A SUBUNIT 1; 1.
DR Pfam; PF12230; PRP21_like_P; 1.
DR Pfam; PF01805; Surp; 2.
DR Pfam; PF01753; zf-MYND; 1.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF144232; HIT/MYND zinc finger-like; 1.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR SUPFAM; SSF54236; Ubiquitin-like; 1.
DR PROSITE; PS50128; SURP; 2.
DR PROSITE; PS50053; UBIQUITIN_2; 1.
DR PROSITE; PS50865; ZF_MYND_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00134}.
FT DOMAIN 40..82
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 159..201
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 625..680
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS50053"
FT DOMAIN 1618..1657
FT /note="MYND-type"
FT /evidence="ECO:0000259|PROSITE:PS50865"
FT REGION 117..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 302..384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 494..569
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 707..744
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..135
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..562
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1670 AA; 179095 MW; 997AE4E6E91AE177 CRC64;
MENGGAIVAV GVAKSALPDQ LVTQTKAIGV ILPPPDIRAI VDKTAQFVAR NGTDFEKRIL
ANEAQNQKFN FLKDGDPYNA YYRKRVAEFT AEEKGEAPAE GGAAAAAPAA AAATAAPAPA
PVVEKPKPAL PPTKPLEAPE EEQYTVHIPE GLTVLDLDVI KLTAQFVARN GKSFLTGLAS
REHANPQFNF LKPTHSLFGF FTALCDAYSR VLMPPKELRG RLDKDAGDRS VILERALKRL
EWERVQEKEA KEKAAREEAE REAMMSVDWH EFVVVETIEF YDDELDELPE PMTVKDVIRL
SKAAAQPEEP EPEAAAAAAA AAQQQDMEMD DEEAALVAQG AAAGRPSAAP GGEADMDMEE
SDDEAAAAKA AAPPVEEEEE EEGPMRVVKN YQRPAARQQQ QQYDPTKFVV SPITGELIPT
GEMAEHMRIS LIDPKYREQK EAMMAKIRDT TKASDDEISR NLVGLTRTLP HIFGTTAEEV
SALVGKQIQE QRAREAAAAR LPPPPMPRPP PPMPRPPMPM GVPTAPPSLG PRAVPTAPGS
GAPPLPPMPF GGAPPLPPHE APPLPDEEPD AKRARLDTFV LTLEEEFADA HPGQAKVYVQ
CPKVDGSEAL TGQLLAVEMP SLLATVSELK GRLSEVLSLP PGKQQLSREH VGIMKNELSL
AFYNVGQKVH LQLGIKERGG RKNLDNMADK QGTVEGDWQI VDKPQAPPEE HATAEPAPQV
PAVKAPAAPE SPPKPKVRQR REPQPPGLLE EIAAKGLFCL LCVVEAAVKM ASHATQMVRR
AGMAGARLAH KLVAATRPAL ESAQAAGSAC AAVLRRGDYR GAAKLAGQHA SEAAARLANA
VAPVLAAIRD TAHNTAVRWR TAAATCCCLP TAAQLREQQQ RLLARAQSSA AEAGKWTSAK
WSTVDPVGRL PAAWKRARSC RSSTYVLVVA GLGCASLAVG TLLLICSPLM QHFVAQFDAA
FSMGRLLLDG AGEARPNLNW RDKEQAIAFL LNVSFHQLAS ETLVTEPSHF GPLATALHRL
LMDVERASLT TSPGLRLMAV LGHIGNAEIR CRTGLPEEAQ LQLMSGCLQD LRGLLADQVW
LAGAADHAEH HQRAAGLASP MSADNLQLLT LLLYYRCTRN LNHVSIGRGM VVPQQLKMEG
AINDALLTLE DSGPMRPCLL RRRGLNLDEC VHHIQGLPLF GPPPQPGAAA AAYEAALVAC
ETSRASLVEA IAALNLAHCL MKGGGGPLWS LARVRSLLGR ALGALTRCKP WLPTPLWHMR
KGEHFVAHMD ASFALGRLLL DASGQARHNL SERDKEQAIV YLLNISFHQL QSESLFAALN
HNRELSGPLH GLLIEVERAS LTTSPGLHLM AVLGHIADAE IRSRSGPMPE QLLVPLLTST
LRDLQGMLAD RGRLASAAGE AEERQLAAGL ATPLSAADLQ RLALLLRFRC ARHMNQSTLR
LGMAAPELLQ LEGAINDELI ALEGDGPMRP CFLRRRGISI SERLQHSKGL PLFGPPLQPG
AAAAAYEAAL VACESSPASL VEAIAALNLA QCLLAGGGGP RWSAARVLAL LDRALAALSR
CKPWLLTAAR QNLLQMHKSL TASLESKAAS CVGDSLPAIS MAKLNETPVP RQLHLPECAA
CGKHALQLMK CGACKTVAYC SKACQTQHWR AGHKHECAAL KAARSSKPAA
//