ID A0A0G4H1Y8_VITBC Unreviewed; 917 AA.
AC A0A0G4H1Y8;
DT 16-SEP-2015, integrated into UniProtKB/TrEMBL.
DT 16-SEP-2015, sequence version 1.
DT 13-SEP-2023, entry version 33.
DE RecName: Full=SURP motif domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=Vbra_19254 {ECO:0000313|EMBL:CEM37417.1};
OS Vitrella brassicaformis (strain CCMP3155).
OC Eukaryota; Sar; Alveolata; Colpodellida; Vitrellaceae; Vitrella.
OX NCBI_TaxID=1169540 {ECO:0000313|EMBL:CEM37417.1, ECO:0000313|Proteomes:UP000041254};
RN [1] {ECO:0000313|EMBL:CEM37417.1, ECO:0000313|Proteomes:UP000041254}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Zhu J., Qi W., Song R.;
RL Submitted (NOV-2014) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CDMY01000938; CEM37417.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0G4H1Y8; -.
DR STRING; 1169540.A0A0G4H1Y8; -.
DR VEuPathDB; CryptoDB:Vbra_19254; -.
DR InParanoid; A0A0G4H1Y8; -.
DR OMA; GQFAERR; -.
DR OrthoDB; 168687at2759; -.
DR PhylomeDB; A0A0G4H1Y8; -.
DR Proteomes; UP000041254; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR CDD; cd01800; Ubl_SF3a120; 1.
DR Gene3D; 1.10.10.790; Surp module; 2.
DR InterPro; IPR045146; SF3A1.
DR InterPro; IPR022030; SF3A1_dom.
DR InterPro; IPR035563; SF3As1_ubi.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR InterPro; IPR000626; Ubiquitin-like_dom.
DR InterPro; IPR029071; Ubiquitin-like_domsf.
DR PANTHER; PTHR15316; SPLICEOSOME ASSOCIATED PROTEIN 114/SWAP SPLICING FACTOR-RELATED; 1.
DR PANTHER; PTHR15316:SF1; SPLICING FACTOR 3A SUBUNIT 1; 1.
DR Pfam; PF12230; PRP21_like_P; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR SUPFAM; SSF54236; Ubiquitin-like; 1.
DR PROSITE; PS50128; SURP; 2.
DR PROSITE; PS50053; UBIQUITIN_2; 1.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000041254};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 73..121
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 196..238
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 827..915
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS50053"
FT REGION 1..61
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 139..160
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 422..463
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 578..610
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 785..812
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 27..61
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 142..160
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 917 AA; 98549 MW; 7F750D5A35326505 CRC64;
MATVVPPTPP PEAEMRDLSL DNGVPPNGVP PGQPPAPAAP PPVIPGAAGP PPVRPPASAP
LGVIYPPQKE REIIDKTAAF VAKNGVEFEQ HLFRQHQDAA ASGQQQKFAF LFQNNPYRAY
YDHKVKELLA GNTADLRPAK PQAILDQERK EEEKKRKKEQ LKMLTMGEDR KRELKPPEPD
EYTVQHPYIA PVDVDIIKMT AQFVARNGKT FLETLCKKEV DSAQFEFLKP THYLFNYFTS
LVDAYTRCLV PKKPSLEKLK RDATDGGPDT APMKVLERCT LRHEWESQEE KKRKEKEQRD
EDEKNQMQSI EWHDFIVVET IQFTEDDDRI QLAPPVTADQ FEGHQPVGPV MAVDLTNGQL
ERDMARAGLL APVPLEYQLP GEEVDSEVPA GTIVGSAIVD KTKRDEELAK KQLTRIVADE
EEMEIDTDER PNEPPPPAAV APAAAAAAAA GGGEEGGAAK EELPVQKETE NITVVKDYVR
QKRGRRGVPE GMQRCPITGQ LVQSDDMAEH LRVLLLDPKW KEQKDRLVAK AKKESAFAPI
ADVEANIANF VARRPDLFGT VEDQIEYAGD PLGIEVPPAP APAQVPPPPP PPKLPALPLT
PAGGVGVPGP PLPPSAAVAA PTVEKAPSPT NAAVPMVAPP QPHMQPQLQP QMPIPASPPP
AAAAAAAPVP APPAAEAMKP TEGEVVSAPP VMKEPKPAGE APKVTTAAAA AGVAVVQPPV
GPSPHIISTG APPPQLGPPG APRPPFAPMG MPPFAPMPMA GGYRLPMMMP GQMPMMGMMP
PGMSMPAQAQ PGQEAAGEVE EPAAKRARTD DAAVPEEQFI SIHKGAIAYT VECIANPTHN
LQAGTLQVEA MVKDTVESLK EKVGQMVSLP SSRLKLKVVS PGPSQGVTLK DAPTLGYYNL
GPETRLELSI KERGGKK
//