ID R7Z100_CONA1 Unreviewed; 514 AA.
AC R7Z100;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=SURP motif domain-containing protein {ECO:0000259|PROSITE:PS50128};
GN ORFNames=W97_06852 {ECO:0000313|EMBL:EON67709.1};
OS Coniosporium apollinis (strain CBS 100218) (Rock-inhabiting black yeast).
OC Eukaryota; Fungi; Dikarya; Ascomycota; Pezizomycotina; Dothideomycetes;
OC Dothideomycetes incertae sedis; Coniosporium.
OX NCBI_TaxID=1168221 {ECO:0000313|EMBL:EON67709.1, ECO:0000313|Proteomes:UP000016924};
RN [1] {ECO:0000313|Proteomes:UP000016924}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CBS 100218 {ECO:0000313|Proteomes:UP000016924};
RG The Broad Institute Genome Sequencing Platform;
RA Cuomo C., Gorbushina A., Noack S., Walker B., Young S.K., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Goldberg J., Griggs A., Gujja S.,
RA Hansen M., Howarth C., Imamovic A., Larimer J., McCowan C., Montmayeur A.,
RA Murphy C., Neiman D., Pearson M., Priest M., Roberts A., Saif S., Shea T.,
RA Sisk P., Sykes S., Wortman J., Nusbaum C., Birren B.;
RT "The genome sequence of Coniosporium apollinis CBS 100218.";
RL Submitted (JUN-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JH767589; EON67709.1; -; Genomic_DNA.
DR RefSeq; XP_007783026.1; XM_007784836.1.
DR AlphaFoldDB; R7Z100; -.
DR STRING; 1168221.R7Z100; -.
DR GeneID; 19904163; -.
DR eggNOG; KOG0007; Eukaryota.
DR HOGENOM; CLU_013259_3_1_1; -.
DR OMA; VKYQEQQ; -.
DR OrthoDB; 168687at2759; -.
DR Proteomes; UP000016924; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0003723; F:RNA binding; IEA:InterPro.
DR GO; GO:0045292; P:mRNA cis splicing, via spliceosome; IEA:InterPro.
DR Gene3D; 1.10.10.790; Surp module; 2.
DR InterPro; IPR045146; SF3A1.
DR InterPro; IPR022030; SF3A1_dom.
DR InterPro; IPR000061; Surp.
DR InterPro; IPR035967; SWAP/Surp_sf.
DR PANTHER; PTHR15316; SPLICEOSOME ASSOCIATED PROTEIN 114/SWAP SPLICING FACTOR-RELATED; 1.
DR PANTHER; PTHR15316:SF1; SPLICING FACTOR 3A SUBUNIT 1; 1.
DR Pfam; PF12230; PRP21_like_P; 1.
DR Pfam; PF01805; Surp; 2.
DR SMART; SM00648; SWAP; 2.
DR SUPFAM; SSF109905; Surp module (SWAP domain); 2.
DR PROSITE; PS50128; SURP; 2.
PE 4: Predicted;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000016924};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728}.
FT DOMAIN 34..76
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT DOMAIN 131..173
FT /note="SURP motif"
FT /evidence="ECO:0000259|PROSITE:PS50128"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 89..113
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 466..499
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..110
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 485..499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 514 AA; 58643 MW; 206CAB83E649EA96 CRC64;
MAPAAIIDTP MTSLDEAHRP PPNVVLPPKD IRSIVEKTAG YVARNGPAFE ERIRAKEQSN
PKFCFLNPSD AYSPFYLWRL SEIREGRGTA VSAGRAEEAA REEEQPQGPE EPPEFHFSAR
MPMMNAQDLE VVKLTALFVA KNGRSFMTAL SQRETGNYQF DFLRPQHSMY QFFSRLVDQY
TELINGGSID GGRPERERLA QLERNVHDRF HMLERARRRA EWVKHQEQQK AKHEEEAEAE
KIAYAQVDWH DFVVVETVVF NEADDQTDLP PPTSLNDLQS ASLEQKAMMS LQPHNMRIEE
AMPTDEPSYY DQVPQPVQMP APQTAYAPPP IHPSRMDYVM HQSAQDDEEE RLIRERTEAR
ERAQQAQAAA KGGTGPMRIR NDYVPRAQAK RQNVSMALCP NCNQQVPYDE LEQHMKVELL
DPEWRKQKAK SDARYATTNL STADVANNLK RLASQRSDVF DGVTGLPVSE EEAARRKKAA
TSYDGNPQPA DTHRMQSLNI EEQIRQIKEK YGGQ
//