ID F2UNQ1_SALR5 Unreviewed; 553 AA.
AC F2UNQ1;
DT 31-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 31-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 39.
DE RecName: Full=Pre-mRNA-splicing factor 38 {ECO:0000256|RuleBase:RU367025};
GN ORFNames=PTSG_09980 {ECO:0000313|EMBL:EGD79256.1};
OS Salpingoeca rosetta (strain ATCC 50818 / BSB-021).
OC Eukaryota; Choanoflagellata; Craspedida; Salpingoecidae; Salpingoeca.
OX NCBI_TaxID=946362 {ECO:0000313|Proteomes:UP000007799};
RN [1] {ECO:0000313|Proteomes:UP000007799}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=ATCC 50818 {ECO:0000313|Proteomes:UP000007799};
RA Russ C., Cuomo C., Burger G., Gray M.W., Holland P.W.H., King N.,
RA Lang F.B.F., Roger A.J., Ruiz-Trillo I., Young S.K., Zeng Q., Gargeya S.,
RA Alvarado L., Berlin A., Chapman S.B., Chen Z., Freedman E., Gellesch M.,
RA Goldberg J., Griggs A., Gujja S., Heilman E., Heiman D., Howarth C.,
RA Mehta T., Neiman D., Pearson M., Roberts A., Saif S., Shea T., Shenoy N.,
RA Sisk P., Stolte C., Sykes S., White J., Yandava C., Haas B., Nusbaum C.,
RA Birren B.;
RT "Annotation of Salpingoeca rosetta.";
RL Submitted (AUG-2009) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: Required for pre-mRNA splicing.
CC {ECO:0000256|RuleBase:RU367025}.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU367025}.
CC -!- SIMILARITY: Belongs to the PRP38 family.
CC {ECO:0000256|ARBA:ARBA00006164, ECO:0000256|RuleBase:RU367025}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL832984; EGD79256.1; -; Genomic_DNA.
DR RefSeq; XP_004989341.1; XM_004989284.1.
DR AlphaFoldDB; F2UNQ1; -.
DR STRING; 946362.F2UNQ1; -.
DR EnsemblProtists; EGD79256; EGD79256; PTSG_09980.
DR GeneID; 16069885; -.
DR KEGG; sre:PTSG_09980; -.
DR eggNOG; KOG2889; Eukaryota.
DR InParanoid; F2UNQ1; -.
DR OMA; QNRMGAY; -.
DR OrthoDB; 5485563at2759; -.
DR Proteomes; UP000007799; Unassembled WGS sequence.
DR GO; GO:0005681; C:spliceosomal complex; IEA:UniProtKB-KW.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:UniProtKB-UniRule.
DR InterPro; IPR005037; PRP38.
DR PANTHER; PTHR23142:SF1; PRE-MRNA-SPLICING FACTOR 38A; 1.
DR PANTHER; PTHR23142; UNCHARACTERIZED; 1.
DR Pfam; PF03371; PRP38; 1.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664,
KW ECO:0000256|RuleBase:RU367025};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187,
KW ECO:0000256|RuleBase:RU367025};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|RuleBase:RU367025};
KW Reference proteome {ECO:0000313|Proteomes:UP000007799};
KW Spliceosome {ECO:0000256|ARBA:ARBA00022728, ECO:0000256|RuleBase:RU367025}.
FT REGION 199..553
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 230..244
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..288
FT /note="Basic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 289..331
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 344..403
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 417..475
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 476..499
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 515..541
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 553 AA; 64982 MW; 07A652A170978E7C CRC64;
MANRTVKDAR SVHGTNPQFI VDKIIRSRIY ETLYWKEQCF ALTAETVIEK AVELTYVGGV
YGGNIRPTPF LCLTLKLLQL QPDKDIIIEY IKNEDFKYLR ALGAFYLRLV GTSMDCYRYI
EPLLNDYRKL KRMSRNGVME LTHMDEFVDE LLREDRVCDI GLPRIQKRYA LEVNDELEPR
RSLLEDELDD FEGELQAAAA AGTGGGGEED TTGSGGGDGG DAQKRKAEED EEEGEEDEEE
GRTGGDGSGD ESDRHHHRRR SPSRSPRRSR SRSHSRSRER GGRHRSRSSS RGRDRDRDRD
RDRDRDRDRD RDRDRDRDRD RDRDRDRSGR SDSRTRRRHR SRSRSHDHDD YDDRDRYSRR
YHRDRDRDRR SRRHDDDDYD DRRRHRRHRD SRDRSISPSD RTRTRRRSRS RSRSRDRGRR
YGRERYSRGE RDGSRERESE RRRSRSPSPS RRDGGDDERE RRVVESERRT RDEETAPSSS
SSARKPAESS ASSGGSGSHR LRFKGQKDSK KKKKKKRDKD SEAEGSKDTA KTPEDEIAEM
NALRAKLGMA PLK
//