ID A0A2P6TS70_CHLSO Unreviewed; 631 AA.
AC A0A2P6TS70;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 23.
DE SubName: Full=U5 small nuclear ribonucleo 40 kDa {ECO:0000313|EMBL:PRW56898.1};
GN ORFNames=C2E21_3814 {ECO:0000313|EMBL:PRW56898.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW56898.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW56898.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- SIMILARITY: Belongs to the ISY1 family.
CC {ECO:0000256|ARBA:ARBA00007002}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW56898.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000007; PRW56898.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P6TS70; -.
DR STRING; 3076.A0A2P6TS70; -.
DR OrthoDB; 5476798at2759; -.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0000350; P:generation of catalytic spliceosome for second transesterification step; IEA:InterPro.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 1.10.287.660; Helix hairpin bin; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR029012; Helix_hairpin_bin_sf.
DR InterPro; IPR009360; Isy1.
DR InterPro; IPR037200; Isy1_sf.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR44006; U5 SMALL NUCLEAR RIBONUCLEOPROTEIN 40 KDA PROTEIN; 1.
DR PANTHER; PTHR44006:SF1; U5 SMALL NUCLEAR RIBONUCLEOPROTEIN 40 KDA PROTEIN; 1.
DR Pfam; PF06246; Isy1; 1.
DR Pfam; PF00400; WD40; 7.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF140102; ISY1 domain-like; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 7.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00023187};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 335..366
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 378..419
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 439..462
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 462..503
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 504..538
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 566..595
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 596..631
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 200..219
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 261..293
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 631 AA; 69809 MW; EAA41B52F83F9CB4 CRC64;
MARNEEKAQS MLNRWLAGKQ AELKPERQKR PYLASECHDL NEADKWRQQI LREIGKKVME
IQNAGLGEHR IRDLNDEINK LIREKGHWER RIVELGGPDY SKVGPKVTDS EGRAVGEAGG
RGPGYRYFGA AKQLPGVREL FEKEAPRQVR RTRAEMYRAI DADYYGFRDE EDGILEKVEA
EAEGPMRRAA IEEWQEREAE RQAALASARG GLADGGAGDA AAAAAGDADA AAAPQFVAYV
PLPDQKEIEA KVLESKKAAL LAQQGGRGQQ QRRRAMADGG AQKRPLDDPE DANGGALVAV
KKARQEDSLV VASKRPDIKA GPDRTSALQA PIMLLTGHGD AVFTMRFSPE GDVIASGSHD
KHIFLWRTYG ECENYMMLKG HKNAVLEVHW TPDGERLVSC SPDKTVRVWD AVTGEQVKKM
GEHKDIVNSC CPLRRGPPLV VSGGDDCEAK LWDLRQKRSV KTLGEKYQIL TVAFSEGGDQ
IYTAGIENVV NVWDLRREEI SMSLAGHSDS ITGMRLSPDG THLLTNSMDN TLRVWDMRPY
APTNRCTKVF AGHVHTFEKN LLRCDWSPDG AKVAAGSGDR MVYIWNAHTR NLMYKLPGHS
GSVNEVVFHP KEPIVGSASS DKTIYLGELA Q
//