ID A0A0M0K4R2_9EUKA Unreviewed; 512 AA.
AC A0A0M0K4R2;
DT 11-NOV-2015, integrated into UniProtKB/TrEMBL.
DT 11-NOV-2015, sequence version 1.
DT 27-MAR-2024, entry version 31.
DE RecName: Full=WD40 repeat-containing protein SMU1 {ECO:0000256|ARBA:ARBA00026184};
GN ORFNames=Ctob_006961 {ECO:0000313|EMBL:KOO33810.1};
OS Chrysochromulina tobinii.
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Prymnesiales;
OC Chrysochromulinaceae; Chrysochromulina.
OX NCBI_TaxID=1460289 {ECO:0000313|EMBL:KOO33810.1, ECO:0000313|Proteomes:UP000037460};
RN [1] {ECO:0000313|Proteomes:UP000037460}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP291 {ECO:0000313|Proteomes:UP000037460};
RX PubMed=26397803; DOI=10.1371/journal.pgen.1005469;
RA Hovde B.T., Deodato C.R., Hunsperger H.M., Ryken S.A., Yost W., Jha R.K.,
RA Patterson J., Monnat R.J. Jr., Barlow S.B., Starkenburg S.R.,
RA Cattolico R.A.;
RT "Genome Sequence and Transcriptome Analyses of Chrysochromulina tobin:
RT Metabolic Tools for Enhanced Algal Fitness in the Prominent Order
RT Prymnesiales (Haptophyceae).";
RL PLoS Genet. 11:e1005469-e1005469(2015).
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the WD repeat SMU1 family.
CC {ECO:0000256|ARBA:ARBA00025801}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KOO33810.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JWZX01001417; KOO33810.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A0M0K4R2; -.
DR OrthoDB; 3893433at2759; -.
DR Proteomes; UP000037460; Unassembled WGS sequence.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR006595; CTLH_C.
DR InterPro; IPR020472; G-protein_beta_WD-40_rep.
DR InterPro; IPR006594; LisH.
DR InterPro; IPR045184; SMU1.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR22848; WD40 REPEAT PROTEIN; 1.
DR PANTHER; PTHR22848:SF0; WD40 REPEAT-CONTAINING PROTEIN SMU1; 1.
DR Pfam; PF17814; LisH_TPL; 1.
DR Pfam; PF00400; WD40; 4.
DR PRINTS; PR00320; GPROTEINBRPT.
DR SMART; SM00667; LisH; 1.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50897; CTLH; 1.
DR PROSITE; PS50896; LISH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 5.
DR PROSITE; PS50294; WD_REPEATS_REGION; 4.
PE 3: Inferred from homology;
KW mRNA processing {ECO:0000256|ARBA:ARBA00022664};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Reference proteome {ECO:0000313|Proteomes:UP000037460};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT DOMAIN 38..90
FT /note="CTLH"
FT /evidence="ECO:0000259|PROSITE:PS50897"
FT REPEAT 208..249
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 258..299
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 301..342
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 343..384
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 478..512
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
SQ SEQUENCE 512 AA; 56404 MW; B84522921F5923D1 CRC64;
MQVEAADVVK IVLQFLKENS LTRSMQVLQE ETHITLNTVD NVDAFVADVQ AGRWDAVMSV
AATLKLPAPL LDDLYEQLVL EMIEMRELDT ARQVLRATPA MAKLREKQPD RYAKLESLAG
RPFFDPRDAY AEGSSKERRR SQIAEALRAE VEVVPPSRLL ALLGQALKWQ QHYGLLPADQ
KFDLFRGGAA QRVVEQESHV DAPGPVIKFG KKSHAECAAF SPDGQFFVSG SMDGFLEVWD
YERGKVRTDL AYQEKDDYMM HDEPVLALAF SRDSELLASG SQDGHVKVWR VRTGQCVRRF
PQAHAQGVTC LAFAKDGTQL ASGSFDAVGR VHGLKSGKTL KELRGHTSYI NALVYAPDGA
RLVTGSSDGS VRVWDAKSCD CLTSFRPPTP ASGVELSINS LAFLPTNPDH LVVCNRSPTL
YIMSLGGEVV NTLSGTEREA SDFVMCAISS QGGWVHCVGE DSRLYSFDLA SAKLAHALRA
HDKDVIGLAI HPHRNLVATW ADEPTLRLWK KG
//