ID A0A2P6V6F0_9CHLO Unreviewed; 2031 AA.
AC A0A2P6V6F0;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=WD40 repeat-containing protein SMU1 {ECO:0000256|ARBA:ARBA00026184};
GN ORFNames=C2E20_6834 {ECO:0000313|EMBL:PSC69662.1};
OS Micractinium conductrix.
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Micractinium.
OX NCBI_TaxID=554055 {ECO:0000313|EMBL:PSC69662.1, ECO:0000313|Proteomes:UP000239649};
RN [1] {ECO:0000313|EMBL:PSC69662.1, ECO:0000313|Proteomes:UP000239649}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=SAG 241.80 {ECO:0000313|EMBL:PSC69662.1,
RC ECO:0000313|Proteomes:UP000239649};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- SUBCELLULAR LOCATION: Nucleus speckle {ECO:0000256|ARBA:ARBA00004324}.
CC -!- SIMILARITY: Belongs to the WD repeat SMU1 family.
CC {ECO:0000256|ARBA:ARBA00025801}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PSC69662.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPF02000025; PSC69662.1; -; Genomic_DNA.
DR STRING; 554055.A0A2P6V6F0; -.
DR Proteomes; UP000239649; Unassembled WGS sequence.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-KW.
DR GO; GO:0016607; C:nuclear speck; IEA:UniProtKB-SubCell.
DR GO; GO:0140359; F:ABC-type transporter activity; IEA:InterPro.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0016887; F:ATP hydrolysis activity; IEA:InterPro.
DR GO; GO:0000398; P:mRNA splicing, via spliceosome; IEA:InterPro.
DR CDD; cd18557; ABC_6TM_TAP_ABCB8_10_like; 1.
DR CDD; cd00293; USP_Like; 1.
DR CDD; cd00200; WD40; 1.
DR Gene3D; 1.20.1560.10; ABC transporter type 1, transmembrane domain; 2.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 1.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 1.
DR InterPro; IPR003593; AAA+_ATPase.
DR InterPro; IPR011527; ABC1_TM_dom.
DR InterPro; IPR036640; ABC1_TM_sf.
DR InterPro; IPR003439; ABC_transporter-like_ATP-bd.
DR InterPro; IPR017871; ABC_transporter-like_CS.
DR InterPro; IPR006595; CTLH_C.
DR InterPro; IPR006594; LisH.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR045184; SMU1.
DR InterPro; IPR006016; UspA.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR22848; WD40 REPEAT PROTEIN; 1.
DR PANTHER; PTHR22848:SF0; WD40 REPEAT-CONTAINING PROTEIN SMU1; 1.
DR Pfam; PF00664; ABC_membrane; 1.
DR Pfam; PF00005; ABC_tran; 1.
DR Pfam; PF17814; LisH_TPL; 1.
DR Pfam; PF00582; Usp; 1.
DR Pfam; PF00400; WD40; 4.
DR SMART; SM00382; AAA; 1.
DR SMART; SM00668; CTLH; 1.
DR SMART; SM00320; WD40; 6.
DR SUPFAM; SSF90123; ABC transporter transmembrane region; 1.
DR SUPFAM; SSF52402; Adenine nucleotide alpha hydrolases-like; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS50929; ABC_TM1F; 1.
DR PROSITE; PS00211; ABC_TRANSPORTER_1; 1.
DR PROSITE; PS50893; ABC_TRANSPORTER_2; 1.
DR PROSITE; PS50897; CTLH; 1.
DR PROSITE; PS50896; LISH; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 1.
DR PROSITE; PS50082; WD_REPEATS_2; 4.
DR PROSITE; PS50294; WD_REPEATS_REGION; 3.
PE 3: Inferred from homology;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW mRNA processing {ECO:0000256|ARBA:ARBA00023187};
KW mRNA splicing {ECO:0000256|ARBA:ARBA00023187};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000239649};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT TRANSMEM 99..119
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 139..159
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 220..237
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 243..259
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 352..376
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 101..384
FT /note="ABC transmembrane type-1"
FT /evidence="ECO:0000259|PROSITE:PS50929"
FT DOMAIN 484..721
FT /note="ABC transporter"
FT /evidence="ECO:0000259|PROSITE:PS50893"
FT DOMAIN 1558..1610
FT /note="CTLH"
FT /evidence="ECO:0000259|PROSITE:PS50897"
FT REPEAT 1727..1759
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1777..1818
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1820..1861
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 1862..1897
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REGION 1..33
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 817..1002
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1029..1093
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1501..1524
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 19..33
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 876..895
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1034..1053
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1079..1093
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1507..1524
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2031 AA; 216479 MW; 5934CA8F3C4B88A1 CRC64;
MSSQLGAPRR RQRQELVAAA VPRNSSAQRC TSGSTSVVAT SAAAAAAADA GGPPQADAAT
DASASAGAPL TASSSWQQRE APLGRAVMRR LLWQQRRHLT LAGIALLLCV SMNLASPVLQ
GMLFDVLVRG QPFQEYSRLF LGLLFIYVAE PLLSQVYILN ACDAGEKVQA SLRSEAFRCL
LSQRVEFFDR HSSSQLTQLL SRDLDSIRSF VFSNTARDRG LRAALEALGT VCVLFFLSWR
LGPILAGVIV ASVCAAFLYR RQTRPLEASN AAAQAALAEV ASSSFTNMRT VRIFAGEALE
QARFGAQVAQ SYASGLGFAR AKAMVEGVNR SSTHLSLLAL YAFGGHLVNS QLLPVGVLVT
AIGFTFSLVF ATQGMLQTFV DLRSMLASVQ RVRATLSQLP LDESMVQSLP PLPDTPWLAA
AAAPGAPAAN GSAAVPFQGL QGEQRATAAA AGSGSGANGS VHSAGGAAAA EANPAVEAAV
SGDIRLEGVS FAYPVRPGAP VLRDLCLTLP RGKVTAVVGR SGAGKSTMAA LLERLYSPDA
GSITLGGRDI HSFTRTQYCA ALAAVSQEPV LFPASIAYNI GYGLSMRCTQ AEIEAAAKAA
NAHEFIATLP EGYDTVVGEA GSLLSGGQRQ RITLARALLK DAPILILDEA TSSLDAENER
LVQAAIEKLM EGRTVMVIAH RLSTVQRADQ IVVLEGGQVV EQGSHKQLLR SGGRYAALIE
AAKAACKTRF PQAQAVEVVV AHSGSSDEIG AVILRRAAEL GASLVALAPH SRTATQRVLQ
GSVTDYCCRH ARLPVLVVQG VEPQAEQGVF SNNDALDEPA AAEPEPAAEA EQAAEEPEQA
AEPVTATEPE PEAEAGPAEP EPEAEAAVEQ PPAEQAAPAE PSPPAPSPPA EPKVEKPAPT
EPKEEPPVPA AYASKKRPAE EAEEQPAAKA AKVAAPAAPG QRRPRKNQNH PAAVAARKAA
RKGQAEATVL FPVGQSPADI FYPQRSRSET PRHARNSRGA WARLKQAAHG TLPLGVKRSK
SVFWSGAVEE AAAASSSEDD EEEEEESSEY SDSDDDSEAP RRGRKPGRAA GRGSGAGHQR
NAAQGNAGQG YNAQNMDPAI AAQLQNVPAH MRPMVMQQLM QQRQQQMGRG APAPVATFEQ
LAPQQQQAVS QQLVASLPHN MQVQFAQLAP QQQRTYIGQL YAQRVAQQQA QQQAQQQAQQ
QQAAAAAAPA AAQASAAQMQ QQQAMNQQLQ ALYATLPPAV QQQLLGLPPA QQQQALLQIF
QQQQHQRAQQ EQQMRIQQQQ AHMAAAQQAA AGGQQAPQQP VDVTHMQPPS LQLRCQWPVT
LLVVLAALAG AGLTFVAAEP SPLVVRRRPA CLHEGLGVWA EQKSGDASLG PAELDYSAIR
LAAPGSERLA RLGGNGWPVV LGGAAAQRCA CCRLCSAQNE LPTAGQPRCR RWSHRARDGA
CRLWGPVGDT TTPVFRQNAV VADQVWFSGS SEGYVPLLAA EPKGQCVVLM YGGMATDGMG
GSVQSLTNLP PPPPPKRPQR SPPPPRAVIK IVLQFCKENS LVESFNAIQN ECQVSLNTVD
SIESFVSDIN NGRWDAVLPQ VASLKLPRAK LEDLYEQVVL ELVELRETDT ARAMLRQTQV
FARMKQDDSE RYLRLEHLAN RAYLDARELY GGMPREKRRA QLAHALSTEV STVPPSRLMA
LVGQALKWQQ QQGLLPPGAA FDLFRGTAAG ARDEVEAFPT DLDRQIKFGS KSHAECAAFS
PDGQLLVTGS VDGFVEVWDQ LTGRLKMDLP YQAQEQFMLH DSAVLALAFS RDNELLATGD
QDGRVKVWRV RTGQCLRRFD TAHTQGITSL AFSRDGTHVL SASYDSLIRV HGIKSGKMLK
EFRGHGSFVN AAVYSADGAQ VISAGADATV RVWDVKSCDL TFVFRPPQAA PAGEAPVIGL
ALNPQNVDQV FVAPRGPTLY LMTLQGQVVK SFQSGKTAGG DFLAFCVSPR GEWAYCLGED
GALYCFSVGG SGKLEHLMQV AEKDPIGLAH HPHRNLLATY AADGSLKTWK P
//