ID E1ZMP9_CHLVA Unreviewed; 935 AA.
AC E1ZMP9;
DT 30-NOV-2010, integrated into UniProtKB/TrEMBL.
DT 30-NOV-2010, sequence version 1.
DT 24-JAN-2024, entry version 56.
DE SubName: Full=Uncharacterized protein {ECO:0000313|EMBL:EFN52732.1};
GN ORFNames=CHLNCDRAFT_138281 {ECO:0000313|EMBL:EFN52732.1};
OS Chlorella variabilis (Green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=554065 {ECO:0000313|Proteomes:UP000008141};
RN [1] {ECO:0000313|EMBL:EFN52732.1, ECO:0000313|Proteomes:UP000008141}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=NC64A {ECO:0000313|EMBL:EFN52732.1,
RC ECO:0000313|Proteomes:UP000008141};
RX PubMed=20852019; DOI=10.1105/tpc.110.076406;
RA Blanc G., Duncan G., Agarkova I., Borodovsky M., Gurnon J., Kuo A.,
RA Lindquist E., Lucas S., Pangilinan J., Polle J., Salamov A., Terry A.,
RA Yamada T., Dunigan D.D., Grigoriev I.V., Claverie J.M., Van Etten J.L.;
RT "The Chlorella variabilis NC64A genome reveals adaptation to
RT photosymbiosis, coevolution with viruses, and cryptic sex.";
RL Plant Cell 22:2943-2955(2010).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GL433854; EFN52732.1; -; Genomic_DNA.
DR RefSeq; XP_005844834.1; XM_005844772.1.
DR AlphaFoldDB; E1ZMP9; -.
DR STRING; 554065.E1ZMP9; -.
DR GeneID; 17352189; -.
DR KEGG; cvr:CHLNCDRAFT_138281; -.
DR eggNOG; KOG1274; Eukaryota.
DR InParanoid; E1ZMP9; -.
DR OMA; NAWFPIC; -.
DR OrthoDB; 3686044at2759; -.
DR Proteomes; UP000008141; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR Gene3D; 2.130.10.10; YVTN repeat-like/Quinoprotein amine dehydrogenase; 2.
DR InterPro; IPR048591; Ctf4-like_C.
DR InterPro; IPR022100; Mcl1_mid.
DR InterPro; IPR015943; WD40/YVTN_repeat-like_dom_sf.
DR InterPro; IPR019775; WD40_repeat_CS.
DR InterPro; IPR036322; WD40_repeat_dom_sf.
DR InterPro; IPR001680; WD40_rpt.
DR PANTHER; PTHR19932; WD REPEAT AND HMG-BOX DNA BINDING PROTEIN; 1.
DR PANTHER; PTHR19932:SF10; WD REPEAT AND HMG-BOX DNA-BINDING PROTEIN 1; 1.
DR Pfam; PF20946; Ctf4_C; 1.
DR Pfam; PF12341; Mcl1_mid; 1.
DR Pfam; PF00400; WD40; 3.
DR SMART; SM00320; WD40; 7.
DR SUPFAM; SSF50978; WD40 repeat-like; 1.
DR PROSITE; PS00678; WD_REPEATS_1; 2.
DR PROSITE; PS50082; WD_REPEATS_2; 2.
DR PROSITE; PS50294; WD_REPEATS_REGION; 2.
PE 4: Predicted;
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000008141};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW WD repeat {ECO:0000256|ARBA:ARBA00022574, ECO:0000256|PROSITE-
KW ProRule:PRU00221}.
FT REPEAT 151..183
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT REPEAT 235..276
FT /note="WD"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00221"
FT DOMAIN 412..697
FT /note="Minichromosome loss protein Mcl1 middle region"
FT /evidence="ECO:0000259|Pfam:PF12341"
FT DOMAIN 708..809
FT /note="DNA polymerase alpha-binding protein Ctf4-like C-
FT terminal"
FT /evidence="ECO:0000259|Pfam:PF20946"
FT REGION 358..409
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 824..935
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 370..390
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 848..862
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 935 AA; 97143 MW; 4308445319620BDE CRC64;
MAGAKQLSFS EAHKPGLCDV AYLPAAGGAA TLITAGADGR VCYRSAEAPA EAAKEIENSN
NGASAPVHCV AAAQGRAVVT GDDQNFVKCY SHPAGELQGV ATRFTLPVRA LAFSPSGLNL
AAGGDDEGIK LVDITTSKVF RQLKSQAYTR SLAYDPEGEY LASVNADGTL NVWEIQTGKQ
PLCRRKACPK LDLASTSRSQ AAWHPDGGSL LAAPGTDHDV VLYERMSWEA AFSLGGQHTA
EVSLLAFSKN GLYLVSAGQD QAVVVWDVNE RACLEKRLLP GAATGLAWHP TKNELAVITE
DGQLAVWSGV VPAKLPGPTV DLDALNGVKK KGEGEAGAAG GAGGAAGMAD GVSVLEGGGG
GGVGATEAGA SDEYDREDSF LADSGPKRGA RRRGRGGYAG FSSLDLPQPQ EAIQPGATEM
GDSGRRYLAF TGLGCIMLRQ EEDHNVVEVS FHDSSRQRKR IPLLNDFFGF SLGSLGDKGA
LYASRSNSES ASTVVYRPFE AWAPNSDWSL SLPKGEEAEC AAAGGSFCAV ATSKRHLRLF
SQAGTQTHLL TLPGAPVGLA AAGHQLAAVW HGAAPTAGGD QCLQYALYDV AEQRQVHGGP
LPLSPAASLA WLGFAEEGLL AAYDTEGELR LRSADFGGSW VTAFSAAAER KSTEQYWVVG
LGAKELQCIV CANTTEPAVP SGMQRPVVTA VALRPPVVAQ DAALAPFEAD LIRHGAVLSH
LGAAAGDARE AEDGGELEEA LHRAQLEADR ISLRLIQKLL AADRQARALE AACTLHNVPA
LQGALKLANH HRATALAERI SAVLEQRVAM EEAALELEEA EQYQQYQGEG VPHSAQRQHG
DITPVPGPTF TENHQASPAA AATAGNMSRA PSPAVVSRPH GGTGAAASSN PFARKPAAEQ
PENSPGNAAA VAAAAKRKAP ASGNPFARRS KAAKA
//