ID C1MUS6_MICPC Unreviewed; 712 AA.
AC C1MUS6;
DT 26-MAY-2009, integrated into UniProtKB/TrEMBL.
DT 26-MAY-2009, sequence version 1.
DT 27-MAR-2024, entry version 66.
DE SubName: Full=Predicted protein {ECO:0000313|EMBL:EEH56682.1};
GN ORFNames=MICPUCDRAFT_69658 {ECO:0000313|EMBL:EEH56682.1};
OS Micromonas pusilla (strain CCMP1545) (Picoplanktonic green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; Mamiellophyceae; Mamiellales;
OC Mamiellaceae; Micromonas.
OX NCBI_TaxID=564608 {ECO:0000313|Proteomes:UP000001876};
RN [1] {ECO:0000313|EMBL:EEH56682.1, ECO:0000313|Proteomes:UP000001876}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1545 {ECO:0000313|EMBL:EEH56682.1,
RC ECO:0000313|Proteomes:UP000001876};
RX PubMed=19359590; DOI=10.1126/science.1167222;
RA Worden A.Z., Lee J.H., Mock T., Rouze P., Simmons M.P., Aerts A.L.,
RA Allen A.E., Cuvelier M.L., Derelle E., Everett M.V., Foulon E.,
RA Grimwood J., Gundlach H., Henrissat B., Napoli C., McDonald S.M.,
RA Parker M.S., Rombauts S., Salamov A., Von Dassow P., Badger J.H.,
RA Coutinho P.M., Demir E., Dubchak I., Gentemann C., Eikrem W., Gready J.E.,
RA John U., Lanier W., Lindquist E.A., Lucas S., Mayer K.F., Moreau H.,
RA Not F., Otillar R., Panaud O., Pangilinan J., Paulsen I., Piegu B.,
RA Poliakov A., Robbens S., Schmutz J., Toulza E., Wyss T., Zelensky A.,
RA Zhou K., Armbrust E.V., Bhattacharya D., Goodenough U.W., Van de Peer Y.,
RA Grigoriev I.V.;
RT "Green evolution and dynamic adaptations revealed by genomes of the marine
RT picoeukaryotes Micromonas.";
RL Science 324:268-272(2009).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; GG663740; EEH56682.1; -; Genomic_DNA.
DR RefSeq; XP_003059550.1; XM_003059504.1.
DR AlphaFoldDB; C1MUS6; -.
DR STRING; 564608.C1MUS6; -.
DR GeneID; 9684990; -.
DR KEGG; mpp:MICPUCDRAFT_69658; -.
DR eggNOG; KOG1343; Eukaryota.
DR OMA; LHINIPR; -.
DR OrthoDB; 69755at2759; -.
DR Proteomes; UP000001876; Unassembled WGS sequence.
DR GO; GO:0004407; F:histone deacetylase activity; IEA:UniProtKB-EC.
DR Gene3D; 1.25.40.20; Ankyrin repeat-containing domain; 1.
DR Gene3D; 3.40.800.20; Histone deacetylase domain; 1.
DR InterPro; IPR002110; Ankyrin_rpt.
DR InterPro; IPR036770; Ankyrin_rpt-contain_sf.
DR InterPro; IPR023801; His_deacetylse_dom.
DR InterPro; IPR037138; His_deacetylse_dom_sf.
DR InterPro; IPR023696; Ureohydrolase_dom_sf.
DR PANTHER; PTHR10625:SF4; HISTONE DEACETYLASE 6, ISOFORM G; 1.
DR PANTHER; PTHR10625; HISTONE DEACETYLASE HDAC1-RELATED; 1.
DR Pfam; PF12796; Ank_2; 1.
DR Pfam; PF00850; Hist_deacetyl; 1.
DR PRINTS; PR01415; ANKYRIN.
DR SMART; SM00248; ANK; 4.
DR SUPFAM; SSF48403; Ankyrin repeat; 1.
DR SUPFAM; SSF52768; Arginase/deacetylase; 1.
DR PROSITE; PS50297; ANK_REP_REGION; 1.
DR PROSITE; PS50088; ANK_REPEAT; 1.
PE 4: Predicted;
KW ANK repeat {ECO:0000256|ARBA:ARBA00023043, ECO:0000256|PROSITE-
KW ProRule:PRU00023}; Reference proteome {ECO:0000313|Proteomes:UP000001876}.
FT REPEAT 367..399
FT /note="ANK"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00023"
FT DOMAIN 454..614
FT /note="Histone deacetylase"
FT /evidence="ECO:0000259|Pfam:PF00850"
FT REGION 41..118
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 131..159
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 196..292
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 712 AA; 73852 MW; C686B73B36C942C9 CRC64;
MTPVADDVAT ADVLSPRAAK KQKMRTTSQL SELCGLDDVL DAPHAPRESS AAAAASTYET
PVRYPASAAP AGAPIRDVDT DDKPHQPSLH RASGGSSDVS TLVSPAGGAA PPGIVRHGSL
PKNLHINIPR ASVGSGASED TNADAADADA DAAASERVPP VTPATAFTAV PLHAAVAAGR
VEDVRLWLEE HAPVDSRDVD DGVGESDRNG NGNDDVVAST PEWPPRPVRD ILGPESVSRE
PPESSDGKLP ARPVVRGIPG RRAPRASVRV RDSDRSSGGD LGARGGGPVD ARNEHGNAAL
AVAAALSDAA VSATLTRLLL THGASPLVLS SGWTPLHWAA QEGNVESLEA MAAWEGGRAV
DARCEAFGRT PLTTAASAGR GACVEALIRA GADALALDVD GAGVLSHVAT KVSKGVRSKV
RAATRSALLA AAPKLKVLLL HHEDCGKHVS FKPHQESPER IAAVLASLAK GAASGALAED
EVSVSSEFEP ATATHLARAH GEEYIGMITD LAERVANTPV AFTPYHQEFK GMPQAKQKKP
EFSDTFFSPG TMTAALRAAG GVVHAVEKVL RGERRTAFVC VRPPGHHAGV NGATAGAPSA
GFSILNNAMI GASRRSRFFS SFRHASRASL VFVRSAPRGG CPRVAVRNFV AGASATTRTS
PRGTFRPPKC TSVSDDWRRG DHVALGQIRA AHPNARSIAR YSILPLERSV DV
//