ID I0Z8J9_COCSC Unreviewed; 707 AA.
AC I0Z8J9;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 24-JAN-2024, entry version 37.
DE RecName: Full=Zinc-finger domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COCSUDRAFT_59460 {ECO:0000313|EMBL:EIE26968.1};
OS Coccomyxa subellipsoidea (strain C-169) (Green microalga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa;
OC Coccomyxa subellipsoidea.
OX NCBI_TaxID=574566 {ECO:0000313|EMBL:EIE26968.1, ECO:0000313|Proteomes:UP000007264};
RN [1] {ECO:0000313|EMBL:EIE26968.1, ECO:0000313|Proteomes:UP000007264}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C-169 {ECO:0000313|EMBL:EIE26968.1,
RC ECO:0000313|Proteomes:UP000007264};
RX PubMed=22630137; DOI=10.1186/gb-2012-13-5-r39;
RA Blanc G., Agarkova I., Grimwood J., Kuo A., Brueggeman A., Dunigan D.,
RA Gurnon J., Ladunga I., Lindquist E., Lucas S., Pangilinan J., Proschold T.,
RA Salamov A., Schmutz J., Weeks D., Yamada T., Claverie J.M., Grigoriev I.,
RA Van Etten J., Lomsadze A., Borodovsky M.;
RT "The genome of the polar eukaryotic microalga coccomyxa subellipsoidea
RT reveals traits of cold adaptation.";
RL Genome Biol. 13:R39-R39(2012).
CC -!- SUBCELLULAR LOCATION: Cytoplasm {ECO:0000256|ARBA:ARBA00004496}.
CC Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EIE26968.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGSI01000002; EIE26968.1; -; Genomic_DNA.
DR RefSeq; XP_005651512.1; XM_005651455.1.
DR AlphaFoldDB; I0Z8J9; -.
DR STRING; 574566.I0Z8J9; -.
DR GeneID; 17044977; -.
DR KEGG; csl:COCSUDRAFT_59460; -.
DR eggNOG; ENOG502QU1W; Eukaryota.
DR OrthoDB; 2912062at2759; -.
DR Proteomes; UP000007264; Unassembled WGS sequence.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0006355; P:regulation of DNA-templated transcription; IEA:InterPro.
DR InterPro; IPR040221; CDCA7/CDA7L.
DR InterPro; IPR028942; WHIM1_dom.
DR InterPro; IPR028941; WHIM2_dom.
DR InterPro; IPR018866; Znf-4CXXC_R1.
DR PANTHER; PTHR31169; OS05G0300700 PROTEIN; 1.
DR PANTHER; PTHR31169:SF8; ZINC-FINGER DOMAIN OF MONOAMINE-OXIDASE A REPRESSOR R1 PROTEIN; 1.
DR Pfam; PF15612; WHIM1; 1.
DR Pfam; PF15613; WSD; 1.
DR Pfam; PF10497; zf-4CXXC_R1; 1.
PE 4: Predicted;
KW Cytoplasm {ECO:0000256|ARBA:ARBA00022490};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000007264};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 2..67
FT /note="Zinc-finger"
FT /evidence="ECO:0000259|Pfam:PF10497"
FT DOMAIN 316..348
FT /note="WHIM1"
FT /evidence="ECO:0000259|Pfam:PF15612"
FT DOMAIN 437..511
FT /note="WHIM2"
FT /evidence="ECO:0000259|Pfam:PF15613"
FT REGION 67..132
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 619..707
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 96..132
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 707 AA; 72834 MW; D706F82B43790469 CRC64;
MFFCPRCLLN RYGEEVEKVN QLAQWKCPKC RDICNCSNCR KKRGLEATGI LANMSKAAGF
TSVSDLLQKN PNAKRPPSAS LQAPGAKAPK QAAGEAKPKA QKKKEPKPEA APDLIDDELL
GARPHSMPER EAVRMHRVSD EVSDMRWPPD LAHTVDLPPD CSAGDLAEVL EFLKVFGEQT
GTAGISLSFA AVELLQPATE AGHSAYTREE RSVAGHVHMR LLDLVRSDWG IRDSVGIHGW
QPVMAQYLRQ DRLHGLAALD RLSGASSAGC AAAAAAQLPN DHEVDAAVAS PGAVAAAVGG
DQESEGEDSL LMFPAGGYWA LQPGTRLRML RSLCFDALET RAIRQCMEDA MGAAAEEDKG
RREELAAAKR EAREAWQKQR DKQIALLLAS GSGSGLTIEE QRSLMDRSWA KVDAEAAATA
APQLPGSGAA MAFPAAVRIL PLGRDRSGAL FWKLACCPVL AGSADAAVLM ASEFRAADSD
EWSVISDAAT LAKSLDGRGR REAGLLRALE AGYDLGATKS GPAAGGVKSG APPAAAAKIG
NGKNVAAATT GTVDTAGTGK AAAGKAGAGK KAAAKAGAAE SLAATKPAAM ATKEAASVSA
GQIGAADVAP AKAGAQKAKV TPAEKASQDA AAAKAKPGRK RAAAVLTAKP ENGAAGAAQT
AAAEPAEDAG NCDNAAAKAK PMKRQRKGAA ENAAGPGKRL TRSQAAA
//