ID I0YWU6_COCSC Unreviewed; 1086 AA.
AC I0YWU6;
DT 13-JUN-2012, integrated into UniProtKB/TrEMBL.
DT 13-JUN-2012, sequence version 1.
DT 24-JAN-2024, entry version 53.
DE RecName: Full=PHD-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=COCSUDRAFT_63988 {ECO:0000313|EMBL:EIE22865.1};
OS Coccomyxa subellipsoidea (strain C-169) (Green microalga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Trebouxiophyceae incertae sedis; Elliptochloris clade; Coccomyxa;
OC Coccomyxa subellipsoidea.
OX NCBI_TaxID=574566 {ECO:0000313|EMBL:EIE22865.1, ECO:0000313|Proteomes:UP000007264};
RN [1] {ECO:0000313|EMBL:EIE22865.1, ECO:0000313|Proteomes:UP000007264}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=C-169 {ECO:0000313|EMBL:EIE22865.1,
RC ECO:0000313|Proteomes:UP000007264};
RX PubMed=22630137; DOI=10.1186/gb-2012-13-5-r39;
RA Blanc G., Agarkova I., Grimwood J., Kuo A., Brueggeman A., Dunigan D.,
RA Gurnon J., Ladunga I., Lindquist E., Lucas S., Pangilinan J., Proschold T.,
RA Salamov A., Schmutz J., Weeks D., Yamada T., Claverie J.M., Grigoriev I.,
RA Van Etten J., Lomsadze A., Borodovsky M.;
RT "The genome of the polar eukaryotic microalga coccomyxa subellipsoidea
RT reveals traits of cold adaptation.";
RL Genome Biol. 13:R39-R39(2012).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EIE22865.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AGSI01000009; EIE22865.1; -; Genomic_DNA.
DR RefSeq; XP_005647409.1; XM_005647352.1.
DR AlphaFoldDB; I0YWU6; -.
DR STRING; 574566.I0YWU6; -.
DR GeneID; 17040852; -.
DR KEGG; csl:COCSUDRAFT_63988; -.
DR eggNOG; KOG4299; Eukaryota.
DR OrthoDB; 473783at2759; -.
DR Proteomes; UP000007264; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR CDD; cd15504; PHD_PRHA_like; 1.
DR Gene3D; 3.30.40.10; Zinc/RING finger domain, C3HC4 (zinc finger); 1.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR001739; Methyl_CpG_DNA-bd.
DR InterPro; IPR045876; PRHA-like_PHD-finger.
DR InterPro; IPR019786; Zinc_finger_PHD-type_CS.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR InterPro; IPR001965; Znf_PHD.
DR InterPro; IPR019787; Znf_PHD-finger.
DR InterPro; IPR013083; Znf_RING/FYVE/PHD.
DR PANTHER; PTHR12628:SF10; C. ELEGANS HOMEOBOX; 1.
DR PANTHER; PTHR12628; POLYCOMB-LIKE TRANSCRIPTION FACTOR; 1.
DR Pfam; PF01429; MBD; 1.
DR Pfam; PF00628; PHD; 1.
DR SMART; SM00391; MBD; 1.
DR SMART; SM00249; PHD; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 2.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR PROSITE; PS50982; MBD; 1.
DR PROSITE; PS01359; ZF_PHD_1; 1.
DR PROSITE; PS50016; ZF_PHD_2; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000007264};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00146}.
FT DOMAIN 148..224
FT /note="MBD"
FT /evidence="ECO:0000259|PROSITE:PS50982"
FT DOMAIN 803..859
FT /note="PHD-type"
FT /evidence="ECO:0000259|PROSITE:PS50016"
FT REGION 83..114
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 300..320
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 402..618
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 630..710
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 917..1031
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1061..1086
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 304..318
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 575..616
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 651..670
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 695..710
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 917..946
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 978..1013
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1015..1030
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1086 AA; 114238 MW; B522938B13EE6CFE CRC64;
MPSKLPSYTG PVEEQQAATG RAALQSLREY LADRGAALPE GFEASVHLRK EGSPGAVDIY
YFDHLHTRYR SKLQVARALG VADSRDRDEG ASLPSSPRVM GVSPSSPAAA DAAGQQLAAH
HLQLPAHADQ GGDFVGSEEE VKVMEALSKY LAKLGGQLEP GWRCQVRKRA GPGNRVDTYF
NSPNGEKFRS KTDVARHLGL AVPAARRPAG GAASPGVDVN MAEEDVAAML MALPHGKPIT
AREAARARKV SGAVAGEWAG RAKGPGTPDN AAGRSGNGLH SIAAALEEAS PFQVKQEGAA
YADDEQMFDS DDEDTDTRAD EKAELAARQM RRLRAAAHAQ GLTLAPGWRV DVSIRKNGTS
RGSTDYYYRL QGGGTFRSIT AAMKHLLAAE HGAAPAVATD APMLSPASEP AADPASPQLP
QQERPPSPAS AARAAPGSPA DNGGPAGPSA KPKPKKFGSL TALSHPGRSG AAVARPTSPS
DSVAKGKNPA PEEPSSAARP TTKAALAALG RDATASGHAP EKQEAAEADR AGLEAGAADA
AAESNMPGPS GMGSVDEAGA AVEGNASEPD QAGHNESNAR MRQRKPNRNA MAPKDEPAEK
TGPAPAARKR RTVNRGRVDY ATVAAMADLQ YAISNEDRRA TLRKRNAAKS GTPEATATNP
PSTGVDKDAA QSTAEGSEPA KGRSQGASGA AGKGEEGEQK PKKAERKRER VVTPYTKLVN
RVRGLISRIR VEEAYLDAYE GEGWRGANRE KLRPTAELEQ ARLQIEKCKL GMREAVKECE
ESGGDRAIPP EHIDEDGEID EAHIFCGHCH DPESYEDNDI ILCDGNCHRA YHEKCLVPPL
DSSTLPEDEG WLCPACDAKA DILTMINEEY GFEYEQETPW HAVFAEAPMS PRAAADGAAA
APVTFKTLAD FLAHADLPDS DADDSSFKDE SGSDGEEGEE AHGGGSGSDG ESESGSEADS
VDSEASEGSS VESMASEERK GLQAEEAAAG EGRPQRKLRS AGKRARGNAD EVADVDADMG
ADVDDDDEEE SLVLSGKRAR TTVDYRKLNM EMFGDFESYE GETLSDEDFT PREAEAEAAA
ASPASP
//