ID R1EQD2_EMIHU Unreviewed; 610 AA.
AC R1EQD2;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 53.
DE RecName: Full=Chromo domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EMIHUDRAFT_114025 {ECO:0000313|EMBL:EOD28840.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD28840.1};
RN [1] {ECO:0000313|EMBL:EOD28840.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD28840.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD28840}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB864886; EOD28840.1; -; Genomic_DNA.
DR RefSeq; XP_005781269.1; XM_005781212.1.
DR STRING; 2903.R1EQD2; -.
DR PaxDb; 2903-EOD28840; -.
DR EnsemblProtists; EOD28840; EOD28840; EMIHUDRAFT_114025.
DR GeneID; 17274386; -.
DR KEGG; ehx:EMIHUDRAFT_114025; -.
DR HOGENOM; CLU_447948_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00024; CD_CSD; 1.
DR Gene3D; 2.40.50.40; -; 1.
DR Gene3D; 1.10.720.30; SAP domain; 1.
DR Gene3D; 3.30.1740.10; Zinc finger, PARP-type; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR000953; Chromo/chromo_shadow_dom.
DR InterPro; IPR023780; Chromo_domain.
DR InterPro; IPR023779; Chromodomain_CS.
DR InterPro; IPR003034; SAP_dom.
DR InterPro; IPR036361; SAP_dom_sf.
DR InterPro; IPR001510; Znf_PARP.
DR InterPro; IPR036957; Znf_PARP_sf.
DR PANTHER; PTHR22812; CHROMOBOX PROTEIN; 1.
DR PANTHER; PTHR22812:SF112; FI06908P; 1.
DR Pfam; PF00385; Chromo; 1.
DR Pfam; PF02037; SAP; 1.
DR Pfam; PF00645; zf-PARP; 1.
DR SMART; SM00298; CHROMO; 1.
DR SMART; SM00513; SAP; 1.
DR SMART; SM01336; zf-PARP; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF57716; Glucocorticoid receptor-like (DNA-binding domain); 1.
DR SUPFAM; SSF68906; SAP domain; 1.
DR PROSITE; PS00598; CHROMO_1; 1.
DR PROSITE; PS50013; CHROMO_2; 1.
DR PROSITE; PS50800; SAP; 1.
DR PROSITE; PS50064; ZF_PARP_2; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 228..278
FT /note="PARP-type"
FT /evidence="ECO:0000259|PROSITE:PS50064"
FT DOMAIN 470..508
FT /note="Chromo"
FT /evidence="ECO:0000259|PROSITE:PS50013"
FT DOMAIN 573..607
FT /note="SAP"
FT /evidence="ECO:0000259|PROSITE:PS50800"
FT REGION 45..78
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 117..146
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 329..350
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 408..472
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 524..555
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 448..472
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 610 AA; 61313 MW; 1F8C2C8C1203FBBC CRC64;
MGWFDDNHWA GEAYDFGAGY MAQAGFTDAY GGPSSSMQRL AASLHRGAPR GGGGSRGGGA
SRGGASRGGG GGGGGGGGGA PRGGGGGGFC ACGSSAAKAC TLQMCGNCCT GCPRHPQSKG
KAAASSAPAP KLPVSSNPPP STNAAMTGRP AAAVLAVIAP APSAAGASMS TPRFAASSVA
PSQFAVAATA RAPAPVRRAP AAAPKAPKAA VAGGGCGPFL AQWETRPTVV EYAKSGRSTC
KACHLPIDKG EVRIGTEGLG ETAYGGAFMM TSWKHVGCQP RSRNGGNTLY GKAALRSGDR
AYVDAWARGD QSAMDAHLAR ARRAAADAEA AAERAREEER AAKERARAEK AAAKAAEKEV
ARAAKAAERE AAKAAKAAAK EAEKEAAKGA KAAAKEAAKV AEVAAKAAGT AAGKEAAKAA
RAAVALDAAK PAKRPRDADA GDGVAGDASC AEEEEEEEEM EEEEEEEEEF EVEALLEAKG
SGRGLKYLVK WRGYDGPDDN TWEPAKQLPA GMVAEFEAAR AAPPSAKRAR VRAAPPQAAA
VAGGGDEEDD EEMTVGETAA RAELKAEADA AAVGKLTVPK LKEALLQRGL EATGLKAVLA
ARLLEALGSE
//