ID R1CED5_EMIHU Unreviewed; 739 AA.
AC R1CED5;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=CW-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EMIHUDRAFT_101766 {ECO:0000313|EMBL:EOD21073.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD21073.1};
RN [1] {ECO:0000313|EMBL:EOD21073.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD21073.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD21073}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB865995; EOD21073.1; -; Genomic_DNA.
DR RefSeq; XP_005773502.1; XM_005773445.1.
DR PaxDb; 2903-EOD21073; -.
DR EnsemblProtists; EOD21073; EOD21073; EMIHUDRAFT_101766.
DR GeneID; 17266620; -.
DR KEGG; ehx:EMIHUDRAFT_101766; -.
DR HOGENOM; CLU_382404_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 3.30.40.100; -; 2.
DR Gene3D; 3.30.730.10; AP2/ERF domain; 4.
DR InterPro; IPR001471; AP2/ERF_dom.
DR InterPro; IPR036955; AP2/ERF_dom_sf.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR011124; Znf_CW.
DR PANTHER; PTHR31677; AP2 DOMAIN CLASS TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR31677:SF127; ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF105; 1.
DR Pfam; PF07496; zf-CW; 2.
DR SMART; SM00380; AP2; 2.
DR SUPFAM; SSF54171; DNA-binding domain; 4.
DR PROSITE; PS51032; AP2_ERF; 1.
DR PROSITE; PS51050; ZF_CW; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 317..372
FT /note="AP2/ERF"
FT /evidence="ECO:0000259|PROSITE:PS51032"
FT DOMAIN 468..519
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DOMAIN 699..739
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 258..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 439..461
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 669..695
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 739 AA; 76362 MW; A8E54C5C7552912E CRC64;
MPISHLGGTH DAPTEDPAGE AGSARTELMP LAPRTEALAA SEPGRAASEL EGAAVTTEAE
GLRLHLSNSN STGYRGVHKH SGRFRAQHRV NGRIVYLGYF DTAVEAAVAY ARSVGEYQPP
TQPPPSVAAE AEGVRLHLSS SSSTGYLGVW EEASGRFRAQ HRVDGMHAGL GSFHTAVEAA
VAVARAISEA SAVATSAPES APPSELADET ICARTQERGS GAIGCVLPAG HAGPHSFTLQ
GKRARCVKCP FDPTDVEAEE SGECASTQAG DDSESEVEAT AGMAAPAGSE ATSSQEAPVA
EAESLRLHPS SCSSSTGYKG VFKQASGRFL AERRLDGRRV SLGTFDSAVE AAVAYARAVG
EYQPPTVAAE AEGMRLHLSS SNNTGYRGVC ELAYGRFRAQ HRVDGRQDVL GYFDTAVEAA
ASYARAVVEA AVPGEAAAAA GEASDSGEGE PGGAGGEAAA AESSGTGEAA SCLWVACDRC
DKWRRLPSSM PGQLPAEWLC WMHPDPACRV CEASEEELGL GRAQAERQAA AVVAEAEGLR
LHLSSSSTGY KGVYEQSGRF QAKHRVDGRL AHLGTFDTAV EAAVAYARAV GEYQPPTVAA
AEAEGLQLHL SSSNATGYKG VFKDGARFKA QHRVDGRMVF DTAVEAAVAY ARAVGQAAAA
GVAGSSAAAA GEASGSGEGE QGGAGGEAAD AEGSSDEEEA ASCLWVACDR CDKWRRLPSG
MPGSLTAEWF CWMHPPQSA
//