ID R1EZ98_EMIHU Unreviewed; 1402 AA.
AC R1EZ98;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=PWWP domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EMIHUDRAFT_114210 {ECO:0000313|EMBL:EOD28286.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD28286.1};
RN [1] {ECO:0000313|EMBL:EOD28286.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD28286.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD28286}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB864932; EOD28286.1; -; Genomic_DNA.
DR RefSeq; XP_005780715.1; XM_005780658.1.
DR PaxDb; 2903-EOD28286; -.
DR EnsemblProtists; EOD28286; EOD28286; EMIHUDRAFT_114210.
DR GeneID; 17273831; -.
DR KEGG; ehx:EMIHUDRAFT_114210; -.
DR HOGENOM; CLU_254261_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003700; F:DNA-binding transcription factor activity; IEA:InterPro.
DR CDD; cd05162; PWWP; 1.
DR Gene3D; 2.30.30.140; -; 2.
DR Gene3D; 3.30.730.10; AP2/ERF domain; 9.
DR InterPro; IPR001471; AP2/ERF_dom.
DR InterPro; IPR036955; AP2/ERF_dom_sf.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR000313; PWWP_dom.
DR InterPro; IPR002999; Tudor.
DR PANTHER; PTHR31677; AP2 DOMAIN CLASS TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR31677:SF127; ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF105; 1.
DR Pfam; PF00855; PWWP; 1.
DR SMART; SM00380; AP2; 4.
DR SMART; SM00333; TUDOR; 2.
DR SUPFAM; SSF54171; DNA-binding domain; 9.
DR SUPFAM; SSF63748; Tudor/PWWP/MBT; 1.
DR PROSITE; PS51032; AP2_ERF; 3.
DR PROSITE; PS50812; PWWP; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015}.
FT DOMAIN 126..184
FT /note="AP2/ERF"
FT /evidence="ECO:0000259|PROSITE:PS51032"
FT DOMAIN 352..408
FT /note="AP2/ERF"
FT /evidence="ECO:0000259|PROSITE:PS51032"
FT DOMAIN 706..761
FT /note="AP2/ERF"
FT /evidence="ECO:0000259|PROSITE:PS51032"
FT DOMAIN 1109..1152
FT /note="PWWP"
FT /evidence="ECO:0000259|PROSITE:PS50812"
FT REGION 176..231
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 283..329
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 483..539
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 625..680
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 836..894
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1044..1090
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1202..1236
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1300..1402
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 186..221
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 283..300
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 314..328
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 486..509
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 643..668
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 854..883
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1050..1087
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1219..1233
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1300..1320
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1402 AA; 143849 MW; A15D8480E9ABE011 CRC64;
MVYLGSFATA VEAAVAYARA VGEYQPPAPP TVATEAEGLR LHLSSSSNTG YTGVFQHASG
HRSRFQAQHR VGGRKVSLGY FDSAVEAAVA YARAARGYQP PAQPPPTVAT EAEGLRLHLS
SSNATGYIGV NKHRSSCRFW AQHRAGGRQV SLGSFDTAVE AAVAYERAVG QAEAAGAAAA
VSAEEGDEAG GEEEGGEEEG GEEEGGEEEG GEEEGGEESG WSGGGEEESH FQVGGRVAVQ
YSDGALYPGE IVGFDGASSL YSVQCDDGEL LEDVSLHEML REAGEGEWEE QERASGEGME
AEEAEAAAGS AGSTASPPPP PPPTHREAAP AAPLVTEAEG LRLHLSSSSS TGYRGVYEHS
GRYQAKRWVD GKEVYLGSFA TAVEAAVAYA RAVGEYQPPP APPTVAAEAE GLRLHLSSSN
SIGYKNVREN GARFKAEARV DGKRVCFGTF ATAVEAAVAY ARAVGQAEVA GAGGPARAAA
LAGEAEAGET EEEEAAEARE MEAMEAEEAE EATAAAAAAA EGAEGAPEEE EAAEAAGTAA
VSAAAPLATE AEGLRLHLSS SNATGYRGVC ADRSRYQAKH RAGERMVYLG TFATAVEAAV
AYARAVGEAE AAGAGGPARA AAAAGDAEAG EAMEAAEARE MEAMEAEEAE EAAAAAAAEE
EEGAEGAPEE EEAAKAAGTA AVSAAAPLAT EAEGLRLHLS SSNATGYRGV REHSGRYQAK
HRAGERMVYL GTFATAVEAA VAYARAAGEY QPPAPPTVAT EAEGLRLHVS SITSTGYKGV
HADRSRYQAT HSVGGKQVYL GSFATAVEAA VAYARAVGEA EAAGAGGPAR AAAAAGDAEA
GEAMEAAEAR EMEAMEAEEA EEAAAAAEEE EEEGAEGAPE EEEAAEAAGT AAVSAAAPLA
TEAEGLRLHL SSSNATGYRG VREHSGRYQA KRGGDCKEVY LGNFATAVEA AVAYARAVGE
YQPPAPPTVA TEAEGLRLHV SSITSTGYKG VHADRSRFRA EARVGGKQVY LGTFATAVEA
AVAYARAVGQ AEAAGAGGPA RAAALAGEAE AGETEEEEAA EAREMEAMEA EEAEEAAAAA
AEEEEDVPPQ AEGLRLHLSG CTGSAGLAAG SPCWAKLRDS PWWPAVIRAE EEGESVRVHF
YGTGNVASLE RASCIKPLRE DERLFDATRW GKEFGRPKWK HLRAPFASAV AQALEVEAAL
PHAPPGALSR GPAGSSADNA RRCDERPPKR RRTSAAQEAV QMVADLETAD VELDAPLVRR
CAVTLDRCLP SRANRGAAPP RLGIGDGWAA SAASSWVSLA PPPPPSTLTP PPPPPPCAAL
AASEPWRLPS ARAPRDGGAA PAPLEQQRHR SRLRVQASVG PLSGAAQGGR AAAAPRPLRH
RGGGGGVRAG GRGGRLQWVH PA
//