ID R1DL96_EMIHU Unreviewed; 1576 AA.
AC R1DL96;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE RecName: Full=CW-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN ORFNames=EMIHUDRAFT_96740 {ECO:0000313|EMBL:EOD09469.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD09469.1};
RN [1] {ECO:0000313|EMBL:EOD09469.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD09469.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD09469}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC ECO:0000256|RuleBase:RU000682}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB868684; EOD09469.1; -; Genomic_DNA.
DR RefSeq; XP_005761898.1; XM_005761841.1.
DR STRING; 2903.R1DL96; -.
DR PaxDb; 2903-EOD09469; -.
DR EnsemblProtists; EOD09469; EOD09469; EMIHUDRAFT_96740.
DR GeneID; 17255584; -.
DR KEGG; ehx:EMIHUDRAFT_96740; -.
DR eggNOG; ENOG502SBSD; Eukaryota.
DR HOGENOM; CLU_245314_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd00086; homeodomain; 1.
DR Gene3D; 3.30.40.100; -; 1.
DR Gene3D; 3.30.730.10; AP2/ERF domain; 1.
DR Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR InterPro; IPR036955; AP2/ERF_dom_sf.
DR InterPro; IPR016177; DNA-bd_dom_sf.
DR InterPro; IPR009057; Homeobox-like_sf.
DR InterPro; IPR017970; Homeobox_CS.
DR InterPro; IPR001356; Homeobox_dom.
DR InterPro; IPR011124; Znf_CW.
DR InterPro; IPR011011; Znf_FYVE_PHD.
DR PANTHER; PTHR31677; AP2 DOMAIN CLASS TRANSCRIPTION FACTOR; 1.
DR PANTHER; PTHR31677:SF127; ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF105; 1.
DR Pfam; PF00046; Homeodomain; 1.
DR Pfam; PF07496; zf-CW; 1.
DR SMART; SM00389; HOX; 1.
DR SUPFAM; SSF54171; DNA-binding domain; 1.
DR SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR SUPFAM; SSF46689; Homeodomain-like; 1.
DR PROSITE; PS00027; HOMEOBOX_1; 1.
DR PROSITE; PS50071; HOMEOBOX_2; 1.
DR PROSITE; PS51050; ZF_CW; 1.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW ProRule:PRU00108};
KW Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT DOMAIN 365..425
FT /note="Homeobox"
FT /evidence="ECO:0000259|PROSITE:PS50071"
FT DOMAIN 947..998
FT /note="CW-type"
FT /evidence="ECO:0000259|PROSITE:PS51050"
FT DNA_BIND 367..426
FT /note="Homeobox"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT REGION 57..76
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 148..201
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 922..946
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1012..1032
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1116..1158
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1435..1466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 152..170
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1116..1130
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1137..1153
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1576 AA; 160349 MW; EB60A1AABA7C334E CRC64;
MPCTCGGCIA AHEMKPERGG ATKKGLVMVW DETKQTCKLD ADQLSAVVAA LAPAGSAAAS
SEEPPPPVGS TAAEVVGPPV PEETAAAVAT ATAGQAAEEE SAAAAVRMAE EEAAAEATAA
ARMDEEAAAE AAAAAAAAAL LETAWEVASG EGEGEGEGEG EGEGEGEGEG EATFDASPVQ
EEAEGGPASQ MDSSRKNKPQ RCSVCGGIGH KSPSCQQTTT YGTSTGAPAG YMPGRQPQPQ
LPSVEAKFGG PRVRKQICRA RICPYKAAGT AATATAAAAE VERLAAAAAA AAAAAAAAAE
VERRAAAAAS EVEAAAAALC CDGVEALGGG GGSVEATGGG MAGAEDMARL GPASAQTGLM
VTATNVDKST KWIVQPAALA VLEQVFAMDR FPTRQLRASL ATDLAVNPRQ VEVWFQNRRQ
KMKKEEARAA AEWSTAVAEW STAAAAAAAT TAAAASPPAA ARPTLEIPEG VLSGQKDRHR
CARQQMWRGG PVVDGAFFDV TVPADLPASR KVHVAMPTEQ PAEQAALSQA ARFTSRIRDV
AALVLARHGP LSKEQLLGCL QQFVQQAGEE LKPLATSLRE LLDSAEPSSR LGNLLCNNRA
YQGDEATQPA DASRRTPWFK SSRVGSGVSW SFEPLGLDRA RPCDSAGILL EARQRVGPEC
ERVLAGISPP SLLARAPART PAPLVTEAEG LRLHLSSSNS TGYKGVYKDG SRFGTQHRVD
GKSRHIGTFG TAVEAAVAYA RAVGQAPAQG PECERVLAGI SPPAVPVRAP ALVTEAEGLR
LHLSSSNATG YKGVYKDGSR FGTQHRVDGK SRHIGTFGTA VEAAVAYARA VGQAPAQGPE
CERVLAGISP PAVPVRAPAL VTEAEGLRLH LSSSNATGYK GVYKDGSRFG TQHRVDGKSR
HIGTFGTAVE AAVAYARAVG EAPAQGSSAA GSEALDSDDG DGEDEEGEAR CLWVACDRCD
RWRRLPAETP RPLPAEWFCW MHPDPARRQC EAPEDEWEED SVAADAAEDG LEEGAGAAEA
AAEPMDEEDE AGGEAYWVDA EVDEEATDEE ATDEEAEAAD GDVVAAEAAG AAEAEGAEGA
AGAAAAAGAA AGAAAAAAAG AAGVADAPHK RVRMAKRPYE PPEGPPPKRR SDAARGPPSS
ASLSSSSSSA AAAAAESAPP HAPAEAAVLL CGNTHECNGG AFGCILPAGH GGPHDFTLPA
KRARCEKRPF DPADVMAAAA LGSQGRLQSP ELGAAEAALL AKLLAVAARL KDEGSPEATV
HVDGQWAALS DSSVPPLAPP VQPAPPPAPP AFQTAAHAAH ATHAAPAAPA APAATCCECG
TLLDDEAGEV MAPSVGCDAG GCSRWACLAC AGFRSEAEAG RKTWFCTLHA PAVAVVAEAE
GLRLLLSSSS STGYRGVSKN SGGFRAQRKV DGKTVSLGTF RTAVEAAVAY ARAVGEEGPA
PPPRPAKRPR ESAAAPALEP TAPSSAPAAA AAAASSSFAP PAPSLPPSRL FELLPDGRTV
HKEAPPDNGG MVAVAEALGR IGLQSYTAAF DSEGFDDIEF LLGLDSAERA AVATATGMDA
KPGHAGKWVK FGFGKP
//