GenomeNet

Database: UniProt
Entry: R1DL96_EMIHU
LinkDB: R1DL96_EMIHU
Original site: R1DL96_EMIHU 
ID   R1DL96_EMIHU            Unreviewed;      1576 AA.
AC   R1DL96;
DT   26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT   26-JUN-2013, sequence version 1.
DT   27-MAR-2024, entry version 61.
DE   RecName: Full=CW-type domain-containing protein {ECO:0008006|Google:ProtNLM};
GN   ORFNames=EMIHUDRAFT_96740 {ECO:0000313|EMBL:EOD09469.1};
OS   Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC   Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC   Noelaerhabdaceae; Emiliania.
OX   NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD09469.1};
RN   [1] {ECO:0000313|EMBL:EOD09469.1}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD09469.1};
RG   DOE Joint Genome Institute;
RA   Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA   Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA   Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA   Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA   Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA   Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA   Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA   Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA   Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA   Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA   Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA   Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA   Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT   "Genome variability drives Emilianias global distribution.";
RL   Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN   [2] {ECO:0000313|Proteomes:UP000013827}
RP   NUCLEOTIDE SEQUENCE.
RC   STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX   PubMed=23760476; DOI=10.1038/nature12221;
RA   Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA   Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA   Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA   Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA   Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA   Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA   Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA   Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA   Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA   Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA   Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA   Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA   Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA   van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA   Wurch L.L.;
RT   "Pan genome of the phytoplankton Emiliania underpins its global
RT   distribution.";
RL   Nature 499:209-213(2013).
RN   [3] {ECO:0000313|EnsemblProtists:EOD09469}
RP   IDENTIFICATION.
RG   EnsemblProtists;
RL   Submitted (NOV-2023) to UniProtKB.
CC   -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|PROSITE-ProRule:PRU00108,
CC       ECO:0000256|RuleBase:RU000682}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; KB868684; EOD09469.1; -; Genomic_DNA.
DR   RefSeq; XP_005761898.1; XM_005761841.1.
DR   STRING; 2903.R1DL96; -.
DR   PaxDb; 2903-EOD09469; -.
DR   EnsemblProtists; EOD09469; EOD09469; EMIHUDRAFT_96740.
DR   GeneID; 17255584; -.
DR   KEGG; ehx:EMIHUDRAFT_96740; -.
DR   eggNOG; ENOG502SBSD; Eukaryota.
DR   HOGENOM; CLU_245314_0_0_1; -.
DR   Proteomes; UP000013827; Unassembled WGS sequence.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0000981; F:DNA-binding transcription factor activity, RNA polymerase II-specific; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   CDD; cd00086; homeodomain; 1.
DR   Gene3D; 3.30.40.100; -; 1.
DR   Gene3D; 3.30.730.10; AP2/ERF domain; 1.
DR   Gene3D; 1.10.10.60; Homeodomain-like; 1.
DR   InterPro; IPR036955; AP2/ERF_dom_sf.
DR   InterPro; IPR016177; DNA-bd_dom_sf.
DR   InterPro; IPR009057; Homeobox-like_sf.
DR   InterPro; IPR017970; Homeobox_CS.
DR   InterPro; IPR001356; Homeobox_dom.
DR   InterPro; IPR011124; Znf_CW.
DR   InterPro; IPR011011; Znf_FYVE_PHD.
DR   PANTHER; PTHR31677; AP2 DOMAIN CLASS TRANSCRIPTION FACTOR; 1.
DR   PANTHER; PTHR31677:SF127; ETHYLENE-RESPONSIVE TRANSCRIPTION FACTOR ERF105; 1.
DR   Pfam; PF00046; Homeodomain; 1.
DR   Pfam; PF07496; zf-CW; 1.
DR   SMART; SM00389; HOX; 1.
DR   SUPFAM; SSF54171; DNA-binding domain; 1.
DR   SUPFAM; SSF57903; FYVE/PHD zinc finger; 1.
DR   SUPFAM; SSF46689; Homeodomain-like; 1.
DR   PROSITE; PS00027; HOMEOBOX_1; 1.
DR   PROSITE; PS50071; HOMEOBOX_2; 1.
DR   PROSITE; PS51050; ZF_CW; 1.
PE   4: Predicted;
KW   DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|PROSITE-
KW   ProRule:PRU00108};
KW   Homeobox {ECO:0000256|ARBA:ARBA00023155, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Nucleus {ECO:0000256|ARBA:ARBA00023242, ECO:0000256|PROSITE-
KW   ProRule:PRU00108}; Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833};
KW   Zinc-finger {ECO:0000256|ARBA:ARBA00022771}.
FT   DOMAIN          365..425
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000259|PROSITE:PS50071"
FT   DOMAIN          947..998
FT                   /note="CW-type"
FT                   /evidence="ECO:0000259|PROSITE:PS51050"
FT   DNA_BIND        367..426
FT                   /note="Homeobox"
FT                   /evidence="ECO:0000256|PROSITE-ProRule:PRU00108"
FT   REGION          57..76
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          148..201
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          922..946
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1012..1032
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1116..1158
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1435..1466
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        152..170
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1116..1130
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        1137..1153
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   1576 AA;  160349 MW;  EB60A1AABA7C334E CRC64;
     MPCTCGGCIA AHEMKPERGG ATKKGLVMVW DETKQTCKLD ADQLSAVVAA LAPAGSAAAS
     SEEPPPPVGS TAAEVVGPPV PEETAAAVAT ATAGQAAEEE SAAAAVRMAE EEAAAEATAA
     ARMDEEAAAE AAAAAAAAAL LETAWEVASG EGEGEGEGEG EGEGEGEGEG EATFDASPVQ
     EEAEGGPASQ MDSSRKNKPQ RCSVCGGIGH KSPSCQQTTT YGTSTGAPAG YMPGRQPQPQ
     LPSVEAKFGG PRVRKQICRA RICPYKAAGT AATATAAAAE VERLAAAAAA AAAAAAAAAE
     VERRAAAAAS EVEAAAAALC CDGVEALGGG GGSVEATGGG MAGAEDMARL GPASAQTGLM
     VTATNVDKST KWIVQPAALA VLEQVFAMDR FPTRQLRASL ATDLAVNPRQ VEVWFQNRRQ
     KMKKEEARAA AEWSTAVAEW STAAAAAAAT TAAAASPPAA ARPTLEIPEG VLSGQKDRHR
     CARQQMWRGG PVVDGAFFDV TVPADLPASR KVHVAMPTEQ PAEQAALSQA ARFTSRIRDV
     AALVLARHGP LSKEQLLGCL QQFVQQAGEE LKPLATSLRE LLDSAEPSSR LGNLLCNNRA
     YQGDEATQPA DASRRTPWFK SSRVGSGVSW SFEPLGLDRA RPCDSAGILL EARQRVGPEC
     ERVLAGISPP SLLARAPART PAPLVTEAEG LRLHLSSSNS TGYKGVYKDG SRFGTQHRVD
     GKSRHIGTFG TAVEAAVAYA RAVGQAPAQG PECERVLAGI SPPAVPVRAP ALVTEAEGLR
     LHLSSSNATG YKGVYKDGSR FGTQHRVDGK SRHIGTFGTA VEAAVAYARA VGQAPAQGPE
     CERVLAGISP PAVPVRAPAL VTEAEGLRLH LSSSNATGYK GVYKDGSRFG TQHRVDGKSR
     HIGTFGTAVE AAVAYARAVG EAPAQGSSAA GSEALDSDDG DGEDEEGEAR CLWVACDRCD
     RWRRLPAETP RPLPAEWFCW MHPDPARRQC EAPEDEWEED SVAADAAEDG LEEGAGAAEA
     AAEPMDEEDE AGGEAYWVDA EVDEEATDEE ATDEEAEAAD GDVVAAEAAG AAEAEGAEGA
     AGAAAAAGAA AGAAAAAAAG AAGVADAPHK RVRMAKRPYE PPEGPPPKRR SDAARGPPSS
     ASLSSSSSSA AAAAAESAPP HAPAEAAVLL CGNTHECNGG AFGCILPAGH GGPHDFTLPA
     KRARCEKRPF DPADVMAAAA LGSQGRLQSP ELGAAEAALL AKLLAVAARL KDEGSPEATV
     HVDGQWAALS DSSVPPLAPP VQPAPPPAPP AFQTAAHAAH ATHAAPAAPA APAATCCECG
     TLLDDEAGEV MAPSVGCDAG GCSRWACLAC AGFRSEAEAG RKTWFCTLHA PAVAVVAEAE
     GLRLLLSSSS STGYRGVSKN SGGFRAQRKV DGKTVSLGTF RTAVEAAVAY ARAVGEEGPA
     PPPRPAKRPR ESAAAPALEP TAPSSAPAAA AAAASSSFAP PAPSLPPSRL FELLPDGRTV
     HKEAPPDNGG MVAVAEALGR IGLQSYTAAF DSEGFDDIEF LLGLDSAERA AVATATGMDA
     KPGHAGKWVK FGFGKP
//
DBGET integrated database retrieval system