ID R1FQ42_EMIHU Unreviewed; 626 AA.
AC R1FQ42;
DT 26-JUN-2013, integrated into UniProtKB/TrEMBL.
DT 26-JUN-2013, sequence version 1.
DT 27-MAR-2024, entry version 54.
DE SubName: Full=Putative endonuclease III {ECO:0000313|EMBL:EOD37766.1};
GN ORFNames=EMIHUDRAFT_466941 {ECO:0000313|EMBL:EOD37766.1};
OS Emiliania huxleyi (Coccolithophore) (Pontosphaera huxleyi).
OC Eukaryota; Haptista; Haptophyta; Prymnesiophyceae; Isochrysidales;
OC Noelaerhabdaceae; Emiliania.
OX NCBI_TaxID=2903 {ECO:0000313|EMBL:EOD37766.1};
RN [1] {ECO:0000313|EMBL:EOD37766.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|EMBL:EOD37766.1};
RG DOE Joint Genome Institute;
RA Read B., Kegel J., Klute M., Kuo A., Lefebvre S.C., Maumus F., Mayer C.,
RA Miller J., Allen A., Bidle K., Borodovsky M., Bowler C., Brownlee C.,
RA Claverie J.-M., Cock M., De Vargas C., Elias M., Frickenhaus S.,
RA Gladyshev V.N., Gonzalez K., Guda C., Hadaegh A., Herman E.,
RA Iglesias-Rodriguez D., Jones B., Lawson T., Leese F., Lin Y.-C.,
RA Lindquist E., Lobanov A., Lucas S., Malik S.-H.B., Marsh M.E., Mock T.,
RA Monier A., Moreau H., Mueller-Roeber B., Napier J., Ogata H., Parker M.,
RA Probert I., Quesneville H., Raines C., Rensing S., Riano-Pachon D.M.,
RA Richier S., Rokitta S., Salamov A., Sarno A.F., Schmutz J., Schroeder D.,
RA Shiraiwa Y., Soanes D.M., Valentin K., Van Der Giezen M., Van Der Peer Y.,
RA Vardi A., Verret F., Von Dassow P., Wheeler G., Williams B., Wilson W.,
RA Wolfe G., Wurch L.L., Young J., Dacks J.B., Delwiche C.F., Dyhrman S.,
RA Glockner G., John U., Richards T., Worden A.Z., Zhang X., Grigoriev I.V.;
RT "Genome variability drives Emilianias global distribution.";
RL Submitted (JUL-2012) to the EMBL/GenBank/DDBJ databases.
RN [2] {ECO:0000313|Proteomes:UP000013827}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=CCMP1516 {ECO:0000313|Proteomes:UP000013827};
RX PubMed=23760476; DOI=10.1038/nature12221;
RA Read B.A., Kegel J., Klute M.J., Kuo A., Lefebvre S.C., Maumus F.,
RA Mayer C., Miller J., Monier A., Salamov A., Young J., Aguilar M.,
RA Claverie J.M., Frickenhaus S., Gonzalez K., Herman E.K., Lin Y.C.,
RA Napier J., Ogata H., Sarno A.F., Shmutz J., Schroeder D., de Vargas C.,
RA Verret F., von Dassow P., Valentin K., Van de Peer Y., Wheeler G.,
RA Dacks J.B., Delwiche C.F., Dyhrman S.T., Glockner G., John U., Richards T.,
RA Worden A.Z., Zhang X., Grigoriev I.V., Allen A.E., Bidle K., Borodovsky M.,
RA Bowler C., Brownlee C., Cock J.M., Elias M., Gladyshev V.N., Groth M.,
RA Guda C., Hadaegh A., Iglesias-Rodriguez M.D., Jenkins J., Jones B.M.,
RA Lawson T., Leese F., Lindquist E., Lobanov A., Lomsadze A., Malik S.B.,
RA Marsh M.E., Mackinder L., Mock T., Mueller-Roeber B., Pagarete A.,
RA Parker M., Probert I., Quesneville H., Raines C., Rensing S.A.,
RA Riano-Pachon D.M., Richier S., Rokitta S., Shiraiwa Y., Soanes D.M.,
RA van der Giezen M., Wahlund T.M., Williams B., Wilson W., Wolfe G.,
RA Wurch L.L.;
RT "Pan genome of the phytoplankton Emiliania underpins its global
RT distribution.";
RL Nature 499:209-213(2013).
RN [3] {ECO:0000313|EnsemblProtists:EOD37766}
RP IDENTIFICATION.
RG EnsemblProtists;
RL Submitted (NOV-2023) to UniProtKB.
CC -!- SIMILARITY: Belongs to the Nth/MutY family.
CC {ECO:0000256|ARBA:ARBA00008343}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; KB864057; EOD37766.1; -; Genomic_DNA.
DR RefSeq; XP_005790195.1; XM_005790138.1.
DR STRING; 2903.R1FQ42; -.
DR PaxDb; 2903-EOD37766; -.
DR EnsemblProtists; EOD37766; EOD37766; EMIHUDRAFT_466941.
DR GeneID; 17283036; -.
DR KEGG; ehx:EMIHUDRAFT_466941; -.
DR eggNOG; KOG1921; Eukaryota.
DR HOGENOM; CLU_437114_0_0_1; -.
DR Proteomes; UP000013827; Unassembled WGS sequence.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0004519; F:endonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0016798; F:hydrolase activity, acting on glycosyl bonds; IEA:UniProtKB-KW.
DR GO; GO:0016829; F:lyase activity; IEA:UniProtKB-KW.
DR GO; GO:0006284; P:base-excision repair; IEA:InterPro.
DR CDD; cd00056; ENDO3c; 1.
DR Gene3D; 1.10.1670.10; Helix-hairpin-Helix base-excision DNA repair enzymes (C-terminal); 1.
DR InterPro; IPR011257; DNA_glycosylase.
DR InterPro; IPR004036; Endonuclease-III-like_CS2.
DR InterPro; IPR003265; HhH-GPD_domain.
DR InterPro; IPR023170; HhH_base_excis_C.
DR InterPro; IPR000445; HhH_motif.
DR PANTHER; PTHR43286; ENDONUCLEASE III-LIKE PROTEIN 1; 1.
DR PANTHER; PTHR43286:SF1; ENDONUCLEASE III-LIKE PROTEIN 1; 1.
DR Pfam; PF00633; HHH; 1.
DR Pfam; PF00730; HhH-GPD; 1.
DR SMART; SM00478; ENDO3c; 1.
DR SUPFAM; SSF48150; DNA-glycosylase; 1.
DR PROSITE; PS01155; ENDONUCLEASE_III_2; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Endonuclease {ECO:0000313|EMBL:EOD37766.1};
KW Glycosidase {ECO:0000256|ARBA:ARBA00023295};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Lyase {ECO:0000256|ARBA:ARBA00023239};
KW Nuclease {ECO:0000313|EMBL:EOD37766.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000013827};
KW Signal {ECO:0000256|SAM:SignalP}.
FT SIGNAL 1..15
FT /evidence="ECO:0000256|SAM:SignalP"
FT CHAIN 16..626
FT /evidence="ECO:0000256|SAM:SignalP"
FT /id="PRO_5014593089"
FT DOMAIN 400..577
FT /note="HhH-GPD"
FT /evidence="ECO:0000259|SMART:SM00478"
FT REGION 282..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 626 AA; 67019 MW; 1B6A54C73E44C019 CRC64;
MRAFATLAWA ALASALTWVP NTARPALHKR WIATPVAVEI DDMMGERECF KDDECELERW
MFGGDGSPPS AAETAPPTEE EQARLARLRE DGRTAVRARE ERRTALRRAA PPRMLTVRTP
LEAGLALTQP TQAEAEAMGV RDWPPTIITQ GGRGAFDADV EDGSLRYVLE GSGTVQRDTT
GEPLAVSPNT LVRVVGEGGA LRWQLDEGVD ELVLLTPEYK GPPLLPVAGA FLAACAALSH
RMRASRSITP ACAVLSTAAA SVQPLALSPL APAAPAARAM ATAAVQTDAH KKKRSSKSRA
AAPLTAEEPA KQPGAPSSTT EQKKRGRRAS ASTAAAASPK RSKATDVRLE PPANWRATWE
LIVELRADRT AVVDSMGTEA IAGESTKEER DFDALVSLML SSQTKDTVNA ATMKKLRAHG
LSPRRLLETR PARGWTTLPR PDTPDERLDE LIYACGFHNN KATTPALEAS TRPVKYLKAT
SRILLEQHGG RVPDTMDGLL ALPGVGPKMA LILLRVAFGK VEGISVDTHV HRICNQLGWA
GPGGTKTPEQ TRRAIEAWMP ADIWAEVNLV LVGLGQEVQT EKSKLLRKCL ACSDPAAALR
LASVCGVKVE PEAKKAGLVL PPGLAL
//