ID R9LDG4_9FIRM Unreviewed; 284 AA.
AC R9LDG4;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 29.
DE RecName: Full=DNA-(apurinic or apyrimidinic site) lyase {ECO:0000256|ARBA:ARBA00012720};
DE EC=4.2.99.18 {ECO:0000256|ARBA:ARBA00012720};
GN ORFNames=C814_02631 {ECO:0000313|EMBL:EOS56760.1};
OS Anaerotruncus sp. G3(2012).
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Anaerotruncus.
OX NCBI_TaxID=1235835 {ECO:0000313|EMBL:EOS56760.1, ECO:0000313|Proteomes:UP000014129};
RN [1] {ECO:0000313|EMBL:EOS56760.1, ECO:0000313|Proteomes:UP000014129}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=G3(2012) {ECO:0000313|Proteomes:UP000014129};
RG The Broad Institute Genomics Platform;
RG The Broad Institute Genome Sequencing Center for Infectious Disease;
RA Earl A., Xavier R., Elson C., Duck W., Walker B., Young S., Zeng Q.,
RA Gargeya S., Fitzgerald M., Haas B., Abouelleil A., Allen A.W., Alvarado L.,
RA Arachchi H.M., Berlin A.M., Chapman S.B., Gainer-Dewar J., Goldberg J.,
RA Griggs A., Gujja S., Hansen M., Howarth C., Imamovic A., Ireland A.,
RA Larimer J., McCowan C., Murphy C., Pearson M., Poon T.W., Priest M.,
RA Roberts A., Saif S., Shea T., Sisk P., Sykes S., Wortman J., Nusbaum C.,
RA Birren B.;
RT "The Genome Sequence of Anaerotruncus bacterium G3.";
RL Submitted (APR-2013) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=2'-deoxyribonucleotide-(2'-deoxyribose 5'-phosphate)-2'-
CC deoxyribonucleotide-DNA = a 3'-end 2'-deoxyribonucleotide-(2,3-
CC dehydro-2,3-deoxyribose 5'-phosphate)-DNA + a 5'-end 5'-monophospho-
CC 2'-deoxyribonucleoside-DNA + H(+); Xref=Rhea:RHEA:66592, Rhea:RHEA-
CC COMP:13180, Rhea:RHEA-COMP:16897, Rhea:RHEA-COMP:17067,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:136412, ChEBI:CHEBI:157695,
CC ChEBI:CHEBI:167181; EC=4.2.99.18;
CC Evidence={ECO:0000256|ARBA:ARBA00024490};
CC -!- SIMILARITY: Belongs to the type-1 OGG1 family.
CC {ECO:0000256|ARBA:ARBA00010679}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:EOS56760.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; ASTA01000075; EOS56760.1; -; Genomic_DNA.
DR AlphaFoldDB; R9LDG4; -.
DR STRING; 1235835.C814_02631; -.
DR PATRIC; fig|1235835.3.peg.2772; -.
DR eggNOG; COG0122; Bacteria.
DR HOGENOM; CLU_027543_3_0_9; -.
DR Proteomes; UP000014129; Unassembled WGS sequence.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0008534; F:oxidized purine nucleobase lesion DNA N-glycosylase activity; IEA:InterPro.
DR GO; GO:0006284; P:base-excision repair; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd00056; ENDO3c; 1.
DR Gene3D; 3.30.310.260; -; 1.
DR Gene3D; 1.10.1670.10; Helix-hairpin-Helix base-excision DNA repair enzymes (C-terminal); 1.
DR InterPro; IPR011257; DNA_glycosylase.
DR InterPro; IPR003265; HhH-GPD_domain.
DR InterPro; IPR023170; HhH_base_excis_C.
DR InterPro; IPR012904; OGG_N.
DR PANTHER; PTHR10242; 8-OXOGUANINE DNA GLYCOSYLASE; 1.
DR PANTHER; PTHR10242:SF2; N-GLYCOSYLASE_DNA LYASE; 1.
DR Pfam; PF00730; HhH-GPD; 1.
DR Pfam; PF07934; OGG_N; 1.
DR SMART; SM00478; ENDO3c; 1.
DR SUPFAM; SSF48150; DNA-glycosylase; 1.
DR SUPFAM; SSF55945; TATA-box binding protein-like; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Reference proteome {ECO:0000313|Proteomes:UP000014129}.
FT DOMAIN 128..277
FT /note="HhH-GPD"
FT /evidence="ECO:0000259|SMART:SM00478"
SQ SEQUENCE 284 AA; 31430 MW; 3DCF10ECC5EDDB84 CRC64;
MLKSALQIPA TFDESGVLLP DMPDFDLAQT LDCGQAFRWE EQMDGSFIGI AHKKRCRISR
EGDAVRLWGI SRASFEQVWH PYFDLGRDYA ALKRRFSSDP ALARAVEYAP GIRVLRQEPW
EALCSFIISQ NNHVKRIKGI VSRFCELLGE PAEGGGFAFP SPEAVASCSP ADLAPLRAGF
RARYLVDAAQ KVVSGQVDLE ACCSLPLPEA RAMLTRITGV GVKVADCALL YGCGRVECFP
IDVWMRRVME LLYPDGLPDC ARGYEGIAQQ YLFHYARTTG LGTV
//