ID R5XLV6_9FIRM Unreviewed; 273 AA.
AC R5XLV6;
DT 24-JUL-2013, integrated into UniProtKB/TrEMBL.
DT 24-JUL-2013, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=DNA-(apurinic or apyrimidinic site) lyase {ECO:0000256|ARBA:ARBA00012720};
DE EC=4.2.99.18 {ECO:0000256|ARBA:ARBA00012720};
GN ORFNames=BN695_00132 {ECO:0000313|EMBL:CDA13494.1};
OS Anaerotruncus sp. CAG:528.
OC Bacteria; Bacillota; Clostridia; Eubacteriales; Oscillospiraceae;
OC Anaerotruncus.
OX NCBI_TaxID=1262700 {ECO:0000313|EMBL:CDA13494.1, ECO:0000313|Proteomes:UP000018024};
RN [1] {ECO:0000313|EMBL:CDA13494.1, ECO:0000313|Proteomes:UP000018024}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=MGS:528 {ECO:0000313|Proteomes:UP000018024};
RA Nielsen H.B., Almeida M., Juncker A.S., Rasmussen S., Li J., Sunagawa S.,
RA Plichta D., Gautier L., Le Chatelier E., Peletier E., Bonde I., Nielsen T.,
RA Manichanh C., Arumugam M., Batto J., Santos M.B.Q.D., Blom N., Borruel N.,
RA Burgdorf K.S., Boumezbeur F., Casellas F., Dore J., Guarner F., Hansen T.,
RA Hildebrand F., Kaas R.S., Kennedy S., Kristiansen K., Kultima J.R.,
RA Leonard P., Levenez F., Lund O., Moumen B., Le Paslier D., Pons N.,
RA Pedersen O., Prifti E., Qin J., Raes J., Tap J., Tims S., Ussery D.W.,
RA Yamada T., MetaHit consortium, Renault P., Sicheritz-Ponten T., Bork P.,
RA Wang J., Brunak S., Ehrlich S.D.;
RT "Dependencies among metagenomic species, viruses, plasmids and units of
RT genetic variation.";
RL Submitted (NOV-2012) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=2'-deoxyribonucleotide-(2'-deoxyribose 5'-phosphate)-2'-
CC deoxyribonucleotide-DNA = a 3'-end 2'-deoxyribonucleotide-(2,3-
CC dehydro-2,3-deoxyribose 5'-phosphate)-DNA + a 5'-end 5'-monophospho-
CC 2'-deoxyribonucleoside-DNA + H(+); Xref=Rhea:RHEA:66592, Rhea:RHEA-
CC COMP:13180, Rhea:RHEA-COMP:16897, Rhea:RHEA-COMP:17067,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:136412, ChEBI:CHEBI:157695,
CC ChEBI:CHEBI:167181; EC=4.2.99.18;
CC Evidence={ECO:0000256|ARBA:ARBA00024490};
CC -!- SIMILARITY: Belongs to the type-1 OGG1 family.
CC {ECO:0000256|ARBA:ARBA00010679}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:CDA13494.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CBBE010000063; CDA13494.1; -; Genomic_DNA.
DR AlphaFoldDB; R5XLV6; -.
DR STRING; 1262700.BN695_00132; -.
DR Proteomes; UP000018024; Unassembled WGS sequence.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0008534; F:oxidized purine nucleobase lesion DNA N-glycosylase activity; IEA:InterPro.
DR GO; GO:0006284; P:base-excision repair; IEA:InterPro.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd00056; ENDO3c; 1.
DR Gene3D; 3.30.310.260; -; 1.
DR Gene3D; 1.10.1670.10; Helix-hairpin-Helix base-excision DNA repair enzymes (C-terminal); 1.
DR InterPro; IPR011257; DNA_glycosylase.
DR InterPro; IPR003265; HhH-GPD_domain.
DR InterPro; IPR023170; HhH_base_excis_C.
DR InterPro; IPR012904; OGG_N.
DR PANTHER; PTHR10242; 8-OXOGUANINE DNA GLYCOSYLASE; 1.
DR PANTHER; PTHR10242:SF2; N-GLYCOSYLASE_DNA LYASE; 1.
DR Pfam; PF00730; HhH-GPD; 1.
DR Pfam; PF07934; OGG_N; 1.
DR SMART; SM00478; ENDO3c; 1.
DR SUPFAM; SSF48150; DNA-glycosylase; 1.
DR SUPFAM; SSF55945; TATA-box binding protein-like; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Reference proteome {ECO:0000313|Proteomes:UP000018024}.
FT DOMAIN 121..269
FT /note="HhH-GPD"
FT /evidence="ECO:0000259|SMART:SM00478"
SQ SEQUENCE 273 AA; 31058 MW; 895FE46446517596 CRC64;
MKVRCENENV ILSEVRCLSL PLTLDCGEAF RWQCEEDGSW SGAAYGKFLN IKEENGEFVL
KNTSLEDFEC VWRNYFDLDR DYAAICDRLK EDSLLSETID EYYGIRILNQ DPWEALVSFV
ISQQNNIKRI KGIIKRLCDT YGTPICEGWN AFPSAEVLAD CSEADFEALG LGYRAKYVKR
LADDVACGAI NLAEIKAMDL ESAKKALLSI YGVGEKVANC ALLFGFQFVR CFPIDVWMKR
ALQYYPNGLP ECFAGCEGIA QQYLFHWARN NLK
//