ID M9LQC4_PSEA3 Unreviewed; 536 AA.
AC M9LQC4;
DT 29-MAY-2013, integrated into UniProtKB/TrEMBL.
DT 29-MAY-2013, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=DNA-(apurinic or apyrimidinic site) lyase {ECO:0000256|ARBA:ARBA00012720};
DE EC=4.2.99.18 {ECO:0000256|ARBA:ARBA00012720};
GN ORFNames=PANT_12c00063 {ECO:0000313|EMBL:GAC74626.1};
OS Pseudozyma antarctica (strain T-34) (Yeast) (Candida antarctica).
OC Eukaryota; Fungi; Dikarya; Basidiomycota; Ustilaginomycotina;
OC Ustilaginomycetes; Ustilaginales; Ustilaginaceae; Moesziomyces.
OX NCBI_TaxID=1151754 {ECO:0000313|EMBL:GAC74626.1, ECO:0000313|Proteomes:UP000011976};
RN [1] {ECO:0000313|Proteomes:UP000011976}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=T-34 {ECO:0000313|Proteomes:UP000011976};
RX PubMed=23558529; DOI=10.1128/genomea.00064-13;
RA Morita T., Koike H., Koyama Y., Hagiwara H., Ito E., Fukuoka T., Imura T.,
RA Machida M., Kitamoto D.;
RT "Genome sequence of the basidiomycetous yeast Pseudozyma antarctica T-34, a
RT producer of the glycolipid biosurfactants mannosylerythritol lipids.";
RL Genome Announc. 1:E0006413-E0006413(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=2'-deoxyribonucleotide-(2'-deoxyribose 5'-phosphate)-2'-
CC deoxyribonucleotide-DNA = a 3'-end 2'-deoxyribonucleotide-(2,3-
CC dehydro-2,3-deoxyribose 5'-phosphate)-DNA + a 5'-end 5'-monophospho-
CC 2'-deoxyribonucleoside-DNA + H(+); Xref=Rhea:RHEA:66592, Rhea:RHEA-
CC COMP:13180, Rhea:RHEA-COMP:16897, Rhea:RHEA-COMP:17067,
CC ChEBI:CHEBI:15378, ChEBI:CHEBI:136412, ChEBI:CHEBI:157695,
CC ChEBI:CHEBI:167181; EC=4.2.99.18;
CC Evidence={ECO:0000256|ARBA:ARBA00024490};
CC -!- SIMILARITY: Belongs to the type-1 OGG1 family.
CC {ECO:0000256|ARBA:ARBA00010679}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DF196778; GAC74626.1; -; Genomic_DNA.
DR AlphaFoldDB; M9LQC4; -.
DR STRING; 1151754.M9LQC4; -.
DR OrthoDB; 118473at2759; -.
DR Proteomes; UP000011976; Unassembled WGS sequence.
DR GO; GO:0140078; F:class I DNA-(apurinic or apyrimidinic site) endonuclease activity; IEA:UniProtKB-EC.
DR GO; GO:0003684; F:damaged DNA binding; IEA:InterPro.
DR GO; GO:0008534; F:oxidized purine nucleobase lesion DNA N-glycosylase activity; IEA:InterPro.
DR GO; GO:0006285; P:base-excision repair, AP site formation; IEA:UniProt.
DR GO; GO:0006289; P:nucleotide-excision repair; IEA:InterPro.
DR CDD; cd00056; ENDO3c; 1.
DR Gene3D; 3.30.310.40; -; 1.
DR Gene3D; 1.10.1670.10; Helix-hairpin-Helix base-excision DNA repair enzymes (C-terminal); 1.
DR InterPro; IPR011257; DNA_glycosylase.
DR InterPro; IPR003265; HhH-GPD_domain.
DR InterPro; IPR023170; HhH_base_excis_C.
DR InterPro; IPR012904; OGG_N.
DR PANTHER; PTHR10242; 8-OXOGUANINE DNA GLYCOSYLASE; 1.
DR PANTHER; PTHR10242:SF2; N-GLYCOSYLASE_DNA LYASE; 1.
DR Pfam; PF00730; HhH-GPD; 1.
DR Pfam; PF07934; OGG_N; 1.
DR SMART; SM00478; ENDO3c; 1.
DR SUPFAM; SSF48150; DNA-glycosylase; 1.
DR SUPFAM; SSF55945; TATA-box binding protein-like; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204};
KW Reference proteome {ECO:0000313|Proteomes:UP000011976}.
FT DOMAIN 180..447
FT /note="HhH-GPD"
FT /evidence="ECO:0000259|SMART:SM00478"
FT REGION 1..23
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 300..349
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 473..536
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..21
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 330..345
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 536 AA; 59980 MW; E8939C55A9F5CE29 CRC64;
MGSNGKLDKL DKHDDPEHAD FFGPRPPAGY AALRAAPSQI LLPLTVSNKC GQAFRWRANK
VWVPTASAPS SGGWDEQIEW SLCLADRVVL LRQDEHRGFL YHKTLLPSTS SRPVDGANDA
QTIRETERWL KDYLNLDVPL EALYAEWEAK DAVFARFATR FSGLRMLRQD PWECLCAFVC
SSNNNIARIG QMVQNLCTHF SPALLEHVYA APPPTVAGEV EQGEVKIVYH PFPSPEALAK
PGVEEKLREL GFGYRAKYLA KTAQMLCEQH AQASATKAYT TYKADHLDES HWLYGIKPEE
DISTPATPDS ASGSKDTTEV GRAVKRARRS SAPNASNSST PSAASEGEEK RFESVRSYLQ
HLRTLSYHSA RQELIQFPGV GPKVADCILL MSLDQASSIP VDRHVFQFAE KWYRLRTKRY
EEIADYFRAL WGEYAGWAHS VLFTADLRSF ATYNAAKKEE KIEDADKAFW GVPQVTEGYP
TPPSSQVVKK EEEKLQAVHS PPRVAIPQLV PQSMEGEGAT LAERIKARPK RKSRST
//