ID A0A176W9G0_MARPO Unreviewed; 737 AA.
AC A0A176W9G0;
DT 07-SEP-2016, integrated into UniProtKB/TrEMBL.
DT 07-SEP-2016, sequence version 1.
DT 27-MAR-2024, entry version 30.
DE RecName: Full=Exonuclease 1 {ECO:0000256|RuleBase:RU910737};
DE EC=3.1.-.- {ECO:0000256|RuleBase:RU910737};
GN ORFNames=AXG93_3884s1460 {ECO:0000313|EMBL:OAE29750.1};
OS Marchantia polymorpha subsp. ruderalis.
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Marchantiophyta;
OC Marchantiopsida; Marchantiidae; Marchantiales; Marchantiaceae; Marchantia.
OX NCBI_TaxID=1480154 {ECO:0000313|EMBL:OAE29750.1, ECO:0000313|Proteomes:UP000077202};
RN [1] {ECO:0000313|EMBL:OAE29750.1, ECO:0000313|Proteomes:UP000077202}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Tak-1 and cv. Tak-2 {ECO:0000313|Proteomes:UP000077202};
RC TISSUE=Whole gametophyte {ECO:0000313|EMBL:OAE29750.1};
RA Honkanen S., Jones V.A., Morieri G., Champion C., Hetherington A.J.,
RA Kelly S., Saint-Marcoux D., Proust H., Prescott H., Dolan L.;
RT "Mechanisms controlling the formation of the plant cell surface in tip-
RT growing cells are functionally conserved among land plants.";
RL Submitted (MAR-2016) to the EMBL/GenBank/DDBJ databases.
CC -!- FUNCTION: 5'->3' double-stranded DNA exonuclease which may also possess
CC a cryptic 3'->5' double-stranded DNA exonuclease activity. Functions in
CC DNA mismatch repair. {ECO:0000256|RuleBase:RU910737}.
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|RuleBase:RU910737};
CC Note=Binds 2 magnesium ions per subunit. They probably participate in
CC the reaction catalyzed by the enzyme. May bind an additional third
CC magnesium ion after substrate binding. {ECO:0000256|RuleBase:RU910737};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU910737}.
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. EXO1
CC subfamily. {ECO:0000256|ARBA:ARBA00010563,
CC ECO:0000256|RuleBase:RU910737}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:OAE29750.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LVLJ01001430; OAE29750.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A176W9G0; -.
DR OrthoDB; 126305at2759; -.
DR Proteomes; UP000077202; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0035312; F:5'-3' DNA exonuclease activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-KW.
DR CDD; cd09908; H3TH_EXO1; 1.
DR CDD; cd09857; PIN_EXO1; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR037315; EXO1_H3TH.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR044752; PIN-like_EXO1.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR019974; XPG_CS.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR11081:SF8; EXONUCLEASE 1; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00279; HhH2; 1.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
DR PROSITE; PS00842; XPG_2; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|RuleBase:RU910737};
KW DNA excision {ECO:0000256|RuleBase:RU910737};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204, ECO:0000256|RuleBase:RU910737};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU910737};
KW Excision nuclease {ECO:0000256|ARBA:ARBA00022881,
KW ECO:0000256|RuleBase:RU910737};
KW Exonuclease {ECO:0000256|RuleBase:RU910737};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU910737};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842, ECO:0000256|RuleBase:RU910737};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|RuleBase:RU910737};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722, ECO:0000256|RuleBase:RU910737};
KW Nucleus {ECO:0000256|RuleBase:RU910737};
KW Reference proteome {ECO:0000313|Proteomes:UP000077202}.
FT DOMAIN 1..99
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 138..206
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 361..381
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 619..648
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 710..737
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 633..647
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 721..737
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 737 AA; 82584 MW; 9F7B29806B04F425 CRC64;
MGISGLLPTL KSIITPIHVR EYAGLKVAID TYSWLHKAAF SCSKDLCEGR PTDKIVQYCM
HRVNLLRHHG VQPVLVFDGG VLPMKMDQEQ IRARSRKENL QRAQEHERSG NHTAAFQCYQ
KAVDITPVIA LSLIKVLQQE GVEYIVAPYE ADAQMAYLAL NDLVQVVITE DSDLIAYGCP
RVLFKMDKSG HGEEFQYSDL VRNKDLNFTN FTKQMVLEMC IMSGCDYLPS LPGMGVKRAH
GLMRRFKSYE KVIRHLRFSG ITIPSDYEEL FGRAILTFRH QRVYDPISEE MTLMKAIAVG
KVDPCTKLPF QEDIEHTGVQ RSTYKSFALQ RNNKKSPELL VQKNVMTNYF FSPSKEAKKQ
FKAPRSAPEV EMGTGEKHYV ETSTSEKEEF FRSLKADFSY PDLSTVFGSP LPDEDSDDEV
GRDLGSTHQL YGESAILEHR NKADETLGAS KDVPNLQALG LNLKIEDAPS ISSLLACHVW
APGRDVDVPS TELKAVIGIR GQNDSLARVA RNAEGAIECA PNQHKSGLPA SAGERPSVKE
RTHIVAPQLK VSGYFSNKRM VQSSVVRRDT EEYATDRSEV SEEDADGCCE KVMKKSRIAL
CEIVRDKVEK SSLLSSSRCQ DGSELAHARS ESWNPISREE EQKQASPFSR FAHEGVSSKV
EGFASNVAHV GHYARIAEKS MERRTTKGLE AVEATLNQFS YNLSSFKPPR LGQSRWPPKL
VTSHDVRRQD PVCPDCP
//