ID A0A2P6TT06_CHLSO Unreviewed; 866 AA.
AC A0A2P6TT06;
DT 23-MAY-2018, integrated into UniProtKB/TrEMBL.
DT 23-MAY-2018, sequence version 1.
DT 27-MAR-2024, entry version 24.
DE RecName: Full=Exonuclease 1 {ECO:0000256|RuleBase:RU910737};
DE EC=3.1.-.- {ECO:0000256|RuleBase:RU910737};
GN ORFNames=C2E21_3835 {ECO:0000313|EMBL:PRW57196.1};
OS Chlorella sorokiniana (Freshwater green alga).
OC Eukaryota; Viridiplantae; Chlorophyta; core chlorophytes; Trebouxiophyceae;
OC Chlorellales; Chlorellaceae; Chlorella clade; Chlorella.
OX NCBI_TaxID=3076 {ECO:0000313|EMBL:PRW57196.1, ECO:0000313|Proteomes:UP000239899};
RN [1] {ECO:0000313|EMBL:PRW57196.1, ECO:0000313|Proteomes:UP000239899}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=UTEX 1602 {ECO:0000313|Proteomes:UP000239899};
RX PubMed=29178410; DOI=10.1111/tpj.13789;
RA Arriola M.B., Velmurugan N., Zhang Y., Plunkett M.H., Hondzo H.,
RA Barney B.M.;
RT "Genome sequences of Chlorella sorokiniana UTEX 1602 and Micractinium
RT conductrix SAG 241.80: implications to maltose excretion by a green alga.";
RL Plant J. 93:566-586(2018).
CC -!- FUNCTION: 5'->3' double-stranded DNA exonuclease which may also possess
CC a cryptic 3'->5' double-stranded DNA exonuclease activity. Functions in
CC DNA mismatch repair. {ECO:0000256|RuleBase:RU910737}.
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|RuleBase:RU910737};
CC Note=Binds 2 magnesium ions per subunit. They probably participate in
CC the reaction catalyzed by the enzyme. May bind an additional third
CC magnesium ion after substrate binding. {ECO:0000256|RuleBase:RU910737};
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123,
CC ECO:0000256|RuleBase:RU910737}.
CC -!- SIMILARITY: Belongs to the XPG/RAD2 endonuclease family. EXO1
CC subfamily. {ECO:0000256|ARBA:ARBA00010563,
CC ECO:0000256|RuleBase:RU910737}.
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PRW57196.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LHPG02000007; PRW57196.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2P6TT06; -.
DR STRING; 3076.A0A2P6TT06; -.
DR OrthoDB; 126305at2759; -.
DR Proteomes; UP000239899; Unassembled WGS sequence.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0035312; F:5'-3' DNA exonuclease activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-KW.
DR CDD; cd09908; H3TH_EXO1; 1.
DR CDD; cd09857; PIN_EXO1; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR037315; EXO1_H3TH.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR044752; PIN-like_EXO1.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR019974; XPG_CS.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR11081:SF8; EXONUCLEASE 1; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00279; HhH2; 1.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
DR PROSITE; PS00842; XPG_2; 1.
PE 3: Inferred from homology;
KW DNA damage {ECO:0000256|ARBA:ARBA00022763, ECO:0000256|RuleBase:RU910737};
KW DNA excision {ECO:0000256|RuleBase:RU910737};
KW DNA repair {ECO:0000256|ARBA:ARBA00023204, ECO:0000256|RuleBase:RU910737};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125, ECO:0000256|RuleBase:RU910737};
KW Excision nuclease {ECO:0000256|ARBA:ARBA00022881,
KW ECO:0000256|RuleBase:RU910737};
KW Exonuclease {ECO:0000256|RuleBase:RU910737, ECO:0000313|EMBL:PRW57196.1};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU910737};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842, ECO:0000256|RuleBase:RU910737};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723,
KW ECO:0000256|RuleBase:RU910737};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722, ECO:0000256|RuleBase:RU910737};
KW Nucleus {ECO:0000256|RuleBase:RU910737};
KW Reference proteome {ECO:0000313|Proteomes:UP000239899}.
FT DOMAIN 1..99
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 138..208
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 373..403
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 417..755
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 785..866
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 375..397
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 421..460
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 535..565
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 572..586
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 628..645
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 653..671
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 711..727
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 866 AA; 92269 MW; D1D05A15AA10E7F9 CRC64;
MGIAGLLPQL RSITRRAHVS KYRGQTMAVD AYCLLHRGSY SCARELVEGE PTDRHVQYCM
GRVELLLAAG VRPMVVFDGD RLPNKADEER SRERSREENK ARARALWAQG NKAAAMECYQ
KAVDIQPAHA KQFVEALKQR NIPYIVAPYE ADAQMAYLAL NNLVDAVLTE DSDLLCYGCP
TVFFKMDKGG EGEEVRLADL PTAKELAFQG FGHELFQEMC VLAGCDFVAS LPGIGIKKAH
QHLRRTRCFL KVVRALRFDG VKIPEGYERR VQRALWTFKH QRVYCPRRRA AVHLHEVVGG
ELAAGARVPA AAQLEEGEPD FLGPPLPHDV AEAIAVGDLD PITKLPFGNE LQPCTAPLAA
DAAASGCGAA ASQRAAPSGR QQKASGRGKS GGQQGRPAAS SGGPILKYVL RGLTKPSTSA
DFKAPRVQQQ SGSQGSQEQQ SIAGASAPRQ QQQQQQQPQQ RRQTEEEEED VQSPAGGLMA
SLRAALQDEP SIEQLLSEQA VAADYDPTAD SGDEDCSGPA LQQRHGGGRP RFVVPHRQQQ
QQQQQRQQRQ QQQQQQVSAL QALGLPSSPT IDELLDGSSA NTGQDEELEL PSLRCSPVVE
GPAASQRPSQ RAWLPQRAAA MQQAPAAIEE QTAAQRRQQA QQDLLDLISP GGPAFLQQQQ
QPEQQQQPDE GGGGITPGFY GGGSEAGSGA RLLTSLPRRP PAATPGSASP VFQGFYRQPQ
LGSAAQRRRS SAAEEEGGGG GDASPSFSGG DDPIASLSHV AAFQPKAEAA MERAAAAALA
ATAQKAFPPL PPPPRHLLHG SAAGPAAGSD DEAGQPASTP AAGKRPLLGL GVEKENRLLQ
AARSKRPKGG GSSTAPGTNP FAGFAL
//