ID B8C060_THAPS Unreviewed; 929 AA.
AC B8C060;
DT 03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 03-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 67.
DE RecName: Full=Exonuclease 1 {ECO:0008006|Google:ProtNLM};
GN ORFNames=THAPSDRAFT_4742 {ECO:0000313|EMBL:EED93464.1};
OS Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC Thalassiosiraceae; Thalassiosira.
OX NCBI_TaxID=35128 {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449};
RN [1] {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED93464.1};
RX PubMed=15459382; DOI=10.1126/science.1101156;
RA Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA Wilkerson F.P., Rokhsar D.S.;
RT "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT metabolism.";
RL Science 306:79-86(2004).
RN [2] {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=CCMP1335 {ECO:0000313|EMBL:EED93464.1};
RX PubMed=18923393; DOI=10.1038/nature07410;
RA Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA Grigoriev I.V.;
RT "The Phaeodactylum genome reveals the evolutionary history of diatom
RT genomes.";
RL Nature 456:239-244(2008).
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000641; EED93464.1; -; Genomic_DNA.
DR RefSeq; XP_002289927.1; XM_002289891.1.
DR AlphaFoldDB; B8C060; -.
DR STRING; 35128.B8C060; -.
DR PaxDb; 35128-Thaps4742; -.
DR EnsemblProtists; EED93464; EED93464; THAPSDRAFT_4742.
DR GeneID; 7443754; -.
DR KEGG; tps:THAPSDRAFT_4742; -.
DR eggNOG; KOG2518; Eukaryota.
DR HOGENOM; CLU_314870_0_0_1; -.
DR InParanoid; B8C060; -.
DR Proteomes; UP000001449; Chromosome 4.
DR GO; GO:0005739; C:mitochondrion; IEA:UniProtKB-KW.
DR GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0035312; F:5'-3' DNA exonuclease activity; IEA:UniProtKB-UniRule.
DR GO; GO:0017108; F:5'-flap endonuclease activity; IBA:GO_Central.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR GO; GO:0006281; P:DNA repair; IEA:UniProtKB-UniRule.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR InterPro; IPR006086; XPG-I_dom.
DR InterPro; IPR006084; XPG/Rad2.
DR InterPro; IPR006085; XPG_DNA_repair_N.
DR PANTHER; PTHR11081:SF72; 5'-3' EXONUCLEASE FAMILY PROTEIN; 1.
DR PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR Pfam; PF00867; XPG_I; 1.
DR Pfam; PF00752; XPG_N; 1.
DR PRINTS; PR00853; XPGRADSUPER.
DR SMART; SM00279; HhH2; 1.
DR SMART; SM00484; XPGI; 1.
DR SMART; SM00485; XPGN; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Mitochondrion {ECO:0000256|ARBA:ARBA00023128};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Reference proteome {ECO:0000313|Proteomes:UP000001449}.
FT DOMAIN 26..133
FT /note="XPG N-terminal"
FT /evidence="ECO:0000259|SMART:SM00485"
FT DOMAIN 192..265
FT /note="XPG-I"
FT /evidence="ECO:0000259|SMART:SM00484"
FT REGION 15..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 252..298
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 322..346
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 425..456
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 657..886
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 900..929
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 258..275
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 276..298
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 323..345
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 439..454
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 687..702
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 753..767
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 788..806
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 847..880
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 929 AA; 101875 MW; C2922F0726D1D72C CRC64;
MGITGLLRGL HPLLVPPPTH HTNRNLGKDD NSNQYTPKIQ HNILQFKNKS LAIDASSWLF
KSAYTCADRL VEATERGIRD PIAETKYSQY MISRCTHLLQ YAQVSSIYLV FDGIRVPLKS
GTNASRESKR QQNIVEARRL MSAGRRNEAL DKYKSCVKGT EEMARVVCAA VEKEFGKDGK
LGVGKKWGVG RVKCVFSPYE ADAQLAKLCA DGYCHGVVTE DSDVLVYSAA CRRPFPMTMD
WLLNPNFLPP VRNGKRSKRY RNGSDSDDDE NENIIRHSNN NSSIDIDDAK SNNNARAEQH
CSEVDYGDDD NMMYAPIRRQ LPPLSVPSTS QNKRGRRTKG YNNDSNGAGG GVALLTSLRA
FASKEAAQPG AGVRLFVQAC VLSGCDYVPN RLSKVGPVTA FKLVKETSHR DPSVRFERVM
KSLPNGSKLL KEPAEDDSGN GDDNNDDDED DFLSAWKSDS DVTEKEKYLE LLSKSEAVFY
YHLTKDLANN SIVPLVPHKA SGSPSRGDES NRIFSPSLDV FKSDPSLSFI GSAAEALKVQ
STPLLPCSQN NSRAIAAHNN NNGGWMASKK YTGPVTNAYS KANNNRQSTK AAQVQAPKST
ILTNFLNGSA KTNAINPSTN RVYVPSYARS STTTKRPSLH SETALSSATN PFADFTHDPA
NPLFSPDPAK KPANEKKLKR SPMLSPMLTP SRPSSETTFD YGAESGVKSK HFGGSIGAID
NDKDAESDND SSTKSEGAAL AYDGENICDM SEELKQQPPT ESDVSNNVVE EDSFDYEIIP
ESPPIMPTTR DSAKSKFFSS PRRVSTSPPA HFKDNRLNGG TSPDDAIEID EGEEEMWGKK
ETSQTDSQTD KTSVNKRPFK SPYPATTNRS TTTASKNPSR PRPPASAILA GFARQKEICT
GSSAKIKSDR ITKRPQKKPK GIKDYMKPC
//