GenomeNet

Database: UniProt
Entry: B8C060_THAPS
LinkDB: B8C060_THAPS
Original site: B8C060_THAPS 
ID   B8C060_THAPS            Unreviewed;       929 AA.
AC   B8C060;
DT   03-MAR-2009, integrated into UniProtKB/TrEMBL.
DT   03-MAR-2009, sequence version 1.
DT   27-MAR-2024, entry version 67.
DE   RecName: Full=Exonuclease 1 {ECO:0008006|Google:ProtNLM};
GN   ORFNames=THAPSDRAFT_4742 {ECO:0000313|EMBL:EED93464.1};
OS   Thalassiosira pseudonana (Marine diatom) (Cyclotella nana).
OC   Eukaryota; Sar; Stramenopiles; Ochrophyta; Bacillariophyta;
OC   Coscinodiscophyceae; Thalassiosirophycidae; Thalassiosirales;
OC   Thalassiosiraceae; Thalassiosira.
OX   NCBI_TaxID=35128 {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449};
RN   [1] {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1335 {ECO:0000313|EMBL:EED93464.1};
RX   PubMed=15459382; DOI=10.1126/science.1101156;
RA   Armbrust E.V., Berges J.A., Bowler C., Green B.R., Martinez D.,
RA   Putnam N.H., Zhou S., Allen A.E., Apt K.E., Bechner M., Brzezinski M.A.,
RA   Chaal B.K., Chiovitti A., Davis A.K., Demarest M.S., Detter J.C.,
RA   Glavina T., Goodstein D., Hadi M.Z., Hellsten U., Hildebrand M.,
RA   Jenkins B.D., Jurka J., Kapitonov V.V., Kroger N., Lau W.W., Lane T.W.,
RA   Larimer F.W., Lippmeier J.C., Lucas S., Medina M., Montsant A., Obornik M.,
RA   Parker M.S., Palenik B., Pazour G.J., Richardson P.M., Rynearson T.A.,
RA   Saito M.A., Schwartz D.C., Thamatrakoln K., Valentin K., Vardi A.,
RA   Wilkerson F.P., Rokhsar D.S.;
RT   "The genome of the diatom Thalassiosira pseudonana: ecology, evolution, and
RT   metabolism.";
RL   Science 306:79-86(2004).
RN   [2] {ECO:0000313|EMBL:EED93464.1, ECO:0000313|Proteomes:UP000001449}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=CCMP1335 {ECO:0000313|EMBL:EED93464.1};
RX   PubMed=18923393; DOI=10.1038/nature07410;
RA   Bowler C., Allen A.E., Badger J.H., Grimwood J., Jabbari K., Kuo A.,
RA   Maheswari U., Martens C., Maumus F., Otillar R.P., Rayko E., Salamov A.,
RA   Vandepoele K., Beszteri B., Gruber A., Heijde M., Katinka M., Mock T.,
RA   Valentin K., Verret F., Berges J.A., Brownlee C., Cadoret J.P.,
RA   Chiovitti A., Choi C.J., Coesel S., De Martino A., Detter J.C., Durkin C.,
RA   Falciatore A., Fournet J., Haruta M., Huysman M.J., Jenkins B.D.,
RA   Jiroutova K., Jorgensen R.E., Joubert Y., Kaplan A., Kroger N., Kroth P.G.,
RA   La Roche J., Lindquist E., Lommer M., Martin-Jezequel V., Lopez P.J.,
RA   Lucas S., Mangogna M., McGinnis K., Medlin L.K., Montsant A.,
RA   Oudot-Le Secq M.P., Napoli C., Obornik M., Parker M.S., Petit J.L.,
RA   Porcel B.M., Poulsen N., Robison M., Rychlewski L., Rynearson T.A.,
RA   Schmutz J., Shapiro H., Siaut M., Stanley M., Sussman M.R., Taylor A.R.,
RA   Vardi A., von Dassow P., Vyverman W., Willis A., Wyrwicz L.S.,
RA   Rokhsar D.S., Weissenbach J., Armbrust E.V., Green B.R., Van de Peer Y.,
RA   Grigoriev I.V.;
RT   "The Phaeodactylum genome reveals the evolutionary history of diatom
RT   genomes.";
RL   Nature 456:239-244(2008).
CC   -!- COFACTOR:
CC       Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC         Evidence={ECO:0000256|ARBA:ARBA00001946};
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; CM000641; EED93464.1; -; Genomic_DNA.
DR   RefSeq; XP_002289927.1; XM_002289891.1.
DR   AlphaFoldDB; B8C060; -.
DR   STRING; 35128.B8C060; -.
DR   PaxDb; 35128-Thaps4742; -.
DR   EnsemblProtists; EED93464; EED93464; THAPSDRAFT_4742.
DR   GeneID; 7443754; -.
DR   KEGG; tps:THAPSDRAFT_4742; -.
DR   eggNOG; KOG2518; Eukaryota.
DR   HOGENOM; CLU_314870_0_0_1; -.
DR   InParanoid; B8C060; -.
DR   Proteomes; UP000001449; Chromosome 4.
DR   GO; GO:0005739; C:mitochondrion; IEA:UniProtKB-KW.
DR   GO; GO:0005634; C:nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0035312; F:5'-3' DNA exonuclease activity; IEA:UniProtKB-UniRule.
DR   GO; GO:0017108; F:5'-flap endonuclease activity; IBA:GO_Central.
DR   GO; GO:0003677; F:DNA binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-UniRule.
DR   GO; GO:0006281; P:DNA repair; IEA:UniProtKB-UniRule.
DR   Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR   Gene3D; 3.40.50.1010; 5'-nuclease; 1.
DR   InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR   InterPro; IPR008918; HhH2.
DR   InterPro; IPR029060; PIN-like_dom_sf.
DR   InterPro; IPR006086; XPG-I_dom.
DR   InterPro; IPR006084; XPG/Rad2.
DR   InterPro; IPR006085; XPG_DNA_repair_N.
DR   PANTHER; PTHR11081:SF72; 5'-3' EXONUCLEASE FAMILY PROTEIN; 1.
DR   PANTHER; PTHR11081; FLAP ENDONUCLEASE FAMILY MEMBER; 1.
DR   Pfam; PF00867; XPG_I; 1.
DR   Pfam; PF00752; XPG_N; 1.
DR   PRINTS; PR00853; XPGRADSUPER.
DR   SMART; SM00279; HhH2; 1.
DR   SMART; SM00484; XPGI; 1.
DR   SMART; SM00485; XPGN; 1.
DR   SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR   SUPFAM; SSF88723; PIN domain-like; 1.
PE   4: Predicted;
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Mitochondrion {ECO:0000256|ARBA:ARBA00023128};
KW   Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Reference proteome {ECO:0000313|Proteomes:UP000001449}.
FT   DOMAIN          26..133
FT                   /note="XPG N-terminal"
FT                   /evidence="ECO:0000259|SMART:SM00485"
FT   DOMAIN          192..265
FT                   /note="XPG-I"
FT                   /evidence="ECO:0000259|SMART:SM00484"
FT   REGION          15..34
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          252..298
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          322..346
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          425..456
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          657..886
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          900..929
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        258..275
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        276..298
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        323..345
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        439..454
FT                   /note="Acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        687..702
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        753..767
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        788..806
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        847..880
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   929 AA;  101875 MW;  C2922F0726D1D72C CRC64;
     MGITGLLRGL HPLLVPPPTH HTNRNLGKDD NSNQYTPKIQ HNILQFKNKS LAIDASSWLF
     KSAYTCADRL VEATERGIRD PIAETKYSQY MISRCTHLLQ YAQVSSIYLV FDGIRVPLKS
     GTNASRESKR QQNIVEARRL MSAGRRNEAL DKYKSCVKGT EEMARVVCAA VEKEFGKDGK
     LGVGKKWGVG RVKCVFSPYE ADAQLAKLCA DGYCHGVVTE DSDVLVYSAA CRRPFPMTMD
     WLLNPNFLPP VRNGKRSKRY RNGSDSDDDE NENIIRHSNN NSSIDIDDAK SNNNARAEQH
     CSEVDYGDDD NMMYAPIRRQ LPPLSVPSTS QNKRGRRTKG YNNDSNGAGG GVALLTSLRA
     FASKEAAQPG AGVRLFVQAC VLSGCDYVPN RLSKVGPVTA FKLVKETSHR DPSVRFERVM
     KSLPNGSKLL KEPAEDDSGN GDDNNDDDED DFLSAWKSDS DVTEKEKYLE LLSKSEAVFY
     YHLTKDLANN SIVPLVPHKA SGSPSRGDES NRIFSPSLDV FKSDPSLSFI GSAAEALKVQ
     STPLLPCSQN NSRAIAAHNN NNGGWMASKK YTGPVTNAYS KANNNRQSTK AAQVQAPKST
     ILTNFLNGSA KTNAINPSTN RVYVPSYARS STTTKRPSLH SETALSSATN PFADFTHDPA
     NPLFSPDPAK KPANEKKLKR SPMLSPMLTP SRPSSETTFD YGAESGVKSK HFGGSIGAID
     NDKDAESDND SSTKSEGAAL AYDGENICDM SEELKQQPPT ESDVSNNVVE EDSFDYEIIP
     ESPPIMPTTR DSAKSKFFSS PRRVSTSPPA HFKDNRLNGG TSPDDAIEID EGEEEMWGKK
     ETSQTDSQTD KTSVNKRPFK SPYPATTNRS TTTASKNPSR PRPPASAILA GFARQKEICT
     GSSAKIKSDR ITKRPQKKPK GIKDYMKPC
//
DBGET integrated database retrieval system