ID A0A118K4U3_CYNCS Unreviewed; 624 AA.
AC A0A118K4U3;
DT 13-APR-2016, integrated into UniProtKB/TrEMBL.
DT 13-APR-2016, sequence version 1.
DT 22-FEB-2023, entry version 22.
DE SubName: Full=5'-3' exonuclease, alpha-helical arch, N-terminal {ECO:0000313|EMBL:KVI08057.1};
GN ORFNames=Ccrd_013576 {ECO:0000313|EMBL:KVI08057.1};
OS Cynara cardunculus var. scolymus (Globe artichoke) (Cynara scolymus).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Carduoideae; Cardueae;
OC Carduinae; Cynara.
OX NCBI_TaxID=59895 {ECO:0000313|EMBL:KVI08057.1, ECO:0000313|Proteomes:UP000243975};
RN [1] {ECO:0000313|EMBL:KVI08057.1, ECO:0000313|Proteomes:UP000243975}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=2C {ECO:0000313|EMBL:KVI08057.1};
RX PubMed=26786968; DOI=10.1038/srep19427;
RA Scaglione D., Reyes-Chin-Wo S., Acquadro A., Froenicke L., Portis E.,
RA Beitel C., Tirone M., Mauro R., Lo Monaco A., Mauromicale G., Faccioli P.,
RA Cattivelli L., Rieseberg L., Michelmore R., Lanteri S.;
RT "The genome sequence of the outbreeding globe artichoke constructed de novo
RT incorporating a phase-aware low-pass sequencing strategy of F1 progeny.";
RL Sci. Rep. 6:19427-19427(2016).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:KVI08057.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; LEKV01001375; KVI08057.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A118K4U3; -.
DR STRING; 59895.A0A118K4U3; -.
DR EnsemblPlants; KVI08057; KVI08057; Ccrd_013576.
DR Gramene; KVI08057; KVI08057; Ccrd_013576.
DR Proteomes; UP000243975; Unassembled WGS sequence.
DR GO; GO:0017108; F:5'-flap endonuclease activity; IEA:InterPro.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:UniProtKB-EC.
DR GO; GO:0004527; F:exonuclease activity; IEA:UniProtKB-KW.
DR GO; GO:0033567; P:DNA replication, Okazaki fragment processing; IEA:InterPro.
DR CDD; cd09898; H3TH_53EXO; 1.
DR CDD; cd09859; PIN_53EXO; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.1010; 5'-nuclease; 2.
DR InterPro; IPR020046; 5-3_exonucl_a-hlix_arch_N.
DR InterPro; IPR002421; 5-3_exonuclease.
DR InterPro; IPR036279; 5-3_exonuclease_C_sf.
DR InterPro; IPR020045; DNA_polI_H3TH.
DR InterPro; IPR038969; FEN.
DR InterPro; IPR008918; HhH2.
DR InterPro; IPR029060; PIN-like_dom_sf.
DR PANTHER; PTHR42646:SF2; DNA-DIRECTED DNA POLYMERASE; 1.
DR PANTHER; PTHR42646; FLAP ENDONUCLEASE XNI; 1.
DR Pfam; PF01367; 5_3_exonuc; 1.
DR Pfam; PF02739; 5_3_exonuc_N; 1.
DR SMART; SM00475; 53EXOc; 1.
DR SMART; SM00279; HhH2; 1.
DR SUPFAM; SSF47807; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR SUPFAM; SSF88723; PIN domain-like; 2.
PE 4: Predicted;
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Exonuclease {ECO:0000313|EMBL:KVI08057.1};
KW Hydrolase {ECO:0000313|EMBL:KVI08057.1};
KW Nuclease {ECO:0000313|EMBL:KVI08057.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000243975}.
FT DOMAIN 143..582
FT /note="5'-3' exonuclease"
FT /evidence="ECO:0000259|SMART:SM00475"
SQ SEQUENCE 624 AA; 69513 MW; CBB56369A95CB9E3 CRC64;
MPNQQNNPPF DGPQGGFLTA GSASSLPLNY SNTNAYELKM GCCQFSNHLL NNFCRTLYCC
RGSFSTNHWA RTPHHLVSSS RIRFAKGYHK VSNSLRSELS GTAHAVSLKD LGTVEGEATQ
QESTSFDSSQ KTENLVNIDS SNGRVMLIDG TSIIYRSYYK LLAKLHHGYL SNADGNGDWV
LTISTALSLI IDVLEFTPSH VAVVFDHDVS VLRIPSFMLS ARMGGRVLDG RGFAMVICLS
RPDKNLWQKV CLFIPSVFSS QLHVMFALSW EYPRKQRGQW RSCQTFRHTL YPSYKSNRPP
TPDTIVQGLQ YLKASIKAMS IKVIEVPGVE ADDVIGTLAM RSVEAGFKLP HICPMLCGDE
RGLLETCPVA AIKATKAELP QLNAFHCPVA VIKATKAELP ELNAFHLIGY ISQRRTILTR
FKQRITPEKT TSSFLSSSPM VVKVRVVSPD KDFFQILSPS LRLLRIAPRG FEMVSFGMED
FAKKYGAIEP SQFVDVMALV GDRSDNIPGV DGIGDVHAVQ LISRFGTLEN LLQHVDQVEE
ERIRKALIAN KEQALLSKEL ALLRSDLPHY MVPYSISDLA FKKPEDNGEK FTNLLTAIGA
YAEGFSLDSV IRRAFYLWKK LEKS
//