ID A0A2U1MJT0_ARTAN Unreviewed; 1961 AA.
AC A0A2U1MJT0;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 24-JAN-2024, entry version 16.
DE SubName: Full=DNA polymerase A {ECO:0000313|EMBL:PWA61533.1};
GN ORFNames=CTI12_AA372140 {ECO:0000313|EMBL:PWA61533.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA61533.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA61533.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA61533.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA61533.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01005080; PWA61533.1; -; Genomic_DNA.
DR STRING; 35608.A0A2U1MJT0; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:InterPro.
DR GO; GO:0003887; F:DNA-directed DNA polymerase activity; IEA:InterPro.
DR GO; GO:0006261; P:DNA-templated DNA replication; IEA:InterPro.
DR GO; GO:0050896; P:response to stimulus; IEA:UniProt.
DR CDD; cd18026; DEXHc_POLQ-like; 1.
DR CDD; cd08638; DNA_pol_A_theta; 1.
DR CDD; cd18795; SF2_C_Ski2; 1.
DR Gene3D; 1.10.3380.20; -; 1.
DR Gene3D; 1.10.3380.30; -; 1.
DR Gene3D; 3.30.70.370; -; 1.
DR Gene3D; 1.10.150.20; 5' to 3' exonuclease, C-terminal subdomain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR011545; DEAD/DEAH_box_helicase_dom.
DR InterPro; IPR001098; DNA-dir_DNA_pol_A_palm_dom.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR002298; DNA_polymerase_A.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR046931; HTH_61.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR048960; POLQ-like_helical.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR10133; DNA POLYMERASE I; 1.
DR PANTHER; PTHR10133:SF62; DNA POLYMERASE THETA; 1.
DR Pfam; PF00270; DEAD; 1.
DR Pfam; PF00476; DNA_pol_A; 1.
DR Pfam; PF00271; Helicase_C; 1.
DR Pfam; PF20470; HTH_61; 1.
DR Pfam; PF21099; POLQ_helical; 1.
DR PRINTS; PR00868; DNAPOLI.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SMART; SM00482; POLAc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 1.
DR SUPFAM; SSF158702; Sec63 N-terminal domain-like; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
PE 4: Predicted;
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207}.
FT DOMAIN 368..560
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 600..800
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT REGION 297..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1154..1225
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1243..1268
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1168..1190
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1205..1224
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1243..1259
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1961 AA; 218190 MW; 5F0DBCFA77F0506F CRC64;
MASDGSSRSR VDQFFSSKKN KFVSPLTKPA IVEKQAKLSH EVSPSAKGSL ANYLVSSQNE
DRSLSTLPTA CGPSDGQDQV RRNLTSDIES SVHGVENPEL QQFATNFLSY YCSEVPSTNV
NTNKRPSSPS VIDLEDQSSK KRHLTSVYNQ SRNELQMSLR KCNTTSNVIN LEDVDMTLCN
TPGVATVNVG TEVTPQSMRG SSLFSPGDTF WREAMQVADG MAHHGKDLSQ KVLEESQGAD
KEVSPLPVKH IDFSCEDKNM AVNNMITPCN PVQSLSRLQE TTSSCVITKR RADVLNQPND
NRSFTTPGTD IKSSNMNGFK SFENTPSSSV PPNDRLDISN WLPSEICITY KKRGISKLYP
WQVDCLQVDG VLQKRNLVYC ASTSAGKSFV AEILMLRRVL STRKMALLVL PYVSICTEKA
EHLEALLEPL NKHVRSYYGS QGGGTLPKDT SVAVCTIEKA NSLLNRLLEE GRLSEVGIIV
IDELHMVGDQ HRGYLLELML TKLRYGAGEG RVEFSSGESS GTSSGKTDPT SGLQIVGMSA
TLPNVGAVAD WLQAALYQTD FRPVPLEEYI KVGNAIYNKK MELVRTIPKR AELGGKDPDQ
IVELCNEVVQ EGHSVLIFCS SRKGCESTAK HVAKYLKQFS LNAPNSENEF PDIASAIDGL
QKSPAGLDPT LAETLPAGVA YHHAGLTVEE REIVENCYRK GLVRVLTATS TLAAGVNLPA
RRVIFRQPRV GRDFIDGTRY RQMSGRAGRT GIDTKGESVL VCKPEEVKRI VALIGDSCPP
LYSCLSEDKN GMTHAILEVV AGGIVQTAND IHRYVRCTLL NSTQPFEDVV KSAKDSLKWL
CHRKFLEWSE DTKLYSTTPL GRASFGSSLN PEESLIVLDD LSRAREGFVL ASDLHLVYLV
TPTNVDVQPD WELYYERFMQ LPKLDQSVGN RVGVQEPFLM RMAHGAPIRN TERSRHDMKG
LGVSTNGVLT DDQMLRVCKR FYVALILSRL VQEVPVAEVC ESFNVARGMV QSLQENAGRF
ASMVSVFCER LGWHDLESLV SKFQNRVSFG VRAEIAELTT IPYIKGSRAR ALYKAGLRTP
QAIAEASIPE IAKTIFESSS WDAQENTAQR RIQLGVAKKI KNGARKIVLE KAEEARDAAF
SAFKALGVDV PQFPTNITKK EPVTSSREDT TSSEGRTSSF VHVAQNNLKS KTIEVKEDKT
SDVGSGTSPE HNSDDTTLSI SGAGPSVGII SMKENTLDKI VPQKEEEKEG LRVGNKERNS
EKGPMSAVNS PGGFDTFLNM WDTMNEFFFD IHYNKRSELH TIAPFELQGI AICWEDSPVY
YISVPKDLYW SEDNKNKSPI GNNNFRAQQD QLEMANKRWC RICMILSKNH VRKFGWNLKI
QNQVLKHPAV SIQRFGSIKD SVKTMGVELI ENSYYMFSPV HLKDVVDLSV VVWILWPDEE
RNSNPNLEKE VKKRLSGEVA AAANQKGRWK NQMRRAAHNG CCRRAAQTRA LSSVLWKLLT
SENLQQPLST IEMPLVNVLA DMELSGIGVD MDGCIQSRYV LGKKLRYLEN EAYRLAGTRF
SLYTAADIAD VLYTRLKLPI PEGYQGKNHP STDKHCLEML RLEHPIIPVI KEHRTLAKLL
NCTLGSICSL AKLSMKTQRY TLHGHWLQTS TATGRLSMED PNLQCVEHMV EFKIDRDEKT
DGDSETEYYK VNPRDFFIPT QENWLLVTAD YSQIELRLMA HFSKDPSLVE LLTKPLGDVF
NMITARWTGK AESSVDPKER EQTKRLVYGI LYGMGANSLA DQLECSPEDA GEKIQSFKSY
FSGVASWLKE AVAGCRKKGY VETLMGRKRF LAKIKYGNSE EKSKAQRQSV NSICQGSAAD
IIKVAMINIH SVIAEGVEKS ESSVKFAERF HMLKGRCRVL LQVHDELILE ADPSVVNEAG
LLLKLSMESA ASLIVPLIVK LKCGRTWGSL EPLQPLQESN M
//