ID A0A2U1NID1_ARTAN Unreviewed; 1172 AA.
AC A0A2U1NID1;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 10.
DE SubName: Full=RNA-directed DNA polymerase, eukaryota {ECO:0000313|EMBL:PWA73265.1};
GN ORFNames=CTI12_AA262640 {ECO:0000313|EMBL:PWA73265.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA73265.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA73265.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA73265.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA73265.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01002763; PWA73265.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1NID1; -.
DR STRING; 35608.A0A2U1NID1; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR CDD; cd01650; RT_nLTR_like; 1.
DR Gene3D; 3.60.10.10; Endonuclease/exonuclease/phosphatase; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR036691; Endo/exonu/phosph_ase_sf.
DR InterPro; IPR005135; Endo/exonuclease/phosphatase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR026960; RVT-Znf.
DR PANTHER; PTHR46890:SF1; FGR14P; 1.
DR PANTHER; PTHR46890; NON-LTR RETROLELEMENT REVERSE TRANSCRIPTASE-LIKE PROTEIN-RELATED; 1.
DR Pfam; PF14529; Exo_endo_phos_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13966; zf-RVT; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF56219; DNase I-like; 1.
DR PROSITE; PS50878; RT_POL; 1.
PE 4: Predicted;
KW Nucleotidyltransferase {ECO:0000313|EMBL:PWA73265.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW RNA-directed DNA polymerase {ECO:0000313|EMBL:PWA73265.1};
KW Transferase {ECO:0000313|EMBL:PWA73265.1}.
FT DOMAIN 440..717
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
SQ SEQUENCE 1172 AA; 135139 MW; 40E4DE54DFD22B4A CRC64;
MEFPAVRNCW GNLTFNYIHS NSVGNSGGIL CVWDLNSFTK VSHTLSDYFV IIRGKWLKND
TDIMFVAVYG PHDPRDKRMV WEYLTHVINQ WNGNVVVMGD FNEVRFKSDR FGSIFNVQGA
DDFNSFIEDA GLEEVPLGGS AFTWCHKSAT KMSKLDRFFI SNSLLNSCPH VSAITLDRFL
SDHRPILLCE TNFDYGPIPF RFFHHWIKLD GFSTFVSDVW NSAPVNKDNG MRNLAGKLKF
LKGKLREWIK SSRVKGKSDS SNLKEELRVL DETIDKGDRS DGLIQRRMEI MNDIQYLDQL
HAMELAQKSK IKWAIEGDEN TRFFHGVLNK KRNQMAIRGV LVDGKWIDQP SDVKMEFFNH
FRDRFSKPVE NRITLELAFP KQISRDQQNE LERMVTKEEL KMAVWDCGTD KSPGPDGFSF
GFFRHFWSIL EKDVFEAVSH FFVHGDIPPG CNPSFITLIP KVPAANMVKD FRPISLIGCL
YKIIAKILAN RLVGVLEDIV HEVQSAFIAN RQILDGPFII NEVHQWCKSK KKQSMLFKVD
FEKAYDSVRW DFLDDVLNKF GFGNKWRNWI HCCLKSSRGS ILVNGSPTEE FQFYKGLKQG
DPLSPFLFIL IMESLHISFQ RVVESNMFKG IKLSDSLCIS HMFYADDAVF VGNWSDENIN
TLTHVLDVFY RASGLKINMC KSKILGINVD VHKVNQAANK LGCLILSCPF SYLGSKVGVS
MSRIEDWREV IDKVKCRLSK WKMKCLSIGG RLTLLKSVLG AMPIFNMSIF KVPAKVLKML
ESIRGRFFNG HESDSRKAYW IKWDKVLANK DKGGLGVSSF FALNRGLMFK WVWRFLNNES
SLWKKVIKAI HGDGGNIHNA TRFGINSCWS SIVKEAQNMK SRGINIFDYL KLRLGKGNSI
KFWDDDWYQG GILKDICPRM YALETRKDVT VCEKMKDPSI CFSFRRGIRG GSEQDQFNHL
EAVSNSITLG PREDRWVWSL EGSGEFSVAS LRRKIDDIHL PNVGTKTRWT KLVPIKVNVL
AWKVMVDALP TRWNLSRRGI NIPSMLCPIC GIGVESSSHL FFRCEVSRHI GQSLAKWWDI
PVQDVVSYDD WKGWLISIRL GSKLKGVLEG VWITMWWYIW WYRNKMLFDD NPPKKACLFD
RIVTSSFQWC RSRCKSSLDW NEWLKTPYLI SL
//