ID A0A2U1N466_ARTAN Unreviewed; 488 AA.
AC A0A2U1N466;
DT 18-JUL-2018, integrated into UniProtKB/TrEMBL.
DT 18-JUL-2018, sequence version 1.
DT 27-MAR-2024, entry version 10.
DE SubName: Full=Reverse transcriptase domain-containing protein {ECO:0000313|EMBL:PWA68269.1};
GN ORFNames=CTI12_AA310560 {ECO:0000313|EMBL:PWA68269.1}, CTI12_AA418720
GN {ECO:0000313|EMBL:PWA56672.1};
OS Artemisia annua (Sweet wormwood).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae; Anthemideae;
OC Artemisiinae; Artemisia.
OX NCBI_TaxID=35608 {ECO:0000313|EMBL:PWA68269.1, ECO:0000313|Proteomes:UP000245207};
RN [1] {ECO:0000313|EMBL:PWA68269.1, ECO:0000313|Proteomes:UP000245207}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Huhao1 {ECO:0000313|Proteomes:UP000245207};
RC TISSUE=Leaf {ECO:0000313|EMBL:PWA68269.1};
RX PubMed=29703587; DOI=10.1016/j.molp.2018.03.015;
RA Shen Q., Zhang L., Liao Z., Wang S., Yan T., Shi P., Liu M., Fu X., Pan Q.,
RA Wang Y., Lv Z., Lu X., Zhang F., Jiang W., Ma Y., Chen M., Hao X., Li L.,
RA Tang Y., Lv G., Zhou Y., Sun X., Brodelius P.E., Rose J.K.C., Tang K.;
RT "The genome of Artemisia annua provides insight into the evolution of
RT Asteraceae family and artemisinin biosynthesis.";
RL Mol. Plant 11:776-788(2018).
CC -!- CAUTION: The sequence shown here is derived from an EMBL/GenBank/DDBJ
CC whole genome shotgun (WGS) entry which is preliminary data.
CC {ECO:0000313|EMBL:PWA68269.1}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; PKPP01006387; PWA56672.1; -; Genomic_DNA.
DR EMBL; PKPP01003673; PWA68269.1; -; Genomic_DNA.
DR AlphaFoldDB; A0A2U1N466; -.
DR Proteomes; UP000245207; Unassembled WGS sequence.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR35046:SF9; CCHC-TYPE DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR35046; ZINC KNUCKLE (CCHC-TYPE) FAMILY PROTEIN; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000313|EMBL:PWA68269.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000245207};
KW RNA-directed DNA polymerase {ECO:0000313|EMBL:PWA68269.1};
KW Transferase {ECO:0000313|EMBL:PWA68269.1};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 313..329
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT REGION 41..90
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 260..306
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 41..61
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..90
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 264..306
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 488 AA; 55960 MW; 9D96303FFE0B67CB CRC64;
MPPRRNRNLN DIHEQEFEQR VMARMEERMG QFVDQLTDHM NDLMNQRRPR NRNRRESEDE
ESENPFGEGD GSSSDEQERR PRRNEREDNR RWESGLRVNI PDFDGDTLNP EGFIDWLAAV
EEVFEFKDIP ENKRVPLIAT KLRGRASAWW QQLKLTRERV GKPKVTTWQK MKKCMRANFI
PHNYQRLMYQ RLQNLKQGAK SVEDYTTEFY QLIARNDIQE TEEQLVSRYI GGLRFQIMDS
VNMFDPVTLS DAHQRALAFE KQNRRVGGSS SSAITGGSSG SGNVTSRFVP NQAKQGSSNT
GPVSKGVGSS TLKCFNCGEP GHRQSECKKA GKRHLFADPD GLHEVHKTVH DNLVRANSKY
KQDADQKRRH VDFEVGDFVW AVLTKDRFSV GEYNKLSAKK IGPLEIVEKI NSNAYRLKLP
SHIRCSDVFN VKHLLPYHGD SDDDLAVNSR ANFVYPGGND GGPSVEERAI MFLEAQDRVT
KGASLKWA
//