ID A0A251S917_HELAN Unreviewed; 1525 AA.
AC A0A251S917;
DT 22-NOV-2017, integrated into UniProtKB/TrEMBL.
DT 22-NOV-2017, sequence version 1.
DT 27-MAR-2024, entry version 21.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=HannXRQ_Chr15g0465871 {ECO:0000313|EMBL:OTF93860.1};
OS Helianthus annuus (Common sunflower).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC asterids; campanulids; Asterales; Asteraceae; Asteroideae;
OC Heliantheae alliance; Heliantheae; Helianthus.
OX NCBI_TaxID=4232 {ECO:0000313|EMBL:OTF93860.1, ECO:0000313|Proteomes:UP000215914};
RN [1] {ECO:0000313|Proteomes:UP000215914}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. SF193 {ECO:0000313|Proteomes:UP000215914};
RX PubMed=28538728; DOI=10.1038/nature22380;
RA Badouin H., Gouzy J., Grassa C.J., Murat F., Staton S.E., Cottret L.,
RA Lelandais-Briere C., Owens G.L., Carrere S., Mayjonade B., Legrand L.,
RA Gill N., Kane N.C., Bowers J.E., Hubner S., Bellec A., Berard A.,
RA Berges H., Blanchet N., Boniface M.C., Brunel D., Catrice O., Chaidir N.,
RA Claudel C., Donnadieu C., Faraut T., Fievet G., Helmstetter N., King M.,
RA Knapp S.J., Lai Z., Le Paslier M.C., Lippi Y., Lorenzon L., Mandel J.R.,
RA Marage G., Marchand G., Marquand E., Bret-Mestries E., Morien E.,
RA Nambeesan S., Nguyen T., Pegot-Espagnet P., Pouilly N., Raftis F.,
RA Sallet E., Schiex T., Thomas J., Vandecasteele C., Vares D., Vear F.,
RA Vautrin S., Crespi M., Mangin B., Burke J.M., Salse J., Munos S.,
RA Vincourt P., Rieseberg L.H., Langlade N.B.;
RT "The sunflower genome provides insights into oil metabolism, flowering and
RT Asterid evolution.";
RL Nature 546:148-152(2017).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM007904; OTF93860.1; -; Genomic_DNA.
DR InParanoid; A0A251S917; -.
DR Proteomes; UP000215914; Chromosome 15.
DR GO; GO:0043227; C:membrane-bounded organelle; IEA:UniProt.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR045358; Ty3_capsid.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR45835:SF105; IPP TRANSFERASE; 1.
DR PANTHER; PTHR45835; YALI0A06105P; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF19259; Ty3_capsid; 1.
DR SMART; SM00343; ZnF_C2HC; 2.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000313|EMBL:OTF93860.1};
KW Reference proteome {ECO:0000313|Proteomes:UP000215914};
KW RNA-directed DNA polymerase {ECO:0000313|EMBL:OTF93860.1};
KW Transferase {ECO:0000313|EMBL:OTF93860.1};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 379..394
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 645..824
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1167..1330
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..21
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 41..97
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 286..332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..20
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 70..86
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 286..308
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 317..332
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1525 AA; 174213 MW; 7D4F032604960BD2 CRC64;
MAKSEETNSH SGERSDNAKI HVTGAELQAL IDNAVAKAMD RQFKESSGTQ SRTRSVTHVK
AKTHSGAHSK PPSKKSEPKK AEDDNHSSNQ SSVPKQEIKL KHDTYSRSCT YKYFVSCKPR
DFTGEKGAVD CMTWIDEMDT VVDISGCADR DVVKYVSQSF KGDALAWWKS LLQAAGKATL
YGLSWEQFVA LIKENFCPQH EVERIESDFV SLVMKNLDCQ AYLTTFNTLS RLVPYLVTPE
PRRIARFIGG LAPEIKASVK ASRPTTFRSV ADLSLSLTQD VVRLRAMKSS EENKRKREDD
TSRRSEKRHR GNNDHRKGSG SRKSDHQSGE KPRCKICRRH HFGRCRLETK SQSSEKRCGI
CKSTDHKAVD CKKMKDATCF GCNEKGHIRP NCPKFAKKAE EGKKTNARVF RMDAKEAVLD
DNVITGTFLV NDVFARVLFD SGADKSFVDD KFCKLLNLPV KTLSVKYEVE LADGTLETAS
TVLDGCVISI RNHSFPLSLL PFKLAGFDIV IGMDWLSSNQ AQILCNRKQV IVKTPSGESL
TIQGDTQHGL PEQVSMLKAS RCMQKGCVIY MAQVTIDEPK PKIEDIPVIS EYPEVFPEEL
PGLPPDRQVE FRIDIIPGAA PVARAPYRLA PTEMKELRTQ LDELLAKGFI RPSSSPWGAP
ILFVKKKDGS MRLCIDYREL NKVTIKNRYP LPRIDDLFDQ LQGASYFSKI DLRSGYHQLK
VKDEDVHKTA FRTRYGHYEF LVMPFGLTNA PAAFMDLMNR VCKPYLDKFV IVFIDDILIY
SKSQADHEKH LRCILKLLYQ EKLYAKFSKC EFWLREVQFL GHVVSERGIQ VDPAKVEAVM
NWQEPKTPTE IRSFLGLAGY YRRFIENFSR IAAPLTSLTR KKIEFDWGPK QQESFDILKQ
KLSNAPVLTL PDGIEEFVVY CDASHTGMGC VLMQKGKVIA YASRQLKVHE KNYTTHDLEL
GAVVFALKLW RHYLYGTKCI IYSDHKSLQH LFNQKELNMR QRRWMETLND YDCEIRYHPG
KANVVADALS RKERVKPIRI NAKRIEIRNN LNERVLAAQK EAVLEANYPA EKLGVTEEQL
SHDKDGMLRL NGRIWVPVYG GLRDVILQEA HSSKYSVHPG ADKMYQDLKS NYWWIGLKKS
VAEHVAKCLT CAQVKAEHQK PSGLLQQPEI PKWKWEMVTM DFITKLPKTK KGNDTIWVIV
DRLTKSAHFL PIKETYSSDM LAQLYVDKIV ALHGIPVSII SDRDTRYTSH FWKSFQQSLG
TRLNFSTAYH PQTDGQSERT IQTLEDMLRA CAIDLGGSWD KNLPLIEFSY NNSYHSSIKA
APFEALYGRK CRSPICWAEV GDVQLSGPDI VFETTDKIVQ IRDRLKAARD RQKSYADPKR
KDFHFDVGEK VLLKVSPWKG VMRFGKKGKL SPRYIGPFEI IERVGAVAYK LKLPEELSAI
HNVFHICNLK KCFADDSLVI PHTDIHIDES LKFVEKPLSI EDRQVKKLRR KHVPIVKVKW
DARRGPEYTW EVESTMKEKY PHLFE
//