ID Q7XBQ6_ORYSJ Unreviewed; 1661 AA.
AC Q7XBQ6;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2003, sequence version 1.
DT 27-MAR-2024, entry version 128.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN OrderedLocusNames=LOC_Os10g09440 {ECO:0000313|EMBL:ABB46918.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:ABB46918.1};
RN [1] {ECO:0000313|EMBL:ABB46918.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=12791992; DOI=10.1126/science.1083523;
RG Rice Chromosome 10 Sequencing Consortium;
RT "In-depth view of structure, activity, and evolution of rice chromosome
RT 10.";
RL Science 300:1566-1569(2003).
RN [2] {ECO:0000313|EMBL:ABB46918.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell C.R., Wing R.A., McCombie W.R., Messing J., Yuan Q., Ouyang S.;
RL Submitted (MAY-2003) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ABB46918.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell R.;
RL Submitted (JUL-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000086; ABB46918.1; -; Genomic_DNA.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006508; P:proteolysis; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR24559:SF434; RNA-DIRECTED DNA POLYMERASE HOMOLOG; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 632..647
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 880..1059
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 1424..1587
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 268..301
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 544..606
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 273..292
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 555..604
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1661 AA; 189611 MW; B01BCFD3CE5E20E6 CRC64;
MVPGLEWLVP GLEVIFVQLG LVVQMWLGLC NMEIPGVTSW YQSHLDLRIS QMMPRHSPRN
LIEVLGFVRE LRLMAFEVGY EVAPEYRQIP HDPDEEKCRV RVTLASDSED LPSFKFEAGG
RSYCHACQEV ALVAIGELRQ HFEEELDSSA FQYHPHKPHG QDYGTYICPD GEESATLMHA
VHMLSAMDAV SVERDKAAHD RENWNRGKIC KLESKVYRLQ KELAELKGEM PPPTPKLRLV
ARKRTCPPPR LQLASKIRVM GEAVPDRAEP VIDIISDEEE EEDPEEREPA TPEEEDSSLR
SDASCCLHHG WDMVNTRTGT GSGSGSGANN NEGEPTLAQI LAQQTQLINL LVQQAQNQQG
NNQNQNPPPP PQNKLADFLR VRPPTFSSTT NPVEAGDWLH AVEKKLDLIQ CTEQEKVSFA
SHQLHGPAAE WWDHFRQGRA GGEPITWQEF TAAFKKTHIP SGVVALKKRE FRALNQGSRS
VTEYLHDFNR LARYAPEDVR TDEERQEKFL EGLNDKLSYA LMSTDFQDFQ QLVDKAIRQE
DKYNRMEQKK RRAAQFKAQQ GSNQRPRLVT GPQVPSYPQG GSSSVVRPQR QFYNNNTGNR
GNDNRNVVAR PAATPVQNQP VRREQGSKPV ICFNCGDPGH YADKCPKPRR VKNAPAPNNF
NVPAPKARVN HVAAAEAQNA PDVVLGTFPV NSIPATVLFD SGATHSFLCK SFAIKHGMEV
VSLGRPLLVN TPGNQAFSTR YCPSVTIEIE EVPFPSSLIL LESKDLDVIL GMDWLSRHRG
VIDCANRKVT LTSSNGETVS FFASSPKSHG EVLNQVALQE IPIVQDYPDV FPEDLPGMPP
KRDIEFRIDL VPGTNPIHKR PYRMAANELA EVKKQVDDLI QKGYIRPSTS PWGAPVIFVE
KKDHTQRMCV DYRALNEVTI KNKYPLPRID DLFDQLEGAT VFSKIDLRSG YHQLRIREED
IPKTAFTTRY GLFECTVMSF GLTNAPAFFM NLMNKVFMEY LDKFVVVFID DILIYSKTKE
EHEEHLRLAL EKLREHQLYA KFSKCEFWLS EVKFLGHVIS SGGVAVDPSN VESVLSWKQP
KTVSEIRSFL GLAGYYRRFV ENFSKIARPM TRLLQKDVKY KWTEDCEQSF QELKKMLVTA
PVLILPDSRK GFQVYCDASR HGLGCVLMQE GKVVAYASRQ LRPHENNYPT HDLELAAVVH
ALKIWRHYLF GNRTEIYTDH KSLKYIFTQP DLNMRQRRWL ELIKDYDMEI HYHPGKANVV
ADALSRKSYC NMSEARCLPW ELCQEFERLN LGIVSKGFVA TLEAKPTLFD QIREAQMNDP
DIQEIKKNMR RGKAIGFVED EQGTVWLGER ICVPENKELK NTIMKEAHET LYSIHPGSTK
MYQDLKLQFW WASMRREIAE YVALCDVCQR VKAEHQKPAG LLQPLKIPEW KWEEIGMDFI
TGLPRTSAGH DSIWVVVDRL TKVAHFIPIK TTYTGHKLAE LYMARVVCLQ GVPKKIVSDR
GSQFTSMFWQ KLQSELGTRL NFSTAYHPQT DGQTERVNQI LEDMLRACVL DFGGSWDKNL
PYAEFSYNNS YQASLQMSPN EALYGRKCCT PLLWDQTGER QVFGTDILRE AEEKVKIIQE
RLRVAQSRQK SYADNRRRDL AFEEGDYVYL RVTPLRGVHR F
//