ID Q7XNG6_ORYSJ Unreviewed; 1946 AA.
AC Q7XNG6;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2005, sequence version 3.
DT 27-MAR-2024, entry version 95.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=OSJNBa0096F01.7 {ECO:0000313|EMBL:CAE04098.3};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:CAE04098.3, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL662933; CAE04098.3; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 4.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1654..1820
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..65
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 321..393
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 417..466
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1068..1230
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1478..1526
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 369..385
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 417..453
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1208..1230
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1946 AA; 214619 MW; E4FE23C35838EC5F CRC64;
MAEPAKAHDL SPSMSGDDGE PNPRRRARTP PPPPRQSPRR EKALERAEGS ATSTPTGGGE
RRRDGERRLL VYGDGSTPQG ALQAAGALLR HPPVAHDPES LAQRWLDDVA KLVMTARQRL
DAGGRSSATK ASGAATTGSA SAFVASLRNV RWPPRFRPTI AEKYDGSVNP AEFLQVYTTG
IEAAGGDDRV MANFFPMALK GQARGWLMNL PPASVHSWED LCQQFTMNFQ GTYPRPGEEA
ELHAVQRRDD ESLRSYIQRF CQVRNTIPCI PAHAVIYAFR GGVRHNRMLE KIASKEPQTT
AELFQLADRV ARKEEAWTWN PSGSGVVASA APGSAAQIGR RDRRKKKRSA HADDEGHVLA
VEGASRATRK GRPAGDKKNE AGAPNRERPT GKWCTVHNTS LHDLADCRAV KSLAERTRKW
EEERRQERRE GKSPAVPSDN RRSEAKQKAP AEDIDDGDDD LGFQEPGATI ATVDGGACAH
VSRRSFKAMK RELLAATPTH EATCRARWSE VALTFDQTDH PPCVARGGQI AMVVSPTVCN
VKLGRVLIDG GAALNILSPA AFDAIKAPGM VLRPSQPIIG VTPGHTWPLG HIDLPVTFGG
SANFRTERVN FDVADLSLPY NAVLGRPALV KFMAAVHYAY LQMKMPGPGG PISVHGDLKV
ALACLEQRAD HLAAASKTEG GDERLGASAP ATPRQRMITG DEVPEDALVS FLRANADVFA
WRPADMPGVP RGLIEHRLAV RPGARPVRQK VRRQAPERQA FIREEVARLL EAGFIREVIH
LEWLANPVVV PKANGKLRMC IDYTDLNKAC PKDPYPLPRI DQIVDSTAGC DLLCFLDAYS
GYHQIRMARE DEEKTAFITP IGTYCYTTMP FGLKNAGPTF QRTTRISLGS QLGRNVEAYV
DDLVVKMRNQ EMLLSDLAET FESLRSARIK LNPDKCVFGV PAGKLLGFLV SARGIEANPE
KIRAIERMRP PSKLRDVQCV TGCMAALSRF ISRLGEKALP LFKLLKRSGP FTWTEEAENA
LTQLKAYLSS PPVLVAPEPD EPLLLYLAAT PQVVSAALVV ERDENNSHFT LPHPVPTWPG
REQGGEAPEP NGGPRPPTTG VGPLPACQTV LGAPDPQEGP KATVGRPHLS PFDPEANPVL
TRPRKEQGEE APEPNGGLRP LTTGVGPLPA CPTTPGAPDP QDGPEATVGR PPLLSSDPEV
ISTEDKCAPR GCLDEECPRD AAPSEEDRPH RKVQRPVYFV SEALRDAKTR YPQAQKMLYA
ILMASRKLRH YFQAHRVTVV TSHPLGQILH NRESTGRVVK WAIELSEFDL HFEPRHAIKS
QALADFVAEW TPAPEPVSIP EASTDPSQLP HTAHWVMQFD GSLSLQGAGA GVTLTSPSGD
VLRYLVRLDF RATNNMAEYE GLLARLRVAA GLGIRRLLVL GDSQLVVNQV CKEYRCSDPQ
MDAYVRQVRR MERHFDGIEL RHVPRRDNMI ADELSRLASS RAQTPPGAFE ERLAQPSARP
DPLGETDAPD RPPRPVGVQA SGPEGSAPNS LRLIAWIAEI QAYLTDKTLP EDREGSERVH
RISKRYVLVE GTLYRRAANG ILLKCIPREQ GVVLLADIHE GECGAHSASR TLVGKAFRQG
FYWPTALNDA VDLVRRCRAC QFHAKQIHQP AQALQTIPLS WPFAVWGLDI LGPFRRAPGG
FEYLYVAIDR FTKWPEAYPV VKIDKHSALK FIKGITARFG VPNRIITDNG TQFTSELFGD
YCEDMGIKLC FASPAHPRSN GQVERANAEI LKGLKTKTFN ILKKHGDSWI EELPAVLWAN
RTTPSRATGE TPFFLVYGAE AVLPSELTLR SPRATMYCEA DQDQLRRDDL DYLEERRRRA
ALRAARYQQS LRRYHQRHVR ARSLCVNDLV LRRVQTRAGL SKLSPMWEGP YRVVGVPRPG
SIRLATGDGT ELPNPWNIEH LRRFYP
//