ID Q5W6X8_ORYSJ Unreviewed; 1688 AA.
AC Q5W6X8;
DT 07-DEC-2004, integrated into UniProtKB/TrEMBL.
DT 07-DEC-2004, sequence version 1.
DT 27-MAR-2024, entry version 86.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=OSJNBa0036C12.19 {ECO:0000313|EMBL:AAV43949.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAV43949.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC121363; AAV43949.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 5.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1396..1562
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..59
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 167..244
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 268..311
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 882..1068
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 33..59
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 219..236
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 910..932
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 965..979
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 990..1015
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1044..1068
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1688 AA; 185166 MW; 3735FDB718DA136D CRC64;
MAHNLSPSAS GDDGEQNPRR RARTPLSPSR RSLGREEALE EVERSATSPH AGDGEGRRDR
ERRLLVYGEF LPHGPERIGA GLANELATRV GPLLGGFVPA VHHELPGHIS APSPQHHTVH
PCTRGDLRVQ GGVRHNRMLE KIASKEPQTT AELFQLADRV ARKEEAWTWN SSGSGVAAPA
APGPTARSGR RDRRRKKRSA HSDDEGHVLA VEGASRATRK GRPASDKKKE AGAPSRERPT
GKWCTVHNTS LHDLVDCRAV KSLAERTRKW EEERRQERRE GNTPAAPAGN RRGEAKQKAP
AEDINDGDDD LGFQEPEATV ATVDGGACAH ASRRSLKAMK RELLAVAPTH EATRRARWSE
VALTFDQTDH PPCVARGGQI AMVVSPTVCN VKLGRVLIDG GAALNILSSA AFDAIKAPGM
VLRPSQPIIG VTPGHTWPLG HIDLPVTFGG SANFRTERVN FDVADLSLPY NAVLGRPALV
KYMVAVHYAY LQMKMSGPGG PITVHGDLKV ALACMEQRAD HLAAASKPAG GDERLSTSVP
AAPRQRMITC DEVPIKQEDA LVSFLRANAD VFAWRLADMP GVPREVIEHR LAVRPGARPV
RQKVRRQAPE RQAFIREEVA RLLEAGFIRE VIHPEWLANP VVVPKANGKL RMCIDYTDLN
KACPKDPYPL PHIDQIVDST AGCDLLCFLD AYSGYHQIRM AREDEEKTAF ITPIGTYCYT
TMPFGLKNAG PTFQRTTRIS LGSQIGRNVE AYVDDLVVKT RNQETLLSDL AETFESLRTA
RIKLNPDKCV FGVPVGKLLG FLISARGIEA NPEKIRAIER MRPPASLGMC NASLAEAERA
LTQLKAYLSS PPVLVAPEPD EPLLLYLAAT PQVVSAALVV ERDEDDPRSA HPHPMSTRPG
REQGGEAPEP NGGPRPPTTG AGPLPACPTV PGAPDPQDGL GATAGRPRLS PSDPEVVGTE
AECAPRGLSD EEHPVSTRPG REQGGEAPEP NGCPRPPTTG AGPLPACPTV PGAPDPQDGP
GATAGRPRLS PSDPEVVDTE AECAPRGLSD EKRPGDAAPG EEDRPRQKVQ RPVYFVSEAL
RDAKTRYPQA QKMLYAILMA SRKLRHYFQT HRVTVVTSYP LGQILHNREG TGRVVKWAIE
LSEFDLHFEP RHAIKSQALA DFVAEWTPAP EPVSIPEASS GPSQLPHTAY WVMQFDGSLS
LQGAGAGVTL TSPSGDVLKY LVRLDFRATN NMAEYEGLLA GLRVAAGLGI RRLLVLGDSQ
LVVNQVSKEY QCSDPQMDAY VRQAYLADKT LPEDREGSER VQRISKRYVL VEGTLYRHAA
NGVLLECIPR EQGVELLAAI HEGECGAHSA SRTLVGKAFR QGFYWPTALN DAVDLVRQCR
ACQFHAKQTH QPAQALQTIP LSWPFADWGL DILRPFIRAP GGFEYLYVAI DKFTKWPEAY
PVIKIDKHSA LKFIRGITAR FGVPNRIITD NGTQFTSELF GDYCEDMGIK LCFASPAHPR
SNGQVERANA EILKGLKTKT FNILKKHGDS WIEELPAVLW ANRTTPSRAT GETPFFLVYG
AEAVLPSEVT LRSPRATMYC EADQDQLRRD DLDYLEERRR RAAIRAARYQ QSLRRYHQRH
VRARSLCVDD LVLRRVQTRA GLSKLSPMWE GPYRVIGVPR PGSVRLATGD GTELPNPWNI
EHLRRFYP
//