ID Q7X8X0_ORYSJ Unreviewed; 1831 AA.
AC Q7X8X0;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 05-JUL-2004, sequence version 2.
DT 27-MAR-2024, entry version 101.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=OSJNBa0060D06.7 {ECO:0000313|EMBL:CAE03541.2},
GN OSJNBb0059K02.25 {ECO:0000313|EMBL:CAE04515.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:CAE03541.2, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|EMBL:CAE03541.2}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=12447439; DOI=10.1038/nature01183;
RA Feng Q., Zhang Y., Hao P., Wang S., Fu G., Huang Y., Li Y., Zhu J., Liu Y.,
RA Hu X., Jia P., Zhang Y., Zhao Q., Ying K., Yu S., Tang Y., Weng Q.,
RA Zhang L., Lu Y., Mu J., Lu Y., Zhang L.S., Yu Z., Fan D., Liu X., Lu T.,
RA Li C., Wu Y., Sun T., Lei H., Li T., Hu H., Guan J., Wu M., Zhang R.,
RA Zhou B., Chen Z., Chen L., Jin Z., Wang R., Yin H., Cai Z., Ren S., Lv G.,
RA Gu W., Zhu G., Tu Y., Jia J., Zhang Y., Chen J., Kang H., Chen X., Shao C.,
RA Sun Y., Hu Q., Zhang X., Zhang W., Wang L., Ding C., Sheng H., Gu J.,
RA Chen S., Ni L., Zhu F., Chen W., Lan L., Lai Y., Cheng Z., Gu M., Jiang J.,
RA Li J., Hong G., Xue Y., Han B.;
RT "Sequence and analysis of rice chromosome 4.";
RL Nature 420:316-320(2002).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [3] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL606690; CAE03541.2; -; Genomic_DNA.
DR EMBL; AL606692; CAE04515.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 4.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1543..1705
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 117..218
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 399..475
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 500..540
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1115..1273
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 121..143
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 166..193
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 450..467
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 500..536
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1247..1273
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1831 AA; 201264 MW; D85314807182EE88 CRC64;
MAEQAKAHDL SPSVSGDDGE PNPRHRARTP PLPPRQSPKR GEALERVEKS AASPVAGGAG
ERRDGERRLL VYGDGSTPQG ALQAVGALLR HPPVVPDPES PAQRWLDDVA NLVMTAQQRL
GAGGRSATTK TSGVATTGSV SSRRRARRAA AAARHSATTP SSAPPTREDQ HEEPDARLDI
ERRRSDRRAP RATEGASSSR VSPRHGREDQ PSVPPVGGVG CRAFVASLRN VRWPPRFRPT
ITEKYDGSVN PTEFLQVYTT GIEAARGDDR VMANFFPMAL KGQARGWLMN LPPASVHSWE
DLCQQFTMNF QGTYPRPGEE ADLHAVQRGD DESLRSYIQR FCQVRNTIPC IPAHAVIYAF
RGGVRHNRML EKIASKEPQT TAELFQLADR VARKEEAWTW NPSGSGVAAS AAPGSAAQTG
RRDRRRKKRS VHSGDEGHVL AVEGAPRATR KGRPASDKKK EAGTPSRERP ASKWCSVHNT
SLHDLADCRA VKNLAERTRK WEEDRRQERR EGKSAAVPSG KRRSEARQKA PAVDIDDGDD
DLGFQEPGAT IATVDGGACA HVSRRSFKAM RRELLAAAPT HEATRRARWS EVALTFDQTD
HPPCVARGGQ IAMVVSPTIC NVKLGRVLID GGAALNILSP AAFDAIKAPG MVLQPSQPII
GVTPGHTWPL GHIDLPVTFG GSANFRTERV NFDVADLSLP YNAVLGRPAL VKFMAAVHYA
YLQMKMPGPG GPISVRGDLK VALACMEQRA DHLAAATKPE GGDERLGTSA PTAPRQRIAT
CDEVPEDALV SFLRANADVF AWRPADMPGV PREVIEHRLA VRPGARPVRQ KVRRQAPERQ
AFIREEVARL LEAGFIREVI HPEWLANPVV VPKANGKLRM CIDYTDLNKA CPKDPYPLPR
IDQIVDSTAG CDLLCFLDAY SGYHQIRMAR EDEEKTAFIT PVGTYCYTSM PFGLKNAGPT
FQRTTRISLG SQIGRNVEAY VDDLVVKTRN QETLLSDLAE TFENLRSARI KLNPDNKLRD
VQCVTGCMAA LSRFISRLGE KALPLFKLLK RSGPFTWTEE AEHALTQLKA YLSSPPVLVA
PEPNEPLLLY LAATPQVVSA ALVVERDEDN PHFVHPHSVL TWPGSERGGE APESDGGLRP
LTTGVGPLPA CQTVLGAPDP QEGPKATAGR PHLTPSDPEA NPVLTRPGKE QGEEAPESNG
GQRPLTTGVG PLPACPTMPG APDPQDGPEA TEGRPPLSSS DPEVIGTEDK CAPKGHLNEE
RPGDTAPSRE DRPHRKVQWP VYFVSEALRD AKTRYPQAQK MLYAILMASR KLRHYFQAHR
VTVVTSYPLG QILHNREGTG RVVKWAIELS EFDLHFEPRH AIKSQALADF VAEWTPAPET
VSIPEASTNP SQLPHTTHWV MQFDGSLSLQ GAGAGVTLTS PTGDVLRYLV RLDFRATNNM
AEYEGLLAGL RVAAGLGIRR LLVLGDSQLV VNQEQGVELL ADIHAGECGA HSASRTLVGK
AFRQGFYWPT ALNDAVDLVR RCRACQFHAK QIHQPAQALQ IIPLSWPFAV WGLDILGPFK
RAPGGFEYLY VAIDKFTKWP EAYPVVKIDK HSALKFIKGI TARFGVPNRI ITDNGTQFTS
ELFGDYCEDM GIKLCFASPA HPRSNGQVER ANAEILKGLK TKTFNILKKH GDSWIEELPA
VLWANRTTPS RATGETPFFL VYGAEAVLPS ELTLRSPRAT MYCEADQDQL RRDDLDYLEE
RRRRAALRAA RYQQSLRRYH QRHVRARSLC VDDLVLRRVQ TRAGLSKLSP MWEGPYRVIG
VPRPGSVRLA TGDSTKLPNP WNIEHLRRFY P
//