ID Q7XPH3_ORYSJ Unreviewed; 1626 AA.
AC Q7XPH3;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 15-FEB-2005, sequence version 3.
DT 27-MAR-2024, entry version 107.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN Name=OSJNBb0003B01.10 {ECO:0000313|EMBL:CAE03619.3};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:CAE03619.3, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL606649; CAE03619.3; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 4.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR016197; Chromo-like_dom_sf.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR24559:SF434; RNA-DIRECTED DNA POLYMERASE HOMOLOG; 1.
DR PANTHER; PTHR24559; TRANSPOSON TY3-I GAG-POL POLYPROTEIN; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF08284; RVP_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF54160; Chromo domain-like; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 644..659
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 876..1055
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT REGION 210..252
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 298..331
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 546..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 236..252
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 546..560
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 570..611
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1626 AA; 184300 MW; A11B20411241915A CRC64;
MDKNDRFVTG LTLLMSSAHS HVPTTRLPPV EVEMAGHPPN HRGEGMIDFV AELARMTLVV
GYPNAPEYTT IPPLSGELPH RVRLEVHGYV GTCLANMAVE ASGGTADHAC QEAAYLMMAR
LRERHNYIFH DTAYRYHPRR ANGDGVSSFR PTAGENDTTF GHMCAVMRGL DRMHSDLHKA
TKALNDGKLM RIVALKDEIA RLKKENAQLK GLPAPGGVRI RTTPRKTTTA PVRIQLAPRN
PPPPAAPAAP PVPPAVPAAP VPAPAFTLSF APASAARGPA SGTGGCLFGM VYTRNGSRAT
GEGSNGEERA DGVHPNSDSG NGPPPLPENP TLAQVMAHQT QMMAAMMQQM QQQHQQMHQR
MMQHAEQQHQ QFGPPPPQSK LPEFLRVRPP TFSSTTNPME ANDWLHAIEK KLNLLQCNDQ
EKVAFATHQL QGPASAWWDN HMATRPPGTE VTWAEFCRSF RKAQVPDGVV AQKKREFRAL
HQGNRTVTEY LHEFNRLARY APEDVRTDAE KQEKFMAGLD DELTNQLISG DYADFERLVD
KAIRQEDQRN KMDRKRKAAQ FRAPQGSHQR PRFTPGQQGG PTTMIIRQHR PFNPSNFHQG
TSGSQNHHGG QPNRGAAPRP SVAPAQSGQP AQAKKETGAK PGSCFNCGEL GHFADKCPKP
RRAGPRFIQA RVNHASAEEA QAAPEVVLEI CGEVWVIVNM KESLVSTPMR VHTPGNSSTS
VSFSPSVLIE IQRSPFLANL ILLESKDLDV ILGMDWLTKF KGVIDCASRT VTLTNEKGET
VVYKSLVSPK KGVSLNQIET EIPVDTVEKN LRKLEDIPIV CEYPEVFPED LTTMPPKREI
EFRIDLAPGT APIYKRPYRM AANELAEVKK QVDEQLQKGY IRPSTSPWGA PVIFVEKKDK
TKRMCVDYRA LNEVTIKNKY PLPRIDDLFD QLKGAKVFSK IDLRSGYHQL RIREEDIPKT
AFTTRYGLYE CTVMSFGLTN APAFFMNLMN KVFMEFLDKF VVVFIDDILI YSKSEEEHEQ
HLRLVLEKLK EHQLYAKFSK CDFWLTEVKF LGHVITAQGV AVDPSNVESV TKWTPPKTVS
QIRSFLGLAG YYRRFIENFS RIARPMTQLL KKDEKFKWTA ECDKSFEELK KKLVSAPVLI
LPDPTKDFQV YCDASRHGLG CVLMQEGRVV AYASRQLRPH EGNYPTHDLE LAAVVHALKI
WRHYLIGNRC EVYTDHKSLK YIFTQPDLNL RQRRWLELIK DYDMSIHYHP GKANVVADAL
SRKSYCTALC IEGMCEELRQ EFEHLNMGIV EHGFVAALEA RPTLVDQVRA AQVNDSEIAE
LKKNMRVGKA RDFHEDEHGT IWMGERLCVP DDKESKDLIL TEAHQTQYSI HPGSTKMYQD
LKEKFWWLVD KSLPYAEFSY NNSYQASLQM APFEALYGRK CRTPLFWDQT GERQLFGTEV
LAEAEEKVRI IRERLRIAQS RQKSYADNRR RELTFEAGDY VYLRVTPLRG VHRFQTKGKL
APRFVGPYKI LERRGEVAYQ LELPSNMIGI HDVFHVSQLK KCLRVPEEQA DSEHIDIQED
LTYVEKPVRI LDTSERRTRN KVTRFCRVQW SHHSEEEATW EREDELKAAH PHLFTSSSES
RGRDSV
//