ID Q6F2P7_ORYSJ Unreviewed; 1992 AA.
AC Q6F2P7;
DT 16-AUG-2004, integrated into UniProtKB/TrEMBL.
DT 16-AUG-2004, sequence version 1.
DT 27-MAR-2024, entry version 84.
DE SubName: Full=Polyprotein {ECO:0000313|EMBL:AAT73678.1};
GN Name=OSJNBa0075J06.3 {ECO:0000313|EMBL:AAT73678.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAT73678.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC136220; AAT73678.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 5.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1414..1543
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 1700..1878
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 72..126
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 202..261
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 318..409
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 586..639
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 674..700
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 924..957
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 72..86
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 104..126
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 352..409
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 675..700
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1992 AA; 225399 MW; 3314172ED97C61E5 CRC64;
MGFVSGIDDL VFPPGQTFRF GSLDFVTNNF GKISLLDSES DQSGENRISV PFGFPNAAES
YFKTISTESA SNHSAEIASS PTTPDQDDGA YPPVLMKLPD DLAAVSTTTA SSSRRPRRKS
ASTPISPSFR EVGVILQPLG TVSTDQLDDY NSSPTVDSRP TDIVEYDEFS SRYNDFDDFD
GSLDDNYTPL YFGVFMANNE TEEQRQARET EERRRLEEEH QAQEQERLRR EQQDRERTAK
EAEDRRQRAL EAGRRARDLI GQQDVEGTPV FRTPQQNAVA AITLLDTLLN QNELNQSDHV
VNILNQTKTM IAASVPVNSE SVHTPTGSRV PHFRSHDYRH PSLSITGAGS NRRSRGHDER
SVHSPPDHHR ERRVERPRSP PRRRPVDLRD TIIQRRAARG YHHSPDRHDD DLDGVAAFTD
DLRRVDWPAG FKPTGIEKYD GTTNPESWLT VYGLAIRAAG GDSKAMANYL PVALADSARS
WLHGLPRGTI GSWAELRDHF IANFQGTFER PGTQYDLYNV IQKSGESLRD YIRRFSEQRN
KISDITDDVI IAAFTKGIRH EELVGKFGRK PPRTVKLMFE KANEYAKAED AQSGPSWKPK
KDTPASGGGG SNNHKDRKRK PAELVATASH SSRQRSRVNT FDKIMNSQCP HHPNSNHVAK
DCFVYKQFAD QYTKTTRKNP DEEQSTSRKK DDGDTPAGFQ DHRKELNHIF GGPLAYESKR
KQKLTEREIN AVQPDMPQYL RWSEIAIKFD RSDHPDRVVH PGRYPLVLDP VVRNVKLRRT
LIDGGSALNI LFAKTLDDMQ IPRSELKPSN APFHGVIPGL SATPLGQITL PVTFGTRENF
RTENISFEVA DFETAYHAIL GRPALAKFMA VPHYTYMMMK MPGPRGVLSL RSDIKQAVTC
DKESCDMAQT REMASAREDI RLAAATASEG EVPATKTSKS GESEAKTKKI PLDPSDPTKT
AVIGAEEVIE HSLHVKEDAK PIKQRLRRFA QDRKDAIKEE LTKLLAAGFI KEVLHPDWLA
NPVLVRKKTG QWRMCVDYTD LNKSCPKDPF GLPRIDQVVD STAGCELLSF LDCYSGYHQI
RLKESDCLKT SFITPFGAYC YVTMPFGLKN AGATYQRMIQ RCFSTQIGRN VEAYVDDVVV
KTKQKDDLIS DLEETFASIR AFRMKLNPEK CTFGVPSGKL LGFMVSHRGI QANPEKVTAI
LNMKPPSTQK DVQKLTGCMA ALSRFVSRLG ERGMPFFKLL KKTDDFQWGP EAQKAFEDFK
KLLTEPPVLA SPHPQEPLLL YVSATSQVVS TVLVVEREEE GHVQKVQRPI YFVSEVLSDS
KTRYPQVQKL LYGILITTRK LSHYFQGHSV TVVTSFPLGD ILHNREANGR IAKWALELMS
LDISFKPRIS IKSQALADFV AEWTECQEDT HAEKMEHWTM HFDGSKRLSG TGAGVVLISP
AGERLSYVLW IHFSASHNVA EYEALLHGLR IAISLGIKRL IVRGDSQLVV NQVMKEWSCL
DDNMMAYRQE VRKLEDKFDG LELSHVLRHN NEAADRLANF GSKREVAPSD VFVEHLYTPT
VPHKDTTQVA GTHDVAMIEA DWREPLIRFL TSQELPQDKD EAERISRRSK LYVMHEAELY
KRSPSGILQR CVSLEEGRQL LKDIHSGICG NHAAARTIVG KAYRQGFFWP TAVSDADKIV
RTCEGCQFFA RQIHLPAQEL QTIPLSWPFA VWGLDMVGPF KKAAGGYTHL FVAIDKFSKW
IEAKPVVTIT ADNARDFFIN IVHRFGVPNR IITDNGRQFT GGVFKDFCED FGIKICYASV
AHPMSNGQVE RANGMILQGI KARVFDRLKP YAGKWVQQLP SVLWSLRTTP SRATGQSLFF
LVYGAEAMLP SEVEFESLRF RNFREERYEE DRVDDLHRLE EVREAALIQS ARYLQGLRRY
HNRNVRSRAF LVGDLVLRKI QTTRDRHKLS PLWEGPFIIS EVTRPGSYRL KREDSTIVDN
SWNIEHLRRF YA
//