ID Q688H9_ORYSJ Unreviewed; 1528 AA.
AC Q688H9;
DT 11-OCT-2004, integrated into UniProtKB/TrEMBL.
DT 11-OCT-2004, sequence version 1.
DT 27-MAR-2024, entry version 78.
DE SubName: Full=Polyprotein {ECO:0000313|EMBL:AAU10841.1};
GN ORFNames=P0686B10.6 {ECO:0000313|EMBL:AAU10841.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAU10841.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC135928; AAU10841.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 5.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1073..1202
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 1235..1405
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 63..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 195..215
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 432..465
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1528 AA; 168454 MW; 3CDF219B9652B97D CRC64;
MYFAGQPSTT AAHPAPHVQL QAQPVAAQVH PQPPGQVPQT LVEGASALQA QLQAFLQQLS
QPHNISSTAP SARPEGNTSQ GAPSWLPSNQ PGLGASPCSQ GPQFDSINAA QAPTVRPQAP
TPGFGANQAP NQVAITWSQP TFDPFMAAQQ ASPIGAGQPN AMAQPHAQAV ISPFATPYPQ
QGAVNRARGE KGLPLSGGIK TRPIPPQFKF PPVPRYSGET DQKEFLSIYE SAIETAHGDE
NTKALANSIY SWEQFRDVFV LNFRGTYEEP KTQQHLLGIR HRPGESIRKY MRRFSQARCQ
VQDITEASVI NAASAGLHEG ELTRKIANKE PQTLEHLLRI IDGFARGEED SKRRQAIQAE
YDKASIAVAQ AQQQVQVAEP PPLVVRQPQP AVQAQPPRQG QAPMTWRKFR TDRAGKVVMA
VEEVQALRKE FDAQQASNHQ QPVRKKVRKT STAPSTDALR TPRSNAETFG NVATREGAPQ
YLNQQIFFGP EDAEGVMFPH QDPLVISAEI AGFEVRRILV NGGSSADVIF AEAYAKMGLT
TQALTPALTS LRGFGGEAVQ VLGQAHLIVA FGTGENRREE QNYNTIFGRA TLNKFEAISH
HNYLKLKMPG PAGVIVVKGL QPSAASKGDL AIINRTVHNV EAEPHDRAKH APKPAPHGKI
VKMQIDDADP TKLVSLGGDM GEEEAKNILE VLKKNIDIFA WGPVEVGGVS ADLIMHHLVV
KPDAKPRKQK LRKMSADRQE AAKAEVQRLL KAGVIQEIDH PEWLANPVLV RKSNGKWRMC
VDFTDLNKAY PKDDFPLPRI DQLVDSTAGC ELMSFLDAYS RYHQIHMNPA DIPKTAFITP
FGTICHLRMP FGLRNAGATF ARLVYKQMKP PSSIREVQKL ADRIAALIRF LSKAAERGLP
FFKTLRGAGK FNWTPECQAA FNELKQYLQS RPALISAASG SELLLYLAAL PVAVSAALFQ
ETDSGQKPIY FVSEALQGAK TRYIKMEKLA YALVMASRKV KHYFQAHKVI VPSQYPLGEI
LWGKEVTGRL SKWAAELSPF NFHFVARTAV KSQVLANFVA EWTSAFAPDP EPVEQPWVMY
FDGSWSHRGA GIAAVLTSPN GAPIRYAARL QFDTTNNAAE YEAILLGLRK AKALGVRCLL
IRTDSKLVAG HIDKSFEAKE EGMKKYLEAV QSMEKCFTGI TVEHLPRGQN EEADALAKSA
AFGGPHSPGI FFEVLYAPSV PMDSLEVMTI DQLELGEDPY DWRTPFVKHL ETGWLPEDEA
EAKRLQLRAT KYKMVSGQLY RSGAEPLGAI TSAAVQKFVW KNIVCRFGVP KEFITDNGKQ
FDSDKFREMC EGLNLEIRFA SVAHPQSNGA AERTNGKILE ALKKRLEGAT KGKWPEEILS
VLWALRTTPT RPTKFSPFML FYGDEAMTPI ELGANSPRVT FSGGEEGREL SLELLEGVRV
EALEHMQKYA TGTSATYNKK VRPTELLPGH LVLRKKVNPI AVGKLESKWE GPYLIKNKSR
TGSFRLATLE GEEFDHSWNT ASLKRFYV
//