ID Q8S7I8_ORYSJ Unreviewed; 1497 AA.
AC Q8S7I8;
DT 01-JUN-2002, integrated into UniProtKB/TrEMBL.
DT 01-JUN-2002, sequence version 1.
DT 27-MAR-2024, entry version 86.
DE SubName: Full=Gag-pol protein {ECO:0000313|EMBL:AAM19013.1};
GN Name=OSJNBa0010I09.7 {ECO:0000313|EMBL:AAM19013.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAM19013.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC084748; AAM19013.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 3.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 1207..1371
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 67..129
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 306..360
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 398..424
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 67..86
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 88..122
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 311..334
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 407..424
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1497 AA; 168569 MW; 7BC94F69FDFAAB8E CRC64;
MGSVSGIDDF AFPPGQAFRF GSLNFITNDF GKISLHDSDS NQSGRNQVPV PFGILNSAEA
YSKIISPESA PNHSDEIQST LPRPDQDDGN SVKPEKRTKI AHGKKLNVAD WRKSDSDRSE
NDYSVNNKTA NGLQRRLKID GNAPWNLGAE RESSLGNKTT TNPESWLTVY GLAIRAAGGD
SKAMANYLPV ALADSARSWL HGLPRGTIGS WAELRDHFIA NFQGTFERPG TQFDLYNVVQ
KSGESLRDYI RRFSELRNKI SDITDDVIIA ALTKGIRHED LVGKFGRKPP RTVKQMFKKA
NEYAKAEDAI TASKQSGTTW KPKKDTPTAG GSGSNNHKDR KRKPEELVAT TSPPSRQRSR
VNTFDKIMNS QCPHHPNSNH VAKDCFVYKQ FAEQYVKNAR KPSDGDQGTS NKNDDEDDAP
TGFQDHRKEL NHIFGGPLAY ESKRKQKLTE REINAVQPNT PQYLRWSETA IKFDRSDHPD
RVVHPGRYPL VLDPVVRNVK LRRTLIDGGS ALNILFANTL DDMQIPRTEL KPSNAPFHGV
IPGLSATPLG QITLPVTFGT QENFRTENVC FEVADFETAY HAILGRPALA EFMAVPHYTY
MMMKMPGPRG VISLQSDIKQ AVTCDKESCE MAQTHEITLA REEIQLAATT ASEGEVPATK
LTKTDESDAK TKKITLDPSD PDKTANNKDI FAWKPSDMPG IPREVIEHSL HVKEDAKPIK
QRLRRFAQNR KDAIKEELTK LLAVGFIKEV LHPDWLANPV LVRKKTEQWR MCVDYTDLNK
SCPKDPFGLP RIDQVVDSTA GCELLSFLDC YSGYHQIRLK ESDCLKTSFI TPFGAYCYIT
MPFGLKNAGA TYQRMIQRCF STQIGRNVEA YVDDVVVKTK HKDDLIADLE ETFASIRAFR
MKLNPEKCIY GVPSGKLLGF MVSQRGIQAN PEKINAILNM KPPSSQKDVQ KLTGCMAALS
RFVSQLGERG MPFFKLLKKT DNFQWGPEAQ KAFEDFKKLL TTPPVLASPH PQEPLLLYVS
ATSQVVSMVL VVEREEEGHI QKVQRPIYFV SEVLADSKTR YPQVQKLLYG VLITVRKLSH
YFQSHSVTVV TSFPLGDILH NREANGRIAK WALELMSLDI SFKPRTSIKS QALADFLAEW
TECQEDMQEE KMEYWTMHFD GSKRITGTGA GEAAPSDVFV EHLYEPTVPR KETIEAMDTQ
GPKSCKPSRC PGRLRSGGLT WSARLKGAVG GYTHLFVAID KFSKWIEAKP VITITADKAR
DFFINIVHRF GVPNRIITDN GTQFTGGAFK DFCEDFCIKI CYASVAHPMS NGQVEHANGM
ILQGIKARVF DRLRPYAGKW VDQLPSVLWS LRTTPSRATG QSPFFLVYGA EAMLPSEVEF
ESLRFRNFNE EGYEEGRVDD INRLEEAREA ALIQSTRYLQ GLRRYHNRNV RSRAFLVGDL
VLRKIQTTQD RHKLSPLWEG PFIIAEVTQP GSYRLKREDG TLINNSWNIE HLRRFYA
//