ID Q7Y1M7_ORYSJ Unreviewed; 1161 AA.
AC Q7Y1M7;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 01-OCT-2003, sequence version 1.
DT 27-MAR-2024, entry version 81.
DE SubName: Full=Polyprotein {ECO:0000313|EMBL:AAP44605.1};
GN Name=OSJNBa0053G10.9 {ECO:0000313|EMBL:AAP44605.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAP44605.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC091233; AAP44605.1; -; Genomic_DNA.
DR AlphaFoldDB; Q7Y1M7; -.
DR Proteomes; UP000000763; Chromosome 3.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR42648:SF22; RETROVIRUS-RELATED POL POLYPROTEIN FROM TRANSPOSON TNT 1-94; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 358..534
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 212..243
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 228..243
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1161 AA; 132421 MW; 20C222A1F5BBE57B CRC64;
MAAPAPSNFN LRSILEKEKL TGTNFMDWYR NLRIVLRQEH KEFVLTEPFP ANLPNNAPAA
QRREHEKRCN DYLDISCLML ATMSPELQRQ YEALDAHTII TGLRNMFEDQ ARAERFNTSK
SLFACRLAEG NLVSPHVIKM IGYTESLDKL GFPLSRELAT DLILQSLPPS FEPFIMNFNM
NNLNRTLAEL HGMLKTAEES IKKNSNHVMV MHKRKPNNKK SGQKRKLNSD EITSTSNSKT
KVQKTGSAKD AECFFCKETE GYGFRSVDNG CSVYYNDIFY FHAPMMNGLY IVNLDGCSVY
NINAKRQRPN NLNPTFIWHC CLGHINEKRI EKLHRDGLLH SFDFESFKTC ESCLLGKMTK
APFTGQSERA SELLGLVHTD VCGPMSSTAR GGFGYFITFT DEFSRYGYVY LMRHKSESFE
KFQEFQNEVQ NHLGKTIKYL RSDRGGEYLS LEYGNHLKEC GIVPQLTPPG TPQWNAVSER
RNRILLDMVR SMMSQTDMPL SFWGYALETA AFTLNRVPSK SVDKTPYEIW TGKRPSLSFL
KIWCCEETKG YYFYNREEGK VFVARHGVFL EKEFISRKDS GSMVRLKEIQ ETPENASTST
QPQVEQDVVQ QVEQVVVEPV VEAPASRRSE RIRRTPARYA LLTSGQRDIL LLDNDEPTTY
EEAMVGPDTE KWLGAMKSEI ESMHVNQVWN LVDPPDGVKA IECKWIFKKM TDVDGTVHIY
NARLVAKGFR QIQGVDYDET FSPVAMLKSI RIVLAIAAYF DYEIWQMDVK TAFLNGNLDE
DVYMTQPKGF VDPQSAKKIC KLQKSIYRLK QASRSWNIRF DEVVKALGFV KNEEEPCVYK
KISGSALVFL ILYVDDILLI GNDIPMLESV KTSLKYSFSM KDLGEAAYIL GIRIYRDRSK
RLIGLSQSTY IDKVLKRFNM QDSKKGFLPM SHGINLGKNQ CPQTTDERNK MSVIPYASAI
GSIMYAMLCT RLDVSYALSA TSRYQSDLGE SHWIAVKNIL KYLRRTKDMF LVYGRQEELV
VNGYTDASFQ TDKDDFRSQS GFVFCLNGGA VSWKSSKQDT VADSTTEAEY IAASEAAKEA
VWIKKFVSQL GVMTSASSPM DLYCDNSGAI AQAKEPRSHQ KSKHILRRYH LIREMVGRGD
VKICKIHTDL NVADPLTKPL P
//