ID Q94I69_ORYSJ Unreviewed; 2014 AA.
AC Q94I69;
DT 01-DEC-2001, integrated into UniProtKB/TrEMBL.
DT 01-DEC-2001, sequence version 1.
DT 27-MAR-2024, entry version 101.
DE SubName: Full=Retroelement {ECO:0000313|EMBL:AAK53848.1};
GN Name=OSJNBa0084C09.2 {ECO:0000313|EMBL:AAK53848.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAK53848.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC016781; AAK53848.1; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 3.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1821..1981
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 136..186
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 257..278
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 741..761
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 144..178
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 741..760
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2014 AA; 228046 MW; 0BBA523E6B5DD792 CRC64;
MLDQINWHAN IPEGELWTCT GVKQISQASC FTSTRFWHAQ WDQEEVSKQF FLMAEKPPLS
PSSASPSTVK EKIQKLGLTD VNEGNVVPID PEKFTPDQKK DFDAMMQQAR DQFLNSFMQT
SKGTVVQKYK VKVVADDPGT SSSKGEDGKQ APDGSAQPSD NGATDGSLGG QRDNSQGVHG
VQGDGVHGVQ DRVDYAVHNA LINQSGMLVN TLSNMMKSMV DGTIAEYQAA GPVYLQGGVF
PNYRPLITNS HPSVQAAPSI APSAQPMAPA SAPAPTVPPS APGQLVNPRL LIREQPQHAG
QNVDRLTQDQ VASMFLPPQN TVDPTRNSRF SKHPKSNKLC NQVCNTWWAA CATCYKLGCP
RAFSTSRTTR WHDVMKEQFR LRPKDAGNLY RQPYPKWFER VPLPNRFKVP DFSKFSWQDS
TSTYEHISRF LAQCGEASAV DALKVRLFRL SLAGSTFTWF SSLPYGSINS WADLEKQFHS
YFYSGIHEMK LSDLTSIKQK HDEPVHEYIQ RFREMRNKCY SLSLTDAQLA DLAFQGMIAP
IREKLSSEDF ESLSHLTQKV ALHEQRYAEA RKNSRKVNHV CPYKYGSDDE DDDSEIAAAE
WVRSKKVIPC QWVKNSGKEE RYDFDISKAD KIFDLLLREK QIQLPAGHTI PSAEELGKKR
YCKWHNSGSH TTNDCKVFRQ QIQAAIEGGK IKFDDSKRLM KVDGNPFPVN MVHTAGRTAD
RGRARGFQVN SAKIINKYQR KYDKQQEKHH EEDDDGLRLP SIEDCPGCSD IAKNSSRSYG
RGNRLRQTRV PVHQRLGPVN QDHGQEDNEI RKNQWCPSGI FTKNQKRRKT KSRQEWRVKN
QVPVADEATA EEAKRLAKGK SVVTASVNMV FTLPAEIGTK QADVDEVEEE SAKLILLPEQ
AIFEKPEGTE NRHLKPLYIN GYVNGKPMSK MMVDGGAEVN LMPYATFRKL GRNVEDLIKT
NMVLIDFGGN LSETKGVSNV ELTVGNKTIP TTFFVIDGNG SYSLLLGRDW IHANCCIPST
MHQCLIQWQD DKIEIVPADS QLKMENPSCY FEGVVEGSNV YIKDTVDDLD DKQGQGFISA
DDLEEIDIGP EFRAKLIELL KEFRDCFAWE YYEMPGLSRS IVEHRLPIKP GVRPHQQPLR
RCKADMLEPV KVEIKRLYDA CFIRPCRYAE WVSSIVPVIK KNDLNKATPK DEYPMPVADQ
LVDAASGNKI LSYMDGNVGY NQIFMAEEDI HKTAFRCPGA IGLFEWVVMT FGLKSAGATY
QRAMNYIYHD LIGWLVEVYI DDVVVKSKEI EDHIADLRKV FERTRKYGLK MNPTKCAFGV
SVGQFLGFLV HERGIEVTQR SVNAIKKIQP PENKTELQEM NGKINFVRRF ISYLSGRLEP
FTPLLRLKAD QQFTWGAEQQ KALDDIKEYL SSPPVLIPPQ KGISFRLYLS AGEKSIGSVL
IQELDGKERV VFYLSRRLLD VETRYSPMEK LCLCLYFSCT RLSHYLLSNE CTVICKADVI
KYMLSAPILK GRVGKWIFSL TEFDLRYESP KAIKGQAIAD FIVEHCDDSI GSVEVVPWTS
FFDGSVCTHD CGIGLVIISP RGACFKFAYT IKPYATNNQA EYEAVLKGLQ LLKEVEADAI
EIMGDSLLVI SQLAGEYECK SDTLMVYNEK CQELMQEFRL VTLKHVSREQ NIEANDLAQG
ASGYKPMIKD VKAEIAAITA GDWRYDVHQY LHNPSQSASR KLRYKALKYT LLDDELYYRT
IDGVLLKCLS ADQAKVAIGE VHEGICGTHQ SAHKMKWLLR RARYFWPTML EDCFKYYKGC
QDCQKFGAIQ RAPASAMNPI IKPWPFRGWG IDMIGMISRP SSKGHKFILV ATDYFTKLVE
AIPLKKVDSG DAIQFVQEHI IYRFGIPQTI TTDQGSIFVS DEFVQFVDSM GIKLLNSSPY
YAQANGQAEA SNKSLIKLIK RKNSDYPRQW HTRLDEALWS YRMACHGSIQ VPPYKLVYGH
EAILPWEVRI GSRRTELLNG KYLKKYYLSV WVNA
//