ID Q8H7J5_ORYSJ Unreviewed; 1146 AA.
AC Q8H7J5;
DT 01-MAR-2003, integrated into UniProtKB/TrEMBL.
DT 01-MAR-2003, sequence version 1.
DT 27-MAR-2024, entry version 61.
DE SubName: Full=Retroelement {ECO:0000313|EMBL:AAN34944.1};
GN Name=OJA1325D05.1 {ECO:0000313|EMBL:AAN34944.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:AAN34944.1, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AC134517; AAN34944.1; -; Genomic_DNA.
DR AlphaFoldDB; Q8H7J5; -.
DR Proteomes; UP000000763; Chromosome 10.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR013103; RVT_2.
DR PANTHER; PTHR45895:SF165; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR45895; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
PE 4: Predicted;
FT DOMAIN 487..543
FT /note="GAG-pre-integrase"
FT /evidence="ECO:0000259|Pfam:PF13976"
FT DOMAIN 758..1000
FT /note="Reverse transcriptase Ty1/copia-type"
FT /evidence="ECO:0000259|Pfam:PF07727"
FT REGION 240..300
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 570..594
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 606..648
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 240..297
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1146 AA; 124478 MW; FBA3B76AC6FECB9B CRC64;
MASTSSGSTP SSAASTGDST GSIFVNPYAT VNVKTHIPIT LELKHPNFNK WKTFFTSMCG
KFGLLPHIDG TAPPRPDDST WAQADCCVQG WLFGSVSDAI LDVVMETDQT ARDLWLAIDD
LFQANKEPRT IYLSHEFHSM TQGDMPIADY CQKVKTAADA LRDVGHPVTE SQLVLNLLSG
LNSRFSSTAD NIASAPVLPS FASAHNTLLL KELRIANAHK VQAETTMVVA ASSANACTSG
TCASSSSSQS HGDVSNNSGR RNHSGGGNGG RQNNSNGGGY GNFGNNFQQQ QHQAGPRTTG
PWVCFNPWAV QQQQSPWRPS NSAGLLGPYP QAHTTFAGPY VSPPMPGLPP MQQSQPNWDQ
AGLIAALNQL SVQSPSPWVL DTGATSHMSS TDGILDTRLP NSYTFITVGN GHTIPVICHG
TSFLPIGTTK FDLKNILVAP SLVRNLLSIR QFTRDNNCSI EFDEFGFSVK GLRTRRVILR
CNSRGDLYTL PIAAPAIAAH SFLAQSSTLW HRRLGHPSSA AIQTLHKLAI LPCTKIDHSL
CHASIPQTAS SFDFLLQGVS KAVAPSSLDV ERPRSLPVAP SPEVEPPDDL AGPDMSQLLL
HNRRLQSSQA PEPAILAPSS ASSSADASTT GGSMSSSSPV QTISSSSIAQ PPALHGVASA
RPLISPYHHT YVRRTQPVAV HGPIRTTRAF RAAQHSQDQQ RHAMVTRSQT GNLHPAQRFN
YTAMQTSVSP VPSNYRSALA DPNWHAAMAD EYKALVDNNT WHLVPQPPGA NVVTGKWIFR
HKFHSDGTLA RHKARCVVRG YSQQHGIDYN ETFSPVVKPA TIHVVLSIAA SRSWPIHQLD
VKNAFLHGNL EETVYCQQPF GFVDPSEPTA VCLLQKSLYG LKQAPRAWYQ RFATYIRQLG
FTSSASDTSL FIYKDGDNVA YLLLYVDDIL LTASSTSLLQ HITARLHSEF AMTDLGDLHF
FLGISVTRSA DGLFLSQWQY AVDLLRRAGM SECHPTATPV DARCKLSATD GAPVSDPMEY
RSLAGALQYL TLTRPEIAYA IQQVCLFMHD PREPHLALVK RILRYIKGTL HICLHIGTAP
VDSLTAYSDA DWVGCPDSRH STSGYCVFLG ENLVSWSSKR QTTVSRSSAE AEYRAVAHAV
AECCWL
//