ID Q7XN29_ORYSJ Unreviewed; 1563 AA.
AC Q7XN29; Q7X8P8;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 01-MAR-2004, sequence version 2.
DT 27-MAR-2024, entry version 87.
DE SubName: Full=OSJNBa0021F22.2 protein {ECO:0000313|EMBL:CAE03708.2};
DE SubName: Full=OSJNBa0083I11.15 protein {ECO:0000313|EMBL:CAE04305.2};
GN ORFNames=OSJNBa0021F22.2 {ECO:0000313|EMBL:CAE03708.2},
GN OSJNBa0083I11.15 {ECO:0000313|EMBL:CAE04305.2};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:CAE04305.2, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|EMBL:CAE04305.2}
RP NUCLEOTIDE SEQUENCE.
RX PubMed=12447439; DOI=10.1038/nature01183;
RA Feng Q., Zhang Y., Hao P., Wang S., Fu G., Huang Y., Li Y., Zhu J., Liu Y.,
RA Hu X., Jia P., Zhang Y., Zhao Q., Ying K., Yu S., Tang Y., Weng Q.,
RA Zhang L., Lu Y., Mu J., Lu Y., Zhang L.S., Yu Z., Fan D., Liu X., Lu T.,
RA Li C., Wu Y., Sun T., Lei H., Li T., Hu H., Guan J., Wu M., Zhang R.,
RA Zhou B., Chen Z., Chen L., Jin Z., Wang R., Yin H., Cai Z., Ren S., Lv G.,
RA Gu W., Zhu G., Tu Y., Jia J., Zhang Y., Chen J., Kang H., Chen X., Shao C.,
RA Sun Y., Hu Q., Zhang X., Zhang W., Wang L., Ding C., Sheng H., Gu J.,
RA Chen S., Ni L., Zhu F., Chen W., Lan L., Lai Y., Cheng Z., Gu M., Jiang J.,
RA Li J., Hong G., Xue Y., Han B.;
RT "Sequence and analysis of rice chromosome 4.";
RL Nature 420:316-320(2002).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [3] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL662960; CAE03708.2; -; Genomic_DNA.
DR EMBL; AL662982; CAE04305.2; -; Genomic_DNA.
DR Proteomes; UP000000763; Chromosome 4.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13975; gag-asp_proteas; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF00665; rve; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 1281..1447
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..43
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 84..136
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1512..1563
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 104..121
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1563 AA; 175902 MW; 3F212A0A62BE6A42 CRC64;
MAEKTAPPSP GSGKGKPPKP GLTDIIDDNV LPVDPEKFTP EQKQDFEAMM QQARDQYQSS
YMQTRKGSLV QKYKLKLVAD IPGIGSSKDG DVKRDPDGSA QPSVKGATEG SAGNQGDNLP
EVHGTPNRVY QELPGGNFSQ GLQDSFNNFQ DRIDYAVHHA LINQSGVLAN MLSNMAAPSV
APIAQPTASA STSVPAASAT AAAQLMNPRL LMREYPQPAE QITNQPTQDQ VAAMFLPPPV
VNSVQQQPIQ QAPPVQPTVQ PVQQQVAPPV QPMVQPVQQQ VAQPVQQASL PQQHFQSNQQ
TPPRQRLQLI QQTPVRHQPI QHLGSANASA GFAAPWGQLV QPFGNQATPE HLVHHIQPDG
TVIPQMVPEH LARNIQSDLH DYQGGNLSYR YQPSAIQAQY QPGGSAQQQF APQFGQFESM
QQQPQGAAQQ RQWADVIADV MRKQFGLKLK ETGSLYRQPY PEWFERVPLP NRFKIPDFSK
FSGQEGVSTY EHISRYLAQY GEASAVDALR VRMFRLSLSG SAFTWFSSLP YGSVNSWADL
EKQFHSYFYS GVHEMKLSDL TAIKQRYDEP VHEYIQRFRE MRNKCFSLSL TDAQLADLAF
QGMIPPIREK FSSEDFDSLS HLIQKVTLHE HRSADVRRSS KKVNHVCQYM YESDDEDDDS
EIAAAEWVRS KKVIPCQWVK SPGKEKKYDF DITKADKIFD LLLREKQIQL PAGHTIPSAE
ELGKKRYCKW HNSGSHSTND CKVFRQQIQV AIEGGKIKFD DSKKPMKVDG NPFPVNMVHT
ASQAADGSRA KGFQVHSAKI INKYQRRYDK QQGRRYEEND NGFDPHWDCE FFRFCWNERM
RLPSIKNCPG CSDIIESSSR PHSRVGEESA KLALSSEQAV FEKPEGTENR HLKPLYVNGY
INGKPMSKMM VDGGAAVNLM PYATFRKLGR NAEDLIKTNM VLKDFGSNPS ETKGVLNVEL
TVGNKTIPTT FFVIDGKGSY SLLLGRDWIH ANCCIPSTMH QCLIQWQGDK IEVVPADSRL
KMENPSYYFE GVVEGSEACA KDTVDDLDDK QGQGFMSADD LEEIDIGLGD RPRPTFISKG
LSSEFRTKLI ELLKEFRDCF AWEYYEMPGL SRSIVEHRLP FKPGFRPHQQ PPRRCKADML
EPVKAEIKRL YDAGFIRPCR YAEWVSSIVP VIKKNGKVRV CIDFRNLNKA TPKDEYPMPV
ADQLVDAASE NKILSFMDGN ADVMFWHRRL GHVGFDHLTR LSGLDLIRGL PKLKKDHDLV
CTQCRHAKMV STSHAPIVSV MTDAPGQLLH MDTVGPARVQ SIGGKWYVLV IVDNFSRYSW
VFFMATKDEA FQHFRGLFLR LELEFPGSLK RIRSDNGGEF NNASFEQFCN ERGLEHEFSS
PRVPQQNGVV ERKNRVLVEM ARTMLDEYKT PRKFWAEAIN TACYISNRVF LRSKLGKTSY
ELRFGHQPKV SHLRVFGCKC FVLKSGNLDK FEARSTDGLF LGYPAHTRGY RVLILGTNKI
IETCEVSFDE ASPGTRPEIA GTLSQVQGED GRIFEDESDY DDDDEVGSAG QTGRQAGQTA
ETQ
//