ID Q7XP54_ORYSJ Unreviewed; 1264 AA.
AC Q7XP54;
DT 01-OCT-2003, integrated into UniProtKB/TrEMBL.
DT 01-MAR-2004, sequence version 2.
DT 27-MAR-2024, entry version 94.
DE SubName: Full=OSJNBa0013K16.13 protein {ECO:0000313|EMBL:CAE03764.2};
GN Name=OSJNBa0013K16.13 {ECO:0000313|EMBL:CAE03764.2};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:CAE03764.2, ECO:0000313|Proteomes:UP000000763};
RN [1] {ECO:0000313|Proteomes:UP000000763}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=16100779; DOI=10.1038/nature03895;
RG International rice genome sequencing project (IRGSP);
RA Matsumoto T., Wu J., Kanamori H., Katayose Y., Fujisawa M., Namiki N.,
RA Mizuno H., Yamamoto K., Antonio B.A., Baba T., Sakata K., Nagamura Y.,
RA Aoki H., Arikawa K., Arita K., Bito T., Chiden Y., Fujitsuka N.,
RA Fukunaka R., Hamada M., Harada C., Hayashi A., Hijishita S., Honda M.,
RA Hosokawa S., Ichikawa Y., Idonuma A., Iijima M., Ikeda M., Ikeno M.,
RA Ito K., Ito S., Ito T., Ito Y., Ito Y., Iwabuchi A., Kamiya K.,
RA Karasawa W., Kurita K., Katagiri S., Kikuta A., Kobayashi H., Kobayashi N.,
RA Machita K., Maehara T., Masukawa M., Mizubayashi T., Mukai Y., Nagasaki H.,
RA Nagata Y., Naito S., Nakashima M., Nakama Y., Nakamichi Y., Nakamura M.,
RA Meguro A., Negishi M., Ohta I., Ohta T., Okamoto M., Ono N., Saji S.,
RA Sakaguchi M., Sakai K., Shibata M., Shimokawa T., Song J., Takazaki Y.,
RA Terasawa K., Tsugane M., Tsuji K., Ueda S., Waki K., Yamagata H.,
RA Yamamoto M., Yamamoto S., Yamane H., Yoshiki S., Yoshihara R., Yukawa K.,
RA Zhong H., Yano M., Yuan Q., Ouyang S., Liu J., Jones K.M., Gansberger K.,
RA Moffat K., Hill J., Bera J., Fadrosh D., Jin S., Johri S., Kim M.,
RA Overton L., Reardon M., Tsitrin T., Vuong H., Weaver B., Ciecko A.,
RA Tallon L., Jackson J., Pai G., Aken S.V., Utterback T., Reidmuller S.,
RA Feldblyum T., Hsiao J., Zismann V., Iobst S., de Vazeille A.R., Buell C.R.,
RA Ying K., Li Y., Lu T., Huang Y., Zhao Q., Feng Q., Zhang L., Zhu J.,
RA Weng Q., Mu J., Lu Y., Fan D., Liu Y., Guan J., Zhang Y., Yu S., Liu X.,
RA Zhang Y., Hong G., Han B., Choisne N., Demange N., Orjeda G., Samain S.,
RA Cattolico L., Pelletier E., Couloux A., Segurens B., Wincker P., D'Hont A.,
RA Scarpelli C., Weissenbach J., Salanoubat M., Quetier F., Yu Y., Kim H.R.,
RA Rambo T., Currie J., Collura K., Luo M., Yang T., Ammiraju J.S.S.,
RA Engler F., Soderlund C., Wing R.A., Palmer L.E., de la Bastide M.,
RA Spiegel L., Nascimento L., Zutavern T., O'Shaughnessy A., Dike S.,
RA Dedhia N., Preston R., Balija V., McCombie W.R., Chow T., Chen H.,
RA Chung M., Chen C., Shaw J., Wu H., Hsiao K., Chao Y., Chu M., Cheng C.,
RA Hour A., Lee P., Lin S., Lin Y., Liou J., Liu S., Hsing Y., Raghuvanshi S.,
RA Mohanty A., Bharti A.K., Gaur A., Gupta V., Kumar D., Ravi V., Vij S.,
RA Kapur A., Khurana P., Khurana P., Khurana J.P., Tyagi A.K., Gaikwad K.,
RA Singh A., Dalal V., Srivastava S., Dixit A., Pal A.K., Ghazi I.A.,
RA Yadav M., Pandit A., Bhargava A., Sureshbabu K., Batra K., Sharma T.R.,
RA Mohapatra T., Singh N.K., Messing J., Nelson A.B., Fuks G., Kavchok S.,
RA Keizer G., Linton E., Llaca V., Song R., Tanyolac B., Young S., Ho-Il K.,
RA Hahn J.H., Sangsakoo G., Vanavichit A., de Mattos Luiz.A.T., Zimmer P.D.,
RA Malone G., Dellagostin O., de Oliveira A.C., Bevan M., Bancroft I.,
RA Minx P., Cordum H., Wilson R., Cheng Z., Jin W., Jiang J., Leong S.A.,
RA Iwama H., Gojobori T., Itoh T., Niimura Y., Fujii Y., Habara T., Sakai H.,
RA Sato Y., Wilson G., Kumar K., McCouch S., Juretic N., Hoen D., Wright S.,
RA Bruskiewich R., Bureau T., Miyao A., Hirochika H., Nishikawa T.,
RA Kadowaki K., Sugiura M., Burr B., Sasaki T.;
RT "The map-based sequence of the rice genome.";
RL Nature 436:793-800(2005).
RN [2] {ECO:0000313|Proteomes:UP000000763}
RP GENOME REANNOTATION.
RC STRAIN=cv. Nipponbare {ECO:0000313|Proteomes:UP000000763};
RX PubMed=18089549; DOI=10.1093/nar/gkm978;
RG The rice annotation project (RAP);
RT "The rice annotation project database (RAP-DB): 2008 update.";
RL Nucleic Acids Res. 36:D1028-D1033(2008).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AL662957; CAE03764.2; -; Genomic_DNA.
DR AlphaFoldDB; Q7XP54; -.
DR Proteomes; UP000000763; Chromosome 4.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648:SF22; RETROVIRUS-RELATED POL POLYPROTEIN FROM TRANSPOSON TNT 1-94; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 242..257
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 485..650
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 188..239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 729..774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 197..239
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 729..751
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..774
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1264 AA; 142098 MW; D810E9A9EDC27FB4 CRC64;
MAGFADALRP DKFTGVHFKR WQIRVTLWLT AMKCFWVSTG KPEGVLTAEQ QKQFEEATTL
FVGCILSILG DRLVEVYMHM TDAKELWDAL NTKFGATDAS NDLYIMEQFH DYKMADNRFV
VEQAHEIQTM AKELELLKCV LPDKFVAGCI IAKLPPSWRS FGTALKHKRQ EYSVEGLIAS
LDVEEKAREK DAASKGDGGQ SSANVVHKAQ NKSKGKYKAQ QTTNFKKQKK NNNNPNQDER
TCFVCGQVGH LARKCPQRKG MKAPAGQTSK SANVTIGNTG DGSGYGNLPT VFSVNQSTNW
WVDTGANVHF CADISLFSSY QVARGSTVLM GNGSHASVHG VGTVDLKFTS GKIVQLKNVQ
HVPSIDRNLV SGSRLTRNGF KLVFESNKVV VSKHGYFIGK GYECGGLFRF SLSDFCNKSV
NHICGSVDDE ANVWHSRLCH INFGLMSRLS SMCLIPKFSI VKGSKCHSCV QSKQPRKPHK
AAEERNLAPL ELLHSDLCEM NGVLTKGGKR YFMTLIDDAT RFCYVYLLKT KDEALDYFKI
YKAEVENQLD RKIKRLRSDR GGEFFSNEFD LFCEEHGIIH ERTPPYSPES NRIAERKNRT
LTDLVNAMLD TAGLPKAWWG EALLTSNHVL NRVPNRNKDK TPYEIWIGRK PSLSYLRTWG
CLAKVNVPIT KKRKLGPKTV DCVFLGYAYH SIAYRFLLVK SEVPDMHVGT IMESRDATFF
ESFFPMKDTH SGSNQPSEII PSSITPPEQT EHTHELVSEE DVSEAPRRSK RQRTAKSFGD
DFTVYLVDDT PKSISEAYAS PDADYWKEAV RSEMDSIIAN GTWEVIERPY GCKPVGYKWV
FKKKLRPDGT IENTSHGLLV HQMDVKTAFL NGELDEEIYM DQPDGFVVEG QEGKVCKLLK
SLYGLKQAPK QWHEKFDKTL TSAGFAVNEA DKCVYYRHGG GEGVILCLYV DDILIFGTNL
EVINEVKSFL SQNFDMKDLG VADIILNIKL IRGENGITLL QSHYVEKILN RFGYIDSKPS
PTPYDPSLLL RKNKRIARNQ LEYSQIIGSL MYLASATRPD ISFAVSKLSR FTSNPGDDHW
RALERVMRYL KGTVGLGVHY TGYPAVLEGY SDSNWISDVD EIKATSGYVF TLGGGAVSWR
SCKQTILTRS TMEAELTALD TATVEAEWLR DLLMDLPVVE KPVPAILMNC DNQTVIVKVN
SSKDNMKSSR HVKRRLKSVR KLRNSGVITL DYIQTARNLA DPFTKGLSRN VIDNASKEMG
LRPM
//