ID B9G2F3_ORYSJ Unreviewed; 521 AA.
AC B9G2F3;
DT 24-MAR-2009, integrated into UniProtKB/TrEMBL.
DT 24-MAR-2009, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=Gypsy retrotransposon integrase-like protein 1 {ECO:0000256|ARBA:ARBA00039658};
GN ORFNames=OsJ_28580 {ECO:0000313|EMBL:EEE69299.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:EEE69299.1};
RN [1] {ECO:0000313|EMBL:EEE69299.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=15685292; DOI=10.1371/journal.pbio.0030038;
RA Yu J., Wang J., Lin W., Li S., Li H., Zhou J., Ni P., Dong W., Hu S.,
RA Zeng C., Zhang J., Zhang Y., Li R., Xu Z., Li S., Li X., Zheng H., Cong L.,
RA Lin L., Yin J., Geng J., Li G., Shi J., Liu J., Lv H., Li J., Wang J.,
RA Deng Y., Ran L., Shi X., Wang X., Wu Q., Li C., Ren X., Wang J., Wang X.,
RA Li D., Liu D., Zhang X., Ji Z., Zhao W., Sun Y., Zhang Z., Bao J., Han Y.,
RA Dong L., Ji J., Chen P., Wu S., Liu J., Xiao Y., Bu D., Tan J., Yang L.,
RA Ye C., Zhang J., Xu J., Zhou Y., Yu Y., Zhang B., Zhuang S., Wei H.,
RA Liu B., Lei M., Yu H., Li Y., Xu H., Wei S., He X., Fang L., Zhang Z.,
RA Zhang Y., Huang X., Su Z., Tong W., Li J., Tong Z., Li S., Ye J., Wang L.,
RA Fang L., Lei T., Chen C., Chen H., Xu Z., Li H., Huang H., Zhang F., Xu H.,
RA Li N., Zhao C., Li S., Dong L., Huang Y., Li L., Xi Y., Qi Q., Li W.,
RA Zhang B., Hu W., Zhang Y., Tian X., Jiao Y., Liang X., Jin J., Gao L.,
RA Zheng W., Hao B., Liu S., Wang W., Yuan L., Cao M., McDermott J.,
RA Samudrala R., Wang J., Wong G.K., Yang H.;
RT "The genomes of Oryza sativa: a history of duplications.";
RL PLoS Biol. 3:266-281(2005).
RN [2] {ECO:0000313|EMBL:EEE69299.1}
RP NUCLEOTIDE SEQUENCE.
RA Wang J., Li R., Fan W., Huang Q., Zhang J., Zhou Y., Hu Y., Zi S., Li J.,
RA Ni P., Zheng H., Zhang Y., Zhao M., Hao Q., McDermott J., Samudrala R.,
RA Kristiansen K., Wong G.K.-S.;
RT "Improved gene annotation of the rice (Oryza sativa) genomes.";
RL Submitted (DEC-2008) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; CM000146; EEE69299.1; -; Genomic_DNA.
DR AlphaFoldDB; B9G2F3; -.
DR Proteomes; UP000007752; Chromosome 9.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09279; RNase_HI_like; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR PANTHER; PTHR47266; ENDONUCLEASE-RELATED; 1.
DR PANTHER; PTHR47266:SF28; GYPSY RETROTRANSPOSON INTEGRASE-LIKE PROTEIN 1; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
FT DOMAIN 36..165
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 322..488
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
SQ SEQUENCE 521 AA; 58973 MW; 42C2663A4E589004 CRC64;
MSLDISFKPR ISVKSQALAD FVAEWTECQE DTPAENMEHW TMHFDGSKRL SGTGAGVVLI
SPTGERLSYV LWIHFLASHN VAEYEALLHG LRIAISLGIK RLIVRGDSQL VVNQVMKEWS
CLDDNMMAYR QEVRKLEDKF DGLELSHVLR HNNEAADRLA NFGSKREAAP SDVFVEHLYT
PTVPHKDITQ DADTHDVAMV EVDWREPLIR FLTSQELPQD KDEAERISRR SKLYVLHEAE
LYKKSPSGIL QRCVSLEEGR QLLKDIHSGI CGNHAAARTI VGKAYRQGFF WPTAVSDADK
IVRTCEGCQF FARQIHLPAQ ELQTIPLSWP FAVWGLDMVG PFKKAVGGYT HLFVAIDKFS
KWIEAKPVVT ITADNARDFF ISIVHRFGVP NRIITDNGTQ FTGGVFKDFC EDFGIKICYA
SVAHPMSNGQ VERANGMILQ GIKARVFDRL KPYAGKWVQQ LPSVLWSLRT TPSRATGQSP
FFLVYGAEAM LPSEVEFESL RFRNFREERY ERGPIGLLAP I
//