ID Q10JS2_ORYSJ Unreviewed; 1659 AA.
AC Q10JS2;
DT 22-AUG-2006, integrated into UniProtKB/TrEMBL.
DT 22-AUG-2006, sequence version 1.
DT 27-MAR-2024, entry version 74.
DE SubName: Full=Retrotransposon protein, putative, Ty3-gypsy subclass {ECO:0000313|EMBL:ABF96555.1};
GN OrderedLocusNames=LOC_Os03g29650 {ECO:0000313|EMBL:ABF96555.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:ABF96555.1};
RN [1] {ECO:0000313|EMBL:ABF96555.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16109971; DOI=10.1101/gr.3869505;
RG Rice Chromosome 3 Sequencing Consortium;
RA Buell C.R., Yuan Q., Ouyang S., Liu J., Zhu W., Wang A., Maiti R., Haas B.,
RA Wortman J., Pertea M., Jones K.M., Kim M., Overton L., Tsitrin T.,
RA Fadrosh D., Bera J., Weaver B., Jin S., Johri S., Reardon M., Webb K.,
RA Hill J., Moffat K., Tallon L., Van Aken S., Lewis M., Utterback T.,
RA Feldblyum T., Zismann V., Iobst S., Hsiao J., de Vazeille A.R.,
RA Salzberg S.L., White O., Fraser C., Yu Y., Kim H., Rambo T., Currie J.,
RA Collura K., Kernodle-Thompson S., Wei F., Kudrna K., Ammiraju J.S., Luo M.,
RA Goicoechea J.L., Wing R.A., Henry D., Oates R., Palmer M., Pries G.,
RA Saski C., Simmons J., Soderlund C., Nelson W., de la Bastide M.,
RA Spiegel L., Nascimento L., Huang E., Preston R., Zutavern T., Palmer L.,
RA O'Shaughnessy A., Dike S., McCombie W.R., Minx P., Cordum H., Wilson R.,
RA Jin W., Lee H.R., Jiang J., Jackson S.;
RT "Sequence, annotation, and analysis of synteny between rice chromosome 3
RT and diverged grass species.";
RL Genome Res. 15:1284-1291(2005).
RN [2] {ECO:0000313|EMBL:ABF96555.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell R., Wing R.A., McCombie W.A., Ouyang S.;
RL Submitted (JUN-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000009; ABF96555.1; -; Genomic_DNA.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1097..1226
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 1375..1533
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 109..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 233..303
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 508..564
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 602..650
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 109..125
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 511..537
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 610..636
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1659 AA; 190250 MW; D85C407410BFDEFB CRC64;
MAVIFQLGTG YRIKIEYLHE GSVGAPRRGS VKFLPTKCDG FRLQHRRLRL PPGQAFRFGS
LNFITNNFGK LSLLDSDSNQ SGRNQVLSPF GIPNSADVYS KIVSPELVSN HSDEIQSTPQ
RPDQDDGSYP SILMRLPDDL ATVFTAKASP VRRSRRASAS APIQSRSREV GVILQSLGTV
STEELDGYLS SPDVSSRPTE ILDYGDFDNF DDSYEDNYTP LFCGVFMADN ETEEQRRARE
ADEERTRQET ERRRLEEERQ RQERERLQHE DHDRCLASSQ LRKHPHPDWE RRVERPHSPH
RRRPVDLRDT INQLRAARGN HSPDRYDDDM DGVAAFTSEL RRVDWPADFK PTGIEKYDGT
TNPESWLTVY GLAVRAVGGD SKAMANYLPV ALADSARSWL HGLPRGTIGS WAELRDHFIA
NFQGTFERPG THFDLYNIVQ KSGESLRDYI RRFSKQRNKI SDITDDVIIA AFTKGIRHED
LVGKFGRKPP KTVKQMFEKA NEYAKAEDAI TASKQSGTTW KPKKDTPTAG GSGSNNHSNN
HKDRKRKPEE LVAITSPSSR QRSRVNTFNK IMNSQCLHHP NSNHVAKDCF VYKQFAEQYV
KNARKPSDGD QGTSNKKDDE DDAPTGFQDH RKELNHIFGG PLSYESKRKQ KLTEREINAV
QPNTPQDLRW SEIAIKFDRS DHPDRVVHPG RYPLNNKDIF AWKPSDMPGI PREVIEHSLH
VKEDTKPIKQ RLRRFAQDMK DAIKEELTKL LAAGFIKEVL HPDWLANPVL VRKKTGQWRM
CVDYTDLNKS CPKDPFGLPR IDQVIDSTAG CELLSFLDCY SGYHQIRLKE SDCLKTSFIT
PFGAYCYITM PFGLKNAGAT YQRMIQRCFS TQIGRNVEAY VDDVVIKTKQ KDDLIADLEE
TFASIRAFRM KLNPEKCIFR VPSGKLLRFM WGPEAEKAFP DFKKLLTTPP VLASPRPQEP
LLLYVSGTSQ VVSMVLVVER EEEGHIQKVQ RPIYFVSDVL ANSKTRYPQV QKLLYGVLIT
VRKLSHYFQS HSVTVVTSFP LGDILHNREA NGRIAKWALE LMPLDISFKP RTLIKSQALA
DYLTEWTEYQ EDMPEEKMEY WTMHFDGSKR LTGAGAGVVL ISPTGERLSY VLWIHFSASH
NMAEYEALLH GLRIAISLGI RRLIVRGDSQ LVVNQVMKEW SCQDDNMTAY RQEVRKLEDK
FDGLELTHVL RHNNEAADRL ANFGSKREAI PSDVFVEHLY EPTVPRKEKI EAMNTQGVSM
IEADWREPLI RFLTKQELPQ DKNEAEQISR RSRLYVIHKT ELYKKSPSGI LQRCVSLEEG
RQLQKDIHSG ICGNHAAART IVGKAYRQGF FWPKAVSDAD KIVRTCEGCQ FFARQTHLPA
QELQTIPLSW PFAVWGLDML WKKFSKWIEA KPVVTITANK ARDFFINIVH RFGVPNRIIT
DNGTQFTGGI FKDFCEDFGI KICYASVSHP MSNGQVKRAN GMVLQGIKAR VFDRLHPYTG
KWVEQLPSVL WSLRTTPSRA TGQSPFFLVY GAEAMLPSEV EFESLRFCNF REERYEEDRV
DDINRLEEAR EAVLIQSARY LQGLRHYHNR NVRSRAFLVG DLVLRRIQTT QDRHKLSPLW
EGPFIIAEVT RPGSYRLKRE DGTLVNNSWN IEHLRRFYA
//