ID Q2QT15_ORYSJ Unreviewed; 1889 AA.
AC Q2QT15;
DT 24-JAN-2006, integrated into UniProtKB/TrEMBL.
DT 24-JAN-2006, sequence version 1.
DT 27-MAR-2024, entry version 76.
DE SubName: Full=Retrotransposon protein, putative, Ty3-gypsy subclass {ECO:0000313|EMBL:ABA97854.1};
GN OrderedLocusNames=LOC_Os12g22200 {ECO:0000313|EMBL:ABA97854.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:ABA97854.1};
RN [1] {ECO:0000313|EMBL:ABA97854.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16188032; DOI=10.1186/1741-7007-3-20;
RG The rice chromosomes 11 and 12 sequencing consortia;
RT "The sequence of rice chromosomes 11 and 12, rich in disease resistance
RT genes and recent gene duplications.";
RL BMC Biol. 3:20-20(2005).
RN [2] {ECO:0000313|EMBL:ABA97854.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell C.R., Wing R.A., McCombie W.A., Ouyang S.;
RL Submitted (APR-2005) to the EMBL/GenBank/DDBJ databases.
RN [3] {ECO:0000313|EMBL:ABA97854.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell R.;
RL Submitted (JAN-2006) to the EMBL/GenBank/DDBJ databases.
CC -!- SUBCELLULAR LOCATION: Nucleus {ECO:0000256|ARBA:ARBA00004123}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000011; ABA97854.1; -; Genomic_DNA.
DR GO; GO:0046872; F:metal ion binding; IEA:UniProtKB-KW.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09279; RNase_HI_like; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.10.20.370; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041577; RT_RNaseH_2.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17919; RT_RNaseH_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF13456; RVT_3; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
PE 4: Predicted;
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723}.
FT DOMAIN 1396..1525
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 1672..1838
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 1..32
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 71..128
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 509..532
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 552..617
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 90..104
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 105..128
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 555..569
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 570..584
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 595..611
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1889 AA; 212848 MW; FB9A2B7D670EB041 CRC64;
MAKTSTAAGG NETTQEERIA GAVVEETGKT PMSDVELAEL VQAQGAVMVS KGQYEELQKE
LQRLQTLHNQ TVGAGGSSDQ CAIPGAREKQ SDQAKGDAQG ENKEGESIKA TQAQDLPRTQ
APSQVQNPSQ AQIPILTQNT PQAQITIPTQ NLAQTLIPSQ NQIPIQTHIS FQPHIPIQTQ
NPNQVQYPQT QLHQAHPVLP NQIQIPQPHI LQSNIPRTQP PQNDTPQIQV PQTQIPQIQT
SLPNFTHTQS TNTQAAQNHH QVPQGFSLFH MQQPEMMFEQ GYIPQMANHI QYAGQIPNQA
FQFQPTQPYF QPTRAIDTKN PLSQNLQMAP WPINFKLSNI TKYKGDTDPN EYLRVYETAV
EAAGGDDTTK AKILPTMLEG VALSWYTTIP PMTIYSWEHM RDTFRAGFIG AYEEPKEADD
LYAMKQLPGE TLRSFIVKFS RVRCQIRHVD DEMLIAAAKR ALLPGPLRFD LARNRPKTAK
DLFERMESFA RGEEDELRVQ EEEAVLLGKK QSKNKQISQG EEQKGENTGK PWKKFKYDYK
QDQKKQVNFI GDGYNNEREK GKHQWDNTRR GRNNWGQSGK GRGQWWNSGR GRGRWWNNER
GKGRENKPDQ TKFCQTHGPG GHSTEECYSK FCHIHGPGGH STEECRQMTH LLEKHVNRYE
DKYEGARDQR GQNAIEAPQV MKIEAIEEVP KRVINAITGG SSLGVESKRQ RKAYVRQVNH
VGTSYQSNPP VYSKTVISFG PEDAEGILFP HQDPLVVSVE IAQCEVQRVL IDGGSSADVL
FYDAFKKMQI PEDRLTNAGV PLQGFGGQQV HAIGKISLQV VFGKGTNVRK EEIVFDVVDM
PYQYNAILGR STINIFEAII HHNYICMKLP GPRGVITVRG EQLAARKYEL QGTPSVKGVH
VVDQKQGEYI KIQKPIPEGK TKKVQLDEHD PGKFILIGEN LEKHIEEEIL KVVKENMAVF
AWSPDELQGV DRSLIEHNLA IKSGYKPKKQ KLRRMSTDRQ QAAKIELEKL LKAKVIREVM
HPEWLANPVL VKKANGKWRM CIDFTDLNKA CPKDDFPLPR IDQLVDATAG CELMSFLDAY
SGYHQVFMVK EDEEKTSFIT PFGSYCFIRM PFGLKNAGAT FARLIGKVLA KQLGRNVEAY
VDDIVVKSKQ AFTHGKDLQE MFENLRKCSV KLNPEKGIEA NPDKIAAIHQ MEPPRNTREV
QRLTGRMASL SRFLSKSAEK GLPFFKTLRG ANTFEWTAEC QQAFDDLKKY LHEMPTLASP
PKGQPLLMYV AATPATVSAV LVQEEENRQV PVYFVSEALQ GPKTRYSEVE KLIYAIVMAS
RKLRHYFLSH DITIPSAYPI GEVLTNKEVA GRIAKWAMEL LPFDLKYISR TAIKSQVLAD
FVAEWTPNEV EQQEEVEKPW IVFSDGACNA AGAGAAAVVK TPMKQTLKYS VQLAFPSTNN
TAEYEGVLLA MRKARALRAR RLIVKTDSKL VGGHFSKSFE VKEETMAKYL EEARLNEKHF
LGITVKAITR EENGEADELA KAAATGQPLE NSFFDIITQP SYEKKEVACI QREGDWREPI
LKYLVSAQLP EKEEEAKRIQ LTSKKYKVVE GQLYKSGVTA PLLKCVTREE GMKMVVEIHE
GLCGAHQAPR SVASKVIRQG IYWPTIMKDT EKYIKTCKAC QKFGPMTKAP PKELQPIPLV
WPFYRWGIDI VGPLPRAKGD LRFVIVAIEY FSRWIEAEAV ARITSAAVQK FVWKNVICRF
GIPKEIVCDN GKQFESGKFQ DMCKGLNLQI NFASVGHPQT NGAVERANGK IMEAIKKRLE
GSAKGKWPED LLSVLWALRT TVVRSTGMTP FRLVYGDEAM TPSKVGAHSP RMIFDQKDEE
GREITLEMLD EIRVEALEKM ASYIEGTKS
//