ID Q10CN4_ORYSJ Unreviewed; 1460 AA.
AC Q10CN4;
DT 22-AUG-2006, integrated into UniProtKB/TrEMBL.
DT 22-AUG-2006, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Retrotransposon protein, putative, unclassified, expressed {ECO:0000313|EMBL:ABF98943.1};
GN OrderedLocusNames=LOC_Os03g54804 {ECO:0000313|EMBL:ABF98943.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:ABF98943.1};
RN [1] {ECO:0000313|EMBL:ABF98943.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16109971; DOI=10.1101/gr.3869505;
RG Rice Chromosome 3 Sequencing Consortium;
RA Buell C.R., Yuan Q., Ouyang S., Liu J., Zhu W., Wang A., Maiti R., Haas B.,
RA Wortman J., Pertea M., Jones K.M., Kim M., Overton L., Tsitrin T.,
RA Fadrosh D., Bera J., Weaver B., Jin S., Johri S., Reardon M., Webb K.,
RA Hill J., Moffat K., Tallon L., Van Aken S., Lewis M., Utterback T.,
RA Feldblyum T., Zismann V., Iobst S., Hsiao J., de Vazeille A.R.,
RA Salzberg S.L., White O., Fraser C., Yu Y., Kim H., Rambo T., Currie J.,
RA Collura K., Kernodle-Thompson S., Wei F., Kudrna K., Ammiraju J.S., Luo M.,
RA Goicoechea J.L., Wing R.A., Henry D., Oates R., Palmer M., Pries G.,
RA Saski C., Simmons J., Soderlund C., Nelson W., de la Bastide M.,
RA Spiegel L., Nascimento L., Huang E., Preston R., Zutavern T., Palmer L.,
RA O'Shaughnessy A., Dike S., McCombie W.R., Minx P., Cordum H., Wilson R.,
RA Jin W., Lee H.R., Jiang J., Jackson S.;
RT "Sequence, annotation, and analysis of synteny between rice chromosome 3
RT and diverged grass species.";
RL Genome Res. 15:1284-1291(2005).
RN [2] {ECO:0000313|EMBL:ABF98943.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell R., Wing R.A., McCombie W.A., Ouyang S.;
RL Submitted (JUN-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000009; ABF98943.1; -; Genomic_DNA.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR Gene3D; 4.10.60.10; Zinc finger, CCHC-type; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648:SF22; RETROVIRUS-RELATED POL POLYPROTEIN FROM TRANSPOSON TNT 1-94; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR Pfam; PF00098; zf-CCHC; 1.
DR SMART; SM00343; ZnF_C2HC; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 242..257
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 485..650
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 188..239
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 729..774
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1315..1384
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1436..1460
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 197..239
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 729..751
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 752..774
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1324..1340
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1460 AA; 162903 MW; 03F2431D8E97C830 CRC64;
MAGFADALRP DKFTGVHFKR WQIRVTLWLT AMKCFWVSTG KPEGVLTAEQ QKQFEEATTL
FVGCILSVLG DRLVEVYMHM TDAKELWDAL NTKFGATDAS NDLYIMEQFH DYKMADNRSV
VEQAHEIQTM AKELELLKCV LPDKFVAGCI IAKLPPSWRS FGTALKHKRQ EYSVEGLIAS
LDVEEKAREK DAASKGDGGQ SSANVVHKAQ NKSKGKYKAQ QTTNFKKQKK NNNNPNQDER
TCFVCGQVGH LARKCPQRKG MKAPAGQTSK SANVTIGNTG DGSGYGNLPT VFSVNQSTNW
WVDTGANVHV CADISLFSSY QVARGSTVLM GNGSHASVHG VGTVDLKFTS GKIVQLKNVQ
HVPSIDRNLV SGSRLTRDGF KLVFESNKVV VSKHGYFIGK GYECGGLFRF SLSDFCNKSV
NHICGSVDDE ANVWHSRLCH INFGLMSRLS SMCLIPKFSI VKGSKCHSCV QSKQPRKPHK
AAEERNLAPL ELLHSDLCEM NGVLTKGGKR YFMTLIDDAT RFCYVYLLKT KDEALDYFKI
YKAEVENQLD RKIKRLRSDR GGEFFSNEFD LFCEEHGIIH ERTPPYSPES NGIAERKNRT
LTDLVNAMLD TAGLPKAWWG EALLTSNHVL NRVPNRNKDK TPYEIWIGRK PSLSYLRTWG
CLAKVNVPIT KKRKLGPKTV DCVFLGYAHH SIAYRFLIVK SEVPDMHVGT IMESRDATFF
ESFFPMKDTH SGSNQPSEII PSSITPPEQT EHTHELVSEE DVSEAPRRSK RQRTAKSFGD
DFTVYLVDDT PKSISEAYAS PDADYWKEAV RSEMDSIIAN GTWEVTERPY GCKPVGCKWV
FKKKLRPDGT IEKYKARLVA KGYTQKEGED FFDTYSPVAR LTTIRVLLSL AASHGLLVHQ
MDVKTAFLNG ELDEEIYMDQ PDGFVLEGQE GKVCKLLKSL YGLKQAPKQW HEKFDKTLTS
AGFAVNEADK CVYYRHGGGE GVILCLYVDD ILIFGTNLEV INEVKSFLSQ NFDMKDLGVA
DVILNIKLIR GENGITLLQS HYVEKILNRF GYIDSKPSPT PYDPSLSLRK NKRIARNQLE
YSQIIGSLMY LASATRPDIS FAVSKLSRFT SNPGDDHWRA LERVMRYLKG TVELGLHYTG
YPAVLEGYSD SNWISDVDEI KATSGYVFTL GGGAVSWRSC KQTILTRSTM EAELTALDTA
TVEAEWLRDL LMDLPVVEKP VPAILMNCDN QTTCEKTIEV CQEIKKLRSY NVGLHPNSEK
PGRSLHEGTI TKCDRQCIEG DGFETHSILE GTHLCELDCW SQSMKILGES SRKLTKDLGR
QHQPRIEQQQ SNQQRSKQQW WWRPAGGDGF GGGSSKSAAG GRFGCGSARN SARTPGGGED
SKTAVRSIPI LLEDDDGGGT DSVVAQANPP PVVVWQSREH IQWWRRGRGE LIAAVDGGSG
CGSARSSARM PSGGEDSKTR
//