ID Q852C7_ORYSJ Unreviewed; 1969 AA.
AC Q852C7;
DT 01-JUN-2003, integrated into UniProtKB/TrEMBL.
DT 01-JUN-2003, sequence version 1.
DT 27-MAR-2024, entry version 107.
DE SubName: Full=Retrotransposon protein, putative, unclassified {ECO:0000313|EMBL:ABF99896.1};
GN OrderedLocusNames=LOC_Os03g63124 {ECO:0000313|EMBL:ABF99896.1};
OS Oryza sativa subsp. japonica (Rice).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; Liliopsida; Poales; Poaceae; BOP clade;
OC Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa.
OX NCBI_TaxID=39947 {ECO:0000313|EMBL:ABF99896.1};
RN [1] {ECO:0000313|EMBL:ABF99896.1}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=16109971; DOI=10.1101/gr.3869505;
RG Rice Chromosome 3 Sequencing Consortium;
RA Buell C.R., Yuan Q., Ouyang S., Liu J., Zhu W., Wang A., Maiti R., Haas B.,
RA Wortman J., Pertea M., Jones K.M., Kim M., Overton L., Tsitrin T.,
RA Fadrosh D., Bera J., Weaver B., Jin S., Johri S., Reardon M., Webb K.,
RA Hill J., Moffat K., Tallon L., Van Aken S., Lewis M., Utterback T.,
RA Feldblyum T., Zismann V., Iobst S., Hsiao J., de Vazeille A.R.,
RA Salzberg S.L., White O., Fraser C., Yu Y., Kim H., Rambo T., Currie J.,
RA Collura K., Kernodle-Thompson S., Wei F., Kudrna K., Ammiraju J.S., Luo M.,
RA Goicoechea J.L., Wing R.A., Henry D., Oates R., Palmer M., Pries G.,
RA Saski C., Simmons J., Soderlund C., Nelson W., de la Bastide M.,
RA Spiegel L., Nascimento L., Huang E., Preston R., Zutavern T., Palmer L.,
RA O'Shaughnessy A., Dike S., McCombie W.R., Minx P., Cordum H., Wilson R.,
RA Jin W., Lee H.R., Jiang J., Jackson S.;
RT "Sequence, annotation, and analysis of synteny between rice chromosome 3
RT and diverged grass species.";
RL Genome Res. 15:1284-1291(2005).
RN [2] {ECO:0000313|EMBL:ABF99896.1}
RP NUCLEOTIDE SEQUENCE.
RA Buell R., Wing R.A., McCombie W.A., Ouyang S.;
RL Submitted (JUN-2006) to the EMBL/GenBank/DDBJ databases.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DP000009; ABF99896.1; -; Genomic_DNA.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd09272; RNase_HI_RT_Ty1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR025724; GAG-pre-integrase_dom.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR013103; RVT_2.
DR InterPro; IPR001878; Znf_CCHC.
DR InterPro; IPR036875; Znf_CCHC_sf.
DR PANTHER; PTHR42648:SF18; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR42648; TRANSPOSASE, PUTATIVE-RELATED; 1.
DR Pfam; PF13976; gag_pre-integrs; 1.
DR Pfam; PF14223; Retrotran_gag_2; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF07727; RVT_2; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF57756; Retrovirus zinc finger-like domains; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS50158; ZF_CCHC; 1.
PE 4: Predicted;
KW Metal-binding {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc {ECO:0000256|PROSITE-ProRule:PRU00047};
KW Zinc-finger {ECO:0000256|PROSITE-ProRule:PRU00047}.
FT DOMAIN 279..294
FT /note="CCHC-type"
FT /evidence="ECO:0000259|PROSITE:PS50158"
FT DOMAIN 941..1107
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 291..316
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 679..728
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1171..1264
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1833..1852
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 296..316
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 679..697
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 707..728
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1185..1202
FT /note="Acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1211..1238
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 1969 AA; 220123 MW; A5872AEA110147E2 CRC64;
MEEKYSAKPF VFDGHNFVIW KARMEAYLQS QGHNVWNKVK SPYTVPDDAD ITPANMAQVD
FNYRARNAII GGISSGEFNR VQHHKSAHDM WTALCNFHEG NNDIQLVRQN QFHKEYQRFE
MHPGESIDSY FKRFGEIISK LRSVGKEFSD NDNARHLLNC LDYGVWEMKV TSITESAPLS
DLTMDKLYSK LKTHEMDVFH RKGLKHSMAL VADPSGSTSS NDSAFVYGGF SLAALHSVTE
EQLEKIPEDD LALFARKFSR AYKNVRDRKR GKTNEPFVCF ECGEPNHKRV NCPKLKKKSD
KTTKKPEGQG RKGKKDLMKK AIHKVLAALE EVQLSDIDSD DDDQEKGDKD FSGMCCLANN
EDFINLCLMA LEDKDDSSEH PEVCLDDIPS LDGSLCDDSC SDNDSVDDEL SKERMAHLMI
EISDKYRSSK YKIEKLKSEN DGMALEIARL RSMIPEEDTC STCASYLSEI NLLKDKLKSC
ALGAGNPSSA SAACSTCYEM KVDMGLLEME LKELKEKFVH DRIGRCENCP ILTSDNDELR
QQVAMLKTKN DLLESFATKE PIHSSCANCA ILETELKDAK TVIDSIKSID SCSSCISLKV
DLESAKKENS YLQQSLERFA QGKKKLNMIL DQSKVSINNQ GIGYDFAESL RIGTHEILGV
TDGMIELAQK PITFKSAGFI GNTSSSTPKT SEPKMVPMTS KSKPVELPRP KNPKQVEHKQ
NQRQTKPVEK TKYECTYCGK AGHLDFGVGR SNSWLVDSGC SRHMTGEAKW FTSLTRASGD
ETITFGDASS GRVMAKGTIK VNDKFMLKDV ALVSKLKYNL LSVSQLCDEN LEVRFKKDRS
RVLDASESPV FDISRVGRVF FANFDSSAPG PSRCLIASEN RDLFFWHRRL GHIGFDHLSR
ISGMDLIRGL PKLKVQKDLV CAPCRHGKMT SSSHKPVTMV MTDGPGQLLH MDTVGPARVQ
SVGGKWYVLV VVDDFSRYSW VYFLESKEET FGFFQSLARS LALEFPGALR AIRSDNGSEF
KNSAFESFCD SSGVEHQFSS PYVPQQNGVV ERKNRTLVEM ARTMLDEFTT PRKFWTEAIS
AACFISNRVF LRTILHKTPY ELRFGRRPKV SHLRVFGCKC FVLKSGNLDK FESRSLDGIF
LGYATHSRAY RVYVLSTNKI VETCEVTFDE ASPGARPEIS GVPDESIFVD EDSDDDDDDS
IPPPLDSTPP VQETGSPSTT SPSGDAPTTS SSAAEEIDGG TSGPTAPRHI QNRHPPDSMI
GGLGERVTRN RSYELVNSAF VASFEPKNVC HALSDENWVN AMHEELENFE RNKVWSLVEP
PLGFNVIGTK WVFKNKLGED GSIVRNKARL VAQGFTQVEG LDFEETFAPV ARLEAIRILL
AFAASKGFKL FQMDVKSAFL NGVIEEEVYV KQPPGFENPK FPNHVFKLEK ALYGLKQAPR
AWYERLKTFL LQNGFEMGAV DKTLFTLHSG IDFLLVQIYV DDIIFGGSSH ALVAQFSDVM
SREFEMSMMG ELTFFLGLQI KQTKEGIFVH QTKYSKELLK KFDMADCKPI ATPMATTSSL
GPDEDGEEVD QREYRSMIGS LLYLTASRPD IHFSVCLCAR FQASPRTSHR QAVKRIFRYI
KSTLEYGIWY SCSSALSVRA FSDADFAGCK IDRKSTSGTC HFLGTSLVSW SSRKQSSVAQ
STAEAEYVAA ASACSQVLWM ISTLKDYGLS FSGVPLLCDN TSAINIAKNP VQHSRTKHIE
IRYHFLRDNV EKGTIVLEFV ESEKQLADIF TKPLDRSRFE FLRSELGVIG VDLGQRRIGT
RLKEELRSKA IKDAGEAKPI QPFHVAQDGE WLGSAGLHRP SSDGPRRRPM AHGLRRCRQL
ACARTRAGKP YAATSVSRDS IGGGCRSAGP TRDFASARGS GSAGRVAASG WLGAGERALR
KRRRNAHDAA GQRGKFGFVK TESTRIHQQE DHVRVRVDEV AIIWSFSFE
//