ID A5B1N0_VITVI Unreviewed; 2301 AA.
AC A5B1N0;
DT 12-JUN-2007, integrated into UniProtKB/TrEMBL.
DT 12-JUN-2007, sequence version 1.
DT 27-MAR-2024, entry version 58.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=VITISV_032155 {ECO:0000313|EMBL:CAN76312.1};
OS Vitis vinifera (Grape).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; Vitales; Vitaceae; Viteae; Vitis.
OX NCBI_TaxID=29760 {ECO:0000313|EMBL:CAN76312.1};
RN [1] {ECO:0000313|EMBL:CAN76312.1}
RP NUCLEOTIDE SEQUENCE.
RA Velasco R., Zharkikh A., Troggio M., Cartwright D.A., Cestaro A., Pruss D.,
RA Pindo M., FitzGerald L.M., Vezzulli S., Reid J., Malacarne G., Iliev D.,
RA Coppola G., Wardell B., Micheletti D., Macalma T., Facci M., Mitchell J.T.,
RA Perazzolli M., Eldredge G., Gatto P., Oyzerski R., Moretto M., Gutin N.,
RA Stefanini M., Chen Y., Segala C., Davenport C., Dematte L., Mraz A.,
RA Battilana J., Stormo K., Costa F., Tao Q., Si-Ammour A., Harkins T.,
RA Lackey A., Perbost C., Taillon B., Stella A., Solovyev V., Fawcett J.A.,
RA Sterck L., Vandepoele K., Grando S.M., Toppo S., Moser C., Lanchbury J.,
RA Bogden R., Skolnick M., Sgaramella V., Bhatnagar S.K., Fontana P.,
RA Gutin A., Van de Peer Y., Salamini F., Viola R.;
RT "The first genome sequence of an elite grapevine cultivar (Pinot noir Vitis
RT vinifera L.): coping with a highly heterozygous genome.";
RL PLoS ONE 2:e1326-e1326(2007).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AM443527; CAN76312.1; -; Genomic_DNA.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 1455..1619
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 337..356
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 423..447
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 462..504
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1893..1965
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2240..2269
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1893..1916
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1917..1950
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 2301 AA; 264158 MW; 02FD86E3AA58A29E CRC64;
MPKWIRDSGG RLVKCDTPHN KEFKLSLNIM EATPEDQHSH QGRQDNLNEF RSMRDRMHPP
RMSAPSCIVP PTEQLVIRPY LVPLLPTFHG MESENPYAHI KEFEDVCNTF QEGGASIDLM
RLKLFPFTLK DKXKIWLNSL RPRSIRSWTD LQAEFLKKFF PTHRTNGLKR QISNFSAKEN
EKFYKCWERY MEAINACPHH GFDTWLLVSY FYDGEVGKMK SQLSAFNAKA GMYTLKEDDD
MKTKLAAMTR RLEELELKRI HEVQAVAEAL VQVKLCQNCK SYEHLVEECP AISAEREMFR
DQANVVGQFK PNNNAPYGNT YNSSWRNHPN FSWKAKATQY QQPDQPSQQS SSLEQAMANL
SKVVGDFVGN QEATNAQINQ RIDRVESTLN KRMDGMQNDI SQKFDNIQYS ISRLTNLNTV
QEKGRFPSQP HQNPKGVHEM ESQEGESSQM KDVKALITLR SGKKIEKPTP KPHVEKEEEE
IKKGDEREDK ESEIGEKKDS DSTMNAIPEK ELLKEEMLKK STSPPFTQAL HGKKGIRNAT
EILEILRQVK VNIPLLDMIK QVPTYAKFLK DLCTIKRGLT VNKKAFLTEQ VSAILQCKSP
LKYKDPESPT ISVMIGGKVV EKTLLDLGAN VNLLPYSVYK QLGLGELKPT TITLSLADRS
VKIPRGVIED VLVQVDNFYY PIDFIVLDTD PTVKEANLIP IILGRPFLAT SNAIINCRNG
LMQLTFGNMT LDLNIFYMSK KQTTPEEEEG PEELCIIDTL VEEHCNQKMQ DKLNKSLADF
EEGLSKSPNE LATLQSWRKI EEILPLFNKE EEAAADKEIP KLNLKPLPME LKYTYLEENN
QCPVVISSSL TNHQVNCLME VLKRCKKAIG WQISDLKGIS PLVCTHHIYM EEEAKPIRQL
QRRLNPHLQE VVRAEVLKLL QACIIYPISD SPWVSPIQVV PKKSGITVVQ NEKGEEITTR
LTSGWRVCID YRKLNAVTRK DHFPLPFIDQ VLERVSGHPF YCFLDGYSGY FQIEIDVADQ
EKTTFTCPFG TYAYRRMPFG LCNAPATFQR SNYGGTFEEC LINLEAVLHR CIEKDLVLNW
EKCHFMVRQG IVLGHIISGK GIEVDKAKVE LIVKLPSPIT VKGVRQFLGH AGFYRRFIKG
FSSLSKPLCE LLAKDAKFIW DERCQNSFDQ LKKLLTTTPI VRAPNWQLPF ELMCDASDFA
IGVVLGQRED GKPYVIYYAS KTLNEAQRNY TTTEKELLXV VFVLDKFRAY LVGSFIIVFT
NHSALKYLLT KQDAKARLIR WILLLQEFDL QIKDKKGVET VVADHLSRLV IAHNSHPLPI
NDDFPKESLM FLVKTPWYAH IANYLVTGEI PSEWNAQDRK HFFAKIHAYY WEEPFLFKYC
ADQIIRKCVP EDEQQGILNY CHENAWGGHF ASQKTAMKVL QSGFTWPSLF KDAHIMCRSC
DICQRLGKLT KRNQIPMNPI LKVELFDVWG IDFMGPFPIS FGNSYILVGV DYVSTWVEAI
PCRQNDHRVV LKFLKENIFS RFGVPKAIIS DGGAHFCNKP FEALLSKYGV KHKVATPYHP
QTFGQVELAN REIKNILMKV VNSSRKDWSI RLHDSLWAYR TAYKTILGMS PYRLVYGKAC
HLPVEVEYKA WWAIKKLNMD LIRAGAKRYL DLNEMEELRN DAYINSKVAK QRMKKWHDKL
ISNKKFQKGQ RVLLYDARLH IFPGKLKSRW IGPFIIHQVY VNGVVELLNS NGKDTFRVNG
YRGRDCKEIE RKKSERNRSK NRVKTEQKQG STRFHSLRKS SAKLALCCQT IPQLIGILYE
NFRRCEVDFG TRVPLRSTGA PNSQLRNGCE AIKQRAILAD CSSPCLSPTP HKPPFIFSGY
QFRPNFGNPK WREPEELNLP LLQAARKSXE RSPFQIPSLS LRAAETTGEA LSHQSPEPSP
VPSPVPSPVS SPAPQAEPQE PQPSLPEPQI PSEIAPEEII RHSMLTQPPI EGNLDYRARS
FHSELCFDTT TFQLRPELAQ SFHLLRRYHM EHLLTPRDFF YPRVAMDFYQ SMTTKQVRDP
TLIHFTIDGR HDILGARHIA EALQIPYEPS HFEDFRVWTN PTKLEMVHIL SRGASTRPHL
LRGELPPIMF LIDAFLRHNL YPLQHWTQRR GVQLEALFKM SEGYFFGPHH LIMAALLYFE
ENVHKKKLQR ADTIPLLFPR LLCQILEHLG YPSKPQLERK RICREVFTLD KWNNMTAYRV
EHPERPQPAA RRASPRHIPE GIPIAAPXIP KAPPVTPASS EPSTSAEPRM AIPISEYREL
CHSLQTLTAS QSSLVQEMAA I
//