ID A5AEM1_VITVI Unreviewed; 2497 AA.
AC A5AEM1;
DT 12-JUN-2007, integrated into UniProtKB/TrEMBL.
DT 12-JUN-2007, sequence version 1.
DT 27-MAR-2024, entry version 50.
DE RecName: Full=RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00012493};
DE EC=2.7.7.49 {ECO:0000256|ARBA:ARBA00012493};
GN ORFNames=VITISV_001382 {ECO:0000313|EMBL:CAN77867.1};
OS Vitis vinifera (Grape).
OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
OC Spermatophyta; Magnoliopsida; eudicotyledons; Gunneridae; Pentapetalae;
OC rosids; Vitales; Vitaceae; Viteae; Vitis.
OX NCBI_TaxID=29760 {ECO:0000313|EMBL:CAN77867.1};
RN [1] {ECO:0000313|EMBL:CAN77867.1}
RP NUCLEOTIDE SEQUENCE.
RA Velasco R., Zharkikh A., Troggio M., Cartwright D.A., Cestaro A., Pruss D.,
RA Pindo M., FitzGerald L.M., Vezzulli S., Reid J., Malacarne G., Iliev D.,
RA Coppola G., Wardell B., Micheletti D., Macalma T., Facci M., Mitchell J.T.,
RA Perazzolli M., Eldredge G., Gatto P., Oyzerski R., Moretto M., Gutin N.,
RA Stefanini M., Chen Y., Segala C., Davenport C., Dematte L., Mraz A.,
RA Battilana J., Stormo K., Costa F., Tao Q., Si-Ammour A., Harkins T.,
RA Lackey A., Perbost C., Taillon B., Stella A., Solovyev V., Fawcett J.A.,
RA Sterck L., Vandepoele K., Grando S.M., Toppo S., Moser C., Lanchbury J.,
RA Bogden R., Skolnick M., Sgaramella V., Bhatnagar S.K., Fontana P.,
RA Gutin A., Van de Peer Y., Salamini F., Viola R.;
RT "The first genome sequence of an elite grapevine cultivar (Pinot noir Vitis
RT vinifera L.): coping with a highly heterozygous genome.";
RL PLoS ONE 2:e1326-e1326(2007).
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AM424660; CAN77867.1; -; Genomic_DNA.
DR GO; GO:0003676; F:nucleic acid binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:InterPro.
DR CDD; cd00303; retropepsin_like; 1.
DR CDD; cd09274; RNase_HI_RT_Ty3; 1.
DR CDD; cd01647; RT_LTR; 1.
DR Gene3D; 1.10.340.70; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR041588; Integrase_H2C2.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR005162; Retrotrans_gag_dom.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR041373; RT_RNaseH.
DR PANTHER; PTHR37984:SF7; INTEGRASE CATALYTIC DOMAIN-CONTAINING PROTEIN; 1.
DR PANTHER; PTHR37984; PROTEIN CBG26694; 1.
DR Pfam; PF13650; Asp_protease_2; 1.
DR Pfam; PF17921; Integrase_H2C2; 1.
DR Pfam; PF03732; Retrotrans_gag; 1.
DR Pfam; PF17917; RT_RNaseH; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00078; RVT_1; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
PE 4: Predicted;
FT DOMAIN 1536..1696
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT REGION 75..105
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 410..449
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 516..546
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 561..605
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1961..2077
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2351..2419
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 82..97
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 516..533
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2026..2042
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2061..2075
FT /note="Pro residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 1436
FT /note="I or L"
FT /evidence="ECO:0000313|EMBL:CAN77867.1"
SQ SEQUENCE 2497 AA; 285323 MW; 4BE5F6299E4B6616 CRC64;
MDFGRQQHQA EGQFRTPLFK VRNPQSTVRE FRTPQTNMRN WNSAYCNLCM PYWIRDQEGR
LVRIENPQDT ELDICVNIMD PPPEDQNSQQ GQGGNPNAYL SMRDRMHPPR MSAPSCILPP
LEQLVIRPHI VPLLPTFHGM ESENPYSHIK EFEEVCNTFR EGGASIDLMR LKLFPFTLKD
KAKIWLNSLR PRSIRNWVDL QAEFLKKYFP THRTNGLKRQ ISNFSAKENE KFHECWERYM
EAINACPHHG FDTWLLVSYF YDGMSSSMKQ ILETMCGGDF MSKNPDEAMD FLSYVAEVSR
GWDEPNSREK GKFPSQXXQN PKAGMYMLSE DVDMKAKVAT IARRLEELEL KKMHDVQAIS
ETQAHAMPCT ICQSCDHVVD ECPTMPAVRE MLGDQVNVVG QFRPNTNAPY GNTYNSSWRN
HPNFSWKPRP PPYQPQGQTQ APQQPSSVEQ AIANLSKVMN DFVGEQRAIN SQLHQKIENV
ESSLNKRMDG MQNDLYHKID NIQYSISRLT NLNTVNEKGK FPSQPSQNPK GVHEVETQEG
DSSKLREVKA VITLRSGKEV DQPLPKVKQD EELMTKRPLV KESKNQEEQS GKKSASKSSI
EEEPRIVIKE DMMKKHMPPP FPQALHGKKE IKNSSEILEV LRQVKVNIPL LDMIKQVPTY
AKFLKDLCTV KRGLQVTKNA FLTEQVSAII QSKSPVKYKD PGCPTISVNI GGTHVEKALL
DLGASVNLLP YSVYKQLGLG GLKPTTMTLS LADRSVKIPR GVIEDVLIQV DKFYYPVDFV
VLDTDSSVKE ENYVPIILGR PFLATSNAIV NCRNGVMQLT FGNMTLEEEG FEEVCLINTL
VEEHCDKSLE ESLNENLEVL EDGFPEPSDV LAIMSPWRRR EEILPLFNQD DSQGVAVEDP
PKLILKPLPV ELKYAYLEDD EKCPVVVAST LTSDQEDSLL GVLRKCKKAI GWQISDLKGI
SPLVCTHHIY MEDDAKPVRQ PQRRLNPHMQ EVVRSEVLKL LQAGIIYPIS DSLWVSPTQV
VPKKSGITVI QNEKGEEVST RPTSGWRVCI DYRRLNSVTR KDHFPLPFMD QVLERVSGHP
FYCFLDGYSG YFQIEIDLED QEKTTFTCPF GTFAYRRMPF GLCNAPATFQ RCMLSIFSDM
VERIMEVFMD DITVYGSSYE ECLMHLEAVL HRCIEKDLVL NWEKCHFMVQ KGIVLGHIIS
KNGIEVDKAK VELIVKLPPP TNVKGIRQFL GHAGFYRRFI KDFSKISKPL CELLVKDAKF
VWDEKCQRSF EELKQFLTTA PIVRAPNWKL PFEVMCDSSD LAMGAVLGQR EDGKPYVIYY
ASRTLNEAQK NYTTTEKELL AVVFALDKFR AYLVGSSIVV FTDHSALKYL LTKQDAKARL
IRWILLLQEF NLQIRDKKGV ENVVADHLSR LVISHDSHGL PINDDFPEES LMSVDXAPWY
SHIANFLVTG EVPSEWSAQD KRHFLAKIHA YYWEEPFLFK YCADQIIRKC VPEQEQSGIL
SHCHDSACGG HFASQKTAMK VIQSGFWWPS LFKDAHSMCK ACDRCQRLDF MGPFPMSFGH
SYILVGVDYV SKWVEAIPCR TNDHKVVLKF LKENIFSRFG VPKAIISDGG THFCNKPFET
LLAKYGVKHK VATPYHPQTS GQVELANREI KNILMKVVNV NRKDWSIKLL DSLWAYRTAY
KTILGMSPYR LVYGKACHLP VEIEYKAWWA IKKLNMDLSR AGLKRCLDLN ELEELRNDAY
LNSKIAKARL KEWHDQLVNQ KNFTKGQKVL LYDSKLHLFP GEESSSKAEQ GXKAKEAEHP
ISRCQNFRTP DFKVRKFRTP QNQGAKSLGQ NKXISHTPTS RCEINFKVRN PFSRCEFPKI
QFRTPLWKAH VCQFRTPQAH FRTVRNGVRK FRTPLFKMRK FRTPHFKVRI LLSKGGHFRT
PKSKVRKFQS KVRKFQFKVR KFRTPLKPRN LTVSPAHKLR HLWNPEHPPP FISAMAKTRG
GHSASPSSPT PRPQRAAMGA APSPXVQAPA IPPTEGEVPS QRRYPTRRPP ADPVPPVDQA
TSPVSRPPAK RTRFSGPGEP SHXPQPEPXT EEPRIPVDMP PEAIIRRPMI AGPPIEGNLD
CRDRPFHSET YFDIEALRQQ PELRDSFRLL QRYHMESFLT PRQFYYPRVV IDFYQSMTTR
GLRNPTLIQF TIDGRQGAIG ARHIAEALRI PYEPVFQADF REWSSFSQSD MVRILSRGTS
TASVLTRREL PSGMLLIDVL LRANLFPLQH KVQRRGAILE ALFRISEGYF FGPHHLIMTS
LLHFEEKVHQ KKLQRADGIP LLFPRLLCQI LEHLGYPEEP RLERRRHCRE DFSLDKWHHL
VAYFAXQGAP AVPAPPELPR DEQVPQAQQD EILTETPPPA PAAHPSVHMP EAIHSTSPIT
QGAPPVVPAT PAPPPSSEAT VTVSLTEFRG LERSLRILST AQDSIIHQMA TIRAHQDQII
ATQAQHTTIL HQIQQHLSLQ TPLGHDRSAP SEPLVPR
//