ID Q1KWW3_9HIV1 Unreviewed; 1010 AA.
AC Q1KWW3;
DT 30-MAY-2006, integrated into UniProtKB/TrEMBL.
DT 30-MAY-2006, sequence version 1.
DT 27-MAR-2024, entry version 111.
DE SubName: Full=Pol protein {ECO:0000313|EMBL:ABF00617.1};
DE Flags: Fragment;
GN Name=pol {ECO:0000313|EMBL:ABF00617.1};
OS Human immunodeficiency virus 1.
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus.
OX NCBI_TaxID=11676 {ECO:0000313|EMBL:ABF00617.1, ECO:0000313|Proteomes:UP000146099};
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1] {ECO:0000313|EMBL:ABF00617.1, ECO:0000313|Proteomes:UP000146099}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=03ZAPS113MB2 {ECO:0000313|EMBL:ABF00617.1};
RX PubMed=16701907; DOI=10.1016/j.jviromet.2006.04.009;
RA Rousseau C.M., Birditt B.A., McKay A.R., Stoddard J.N., Lee T.C.,
RA McLaughlin S., Moore S.W., Shindo N., Learn G.H., Korber B.T., Brander C.,
RA Goulder P.J., Kiepiela P., Walker B.D., Mullins J.I.;
RT "Large-scale amplification, cloning and sequencing of near full-length HIV-
RT 1 subtype C genomes.";
RL J. Virol. Methods 136:118-125(2006).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3'-end directed exonucleolytic cleavage of viral RNA-DNA
CC hybrid.; EC=3.1.13.2; Evidence={ECO:0000256|ARBA:ARBA00000379};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of RNA in RNA/DNA hybrids. Three different
CC cleavage modes: 1. sequence-specific internal cleavage of RNA. Human
CC immunodeficiency virus type 1 and Moloney murine leukemia virus
CC enzymes prefer to cleave the RNA strand one nucleotide away from the
CC RNA-DNA junction. 2. RNA 5'-end directed cleavage 13-19 nucleotides
CC from the RNA end. 3. DNA 3'-end directed cleavage 15-20 nucleotides
CC away from the primer terminus.; EC=3.1.26.13;
CC Evidence={ECO:0000256|ARBA:ARBA00023415};
CC -!- SIMILARITY: Belongs to the retroviral Pol polyprotein family.
CC {ECO:0000256|RuleBase:RU004064}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ396365; ABF00617.1; -; Genomic_RNA.
DR Proteomes; UP000146099; Genome.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004533; F:exoribonuclease H activity; IEA:UniProtKB-EC.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR CDD; cd05482; HIV_retropepsin_like; 1.
DR CDD; cd01645; RT_Rtv; 1.
DR Gene3D; 1.10.10.200; -; 1.
DR Gene3D; 3.30.70.270; -; 3.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 2.30.30.10; Integrase, C-terminal domain superfamily, retroviral; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR017856; Integrase-like_N.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR003308; Integrase_Zn-bd_dom_N.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034170; Retropepsin-like_cat_dom.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR010659; RVT_connect.
DR InterPro; IPR010661; RVT_thumb.
DR PANTHER; PTHR41694; ENDOGENOUS RETROVIRUS GROUP K MEMBER POL PROTEIN; 1.
DR PANTHER; PTHR41694:SF3; RNA-DIRECTED DNA POLYMERASE-RELATED; 1.
DR Pfam; PF00552; IN_DBD_C; 1.
DR Pfam; PF02022; Integrase_Zn; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00077; RVP; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF06815; RVT_connect; 1.
DR Pfam; PF06817; RVT_thumb; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF50122; DNA-binding domain of retroviral integrase; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF46919; N-terminal Zn binding domain of HIV integrase; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50876; ZF_INTEGRASE; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750,
KW ECO:0000256|RuleBase:RU004064};
KW DNA integration {ECO:0000256|ARBA:ARBA00022908};
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU004064};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU004064};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Viral genome integration {ECO:0000256|ARBA:ARBA00023195};
KW Viral release from host cell {ECO:0000256|ARBA:ARBA00023113};
KW Virion maturation {ECO:0000256|ARBA:ARBA00023113};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00450}.
FT DOMAIN 83..152
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT DOMAIN 206..396
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 596..719
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 725..766
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS50876"
FT DOMAIN 776..926
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 945..992
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS51027"
FT DNA_BIND 945..992
FT /note="Integrase-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00506"
FT REGION 1..57
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..43
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ABF00617.1"
SQ SEQUENCE 1010 AA; 114335 MW; 3346233D46252D7B CRC64;
FFREDLAFPQ GKAREFPSEQ TRANSPTTEQ ARANSPTSRE LQVRRNNPRS EAGAERQGTT
FNLPQITLWQ RPLVSVKVGG QIKEALLDTG ADDTVLEDIN LPGKWKPKMI GGIGGFIKVK
QYDQIVIEIC GKKAIGSVLV GPTPVNIIGR NMLTQLGCTL NFPISPIETV PVMLKPGMDG
PKVKQWPLTE EKIKALTAIC EEMEKEGKIT KIGPENPYNT PVFAIKKKDS TKWRKLVDFR
ELNKRTQDFW EVQLGIPHPA GLKRKKSVTV LDVGDAYFSV PLDENFRKYT AFTIPSINNE
TPGIRYQYNV LPQGWKGSPA IFQSSMTRIL EPFRAQNPEI VIYQYMDDLY VGSDLEIGQH
RAKIEELRKH LLRWGFTTPD KKHQKEPPFL WMGYELHPDK WTVQPIQLPT KDSWTVNDIQ
KLVGKLNWAS QIYPGIKVRQ LCKLLRGAKA LTDIVTLTEE AELELAENRE ILKEPVHGVY
YDPSKDLIAE IQKQGNGQWT YQIYQEPFKN LKTGKYAKMR TAHTNDVKQL TEVVQKIAME
SIVIWGKTPK FRLPIQKETW ETWWTDYWQA TWIPEWEFVN TPPLVKLWYQ LEKDPIVGAE
TFYVDGAASR ETKMGKAGYV TDKGRQKIVH LTETTNQKTE LQAIQLALQD SGPEVNIVTD
SQYALGIIHA QPDKSESELV SQIIEQLIGK ERVYLSWVPA HKGIGGNEQI DKLVSSGIRK
VLFLDGIDKA QDEHEKYHSN WRAMASDFNL PPIVAKEIVA SCDKCQLKGE AIHGQVDCSP
GIWQLDCTHL EGKVILVAVH VASGYIEAEV IPAETGQETA YYILKLAGRW PVKVIHTDNG
SNFTSAAVKA ACWWAGIQQE FGIPYNPQSQ GVVESMNKEL KKIIGQVRDQ AEHLKTAVQM
AVFIHNFKRK GGIGGYSAGE RIIDIIATDI QTRELQKQII KIQNFRVYYR DSRDPIWKGP
AKLLWKGEGA VVIQDNSDIK VVPRRKVKII KDYGKQMAGA DCVAGRQDED
//