ID F1K1M2_9HIV1 Unreviewed; 1007 AA.
AC F1K1M2;
DT 03-MAY-2011, integrated into UniProtKB/TrEMBL.
DT 03-MAY-2011, sequence version 1.
DT 27-MAR-2024, entry version 69.
DE SubName: Full=Pol protein {ECO:0000313|EMBL:ADZ32672.1};
DE Flags: Fragment;
GN Name=pol {ECO:0000313|EMBL:ADZ32672.1};
OS Human immunodeficiency virus 1.
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus.
OX NCBI_TaxID=11676 {ECO:0000313|EMBL:ADZ32672.1, ECO:0000313|Proteomes:UP000141921};
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1] {ECO:0000313|Proteomes:UP000141921}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RA Rolland M., Tovanabutra S., deCamp A.C., Frahm N., Gilbert P.B.,
RA Sanders-Buell E., Heath L., Magaret C.A., Bose M., Bradfield A.,
RA O'Sullivan A., Crossler J., Jones T., Nau M., Wong K., Zhao H., Ragui D.N.,
RA Sorensen S., Stoddard J.N., Maust B.S., Deng W., Hural J., Dubey S.,
RA Michael N.L., Shiver J., Corey L., Li F., Self S.G., Kim J., Buchbinder S.,
RA Casimiro D.R., Robertson M.N., Duerr A., McElrath M.J., McCutchan F.E.,
RA Mullins J.I.;
RL Submitted (FEB-2011) to the EMBL/GenBank/DDBJ databases.
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3'-end directed exonucleolytic cleavage of viral RNA-DNA
CC hybrid.; EC=3.1.13.2; Evidence={ECO:0000256|ARBA:ARBA00000379};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of RNA in RNA/DNA hybrids. Three different
CC cleavage modes: 1. sequence-specific internal cleavage of RNA. Human
CC immunodeficiency virus type 1 and Moloney murine leukemia virus
CC enzymes prefer to cleave the RNA strand one nucleotide away from the
CC RNA-DNA junction. 2. RNA 5'-end directed cleavage 13-19 nucleotides
CC from the RNA end. 3. DNA 3'-end directed cleavage 15-20 nucleotides
CC away from the primer terminus.; EC=3.1.26.13;
CC Evidence={ECO:0000256|ARBA:ARBA00023415};
CC -!- SIMILARITY: Belongs to the retroviral Pol polyprotein family.
CC {ECO:0000256|RuleBase:RU004064}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JF320027; ADZ32672.1; -; Genomic_RNA.
DR Proteomes; UP000141921; Genome.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004533; F:exoribonuclease H activity; IEA:UniProtKB-EC.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR CDD; cd05482; HIV_retropepsin_like; 1.
DR CDD; cd01645; RT_Rtv; 1.
DR Gene3D; 1.10.10.200; -; 1.
DR Gene3D; 3.30.70.270; -; 3.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 2.30.30.10; Integrase, C-terminal domain superfamily, retroviral; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR017856; Integrase-like_N.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR003308; Integrase_Zn-bd_dom_N.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034170; Retropepsin-like_cat_dom.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR010659; RVT_connect.
DR InterPro; IPR010661; RVT_thumb.
DR PANTHER; PTHR41694; ENDOGENOUS RETROVIRUS GROUP K MEMBER POL PROTEIN; 1.
DR PANTHER; PTHR41694:SF3; RNA-DIRECTED DNA POLYMERASE-RELATED; 1.
DR Pfam; PF00552; IN_DBD_C; 1.
DR Pfam; PF02022; Integrase_Zn; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00077; RVP; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF06815; RVT_connect; 1.
DR Pfam; PF06817; RVT_thumb; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF50122; DNA-binding domain of retroviral integrase; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF46919; N-terminal Zn binding domain of HIV integrase; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50876; ZF_INTEGRASE; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750,
KW ECO:0000256|RuleBase:RU004064};
KW DNA integration {ECO:0000256|ARBA:ARBA00022908};
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU004064};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU004064};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Viral genome integration {ECO:0000256|ARBA:ARBA00023195};
KW Viral release from host cell {ECO:0000256|ARBA:ARBA00023113};
KW Virion maturation {ECO:0000256|ARBA:ARBA00023113};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00450}.
FT DOMAIN 80..149
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT DOMAIN 203..393
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 593..716
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 722..763
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS50876"
FT DOMAIN 773..923
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 942..989
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS51027"
FT DNA_BIND 942..989
FT /note="Integrase-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00506"
FT REGION 12..48
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:ADZ32672.1"
SQ SEQUENCE 1007 AA; 114347 MW; 89A86CBB1497D0AD CRC64;
FFREDLAFQQ REAREFPSEQ TRANSPTRRE LQVCRRDNNS LSEAGADEGV DRAGPISFNL
PQITLWQRPL VTIKIGGQLK EALLDTGADD TVLEDMNLQG KWKPKMIGGI GGFIKVRQYD
QIPIEICGHK TIGTVLVGPT PVNIIGRNLL TQIGCTLNFP ISPIDTVPVK LKPGMDGPKV
KQWPLTEEKI KALVEICTEM EKEGKISKIG PENPYNTPVF AIKKKDSTKW RKLVDFRELN
KRTQDFWEVQ LGIPHPAGLK QKKSVTVLDV GDAYFSVPLD KEFRKYTAFT IPSINNETPG
IRYQYNVLPQ GWKGSPAIFQ SSMTKILEPF RKQNPEIVIY QYMDDLYVGS DLEIGQHRTK
IEELRQHLLR WGLTTPDKKH QKEPPFLWMG YELHPDKWTV QPIMLPEKDS WTVNDIQKLV
GKLNWASQIY PGIKVRQLCK LLRGTKALTE VIPLTEEAEL ELAENREILK EPVHGAYYDP
SKNLIAEVQK QGQGQWTYQI YQEPFKNLKT GKYAKMRGAH TNDVKQLTEV VQKISTESIV
IWGKTPKFRL PIQKETWEAW WTEYWQATWI PEWEFVNTPP LVKLWYQLEK EPIMGAETFY
VDGAANRETK LGKAGYVTDR GRQKAVPLTD TTNQKTELQA IYLALQDSGL EVNIVTDSQY
ALGIIQAQPD KSESEIVNQI IEQLIKKEKV YLAWVPAHKG IGGNEQVDKL VSAGIRKVLF
LDGIEKAQDE HEKYHSNWRA MASDFNLPPV VAKEIVASCD KCQLKGEAMH GQVDCSPGIW
QLDCTHLEGK IILVAVHVAS GYIEAEVIPA ETGQETAYFI LKLAGRWPVK TIHTDNGSNF
TSTTVKAACW WAGIKQEFGI PYNPQSQGVV ESMNNELKKI IGQVRDQAEH LKTAVQMAVF
IHNFKRKGGI GGYSAGERIV DIIATDLQTK ELQKQITKIQ NFRVYYRDNK DPLWKGPAKL
LWKGEGAVVI QDNSEIKVVP RRKAKIIRDY GKQMAGDDCV ASRQDED
//