ID L7TDC0_9HIV1 Unreviewed; 984 AA.
AC L7TDC0;
DT 03-APR-2013, integrated into UniProtKB/TrEMBL.
DT 03-APR-2013, sequence version 1.
DT 27-MAR-2024, entry version 57.
DE SubName: Full=Pol protein {ECO:0000313|EMBL:AGC32095.1};
DE Flags: Fragment;
GN Name=pol {ECO:0000313|EMBL:AGC32095.1};
OS Human immunodeficiency virus 1.
OC Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
OC Ortervirales; Retroviridae; Orthoretrovirinae; Lentivirus.
OX NCBI_TaxID=11676 {ECO:0000313|EMBL:AGC32095.1};
OH NCBI_TaxID=9606; Homo sapiens (Human).
RN [1] {ECO:0000313|EMBL:AGC32095.1}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=0B191647_5_F2 {ECO:0000313|EMBL:AGC32095.1};
RX PubMed=23123038; DOI=10.1016/j.virol.2012.10.009;
RA Baalwa J., Wang S., Parrish N.F., Decker J.M., Keele B.F., Learn G.H.,
RA Yue L., Ruzagira E., Ssemwanga D., Kamali A., Amornkul P.N., Price M.A.,
RA Kappes J.C., Karita E., Kaleebu P., Sanders E., Gilmour J., Allen S.,
RA Hunter E., Montefiori D.C., Haynes B.F., Cormier E., Hahn B.H., Shaw G.M.;
RT "Molecular identification, cloning and characterization of
RT transmitted/founder HIV-1 subtype A, D and A/D infectious molecular
RT clones.";
RL Virology 436:33-48(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=3'-end directed exonucleolytic cleavage of viral RNA-DNA
CC hybrid.; EC=3.1.13.2; Evidence={ECO:0000256|ARBA:ARBA00000379};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Endohydrolysis of RNA in RNA/DNA hybrids. Three different
CC cleavage modes: 1. sequence-specific internal cleavage of RNA. Human
CC immunodeficiency virus type 1 and Moloney murine leukemia virus
CC enzymes prefer to cleave the RNA strand one nucleotide away from the
CC RNA-DNA junction. 2. RNA 5'-end directed cleavage 13-19 nucleotides
CC from the RNA end. 3. DNA 3'-end directed cleavage 15-20 nucleotides
CC away from the primer terminus.; EC=3.1.26.13;
CC Evidence={ECO:0000256|ARBA:ARBA00023415};
CC -!- SIMILARITY: Belongs to the retroviral Pol polyprotein family.
CC {ECO:0000256|RuleBase:RU004064}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JX202830; AGC32095.1; -; Genomic_RNA.
DR GO; GO:0004190; F:aspartic-type endopeptidase activity; IEA:UniProtKB-KW.
DR GO; GO:0003677; F:DNA binding; IEA:UniProtKB-KW.
DR GO; GO:0004533; F:exoribonuclease H activity; IEA:UniProtKB-EC.
DR GO; GO:0003964; F:RNA-directed DNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004523; F:RNA-DNA hybrid ribonuclease activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0015074; P:DNA integration; IEA:UniProtKB-KW.
DR GO; GO:0006310; P:DNA recombination; IEA:UniProtKB-KW.
DR GO; GO:0075713; P:establishment of integrated proviral latency; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0044826; P:viral genome integration into host DNA; IEA:UniProtKB-KW.
DR CDD; cd05482; HIV_retropepsin_like; 1.
DR CDD; cd01645; RT_Rtv; 1.
DR Gene3D; 1.10.10.200; -; 1.
DR Gene3D; 3.30.70.270; -; 3.
DR Gene3D; 2.40.70.10; Acid Proteases; 1.
DR Gene3D; 3.10.10.10; HIV Type 1 Reverse Transcriptase, subunit A, domain 1; 1.
DR Gene3D; 2.30.30.10; Integrase, C-terminal domain superfamily, retroviral; 1.
DR Gene3D; 3.30.420.10; Ribonuclease H-like superfamily/Ribonuclease H; 2.
DR InterPro; IPR001969; Aspartic_peptidase_AS.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR017856; Integrase-like_N.
DR InterPro; IPR036862; Integrase_C_dom_sf_retrovir.
DR InterPro; IPR001037; Integrase_C_retrovir.
DR InterPro; IPR001584; Integrase_cat-core.
DR InterPro; IPR003308; Integrase_Zn-bd_dom_N.
DR InterPro; IPR001995; Peptidase_A2_cat.
DR InterPro; IPR021109; Peptidase_aspartic_dom_sf.
DR InterPro; IPR034170; Retropepsin-like_cat_dom.
DR InterPro; IPR018061; Retropepsins.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR012337; RNaseH-like_sf.
DR InterPro; IPR002156; RNaseH_domain.
DR InterPro; IPR036397; RNaseH_sf.
DR InterPro; IPR000477; RT_dom.
DR InterPro; IPR010659; RVT_connect.
DR InterPro; IPR010661; RVT_thumb.
DR PANTHER; PTHR41694; ENDOGENOUS RETROVIRUS GROUP K MEMBER POL PROTEIN; 1.
DR PANTHER; PTHR41694:SF3; RNA-DIRECTED DNA POLYMERASE-RELATED; 1.
DR Pfam; PF00552; IN_DBD_C; 1.
DR Pfam; PF02022; Integrase_Zn; 1.
DR Pfam; PF00075; RNase_H; 1.
DR Pfam; PF00665; rve; 1.
DR Pfam; PF00077; RVP; 1.
DR Pfam; PF00078; RVT_1; 1.
DR Pfam; PF06815; RVT_connect; 1.
DR Pfam; PF06817; RVT_thumb; 1.
DR SUPFAM; SSF50630; Acid proteases; 1.
DR SUPFAM; SSF50122; DNA-binding domain of retroviral integrase; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF46919; N-terminal Zn binding domain of HIV integrase; 1.
DR SUPFAM; SSF53098; Ribonuclease H-like; 2.
DR PROSITE; PS50175; ASP_PROT_RETROV; 1.
DR PROSITE; PS00141; ASP_PROTEASE; 1.
DR PROSITE; PS50994; INTEGRASE; 1.
DR PROSITE; PS51027; INTEGRASE_DBD; 1.
DR PROSITE; PS50879; RNASE_H_1; 1.
DR PROSITE; PS50878; RT_POL; 1.
DR PROSITE; PS50876; ZF_INTEGRASE; 1.
PE 3: Inferred from homology;
KW Aspartyl protease {ECO:0000256|ARBA:ARBA00022750,
KW ECO:0000256|RuleBase:RU004064};
KW DNA integration {ECO:0000256|ARBA:ARBA00022908};
KW DNA recombination {ECO:0000256|ARBA:ARBA00023172};
KW DNA-binding {ECO:0000256|ARBA:ARBA00023125};
KW Endonuclease {ECO:0000256|ARBA:ARBA00022759};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801, ECO:0000256|RuleBase:RU004064};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nuclease {ECO:0000256|ARBA:ARBA00022722};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Protease {ECO:0000256|ARBA:ARBA00022670, ECO:0000256|RuleBase:RU004064};
KW RNA-directed DNA polymerase {ECO:0000256|ARBA:ARBA00022918};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Viral genome integration {ECO:0000256|ARBA:ARBA00023195};
KW Viral release from host cell {ECO:0000256|ARBA:ARBA00023113};
KW Virion maturation {ECO:0000256|ARBA:ARBA00023113};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00450}.
FT DOMAIN 75..144
FT /note="Peptidase A2"
FT /evidence="ECO:0000259|PROSITE:PS50175"
FT DOMAIN 198..388
FT /note="Reverse transcriptase"
FT /evidence="ECO:0000259|PROSITE:PS50878"
FT DOMAIN 588..711
FT /note="RNase H type-1"
FT /evidence="ECO:0000259|PROSITE:PS50879"
FT DOMAIN 717..758
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS50876"
FT DOMAIN 768..918
FT /note="Integrase catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50994"
FT DOMAIN 937..984
FT /note="Integrase-type"
FT /evidence="ECO:0000259|PROSITE:PS51027"
FT DNA_BIND 937..984
FT /note="Integrase-type"
FT /evidence="ECO:0000256|PROSITE-ProRule:PRU00506"
FT REGION 13..34
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 17..34
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT NON_TER 1
FT /evidence="ECO:0000313|EMBL:AGC32095.1"
FT NON_TER 984
FT /evidence="ECO:0000313|EMBL:AGC32095.1"
SQ SEQUENCE 984 AA; 111588 MW; FC4206AE947463C6 CRC64;
FFRENLVFPQ GKARELPSEQ TRANSPTSRE LQVWGGDNTL SETGAERQGA VSFNFPQITL
WQRPLVTVKI GGQLREALLD TGADDTVLEE ISLPGKWKPK MIGGIGGFIK VKQYDQIPVE
ICGYKAIGTV LVGPTPVNII GRNLLTQIGC TLNFPISPIE TVPVKLKPGM DGPKVKQWPL
TEEKIKALTE ICAEMEKEGK ISRIGPENPY NTPIFAIKKK DSTKWRKLVD FRELNKRTQD
FWEVQLGIPH PAGLKKKKSV TVLDVGDAYF SVPLYEDFRK YTAFTIPSTN NETPGIRYQY
NVLPQGWKGS PAIFQSSMTK ILDPFRKQNP EMVIYQYMDD LYVGSDLEIG QHRLKIEELR
EHLLKWGLTT PDKKHQKEPP FLWMGYELHP DKWTVQPIKL PEKESWTVND IQKLVGKLNW
ASQIYSGIKV KQLCKCLRGA KALTEVVPLT EEAELELAEN REILKEPVHG VYYDPSKDLI
AEIQKQGQGQ WSYQIYQEQY QNLKTGKYAK TRGTHTNDVK QLTEVVQKIA QESIVIWGKT
PKFRLPIQKE TWETWWTEYW QATWIPEWEF VNTPPLVKLW YQLEKEPIVG AETFYVDGAA
NRETKLGKAG YVTDRGRQKV VSLTDTTNQK TELQAINLAL QDSGLEVNIV TDSQYALGII
QAQPDKSESE LVSQIIEQLI KKEKVYLSWV PAHKGIGGNE QVDKLVSNGI RKILFLDGID
KAQEEHEKYH NNWRAMASDF NLPPVVAKEI VASCDKCQLK GEALHGQVDC SPGIWQLDCT
HLEGKVILVA VHVASGYIEA EVIPAETGQE TAYFLLKLAG RWPVKVVHTD NGSNFTSGAV
KAACWWAGIK QEFGIPYNPQ SQGVVESMNK ELKKIIGQVR DQAEHLKTAV QMAVFIHNFK
RRGGIGGYSA GERIIDIIAT DIQTKELQKQ IIKIQNFRVY YRDSRDPVWK GPAKLLWKGE
GAVVIQDNSD IKVVPRRKVK IIRD
//