ID A4K464_9HEPC Unreviewed; 2237 AA.
AC A4K464;
DT 01-MAY-2007, integrated into UniProtKB/TrEMBL.
DT 01-MAY-2007, sequence version 1.
DT 27-MAR-2024, entry version 113.
DE RecName: Full=Genome polyprotein {ECO:0000256|ARBA:ARBA00020107};
DE Flags: Fragment;
OS Hepacivirus hominis.
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC Amarillovirales; Flaviviridae; Hepacivirus.
OX NCBI_TaxID=3052230 {ECO:0000313|EMBL:ABD85061.2};
RN [1] {ECO:0000313|EMBL:ABD85061.2}
RP NUCLEOTIDE SEQUENCE.
RC STRAIN=TN9-23del {ECO:0000313|EMBL:ABD85061.2};
RX PubMed=17493654; DOI=10.1016/j.virol.2007.04.003;
RA Bernardin F., Stramer S.L., Rehermann B., Page-Shafer K., Cooper S.,
RA Bangsberg D.R., Hahn J., Tobler L., Busch M., Delwart E.;
RT "High levels of subgenomic HCV plasma RNA in immunosilent infections.";
RL Virology 365:446-456(2007).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13;
CC Evidence={ECO:0000256|ARBA:ARBA00001556};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of four peptide bonds in the viral precursor
CC polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr
CC in P1 and Ser or Ala in P1'.; EC=3.4.21.98;
CC Evidence={ECO:0000256|ARBA:ARBA00001117};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC Evidence={ECO:0000256|ARBA:ARBA00001491};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Evidence={ECO:0000256|ARBA:ARBA00001947};
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004236}.
CC Cytoplasm {ECO:0000256|ARBA:ARBA00004496}. Endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004406}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004406}. Host cell membrane
CC {ECO:0000256|ARBA:ARBA00004165}. Host cytoplasm, host perinuclear
CC region {ECO:0000256|ARBA:ARBA00004407}. Host endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004291}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004291}. Host mitochondrion
CC {ECO:0000256|ARBA:ARBA00004181}. Host nucleus
CC {ECO:0000256|ARBA:ARBA00004147}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC {ECO:0000256|ARBA:ARBA00004170}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004170}. Mitochondrion
CC {ECO:0000256|ARBA:ARBA00004173}. Nucleus
CC {ECO:0000256|ARBA:ARBA00004123}. Virion
CC {ECO:0000256|ARBA:ARBA00004328}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; DQ430818; ABD85061.2; -; Genomic_RNA.
DR MEROPS; S29.001; -.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0033650; C:host cell mitochondrion; IEA:UniProtKB-SubCell.
DR GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0019028; C:viral capsid; IEA:UniProtKB-KW.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005216; F:monoatomic ion channel activity; IEA:UniProtKB-KW.
DR GO; GO:0017111; F:ribonucleoside triphosphate phosphatase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC.
DR GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039563; P:disruption by virus of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW.
DR GO; GO:0039527; P:disruption by virus of host TRAF-mediated signal transduction; IEA:UniProtKB-KW.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039645; P:perturbation by virus of host G1/S transition checkpoint; IEA:UniProtKB-KW.
DR GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR GO; GO:0039545; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of MAVS activity; IEA:UniProtKB-KW.
DR GO; GO:0019087; P:transformation of host cell by virus; IEA:InterPro.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR GO; GO:0039707; P:virus-mediated pore formation in host cell membrane; IEA:UniProtKB-KW.
DR CDD; cd23202; Hepacivirus_RdRp; 1.
DR Gene3D; 2.40.10.120; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 6.10.250.1610; -; 1.
DR Gene3D; 6.10.250.2920; -; 1.
DR Gene3D; 2.20.25.210; Hepatitis C NS5A, domain 1B; 1.
DR Gene3D; 2.30.30.710; Hepatitis C virus non-structural protein NS2, C-terminal domain; 1.
DR Gene3D; 1.20.1280.150; Hepatitis C virus non-structural protein NS2, N-terminal domain; 1.
DR Gene3D; 2.20.25.220; Hepatitis C virus NS5A, 1B domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 1.10.820.10; RNA Helicase Chain A , domain 3; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR011492; Flavi_DEAD.
DR InterPro; IPR002521; HCV_Core_C.
DR InterPro; IPR044896; HCV_core_chain_A.
DR InterPro; IPR002522; HCV_core_N.
DR InterPro; IPR002518; HCV_NS2.
DR InterPro; IPR042205; HCV_NS2_C.
DR InterPro; IPR042209; HCV_NS2_N.
DR InterPro; IPR000745; HCV_NS4a.
DR InterPro; IPR001490; HCV_NS4b.
DR InterPro; IPR002868; HCV_NS5a.
DR InterPro; IPR013192; HCV_NS5A_1a.
DR InterPro; IPR013193; HCV_NS5a_1B_dom.
DR InterPro; IPR038568; HCV_NS5A_1B_sf.
DR InterPro; IPR024350; HCV_NS5a_C.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR004109; HepC_NS3_protease.
DR InterPro; IPR038170; NS5A_1a_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR InterPro; IPR002166; RNA_pol_HCV.
DR Pfam; PF07652; Flavi_DEAD; 1.
DR Pfam; PF01543; HCV_capsid; 1.
DR Pfam; PF01542; HCV_core; 1.
DR Pfam; PF01538; HCV_NS2; 1.
DR Pfam; PF01006; HCV_NS4a; 1.
DR Pfam; PF01001; HCV_NS4b; 1.
DR Pfam; PF01506; HCV_NS5a; 1.
DR Pfam; PF08300; HCV_NS5a_1a; 1.
DR Pfam; PF08301; HCV_NS5a_1b; 1.
DR Pfam; PF12941; HCV_NS5a_C; 1.
DR Pfam; PF02907; Peptidase_S29; 1.
DR Pfam; PF00998; RdRP_3; 1.
DR SMART; SM00487; DEXDc; 1.
DR SMART; SM00490; HELICc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51693; HCV_NS2_PRO; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
DR PROSITE; PS51822; HV_PV_NS3_PRO; 1.
DR PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE 4: Predicted;
KW Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Fusion of virus membrane with host endosomal membrane
KW {ECO:0000256|ARBA:ARBA00022510};
KW Fusion of virus membrane with host membrane
KW {ECO:0000256|ARBA:ARBA00022506};
KW G1/S host cell cycle checkpoint dysregulation by virus
KW {ECO:0000256|ARBA:ARBA00023309};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Helicase {ECO:0000256|ARBA:ARBA00022806};
KW Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW Host nucleus {ECO:0000256|ARBA:ARBA00022562};
KW Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Inhibition of host innate immune response by virus
KW {ECO:0000256|ARBA:ARBA00022632};
KW Inhibition of host interferon signaling pathway by virus
KW {ECO:0000256|ARBA:ARBA00022830};
KW Inhibition of host MAVS by virus {ECO:0000256|ARBA:ARBA00022986};
KW Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW Inhibition of host STAT1 by virus {ECO:0000256|ARBA:ARBA00022961};
KW Inhibition of host TRAFs by virus {ECO:0000256|ARBA:ARBA00022647};
KW Interferon antiviral system evasion {ECO:0000256|ARBA:ARBA00023258};
KW Ion channel {ECO:0000256|ARBA:ARBA00023303};
KW Ion transport {ECO:0000256|ARBA:ARBA00023065};
KW Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Membrane {ECO:0000256|ARBA:ARBA00023136};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Modulation of host cell cycle by virus {ECO:0000256|ARBA:ARBA00022504};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW RNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022484};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989};
KW Transport {ECO:0000256|ARBA:ARBA00023065};
KW Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW Viral ion channel {ECO:0000256|ARBA:ARBA00023039};
KW Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW Viral RNA replication {ECO:0000256|ARBA:ARBA00022953};
KW Virion {ECO:0000256|ARBA:ARBA00022844};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT DOMAIN 233..356
FT /note="Peptidase C18"
FT /evidence="ECO:0000259|PROSITE:PS51693"
FT DOMAIN 357..538
FT /note="Peptidase S29"
FT /evidence="ECO:0000259|PROSITE:PS51822"
FT DOMAIN 547..699
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 691..868
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DOMAIN 1982..2100
FT /note="RdRp catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50507"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1671..1755
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1..19
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 1680..1711
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 2193
FT /note="I or L"
FT /evidence="ECO:0000313|EMBL:ABD85061.2"
FT NON_TER 2237
FT /evidence="ECO:0000313|EMBL:ABD85061.2"
SQ SEQUENCE 2237 AA; 241887 MW; 8025AA93E043CC33 CRC64;
MSTDPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR GPRLGVRAAR KTSERSQPRG
RRQPIPKDRR SPGKSWGKPG YPWPLYGNEG CGWAGWLLSP RGSRPTWGPT DPRHRSRNLG
KVIDTITCGF ADLMGYIPVV GAPVGGVARA LAHGVRVLED GINYXTGNLP GCSFSIFLLA
LLSCLTVPAS AVEVRNISSG YYTTNDCSNS SISLVFEVTK WLLAVLGPAY LLRASLLRVP
YFVRAHALLR VCTLVRHLAG ARYIQMLLIT XGRWTGTYIY DHLSPLSTWA AQGLRDLAVA
VEPVVFSPME KKVIVWGAEX VACGDILHGL PVSARLGRXV LLGPADGYXS KGWKLLAPIT
AYTQQTRGLL GAIVVSLTGR DKXEQAGQVQ VLSSVTQSFL GTCISGVLWT VYHGAGNKTL
AGSKGPITQM YTXAXGDLVG WPSPPGTKSL DPCTCGAVDL YLVTRNADVI PVRRKBDRRG
ALLSPRPLST LKGSSGGPVL CXMGHAVGLF RAAVCSRGVA KSIDFXPVES LDIATRSPSF
SDNSTPPAVP QTYQVGYLHA PTGSGKSTKV PAAYASQGYK VLVLNPSVAA TLGFGAYMSK
AHGINPNIRT GVRTVTTGDP ITYSTYGKFL ADGGCSXGAY DVIICDECHA VDATTILGIG
TVLDQAETAG ARLVVLATAT PPGTVTTPHS NIEEVALGHE GEIPFYGKAI PLAXIKGGRH
LIFCHSKKKC DELAXALRGM GVNAVAYYRG LDVSVIPAQG DVVVVATDAL MTGFTGDFDS
VIDCNVAVTQ IVDFSLDPTF TITTQTVPQD AVSRSQRRGR TGRGRLGIYR YVSSGERPSG
MFDSXVLCEC YDAGAAWYEL XPAETTVRLR AYFNTPGLPV CQDHLEFWEA VFTGLTHIDA
HFLSQTKQXG DNFAYLXAYQ ATVCARAKAP PPSWDVMWKC LNRLKPTLTG PTPLLYRLGA
VTNEVTLTHP VTKYIATCMQ ADLEIMTSTW VLAGGVLAAV AAYCLATGCI SIIGRLHLNN
RVVVAPNKEV LYEAFDEMEE CASKAALIEE GQRMAEMLKS KIQGLLQQAT KQAQDIQPAI
QSSWPKLEQF WAKHMWNFIS GIQYLAGLST LPGNPAVASM MAFSAALTSP LPTSTTILLN
IMGGWLASQI APPAGATGFV VSGLVGAAVG SIGLGKILVD VLAGYGAGIS GALVAFKIMS
GEKPXVEDVV NLLPAILSPG ALVVGVICAA ILRRHVGQGE GAVQWMNRLI AFASRGNHVA
PTHYVAESDA SQRVTQVLSS LTITSLLRRL HAWITEDCPV PCSGSWLRDI WDWVCSILTD
FKNWLSSKLL PKMPGLPFIS CQKGYRGVWA GTGIMTTRCP CGANISGHVR LGTMKITGPK
TCLNLWQGTF PINCYTEGPC VPKPPPNYKT AIWRVAASEY VEXTQHGSFS YVTGLTSDNL
KVPCQVPAPE FFSWVDGVQI HRFAPTPGPF FRDEVTFTVG LNSFVVGSQL PCDPEPDTEV
LASMLTDPSH ITAEAAARRL ARGSPPSQAS SSASQLSAPS LKATCTTHKM XYDCDMVDAN
LFMGGDVTRI ESDSKVIXLD SLDSMTEXED DREPSIPSEY LIXRKKFPPA LPPWAHPDYN
PPTIETWKEP GYXPPTVLGC ALPPTPQXPV PPPRRRRAKV LTQDNVEGVL REMADKVLSP
LQSCNDSGHS TGVDTGGDSV QQPSDEIAAS EAGSLSSMPP LEGEPGDPDL EFEPAGSASP
SEGEREVIDS DSKSWSTVSD QEDSVVCCSM SYSWTGALIT PCGPEEEKLP INPLSNSLMR
FHNKVYSTTS RSASLRAKKV TFDRVQMLDA HYDSVLQDVK RAASEVSARL LSVEEACALT
PPHSARSRYG FGAKEVRSLS RRAVNHIQSV WEDLLEDQHT PIQTTIMAKN EVFCVDPAKG
GKKAARLIVY PDLGVRVCEK MALYDIAQKL PKAIMGASYG FQYSPAERVD FLLKAWGSKK
DPMGFSYDTR CFDSTVTERD IRTEESIYQA CSLPQEARTV IHSLTERLYV GGPMTNSKGQ
SCGYRRCRAS GVFTTSMGNT MTCYIKALAA CKAAGIVNPV MLVCGDDLVV ISESQGNEED
ERNLRAFTEA MTRYSAPPGD LPKPEYDLEL ITSCSSNVSV ALDSRGRRRY YLTRDPTTPI
SRAAWETVRH SPVNSWLGNI IQYAPTIWVR MVXMTHFFSI LLAQDTLNQN LNFEMYGAVY
SVNPLDLPAI IERLHGL
//