ID K0H4B8_9HEPC Unreviewed; 3016 AA.
AC K0H4B8;
DT 28-NOV-2012, integrated into UniProtKB/TrEMBL.
DT 06-MAR-2013, sequence version 2.
DT 27-MAR-2024, entry version 71.
DE RecName: Full=Genome polyprotein {ECO:0000256|ARBA:ARBA00020107};
OS Hepacivirus hominis.
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC Amarillovirales; Flaviviridae; Hepacivirus.
OX NCBI_TaxID=3052230 {ECO:0000313|EMBL:AFU34524.1, ECO:0000313|Proteomes:UP000163528};
RN [1] {ECO:0000313|EMBL:AFU34524.1, ECO:0000313|Proteomes:UP000163528}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=TV476 {ECO:0000313|EMBL:AFU34524.1};
RX PubMed=23015745; DOI=10.1099/vir.0.047506-0;
RA Wang H., Yuan Z., Barnes E., Yuan M., Li C., Fu Y., Xia X., Li G.,
RA Newton P., Vongsouvath M., Klenerman P., Pybus O.G., Murphy D., Abe K.,
RA Lu L.;
RT "Eight novel hepatitis C virus genomes reveal the changing taxonomic
RT structure of genotype 6.";
RL J. Gen. Virol. 94:76-80(2013).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13;
CC Evidence={ECO:0000256|ARBA:ARBA00001556};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of four peptide bonds in the viral precursor
CC polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr
CC in P1 and Ser or Ala in P1'.; EC=3.4.21.98;
CC Evidence={ECO:0000256|ARBA:ARBA00001117};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC Evidence={ECO:0000256|ARBA:ARBA00001491};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Evidence={ECO:0000256|ARBA:ARBA00001947};
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004236}.
CC Cytoplasm {ECO:0000256|ARBA:ARBA00004496}. Endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004406}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004406}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004115}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004115}. Host cell membrane
CC {ECO:0000256|ARBA:ARBA00004165}. Host cytoplasm, host perinuclear
CC region {ECO:0000256|ARBA:ARBA00004407}. Host endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004291}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004291}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004482}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004482}. Host lipid droplet
CC {ECO:0000256|ARBA:ARBA00004338}. Host mitochondrion
CC {ECO:0000256|ARBA:ARBA00004181}. Host nucleus
CC {ECO:0000256|ARBA:ARBA00004147}. Lipid droplet
CC {ECO:0000256|ARBA:ARBA00004502}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC {ECO:0000256|ARBA:ARBA00004170}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004170}. Membrane
CC {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004479}. Mitochondrion
CC {ECO:0000256|ARBA:ARBA00004173}. Nucleus
CC {ECO:0000256|ARBA:ARBA00004123}. Virion membrane
CC {ECO:0000256|ARBA:ARBA00004563}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004563}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; JX183552; AFU34524.1; -; Genomic_RNA.
DR Proteomes; UP000163528; Segment.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044186; C:host cell lipid droplet; IEA:UniProtKB-SubCell.
DR GO; GO:0033650; C:host cell mitochondrion; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005811; C:lipid droplet; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005216; F:monoatomic ion channel activity; IEA:UniProtKB-KW.
DR GO; GO:0017111; F:ribonucleoside triphosphate phosphatase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC.
DR GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039563; P:disruption by virus of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW.
DR GO; GO:0039527; P:disruption by virus of host TRAF-mediated signal transduction; IEA:UniProtKB-KW.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039645; P:perturbation by virus of host G1/S transition checkpoint; IEA:UniProtKB-KW.
DR GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR GO; GO:0039545; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of MAVS activity; IEA:UniProtKB-KW.
DR GO; GO:0019087; P:transformation of host cell by virus; IEA:InterPro.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR GO; GO:0039707; P:virus-mediated pore formation in host cell membrane; IEA:UniProtKB-KW.
DR CDD; cd20903; HCV_p7; 1.
DR CDD; cd23202; Hepacivirus_RdRp; 1.
DR Gene3D; 2.40.10.120; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 6.10.250.1610; -; 1.
DR Gene3D; 6.10.250.1750; -; 1.
DR Gene3D; 6.10.250.2920; -; 1.
DR Gene3D; 2.20.25.210; Hepatitis C NS5A, domain 1B; 1.
DR Gene3D; 3.30.160.890; Hepatitis C virus envelope glycoprotein E1, chain C; 1.
DR Gene3D; 2.30.30.710; Hepatitis C virus non-structural protein NS2, C-terminal domain; 1.
DR Gene3D; 1.20.1280.150; Hepatitis C virus non-structural protein NS2, N-terminal domain; 1.
DR Gene3D; 2.20.25.220; Hepatitis C virus NS5A, 1B domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 1.10.820.10; RNA Helicase Chain A , domain 3; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR011492; Flavi_DEAD.
DR InterPro; IPR002521; HCV_Core_C.
DR InterPro; IPR044896; HCV_core_chain_A.
DR InterPro; IPR002522; HCV_core_N.
DR InterPro; IPR002519; HCV_Env.
DR InterPro; IPR002531; HCV_NS1.
DR InterPro; IPR002518; HCV_NS2.
DR InterPro; IPR042205; HCV_NS2_C.
DR InterPro; IPR042209; HCV_NS2_N.
DR InterPro; IPR000745; HCV_NS4a.
DR InterPro; IPR001490; HCV_NS4b.
DR InterPro; IPR002868; HCV_NS5a.
DR InterPro; IPR013192; HCV_NS5A_1a.
DR InterPro; IPR013193; HCV_NS5a_1B_dom.
DR InterPro; IPR038568; HCV_NS5A_1B_sf.
DR InterPro; IPR024350; HCV_NS5a_C.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR004109; HepC_NS3_protease.
DR InterPro; IPR038170; NS5A_1a_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR InterPro; IPR002166; RNA_pol_HCV.
DR Pfam; PF07652; Flavi_DEAD; 1.
DR Pfam; PF01543; HCV_capsid; 1.
DR Pfam; PF01542; HCV_core; 1.
DR Pfam; PF01539; HCV_env; 1.
DR Pfam; PF01560; HCV_NS1; 1.
DR Pfam; PF01538; HCV_NS2; 1.
DR Pfam; PF01006; HCV_NS4a; 1.
DR Pfam; PF01001; HCV_NS4b; 1.
DR Pfam; PF01506; HCV_NS5a; 1.
DR Pfam; PF08300; HCV_NS5a_1a; 1.
DR Pfam; PF08301; HCV_NS5a_1b; 1.
DR Pfam; PF12941; HCV_NS5a_C; 1.
DR Pfam; PF02907; Peptidase_S29; 1.
DR Pfam; PF00998; RdRP_3; 1.
DR SMART; SM00487; DEXDc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51693; HCV_NS2_PRO; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
DR PROSITE; PS51822; HV_PV_NS3_PRO; 1.
DR PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE 4: Predicted;
KW Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW Apoptosis {ECO:0000256|ARBA:ARBA00022703};
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Fusion of virus membrane with host endosomal membrane
KW {ECO:0000256|ARBA:ARBA00022510};
KW Fusion of virus membrane with host membrane
KW {ECO:0000256|ARBA:ARBA00022506};
KW G1/S host cell cycle checkpoint dysregulation by virus
KW {ECO:0000256|ARBA:ARBA00023309};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Helicase {ECO:0000256|ARBA:ARBA00022806};
KW Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW Host lipid droplet {ECO:0000256|ARBA:ARBA00023190};
KW Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW Host mitochondrion {ECO:0000256|ARBA:ARBA00023147};
KW Host nucleus {ECO:0000256|ARBA:ARBA00022562};
KW Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Inhibition of host innate immune response by virus
KW {ECO:0000256|ARBA:ARBA00022632};
KW Inhibition of host interferon signaling pathway by virus
KW {ECO:0000256|ARBA:ARBA00022830};
KW Inhibition of host MAVS by virus {ECO:0000256|ARBA:ARBA00022986};
KW Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW Inhibition of host STAT1 by virus {ECO:0000256|ARBA:ARBA00022961};
KW Inhibition of host TRAFs by virus {ECO:0000256|ARBA:ARBA00022647};
KW Interferon antiviral system evasion {ECO:0000256|ARBA:ARBA00023258};
KW Ion channel {ECO:0000256|ARBA:ARBA00023303};
KW Ion transport {ECO:0000256|ARBA:ARBA00023065};
KW Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Modulation of host cell cycle by virus {ECO:0000256|ARBA:ARBA00022504};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW RNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022484};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}; Transport {ECO:0000256|ARBA:ARBA00022448};
KW Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW Viral envelope protein {ECO:0000256|ARBA:ARBA00022879};
KW Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW Viral ion channel {ECO:0000256|ARBA:ARBA00023039};
KW Viral nucleoprotein {ECO:0000256|ARBA:ARBA00023086};
KW Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW Viral RNA replication {ECO:0000256|ARBA:ARBA00022953};
KW Virion {ECO:0000256|ARBA:ARBA00022844};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT TRANSMEM 264..287
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 720..742
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 754..781
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 787..805
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 817..837
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1662..1685
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1801..1820
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1853..1872
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1884..1904
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2996..3013
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 905..1028
FT /note="Peptidase C18"
FT /evidence="ECO:0000259|PROSITE:PS51693"
FT DOMAIN 1029..1210
FT /note="Peptidase S29"
FT /evidence="ECO:0000259|PROSITE:PS51822"
FT DOMAIN 1219..1371
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 1363..1540
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DOMAIN 2639..2757
FT /note="RdRp catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50507"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 1479..1501
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2184..2207
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2315..2414
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..61
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2355..2380
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT UNSURE 140
FT /note="I or L"
FT /evidence="ECO:0000313|EMBL:AFU34524.1"
SQ SEQUENCE 3016 AA; 328171 MW; FD28174C10D22C52 CRC64;
MSTLPKPQRQ TKRNTNRRPT DVKFPGGGQI VGGVYLLPRR GPRLGVRATR KTSERSQPRG
RRQPIPKARQ PQGRHWAQPG YPWPLYGNEG CGWAGWLLSP RGSRPNWGPN DPRRRSRNLG
KVIDTLTCGL ADLMGYIPVL GAPLGGVAAA LAHGVRAVED GINYATGNLP GCSFSIFLLA
LFSCLTTPAS AIHYKNTSGI YHLTNDCPNS SIIYEADNII MHTPGCVPCV RTDNISRCWV
PVAPTLAVTN VSVPVRGFRS HXDLLVGSAA FCSALYIGDL CGGLFLIGQV FTFRPRRYTT
VQECNCSIYT GHVTGHRMAW DMMMNWSPAL TFAVSSALRV PQLLLEAXFE RHWGVIGAVL
YYSMXANWAK VIAVLLLFAG VEGTTTIGHS VGRSTSGLTS LFSTGPQQNL QLVNTNGSWH
INRTALNCND TLHTGFLAGL FAHHRFNSSG CPERMSSCRP LRDFDQGWGP LSYANVSGPS
SDKPYCWHYP PRPCEVVPAR NVCGPVYCFT PSPVVVGTTD RRGHPTYTWG ENESDVFLLQ
TTRPPGGGWF GCTWMNSTGF VKTCGAPPCK TRPDTKNNNT LLCPTDCFRK HPDATYGKCG
SGPWLTXRCI VDYPYRLWHY PCTVNYTIHK VRMYVGGVEH RFNAACNWTR GERCELDDRD
RVEMSPLLFS TTELSILPCS FTTMPALSTG LIHLHQNVVD VQYLYGVSSA IVSWAVKWEY
VVLAFLVLAD ARVCACLWLM FLVGQAEAAL DKLIILNAAS AASTQGWLCG IAFICAAWYI
RGRAVPITTY AILQMWPLLL LVLALPRRAY ASNGEEAASL GLLVIAIITI FTLTPAYKTL
LISTLWWIQY FIARAEAIMH EWVPPLQVRG GRDAVILLTC LLHPXLGFEV TKIILALMGP
IYILQYSLLK TPYFVRAHIL LRACMFLRGV MGGKYLQAAL LRLGAWTGTY IYDHLSPLSD
WASDGLRDLA VAVEPVIFSP MEKKVITWGA DTAACGDIIS GLPVSARRGN LIFLGPADDV
KGGGWRLLAP ITAYAQQTRG LVGTIVTSLT GRDKNEVEGE IQVVSTATQS FLATAINGVL
WTVYHGAGSK TLAGPKGPVC QMYTNVDQDL VGWPAPPGCR SLTPCSCGSS DLYLVTRNAD
VIPARRRGDT RAALLSPRPI STLKGSSGGP MLCPSGHVAG IFRAAVCTRG VAKSLDFIPV
ESMQSTVRSP SFSDNSSPPA VPQTYQVGYL HAPTGSGXST KVPAAYAXQG YKVLVLNPSV
AATLGFGSYM STAHGIDPNI RTGVRTITTG GAITYSTYGK FLADGGCSGG AYDIIICDEC
HSTDPTTILG IGTVLDQAGT AGVRLTVLAT ATPPGSITVP HPNITETALP TTGEVPFYGK
AIPLEYIKGG RHLIFCHSKK KCDELAKQLT SLGLNAVAFY RGVDVAVIPT SGDVVVCATD
ALMTGYTGDF DSVIDCNVSV TQVVDFSLDP TFTIETTTVP QDAVSRSQRR GRTGRGKHGD
YRYVSQGERP SGLFXTVVLC EAYDTGXAWY ELTPAETTVR LRAYLNTPGL PVCQDHLEFW
EGVFTGLTHI DAHFLSQTKQ AGENLAYLVA YQATVCARAK APPPSWDTMW KCLIRLKPVL
TGPTPLLYRL GAVQNEITTT HPITKYIMTC MSADLEVITS TWVLVGGVLA ALAAYCLSVG
CVVICGRITT TGRPVVVPDR EVLYQQFDEM EECSRHIPYL VEGQHLAEQF KQKVLGLIQS
TTKQAEELKP AVHSAWPKLE QFWQKHLWNF VSGIQYLAGL STLPGNPAVA SLMSFPASLT
CPLNTTTTLL LNILGGWVAA QLANPTASTA FVVSGLAGAA VGSIGLGRVL VDVIAGYGAG
VSGALVAFKI MSGETPSAED MVNMLPALLS PGALVVGVVC AAILRRHAGP TEGATQWMNR
LIAFASRGNH VSPTHYVPET DTSRQIMTIL SSLTVTSLLR KLHEWINSDW STPCSGSWLR
DIRDWVCTVL TDFKVWLKTK LVPALPGVPF LSCQRGFRGV WRGDGICRTT CPCGADIVGH
VKNGSMRISG SRWCSNIWHG TFPINATTTG PSVPIPEPNY KRALWRVAAE EYVEVVRVGD
SHFVVGATNQ DLKCPCQVPA PEFFTEVDGV RLHRYAPXCR PLIRDEISFX VGLNSYAIGS
QLPCEPEPDV TVVTSMLVDP SHITAETAAR RXARGSPPSL ASSSASQLSA PSLKATCTTH
GAHPDADLIE ADLLWRQEIG GNITRVESEN KVIVLDSFDP LVPEYDDREP SIPAECHRPN
RPKFPPALPI WARPDYNPPL IETWKKPEYV PPTVHGCALP PSGQPPIPPP RRKRTVRLDE
STVAGALAEL AQKSFPTLPA STTDSDSGHP TANGSSDQAD DEADTASDAG SYSSMPPLEG
EPGDPDLSSG SWSTVSEEGD SVVCCSMSYS WTGALITPCA AEEEKLPINP LSNSLIRHHN
LVYSTTSRSA AMRQKKVTFD RXQVLDQHYN DVVKEIKARA SGVTAKLLSV EEACDLTPPH
SARSKFGYGA KDVRSHASKA INHINSVWKD LLEDPQTPIP TTIMAKNEVF CVDASKGGRK
PARLIVYPDL GVRVCEKRAL FDITRKLPXA IMGEAYGFQY SPXQRVDYLL KMWRSKKTPM
GFSYDTRCFD STVTEHDIKT ERDIYLSCNL DSEARVAIQS LTERLYVGGP MYNSQGQKCG
TRRCRASGVL TTSLGNTMTC FIKAEAACRA AGITNYDMLV CGDDLVVVAE SAGIQEDASA
LRAFTEAMTR YSAPPGDQPQ PAYDLELITS CSSNVSVAHD HTGQRYYYLT RDPVIPLSRA
AWETARHTPV NSWLGNIIMY APTIWVRMVL MTHFFQILQS QEQLDKVLDF DIYGVTYSIN
PLQLPAIIQR LHGMAAFSLH GYSPDELNRV GACLRKLGAP PLRAWRHRAR AVRAKLIAQG
GNAAICGRYL FNWAVKTKLK LTPLPGAANL DLSGWFVSGF SGGDIYHSVS RARPRMLLLC
LLLLTVGVGI FLLPTR
//