ID Q99IB2_9HEPC Unreviewed; 3032 AA.
AC Q99IB2;
DT 01-JUN-2001, integrated into UniProtKB/TrEMBL.
DT 01-JUN-2001, sequence version 1.
DT 27-MAR-2024, entry version 152.
DE RecName: Full=Genome polyprotein {ECO:0000256|ARBA:ARBA00020107};
OS Hepacivirus hominis.
OC Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC Amarillovirales; Flaviviridae; Hepacivirus.
OX NCBI_TaxID=3052230 {ECO:0000313|EMBL:BAB32878.1, ECO:0000313|Proteomes:UP000141063};
RN [1] {ECO:0000313|EMBL:BAB32878.1, ECO:0000313|Proteomes:UP000141063}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX PubMed=11424123; DOI=10.1002/jmv.1055;
RA Kato T., Furusaka A., Miyamoto M., Date T., Yasui K., Hiramoto J.,
RA Nagayama K., Tanaka T., Wakita T.;
RT "Sequence analysis of hepatitis C virus isolated from a fulminant hepatitis
RT patient.";
RL J. Med. Virol. 64:334-339(2001).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13;
CC Evidence={ECO:0000256|ARBA:ARBA00001556};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Hydrolysis of four peptide bonds in the viral precursor
CC polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr
CC in P1 and Ser or Ala in P1'.; EC=3.4.21.98;
CC Evidence={ECO:0000256|ARBA:ARBA00001117};
CC -!- CATALYTIC ACTIVITY:
CC Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC Evidence={ECO:0000256|ARBA:ARBA00001491};
CC -!- COFACTOR:
CC Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC Evidence={ECO:0000256|ARBA:ARBA00001946};
CC -!- COFACTOR:
CC Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC Evidence={ECO:0000256|ARBA:ARBA00001947};
CC -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004236}.
CC Cytoplasm {ECO:0000256|ARBA:ARBA00004496}. Endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004406}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004406}. Endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004115}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004115}. Host cell membrane
CC {ECO:0000256|ARBA:ARBA00004165}. Host cytoplasm, host perinuclear
CC region {ECO:0000256|ARBA:ARBA00004407}. Host endoplasmic reticulum
CC membrane {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004291}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004291}. Host endoplasmic reticulum membrane
CC {ECO:0000256|ARBA:ARBA00004482}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004482}. Host lipid droplet
CC {ECO:0000256|ARBA:ARBA00004338}. Host mitochondrion
CC {ECO:0000256|ARBA:ARBA00004181}. Host nucleus
CC {ECO:0000256|ARBA:ARBA00004147}. Lipid droplet
CC {ECO:0000256|ARBA:ARBA00004502}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC {ECO:0000256|ARBA:ARBA00004170}; Peripheral membrane protein
CC {ECO:0000256|ARBA:ARBA00004170}. Membrane
CC {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004479}. Mitochondrion
CC {ECO:0000256|ARBA:ARBA00004173}. Nucleus
CC {ECO:0000256|ARBA:ARBA00004123}. Virion membrane
CC {ECO:0000256|ARBA:ARBA00004563}; Single-pass type I membrane protein
CC {ECO:0000256|ARBA:ARBA00004563}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; AB047645; BAB32878.1; -; Genomic_RNA.
DR MEROPS; S29.001; -.
DR euHCVdb; AB047645; -.
DR Proteomes; UP000141063; Genome.
DR GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044186; C:host cell lipid droplet; IEA:UniProtKB-SubCell.
DR GO; GO:0033650; C:host cell mitochondrion; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005811; C:lipid droplet; IEA:UniProtKB-SubCell.
DR GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW.
DR GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005216; F:monoatomic ion channel activity; IEA:UniProtKB-KW.
DR GO; GO:0017111; F:ribonucleoside triphosphate phosphatase activity; IEA:UniProtKB-EC.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC.
DR GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW.
DR GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039563; P:disruption by virus of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW.
DR GO; GO:0039527; P:disruption by virus of host TRAF-mediated signal transduction; IEA:UniProtKB-KW.
DR GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039645; P:perturbation by virus of host G1/S transition checkpoint; IEA:UniProtKB-KW.
DR GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR GO; GO:0039545; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of MAVS activity; IEA:UniProtKB-KW.
DR GO; GO:0019087; P:transformation of host cell by virus; IEA:InterPro.
DR GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR GO; GO:0039707; P:virus-mediated pore formation in host cell membrane; IEA:UniProtKB-KW.
DR CDD; cd20903; HCV_p7; 1.
DR CDD; cd23202; Hepacivirus_RdRp; 1.
DR Gene3D; 2.40.10.120; -; 1.
DR Gene3D; 3.30.70.270; -; 2.
DR Gene3D; 6.10.250.1610; -; 1.
DR Gene3D; 6.10.250.1750; -; 1.
DR Gene3D; 6.10.250.2920; -; 1.
DR Gene3D; 2.20.25.210; Hepatitis C NS5A, domain 1B; 1.
DR Gene3D; 3.30.160.890; Hepatitis C virus envelope glycoprotein E1, chain C; 1.
DR Gene3D; 2.30.30.710; Hepatitis C virus non-structural protein NS2, C-terminal domain; 1.
DR Gene3D; 1.20.1280.150; Hepatitis C virus non-structural protein NS2, N-terminal domain; 1.
DR Gene3D; 2.20.25.220; Hepatitis C virus NS5A, 1B domain; 1.
DR Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR Gene3D; 1.10.820.10; RNA Helicase Chain A , domain 3; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR InterPro; IPR043502; DNA/RNA_pol_sf.
DR InterPro; IPR011492; Flavi_DEAD.
DR InterPro; IPR002521; HCV_Core_C.
DR InterPro; IPR044896; HCV_core_chain_A.
DR InterPro; IPR002522; HCV_core_N.
DR InterPro; IPR002519; HCV_Env.
DR InterPro; IPR002531; HCV_NS1.
DR InterPro; IPR002518; HCV_NS2.
DR InterPro; IPR042205; HCV_NS2_C.
DR InterPro; IPR042209; HCV_NS2_N.
DR InterPro; IPR000745; HCV_NS4a.
DR InterPro; IPR001490; HCV_NS4b.
DR InterPro; IPR002868; HCV_NS5a.
DR InterPro; IPR013192; HCV_NS5A_1a.
DR InterPro; IPR013193; HCV_NS5a_1B_dom.
DR InterPro; IPR038568; HCV_NS5A_1B_sf.
DR InterPro; IPR024350; HCV_NS5a_C.
DR InterPro; IPR014001; Helicase_ATP-bd.
DR InterPro; IPR001650; Helicase_C.
DR InterPro; IPR004109; HepC_NS3_protease.
DR InterPro; IPR038170; NS5A_1a_sf.
DR InterPro; IPR027417; P-loop_NTPase.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR InterPro; IPR002166; RNA_pol_HCV.
DR Pfam; PF07652; Flavi_DEAD; 1.
DR Pfam; PF01543; HCV_capsid; 1.
DR Pfam; PF01542; HCV_core; 1.
DR Pfam; PF01539; HCV_env; 1.
DR Pfam; PF01560; HCV_NS1; 1.
DR Pfam; PF01538; HCV_NS2; 1.
DR Pfam; PF01006; HCV_NS4a; 1.
DR Pfam; PF01001; HCV_NS4b; 1.
DR Pfam; PF01506; HCV_NS5a; 1.
DR Pfam; PF08300; HCV_NS5a_1a; 1.
DR Pfam; PF08301; HCV_NS5a_1b; 1.
DR Pfam; PF12941; HCV_NS5a_C; 1.
DR Pfam; PF02907; Peptidase_S29; 1.
DR Pfam; PF00998; RdRP_3; 1.
DR SMART; SM00487; DEXDc; 1.
DR SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51693; HCV_NS2_PRO; 1.
DR PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR PROSITE; PS51194; HELICASE_CTER; 1.
DR PROSITE; PS51822; HV_PV_NS3_PRO; 1.
DR PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE 4: Predicted;
KW Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW Apoptosis {ECO:0000256|ARBA:ARBA00022703};
KW ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Fusion of virus membrane with host endosomal membrane
KW {ECO:0000256|ARBA:ARBA00022510};
KW Fusion of virus membrane with host membrane
KW {ECO:0000256|ARBA:ARBA00022506};
KW G1/S host cell cycle checkpoint dysregulation by virus
KW {ECO:0000256|ARBA:ARBA00023309};
KW Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW Helicase {ECO:0000256|ARBA:ARBA00022806};
KW Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW Host lipid droplet {ECO:0000256|ARBA:ARBA00023190};
KW Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW Host mitochondrion {ECO:0000256|ARBA:ARBA00023147};
KW Host nucleus {ECO:0000256|ARBA:ARBA00022562};
KW Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Inhibition of host innate immune response by virus
KW {ECO:0000256|ARBA:ARBA00022632};
KW Inhibition of host interferon signaling pathway by virus
KW {ECO:0000256|ARBA:ARBA00022830};
KW Inhibition of host MAVS by virus {ECO:0000256|ARBA:ARBA00022986};
KW Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW Inhibition of host STAT1 by virus {ECO:0000256|ARBA:ARBA00022961};
KW Inhibition of host TRAFs by virus {ECO:0000256|ARBA:ARBA00022647};
KW Interferon antiviral system evasion {ECO:0000256|ARBA:ARBA00023258};
KW Ion channel {ECO:0000256|ARBA:ARBA00023303};
KW Ion transport {ECO:0000256|ARBA:ARBA00023065};
KW Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Modulation of host cell cycle by virus {ECO:0000256|ARBA:ARBA00022504};
KW Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW Protease {ECO:0000256|ARBA:ARBA00022670};
KW Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW RNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022484};
KW Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Transcription {ECO:0000256|ARBA:ARBA00023163};
KW Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW Transferase {ECO:0000256|ARBA:ARBA00022679};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius}; Transport {ECO:0000256|ARBA:ARBA00022448};
KW Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW Viral envelope protein {ECO:0000256|ARBA:ARBA00022879};
KW Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW Viral ion channel {ECO:0000256|ARBA:ARBA00023039};
KW Viral nucleoprotein {ECO:0000256|ARBA:ARBA00023086};
KW Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW Viral RNA replication {ECO:0000256|ARBA:ARBA00022953};
KW Virion {ECO:0000256|ARBA:ARBA00022844};
KW Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT TRANSMEM 359..381
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 721..744
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 756..781
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 788..806
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 818..838
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 876..896
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1662..1686
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1790..1815
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1827..1848
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1854..1873
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 1885..1905
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3012..3031
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 906..1029
FT /note="Peptidase C18"
FT /evidence="ECO:0000259|PROSITE:PS51693"
FT DOMAIN 1030..1211
FT /note="Peptidase S29"
FT /evidence="ECO:0000259|PROSITE:PS51822"
FT DOMAIN 1220..1372
FT /note="Helicase ATP-binding"
FT /evidence="ECO:0000259|PROSITE:PS51192"
FT DOMAIN 1364..1541
FT /note="Helicase C-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51194"
FT DOMAIN 2655..2773
FT /note="RdRp catalytic"
FT /evidence="ECO:0000259|PROSITE:PS50507"
FT REGION 1..79
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2307..2332
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT REGION 2349..2430
FT /note="Disordered"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 47..74
FT /note="Basic and acidic residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
FT COMPBIAS 2367..2390
FT /note="Polar residues"
FT /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ SEQUENCE 3032 AA; 329395 MW; 1FE2A9DBB3442677 CRC64;
MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYLLPRR GPRLGVRATR KTSERSQPRG
RRQPIPKDRR STGKSWGKPG YPWPLYGNEG LGWAGWLLSP RGSRPSWGPT DPRHRSRNVG
KVIDTLTCGF ADLMGYIPVV GAPLGGVARA LAHGVRVLED GVNYATGNLP GCSFSIFLLA
LLSCITTPVS AAQVKNTSDI YMVTNDCSND SITWQLQAAA LHVPGCVPCE RVGNTSRCWI
PVSPNVAVRQ PGALTQGLRT HIDMVVMSAT LCSALYVGDL CGGVMLAAQM FIISPQHHWF
VQECNCSIYP GTITGHRMAW DMMMNWSPTT TMILAYVMRV PEVIIDIIGG AHWGVMFGLA
YFSMQGAWAK VVVILLLAAG VDAHTRTGSS VGYATSGIVG LFTSGPKQNI QLINTNGSWH
INRTALNCND SLNTGFIVSL FYARNFNSTG CPERLSACRG IEGFRIGWGT LQYEDNVTNP
EDMRPYCWHY PPKQCGIVPA GSVCGPVYCF TPSPVVVGTT DRLGVPTYTW GENETDVFLL
NSTRPPVGSW FGCTWMNSTG FTKTCGAPPC RIRADFNAST DLLCPTDCFR KHPEATYIKC
GSGPWLTPRC LVDYPYRLWH YPCTVNYSIF KIRMYVGGVE HRLTAACNFT RGDRCNLEDR
DRSQLTPLLH STTEWAILPC TYSDLPALST GLLHLHQNIV DVQYMYGLSP ALTKYVVRWE
WVVLLFLLLA DARVCACVWM LILLGQAEAA LEKLVVLHAA STASCNGFLY FVIFFVAAWY
IKGRAVPLAT YSLTGLWPFC LLLLALPQQA YAYDASVHGQ VGVVLLILIT LFTLTPGYKT
LLSRCLWWLC YILTLGEAMV QEWAPSMQAR GGRDGIIWAA TIFCPGIVFD ITKWLLAVLG
PAYLLRDALT RVPYFVRAHA LLRMCTMVRH LAGGRYVQMA LLALGRWTGT YIYDHLTPMS
DWAASGLRDL AVAVEPIIFS PMEKKVIVWG AETAACGDIL HGLPVSARLG REILLGPADG
YTSKGWRLLA PITAYAQQTR GLLGTIVVSM TGRDKTEQAG EIQVLSTVTQ SFLGTSIAGV
LWTVYHGAGN KTLAGSRGPV TQMYSSAEGD LVGWPSPPGT KSLEPCTCGS VDLYLVTRNA
DVIPVRRRGD KRGALLSPRP LSTLKGSSGG PVLCPRGHAV GIFRAAVCSR GVAKSIDFIP
VETLDIVTRS PTFSDNSTPP AVPQTYQVGY LHAPTGSGKS TKVPVAYAAQ GYKVLVLNPS
VAATLGFGAY LSKAHGINPN IRTGVRTVTT GEAITYSTYG KFLADGGCSG GAYDIIICDE
CHATDSTTIL GIGTVLDQAE TAGVRLTVLA TATPPGSVTT PHPNIEEVAL GQEGEIPFYG
KAIPLSYIKG GRHLIFCHSK KKCEELAVAL RAMGLNAVAY YRGLDVSIIP TQGDVVVVAT
DALMTGYTGD FDSVIDCNVA VTQVIDFSLD PTFTITTQTV PQDAVSRSQR RGRTGRGRLG
TYRYVSTGER ASGMFDSVVL CECYDAGAAW YELTPAETTV RLRAYFNTPG LPVCQDHLEF
WEGAFTDLTH IDAHFLSQTK QAGENFAYLV AYQATVCARA KAPPPSWDIM WKCLTRLKPT
LVGPTPLLYR LGSVTNEVTL THPVTKYIAT CMQADLEVMT STWILAGGVL AAVAAYCLAT
GCVSIIGRLH VNQRAVVAPD KEVPYEAFDE MEECASRAAL IEEGQRIAEM LKSKIQGLLQ
QASKQAQNIQ PAVQTSWPKV EQFWAKHMWN FISGIQYLAG LSTLPGNPAV ASMMAFSAAL
TSPLSTSTTI LLNILGGWLA SQIAPPAGAT GFVVSGLVGA AIGSIGLGKV LVDILAGYGA
GISGALVAFK IMSGEKPSME DVVNLLPGIL SPGALVVGVI CAAILRRHVG PGEGAVQWMN
RLIAFASRGN HVAPTHYVTE SDASQRVTQL LGSLTITSLL RRLHNWITED CPIPCAGSWL
RDVWDWVCSI LTDFKNWLTS KLFPKMPGLP FISCQKGYKG VWAGTGIMTT RCPCGAVISG
NVRLGSMRIT GPKTCMNTWQ GTFPINCYTE GQCVPKPAPN FKTAIWRVAA SEYAEVTQHG
SYSYITGLTT DNLKVPCQLP SPEFFSWVDG VQIHRFAPTP KPFFRDEVSF CVGLNSFVVG
SQLPCDPEPD TDVLMSMLTD PSHITAEAAA RRLARGSPPS EASSSASQLS AASLRATCTT
HGKTYDVDMV DASLFMRGDV TRIESESKVV VLDYLDPMVE ETSDREPSIP SEYMLPRNRF
PPALPAWARP DYNPPLVESW KRPDYQPPTV AGCALPPPKK TPTPPPRRRR TVGLSESTIG
DALQQLAVKS FGQPPPSGDS GLSTGADAAD SGGQTSPDEL ALSETGSISS IPPLEGEPGD
PDLEPEQVEL QPPPQGGEVA PGSDSGSWST CSEEDDAVVC CSMSYSWTGA LITPCSPEEE
KLPINPLSNS LLRYHNKVYC TTSKSASLRA KKVTFDRMQV LDTYYDSVLK DIKLAASKVS
ARLLTLEEAC QLTPPHSARS KYGFGAKEVR SLTRRAVNHI KSVWEDLLED SQTPIPTTIM
AKNEVFCVDP TKGGKKAARL IVYPDLGVRV CEKMALYDVT QKLPQAVMGA SYGFQYSPSQ
RVEFLLKAWA EKKGPMGFSY DTRCFDSTVT ERDIKTEESI YQACSLPEEA RTAIHSLTER
LYVGGPMFNS KGQSCGYRRC RASGVLTTSM GNTITCYVKA LAACKAAGIV APTMLVCGDD
LVVISESQGT EEDERNLRAF TEAMTRYSAP PGDPPRPEYD LELITSCSSN VSVALGPQGR
RRYYLTRDPA TPIARAARET VRHSPVNSWL GNIIQYAPTI WVRMVLMTHF FSILMAQDTL
DQNLNFEMYG SVYSVSPLDL PAIIERLHGL GAFSLHTYTP HELTRVASAL RKLGAPPLRT
WKSRARAVRA SLISRGGRAA VCGRYLFNWA VRTKLKLTPL PEARLLDLSS WFTVGAGGGD
IYHSVSRARP RLLLLSLLLL FVGVGLFLLP AR
//