GenomeNet

Database: UniProt
Entry: F8WTL9_9HEPC
LinkDB: F8WTL9_9HEPC
Original site: F8WTL9_9HEPC 
ID   F8WTL9_9HEPC            Unreviewed;      3034 AA.
AC   F8WTL9;
DT   19-OCT-2011, integrated into UniProtKB/TrEMBL.
DT   19-OCT-2011, sequence version 1.
DT   27-MAR-2024, entry version 78.
DE   RecName: Full=Genome polyprotein {ECO:0000256|ARBA:ARBA00020107};
OS   Hepatitis C virus subtype 2b.
OC   Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC   Amarillovirales; Flaviviridae; Hepacivirus; Hepacivirus hominis.
OX   NCBI_TaxID=31650 {ECO:0000313|EMBL:BAK61636.1, ECO:0000313|Proteomes:UP000097155};
RN   [1] {ECO:0000313|Proteomes:UP000097155}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RX   PubMed=21935415;
RA   Kadokura M., Maekawa S., Sueki R., Miura M., Komase K., Shindo H.,
RA   Amemiya F., Uetake T., Inoue T., Sakamoto M., Nakagawa M., Sakamoto N.,
RA   Watanabe M., Enomoto N.;
RT   "Analysis of the complete open reading frame of genotype 2b hepatitis C
RT   virus in association with the response to peginterferon and ribavirin
RT   therapy.";
RL   PLoS ONE 6:e24514-e24514(2011).
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC         ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC         ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13;
CC         Evidence={ECO:0000256|ARBA:ARBA00001556};
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of four peptide bonds in the viral precursor
CC         polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr
CC         in P1 and Ser or Ala in P1'.; EC=3.4.21.98;
CC         Evidence={ECO:0000256|ARBA:ARBA00001117};
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC         diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC         ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC         ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC         Evidence={ECO:0000256|ARBA:ARBA00001491};
CC   -!- COFACTOR:
CC       Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC         Evidence={ECO:0000256|ARBA:ARBA00001946};
CC   -!- COFACTOR:
CC       Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC         Evidence={ECO:0000256|ARBA:ARBA00001947};
CC   -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004236}.
CC       Cytoplasm {ECO:0000256|ARBA:ARBA00004496}. Endoplasmic reticulum
CC       membrane {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004406}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004406}. Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004389}; Single-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004389}. Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004115}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004115}. Host cell membrane
CC       {ECO:0000256|ARBA:ARBA00004165}. Host cytoplasm, host perinuclear
CC       region {ECO:0000256|ARBA:ARBA00004407}. Host endoplasmic reticulum
CC       membrane {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004291}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004291}. Host endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004517}; Single-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004517}. Host endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004482}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004482}. Host lipid droplet
CC       {ECO:0000256|ARBA:ARBA00004338}. Host mitochondrion membrane
CC       {ECO:0000256|ARBA:ARBA00004458}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004458}. Host nucleus
CC       {ECO:0000256|ARBA:ARBA00004147}. Lipid droplet
CC       {ECO:0000256|ARBA:ARBA00004502}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004170}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004170}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004167}; Single-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004167}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004479}. Mitochondrion membrane
CC       {ECO:0000256|ARBA:ARBA00004583}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004583}. Nucleus
CC       {ECO:0000256|ARBA:ARBA00004123}. Virion membrane
CC       {ECO:0000256|ARBA:ARBA00004563}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004563}.
CC   -!- SIMILARITY: Belongs to the hepacivirus polyprotein family.
CC       {ECO:0000256|ARBA:ARBA00008286}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; AB661404; BAK61636.1; -; Genomic_RNA.
DR   MEROPS; S29.001; -.
DR   Proteomes; UP000097155; Genome.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0044186; C:host cell lipid droplet; IEA:UniProtKB-SubCell.
DR   GO; GO:0044191; C:host cell mitochondrial membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005811; C:lipid droplet; IEA:UniProtKB-SubCell.
DR   GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR   GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW.
DR   GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR   GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0005216; F:monoatomic ion channel activity; IEA:UniProtKB-KW.
DR   GO; GO:0017111; F:ribonucleoside triphosphate phosphatase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0075512; P:clathrin-dependent endocytosis of virus by host cell; IEA:UniProtKB-KW.
DR   GO; GO:0039563; P:disruption by virus of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW.
DR   GO; GO:0039527; P:disruption by virus of host TRAF-mediated signal transduction; IEA:UniProtKB-KW.
DR   GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR   GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR   GO; GO:0039645; P:perturbation by virus of host G1/S transition checkpoint; IEA:UniProtKB-KW.
DR   GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR   GO; GO:0039545; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of MAVS activity; IEA:UniProtKB-KW.
DR   GO; GO:0019087; P:transformation of host cell by virus; IEA:InterPro.
DR   GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR   GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR   GO; GO:0039707; P:virus-mediated pore formation in host cell membrane; IEA:UniProtKB-KW.
DR   CDD; cd20903; HCV_p7; 1.
DR   CDD; cd23202; Hepacivirus_RdRp; 1.
DR   Gene3D; 2.40.10.120; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 6.10.250.1610; -; 1.
DR   Gene3D; 6.10.250.1750; -; 1.
DR   Gene3D; 6.10.250.2920; -; 1.
DR   Gene3D; 2.20.25.210; Hepatitis C NS5A, domain 1B; 1.
DR   Gene3D; 3.30.160.890; Hepatitis C virus envelope glycoprotein E1, chain C; 1.
DR   Gene3D; 2.30.30.710; Hepatitis C virus non-structural protein NS2, C-terminal domain; 1.
DR   Gene3D; 1.20.1280.150; Hepatitis C virus non-structural protein NS2, N-terminal domain; 1.
DR   Gene3D; 2.20.25.220; Hepatitis C virus NS5A, 1B domain; 1.
DR   Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR   Gene3D; 1.10.820.10; RNA Helicase Chain A , domain 3; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR011492; Flavi_DEAD.
DR   InterPro; IPR002521; HCV_Core_C.
DR   InterPro; IPR044896; HCV_core_chain_A.
DR   InterPro; IPR002522; HCV_core_N.
DR   InterPro; IPR002519; HCV_Env.
DR   InterPro; IPR002531; HCV_NS1.
DR   InterPro; IPR002518; HCV_NS2.
DR   InterPro; IPR042205; HCV_NS2_C.
DR   InterPro; IPR042209; HCV_NS2_N.
DR   InterPro; IPR000745; HCV_NS4a.
DR   InterPro; IPR001490; HCV_NS4b.
DR   InterPro; IPR002868; HCV_NS5a.
DR   InterPro; IPR013192; HCV_NS5A_1a.
DR   InterPro; IPR013193; HCV_NS5a_1B_dom.
DR   InterPro; IPR038568; HCV_NS5A_1B_sf.
DR   InterPro; IPR024350; HCV_NS5a_C.
DR   InterPro; IPR014001; Helicase_ATP-bd.
DR   InterPro; IPR001650; Helicase_C.
DR   InterPro; IPR004109; HepC_NS3_protease.
DR   InterPro; IPR038170; NS5A_1a_sf.
DR   InterPro; IPR027417; P-loop_NTPase.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR   InterPro; IPR002166; RNA_pol_HCV.
DR   Pfam; PF07652; Flavi_DEAD; 1.
DR   Pfam; PF01543; HCV_capsid; 1.
DR   Pfam; PF01542; HCV_core; 1.
DR   Pfam; PF01539; HCV_env; 1.
DR   Pfam; PF01560; HCV_NS1; 1.
DR   Pfam; PF01538; HCV_NS2; 1.
DR   Pfam; PF01006; HCV_NS4a; 1.
DR   Pfam; PF01001; HCV_NS4b; 1.
DR   Pfam; PF01506; HCV_NS5a; 1.
DR   Pfam; PF08300; HCV_NS5a_1a; 1.
DR   Pfam; PF08301; HCV_NS5a_1b; 1.
DR   Pfam; PF12941; HCV_NS5a_C; 1.
DR   Pfam; PF02907; Peptidase_S29; 1.
DR   Pfam; PF00998; RdRP_3; 1.
DR   SMART; SM00487; DEXDc; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51693; HCV_NS2_PRO; 1.
DR   PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR   PROSITE; PS51194; HELICASE_CTER; 1.
DR   PROSITE; PS51822; HV_PV_NS3_PRO; 1.
DR   PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE   3: Inferred from homology;
KW   Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW   Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW   Apoptosis {ECO:0000256|ARBA:ARBA00022703};
KW   ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW   Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW   Clathrin-mediated endocytosis of virus by host
KW   {ECO:0000256|ARBA:ARBA00022570};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Fusion of virus membrane with host endosomal membrane
KW   {ECO:0000256|ARBA:ARBA00022510};
KW   Fusion of virus membrane with host membrane
KW   {ECO:0000256|ARBA:ARBA00022506};
KW   G1/S host cell cycle checkpoint dysregulation by virus
KW   {ECO:0000256|ARBA:ARBA00023309};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Helicase {ECO:0000256|ARBA:ARBA00022806};
KW   Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW   Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW   Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW   Host lipid droplet {ECO:0000256|ARBA:ARBA00023190};
KW   Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW   Host mitochondrion {ECO:0000256|ARBA:ARBA00023147};
KW   Host nucleus {ECO:0000256|ARBA:ARBA00022562};
KW   Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Inhibition of host innate immune response by virus
KW   {ECO:0000256|ARBA:ARBA00022632};
KW   Inhibition of host interferon signaling pathway by virus
KW   {ECO:0000256|ARBA:ARBA00022830};
KW   Inhibition of host MAVS by virus {ECO:0000256|ARBA:ARBA00022986};
KW   Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW   Inhibition of host STAT1 by virus {ECO:0000256|ARBA:ARBA00022961};
KW   Inhibition of host TRAFs by virus {ECO:0000256|ARBA:ARBA00022647};
KW   Interferon antiviral system evasion {ECO:0000256|ARBA:ARBA00023258};
KW   Ion channel {ECO:0000256|ARBA:ARBA00023303};
KW   Ion transport {ECO:0000256|ARBA:ARBA00023065};
KW   Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW   Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Modulation of host cell cycle by virus {ECO:0000256|ARBA:ARBA00022504};
KW   Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW   Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   RNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022484};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius}; Transport {ECO:0000256|ARBA:ARBA00022448};
KW   Ubl conjugation {ECO:0000256|ARBA:ARBA00022843};
KW   Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW   Viral envelope protein {ECO:0000256|ARBA:ARBA00022879};
KW   Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW   Viral ion channel {ECO:0000256|ARBA:ARBA00023039};
KW   Viral nucleoprotein {ECO:0000256|ARBA:ARBA00023086};
KW   Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW   Viral RNA replication {ECO:0000256|ARBA:ARBA00022953};
KW   Virion {ECO:0000256|ARBA:ARBA00022844};
KW   Virus endocytosis by host {ECO:0000256|ARBA:ARBA00022890};
KW   Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT   TRANSMEM        722..743
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        755..781
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        787..808
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        820..839
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1664..1687
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1828..1848
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1855..1874
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1886..1906
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        3014..3033
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          907..1030
FT                   /note="Peptidase C18"
FT                   /evidence="ECO:0000259|PROSITE:PS51693"
FT   DOMAIN          1031..1212
FT                   /note="Peptidase S29"
FT                   /evidence="ECO:0000259|PROSITE:PS51822"
FT   DOMAIN          1221..1373
FT                   /note="Helicase ATP-binding"
FT                   /evidence="ECO:0000259|PROSITE:PS51192"
FT   DOMAIN          1365..1542
FT                   /note="Helicase C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51194"
FT   DOMAIN          2657..2775
FT                   /note="RdRp catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50507"
FT   REGION          1..31
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          45..79
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2193..2212
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2353..2432
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        47..74
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2198..2212
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2358..2391
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3034 AA;  330459 MW;  6AB1017FD3E68312 CRC64;
     MSTNPKPQRK TKRNTNRRPQ DVKFPGGGQI VGGVYVLPRR SLRLGVRATR KTPERSQPRG
     RRQPIPKDRR STGKSWGKPG YPWPLYGNEG CGWAGWLLSP RGSRPTWGPT DPRHRSRNLG
     KVIDTLTCGF ADLMGYIPVV GAPVGGVARA LAHGVRVLED GVNYATGNLP GCSFSIFLLA
     LLSCVTVPVS AVEVRNISTS YYATNDCSNN SITWQLTNAV LHLPGCVPCE NDNGILRCWI
     QVTPNVAVKH RGALTHNLRT HVDVIVMAAT VCSALYVGDV CGAVMIVSQA LIISPERHNF
     TQECNCSIYQ GHITGHRMAW DMMLNWSPTL TMILAYAARV PEMVLEVTFG GHWGVVFGLA
     YFSLQGAWAK VIAILLLVAG VDAGTYTTGG EAGRSTKGIA SLFDLGAQQR LSLVNTNGSW
     HINRTALNCN DSLQTGFIAA LFYANKFNSS GCPERLSSCR GLDYFRIGWG TLEYETNVTN
     DEDMRPYCWH YPPKPCGIVS AKTVCGPVYC FTPSPVVVGT TDKQGVPTHT WGENETDVFL
     LNSTRPPQGA WFGCTWMNGT GFTKTCGAPP CRTRKDFNST SDLLCPTDCF RKHPDATYLK
     CGAGPWLTPR CLIDYPYRLW HYPCTVNFTI FKVRMYVGGV EHRFSAACNF TRGDRCGLED
     RDRGQQSPLL HSTTEWAVLP CTYSDLPALS TGLLHLHQSI VDVQYLYGLS PAITRYIVKW
     EWVVLLFLLL ADARVCACLW MLIILGQAEA ALEKLIILHS ASAASANGPL WFLIFFTAAW
     YLKGRVVPMA TYSALGLWSF LLLVLTLPQR AYALDGAEQG ELGLVVLVII SIFTLTPAYK
     TLLSRSVWWL CYMLVLAEDQ IQQWVPPLEV RGGRDGIIWV AVVLYPRTVF EVTKWLLAVL
     GPAHLLKASL LRVPYFVRAH ALLRVCTLVR HLAGGRYIQM LLITMGRWTG TYIYDHLSPL
     STWAAQGLRD LAVAVEPVVF SPMEKKVIEW GAETVACGDI LHGLPVSARL GREVLLGPAD
     SYTSKGWRLL APITAYTQQT RGVLGAIVVS LTGRDKNEQA GQVQVLSSVT QSFLGTTISG
     VLWTVYHGAG NKTLAGPKGP VTQMYTSAEG DLVGWPSPPG TKSLDPCTCG AVDLYLVTRN
     ADVIPVRRKD DRRGALLSPR PLSTLKGSSG GPVLCSRGHV VGLFRAAVCT RGVAKSIDFI
     PVESLDITAR TPSFSDNSTP PAVPQTYQVG YLHAPTGSGK STKVPAAYAS QGYKVLVLNP
     SVAATLGFGA YMSKAHGIDP NIRTGVRTVT TGDPITYSTY GKFLADGGCS AGAYDIIICD
     ECHSVDATTI LGIGTVLDQA ETAGVRLVVL ATATPPGTVT TPHSNIEEVA LGHEGEIPFY
     GKAIPLASIK GGRHLIFCHS KKKCDELAAA LRGMGINAVA YYRGLDVSII PTQGDVVVVA
     TDALMTGYTG DFDSVIDCNV AVTQIVDFSL DPTFTITTQT VPQDAVSRSQ RRGRTGRGRL
     GIYRYVSSGE RPSGMFDSVV LCECYDAGAA WYELTPAETT VRLRAYFNTP GLPVCQDHLE
     FWEAVFTGLT HIDAHFLSQT KQGGDNFAYL MAYQATVCAR AKAPPPSWDV MWKCLARLKP
     ILTGPTPLLY RLGTVTNEVT LTHPVTKYIA TCMQADLEIM TSTWVLAGGV LAAVAAYCLA
     TGCISIIGRL HLNDQVVVTP DKEILYEAFD EMEECASKAA LIEEGQRMAE MLKSKVQGLL
     QQATRQAQDI QPAIQSSWPK LEQFWAKHMW NFISGIQYLA GLSTLPGNPA VASMMAFSAA
     LTSPLPTSTT ILLNIMGGWL ASQIAPPAGA TGFVVSGLVG AAVGSIGLGK ILVDILAGYG
     AGISGALVAF KIMSGEKPTV EDVVNLLPAI LSPGALVVGV ICAAILRRHV GQGEGAVQWM
     NRLIAFASRG NHVAPTHYVA ESDASHRVTQ VLSSLTITSL LRRLHAWISA DCPVPCSGSW
     LRDVWDWVCS ILTDFKNWLS SKLLPKMPGI PFISCQKGYR GVWSGTGIMT TRCPCGANIS
     GHVRMGTMKI TGPKTCLNLW QGTFPINCYT EGPCVPKPSP NYKTAIWRVA ASEYVEVTQH
     SSFSYITGLT SDNLKVPCQV PAPEFFSWVD GVQIHRFAPT PGPFFRDEVT FTVGLNSFVV
     GSQLPCDPEP DTEVLASMLT DPSHITAEAA ARRLARGSPP SQASSSASQL SAPSLKATCT
     THKMAYDCDM VDVTLFMRED VTRIESDSKV IVLDYLDSMT EVKDDREPSV PSEYLVPRKK
     FPPALPPWAH PDYNPPVLET WKRSDYEPPT VLGCALPPTP QAPVPPPRRR RARVLTQDNV
     EGVLREMADK VLSPLQDHND SGRPTGADTG GDSVQQPSDG ADASETGSLS SMPPLEGEPG
     DPDLEFEPAG SAPPSEEERE AVNSGSSSWS TVSDQEDSVV CCSMSYSWTG ALITPCGPEE
     EKLPINPLSN SLMRFHNKVY STTSRSASLR AKKVTFDRVQ VLDAHYDSVL QDVKRAASTV
     SARLLSVEEA CALTPPHSAK SRYGFGAKEV RSLSRRAVNH IRSVWKDLLE DQHTPIDTTI
     IMAKNEVFCV DPTKGGKKPA RLIVYPDLGV RVCEKMALYD IAQKLPKAVM GPSYGFQYSP
     AERVDFLLKA WRGKKDPMGF SYDTRCFDST VTERDIRTEE SIYQACSLPQ EAKTVIHSLT
     ERLYVGGPMT NSRGQSCGYR RCRASGVFTT SIGNTMTCYI KALAACKSAG IVDPIMLVCG
     DDLVVISESQ GNEEDERSLR AFTEAMTRYS APPGDLPRPE YDLERITSCS SNVSVALDSR
     GRRRYFLTRD PTTPITRAAW ETVRHSPVNS WLGNIIQYAP TIWVRMVIMT HFFSILLAQD
     TLNQNLNFEM YGAVYSVNPL DLPAIIERLH GLDAFSLHTY SPHELSRVAA TLRKLGAPPL
     RAWKSRARAV RASLIAQGGR AATCGRYLFN WAVKTKLKLT PLPEASRLDL SGWFTVGAGG
     GDIYHSVSHA RPRLLLLCLL LLSVGVGIFL LPAR
//
DBGET integrated database retrieval system