GenomeNet

Database: UniProt
Entry: A7XAV1_9HEPC
LinkDB: A7XAV1_9HEPC
Original site: A7XAV1_9HEPC 
ID   A7XAV1_9HEPC            Unreviewed;      3016 AA.
AC   A7XAV1;
DT   23-OCT-2007, integrated into UniProtKB/TrEMBL.
DT   23-OCT-2007, sequence version 1.
DT   27-MAR-2024, entry version 116.
DE   RecName: Full=Genome polyprotein {ECO:0000256|ARBA:ARBA00020107};
OS   Hepacivirus hominis.
OC   Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Flasuviricetes;
OC   Amarillovirales; Flaviviridae; Hepacivirus.
OX   NCBI_TaxID=3052230 {ECO:0000313|EMBL:ABR27361.1, ECO:0000313|Proteomes:UP000141947};
RN   [1] {ECO:0000313|EMBL:ABR27361.1, ECO:0000313|Proteomes:UP000141947}
RP   NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC   STRAIN=1002 {ECO:0000313|EMBL:ABR27361.1};
RX   PubMed=17522222; DOI=10.1128/JVI.00487-07;
RG   Virahep-C Study Group;
RA   Donlin M.J., Cannon N.A., Yao E., Li J., Wahed A., Taylor M.W., Belle S.H.,
RA   Di Bisceglie A.M., Aurora R., Tavis J.E.;
RT   "Pretreatment sequence diversity differences in the full-length hepatitis C
RT   virus open reading frame correlate with early response to therapy.";
RL   J. Virol. 81:8211-8224(2007).
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=ATP + H2O = ADP + H(+) + phosphate; Xref=Rhea:RHEA:13065,
CC         ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:30616,
CC         ChEBI:CHEBI:43474, ChEBI:CHEBI:456216; EC=3.6.4.13;
CC         Evidence={ECO:0000256|ARBA:ARBA00001556};
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=Hydrolysis of four peptide bonds in the viral precursor
CC         polyprotein, commonly with Asp or Glu in the P6 position, Cys or Thr
CC         in P1 and Ser or Ala in P1'.; EC=3.4.21.98;
CC         Evidence={ECO:0000256|ARBA:ARBA00001117};
CC   -!- CATALYTIC ACTIVITY:
CC       Reaction=a ribonucleoside 5'-triphosphate + H2O = a ribonucleoside 5'-
CC         diphosphate + H(+) + phosphate; Xref=Rhea:RHEA:23680,
CC         ChEBI:CHEBI:15377, ChEBI:CHEBI:15378, ChEBI:CHEBI:43474,
CC         ChEBI:CHEBI:57930, ChEBI:CHEBI:61557; EC=3.6.1.15;
CC         Evidence={ECO:0000256|ARBA:ARBA00001491};
CC   -!- COFACTOR:
CC       Name=Mg(2+); Xref=ChEBI:CHEBI:18420;
CC         Evidence={ECO:0000256|ARBA:ARBA00001946};
CC   -!- COFACTOR:
CC       Name=Zn(2+); Xref=ChEBI:CHEBI:29105;
CC         Evidence={ECO:0000256|ARBA:ARBA00001947};
CC   -!- SUBCELLULAR LOCATION: Cell membrane {ECO:0000256|ARBA:ARBA00004236}.
CC       Cytoplasm {ECO:0000256|ARBA:ARBA00004496}. Endoplasmic reticulum
CC       membrane {ECO:0000256|ARBA:ARBA00004477}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004477}. Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004406}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004406}. Endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004115}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004115}. Host cell membrane
CC       {ECO:0000256|ARBA:ARBA00004165}. Host cytoplasm, host perinuclear
CC       region {ECO:0000256|ARBA:ARBA00004407}. Host endoplasmic reticulum
CC       membrane {ECO:0000256|ARBA:ARBA00004153}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004153}. Host endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004291}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004291}. Host endoplasmic reticulum membrane
CC       {ECO:0000256|ARBA:ARBA00004482}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004482}. Host lipid droplet
CC       {ECO:0000256|ARBA:ARBA00004338}. Host mitochondrion
CC       {ECO:0000256|ARBA:ARBA00004181}. Host nucleus
CC       {ECO:0000256|ARBA:ARBA00004147}. Lipid droplet
CC       {ECO:0000256|ARBA:ARBA00004502}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC       {ECO:0000256|ARBA:ARBA00004141}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004170}; Peripheral membrane protein
CC       {ECO:0000256|ARBA:ARBA00004170}. Membrane
CC       {ECO:0000256|ARBA:ARBA00004479}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004479}. Mitochondrion
CC       {ECO:0000256|ARBA:ARBA00004173}. Nucleus
CC       {ECO:0000256|ARBA:ARBA00004123}. Virion membrane
CC       {ECO:0000256|ARBA:ARBA00004563}; Single-pass type I membrane protein
CC       {ECO:0000256|ARBA:ARBA00004563}.
CC   ---------------------------------------------------------------------------
CC   Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC   Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC   ---------------------------------------------------------------------------
DR   EMBL; EF407412; ABR27361.1; -; Genomic_RNA.
DR   MEROPS; S29.001; -.
DR   euHCVdb; EF407412; -.
DR   Proteomes; UP000141947; Genome.
DR   GO; GO:0005737; C:cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0044167; C:host cell endoplasmic reticulum membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0044186; C:host cell lipid droplet; IEA:UniProtKB-SubCell.
DR   GO; GO:0033650; C:host cell mitochondrion; IEA:UniProtKB-SubCell.
DR   GO; GO:0042025; C:host cell nucleus; IEA:UniProtKB-SubCell.
DR   GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR   GO; GO:0020002; C:host cell plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005811; C:lipid droplet; IEA:UniProtKB-SubCell.
DR   GO; GO:0005886; C:plasma membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:1990904; C:ribonucleoprotein complex; IEA:UniProtKB-KW.
DR   GO; GO:0019031; C:viral envelope; IEA:UniProtKB-KW.
DR   GO; GO:0019013; C:viral nucleocapsid; IEA:UniProtKB-KW.
DR   GO; GO:0055036; C:virion membrane; IEA:UniProtKB-SubCell.
DR   GO; GO:0005524; F:ATP binding; IEA:UniProtKB-KW.
DR   GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0005216; F:monoatomic ion channel activity; IEA:UniProtKB-KW.
DR   GO; GO:0017111; F:ribonucleoside triphosphate phosphatase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR   GO; GO:0003724; F:RNA helicase activity; IEA:UniProtKB-EC.
DR   GO; GO:0003968; F:RNA-dependent RNA polymerase activity; IEA:UniProtKB-KW.
DR   GO; GO:0004252; F:serine-type endopeptidase activity; IEA:InterPro.
DR   GO; GO:0005198; F:structural molecule activity; IEA:InterPro.
DR   GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR   GO; GO:0039563; P:disruption by virus of host JAK-STAT cascade via inhibition of STAT1 activity; IEA:UniProtKB-KW.
DR   GO; GO:0039527; P:disruption by virus of host TRAF-mediated signal transduction; IEA:UniProtKB-KW.
DR   GO; GO:0039654; P:fusion of virus membrane with host endosome membrane; IEA:UniProtKB-KW.
DR   GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR   GO; GO:0039645; P:perturbation by virus of host G1/S transition checkpoint; IEA:UniProtKB-KW.
DR   GO; GO:0051259; P:protein complex oligomerization; IEA:UniProtKB-KW.
DR   GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR   GO; GO:0039502; P:suppression by virus of host type I interferon-mediated signaling pathway; IEA:UniProtKB-KW.
DR   GO; GO:0039545; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of MAVS activity; IEA:UniProtKB-KW.
DR   GO; GO:0019087; P:transformation of host cell by virus; IEA:InterPro.
DR   GO; GO:0046718; P:viral entry into host cell; IEA:UniProtKB-KW.
DR   GO; GO:0039694; P:viral RNA genome replication; IEA:InterPro.
DR   GO; GO:0019062; P:virion attachment to host cell; IEA:UniProtKB-KW.
DR   GO; GO:0039707; P:virus-mediated pore formation in host cell membrane; IEA:UniProtKB-KW.
DR   CDD; cd17931; DEXHc_viral_Ns3; 1.
DR   CDD; cd20903; HCV_p7; 1.
DR   CDD; cd23202; Hepacivirus_RdRp; 1.
DR   Gene3D; 2.40.10.120; -; 1.
DR   Gene3D; 3.30.70.270; -; 2.
DR   Gene3D; 6.10.250.1610; -; 1.
DR   Gene3D; 6.10.250.1750; -; 1.
DR   Gene3D; 6.10.250.2920; -; 1.
DR   Gene3D; 2.20.25.210; Hepatitis C NS5A, domain 1B; 1.
DR   Gene3D; 3.30.160.890; Hepatitis C virus envelope glycoprotein E1, chain C; 1.
DR   Gene3D; 2.30.30.710; Hepatitis C virus non-structural protein NS2, C-terminal domain; 1.
DR   Gene3D; 1.20.1280.150; Hepatitis C virus non-structural protein NS2, N-terminal domain; 1.
DR   Gene3D; 2.20.25.220; Hepatitis C virus NS5A, 1B domain; 1.
DR   Gene3D; 3.40.50.300; P-loop containing nucleotide triphosphate hydrolases; 2.
DR   Gene3D; 1.10.820.10; RNA Helicase Chain A , domain 3; 1.
DR   Gene3D; 2.40.10.10; Trypsin-like serine proteases; 1.
DR   InterPro; IPR043502; DNA/RNA_pol_sf.
DR   InterPro; IPR011492; Flavi_DEAD.
DR   InterPro; IPR002521; HCV_Core_C.
DR   InterPro; IPR044896; HCV_core_chain_A.
DR   InterPro; IPR002522; HCV_core_N.
DR   InterPro; IPR002519; HCV_Env.
DR   InterPro; IPR002531; HCV_NS1.
DR   InterPro; IPR002518; HCV_NS2.
DR   InterPro; IPR042205; HCV_NS2_C.
DR   InterPro; IPR042209; HCV_NS2_N.
DR   InterPro; IPR000745; HCV_NS4a.
DR   InterPro; IPR001490; HCV_NS4b.
DR   InterPro; IPR002868; HCV_NS5a.
DR   InterPro; IPR013192; HCV_NS5A_1a.
DR   InterPro; IPR013193; HCV_NS5a_1B_dom.
DR   InterPro; IPR038568; HCV_NS5A_1B_sf.
DR   InterPro; IPR024350; HCV_NS5a_C.
DR   InterPro; IPR014001; Helicase_ATP-bd.
DR   InterPro; IPR001650; Helicase_C.
DR   InterPro; IPR004109; HepC_NS3_protease.
DR   InterPro; IPR038170; NS5A_1a_sf.
DR   InterPro; IPR027417; P-loop_NTPase.
DR   InterPro; IPR009003; Peptidase_S1_PA.
DR   InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR   InterPro; IPR043128; Rev_trsase/Diguanyl_cyclase.
DR   InterPro; IPR007094; RNA-dir_pol_PSvirus.
DR   InterPro; IPR002166; RNA_pol_HCV.
DR   Pfam; PF07652; Flavi_DEAD; 1.
DR   Pfam; PF01543; HCV_capsid; 1.
DR   Pfam; PF01542; HCV_core; 1.
DR   Pfam; PF01539; HCV_env; 1.
DR   Pfam; PF01560; HCV_NS1; 1.
DR   Pfam; PF01538; HCV_NS2; 1.
DR   Pfam; PF01006; HCV_NS4a; 1.
DR   Pfam; PF01001; HCV_NS4b; 1.
DR   Pfam; PF01506; HCV_NS5a; 1.
DR   Pfam; PF08300; HCV_NS5a_1a; 1.
DR   Pfam; PF08301; HCV_NS5a_1b; 1.
DR   Pfam; PF12941; HCV_NS5a_C; 1.
DR   Pfam; PF02907; Peptidase_S29; 1.
DR   Pfam; PF00998; RdRP_3; 1.
DR   SMART; SM00487; DEXDc; 1.
DR   SMART; SM00490; HELICc; 1.
DR   SUPFAM; SSF56672; DNA/RNA polymerases; 1.
DR   SUPFAM; SSF52540; P-loop containing nucleoside triphosphate hydrolases; 2.
DR   SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR   PROSITE; PS51693; HCV_NS2_PRO; 1.
DR   PROSITE; PS51192; HELICASE_ATP_BIND_1; 1.
DR   PROSITE; PS51194; HELICASE_CTER; 1.
DR   PROSITE; PS51822; HV_PV_NS3_PRO; 1.
DR   PROSITE; PS50507; RDRP_SSRNA_POS; 1.
PE   4: Predicted;
KW   Acetylation {ECO:0000256|ARBA:ARBA00022990};
KW   Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW   Apoptosis {ECO:0000256|ARBA:ARBA00022703};
KW   ATP-binding {ECO:0000256|ARBA:ARBA00022840};
KW   Capsid protein {ECO:0000256|ARBA:ARBA00022561};
KW   Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW   Fusion of virus membrane with host endosomal membrane
KW   {ECO:0000256|ARBA:ARBA00022510};
KW   Fusion of virus membrane with host membrane
KW   {ECO:0000256|ARBA:ARBA00022506};
KW   G1/S host cell cycle checkpoint dysregulation by virus
KW   {ECO:0000256|ARBA:ARBA00023309};
KW   Glycoprotein {ECO:0000256|ARBA:ARBA00023180};
KW   Helicase {ECO:0000256|ARBA:ARBA00022806};
KW   Host cell membrane {ECO:0000256|ARBA:ARBA00022511};
KW   Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW   Host endoplasmic reticulum {ECO:0000256|ARBA:ARBA00023184};
KW   Host lipid droplet {ECO:0000256|ARBA:ARBA00023190};
KW   Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW   Host mitochondrion {ECO:0000256|ARBA:ARBA00023147};
KW   Host nucleus {ECO:0000256|ARBA:ARBA00022562};
KW   Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW   Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW   Inhibition of host innate immune response by virus
KW   {ECO:0000256|ARBA:ARBA00022632};
KW   Inhibition of host interferon signaling pathway by virus
KW   {ECO:0000256|ARBA:ARBA00022830};
KW   Inhibition of host MAVS by virus {ECO:0000256|ARBA:ARBA00022986};
KW   Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW   Inhibition of host STAT1 by virus {ECO:0000256|ARBA:ARBA00022961};
KW   Inhibition of host TRAFs by virus {ECO:0000256|ARBA:ARBA00022647};
KW   Interferon antiviral system evasion {ECO:0000256|ARBA:ARBA00023258};
KW   Ion channel {ECO:0000256|ARBA:ARBA00023303};
KW   Ion transport {ECO:0000256|ARBA:ARBA00023065};
KW   Lipoprotein {ECO:0000256|ARBA:ARBA00023288};
KW   Magnesium {ECO:0000256|ARBA:ARBA00022842};
KW   Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW   Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW   Modulation of host cell cycle by virus {ECO:0000256|ARBA:ARBA00022504};
KW   Multifunctional enzyme {ECO:0000256|ARBA:ARBA00023268};
KW   Nucleotide-binding {ECO:0000256|ARBA:ARBA00022741};
KW   Nucleotidyltransferase {ECO:0000256|ARBA:ARBA00022695};
KW   Palmitate {ECO:0000256|ARBA:ARBA00023139};
KW   Phosphoprotein {ECO:0000256|ARBA:ARBA00022553};
KW   Protease {ECO:0000256|ARBA:ARBA00022670};
KW   Ribonucleoprotein {ECO:0000256|ARBA:ARBA00023274};
KW   RNA-binding {ECO:0000256|ARBA:ARBA00022884};
KW   RNA-directed RNA polymerase {ECO:0000256|ARBA:ARBA00022484};
KW   Serine protease {ECO:0000256|ARBA:ARBA00022825};
KW   Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW   Transcription {ECO:0000256|ARBA:ARBA00023163};
KW   Transcription regulation {ECO:0000256|ARBA:ARBA00023015};
KW   Transferase {ECO:0000256|ARBA:ARBA00022679};
KW   Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW   Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW   ECO:0000256|SAM:Phobius}; Transport {ECO:0000256|ARBA:ARBA00022448};
KW   Viral attachment to host cell {ECO:0000256|ARBA:ARBA00022804};
KW   Viral envelope protein {ECO:0000256|ARBA:ARBA00022879};
KW   Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW   Viral ion channel {ECO:0000256|ARBA:ARBA00023039};
KW   Viral nucleoprotein {ECO:0000256|ARBA:ARBA00023086};
KW   Viral penetration into host cytoplasm {ECO:0000256|ARBA:ARBA00022595};
KW   Viral RNA replication {ECO:0000256|ARBA:ARBA00022953};
KW   Virion {ECO:0000256|ARBA:ARBA00022844};
KW   Virus entry into host cell {ECO:0000256|ARBA:ARBA00023296};
KW   Zinc {ECO:0000256|ARBA:ARBA00022833}.
FT   TRANSMEM        723..745
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        757..782
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        788..807
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        819..837
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        878..906
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1821..1845
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1852..1875
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        1887..1907
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   TRANSMEM        2996..3013
FT                   /note="Helical"
FT                   /evidence="ECO:0000256|SAM:Phobius"
FT   DOMAIN          904..1031
FT                   /note="Peptidase C18"
FT                   /evidence="ECO:0000259|PROSITE:PS51693"
FT   DOMAIN          1032..1213
FT                   /note="Peptidase S29"
FT                   /evidence="ECO:0000259|PROSITE:PS51822"
FT   DOMAIN          1222..1374
FT                   /note="Helicase ATP-binding"
FT                   /evidence="ECO:0000259|PROSITE:PS51192"
FT   DOMAIN          1366..1543
FT                   /note="Helicase C-terminal"
FT                   /evidence="ECO:0000259|PROSITE:PS51194"
FT   DOMAIN          2639..2757
FT                   /note="RdRp catalytic"
FT                   /evidence="ECO:0000259|PROSITE:PS50507"
FT   REGION          1..77
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          1486..1505
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2317..2337
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   REGION          2360..2413
FT                   /note="Disordered"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        47..69
FT                   /note="Basic and acidic residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2317..2334
FT                   /note="Pro residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
FT   COMPBIAS        2360..2386
FT                   /note="Polar residues"
FT                   /evidence="ECO:0000256|SAM:MobiDB-lite"
SQ   SEQUENCE   3016 AA;  327744 MW;  DC0FF4939C6B9139 CRC64;
     MSTNPKPQRK TKRNTNRRPQ DVRFPGGGQI VGGVYLLPRR GPRLGVRTTR KTSERSQPRG
     RRQPIPKARR PEGRTWAQPG YPWPLYGNEG CGWAGWLLSP HGSRPNWGPT DPRRRSRNLG
     KVIDTLTCGL ADLMGYIPLV GAPLGGVARA LAHGVRVLED GVNYATGNLP GCSFSIFLLA
     LLSCLTVPAS AYQVRNSTGL YHVTNDCPNS SIVYEAADAI LHAPGCVPCV LEGNVSRCWV
     ALTPTVATRD GKLPTTQLRR HIDLLVGSAT LCSALYVGDL CGSVFLVGQM FTFSPRRHWT
     TQDCNCSIYP GHITGHRMAW DMMMNWSPTT ALVVAQLLRI PQAILDMIAG AHWGVLAGIA
     YFSMVGNWAK VLVVLLLFAG VDADHVLGGT YISGGAAGRT AHGLTAIFQT GAKQNIQLIN
     SNGSWHINRT ALNCNESLTT GWLAGLFYYN KFNSSGCPER LASCRRLADF DQGWGPISYA
     NGSGPDHRPY CWHYPPKPCG IVPAHSVCGP VYCFTPSPVV VGTTDKWGAP TYNWGANDTD
     VFVLNNTRPP LGNWFGCTWM NATGYTKVCG APPCVIRGAG NNTLYCPTDC FRKHPETTYS
     RCGSGPWITP RCLVDYPYRL WHYPCTINYT IFKVRMYVGG VEHRLDAACN WTRGERCDLE
     DRDRSELSPL LLSTTQWQVL PCSFTTLPAL STGLIHLHQN IVDVQYLYGV GSSIASWAIK
     WDYVVLLFLL LADARVCSCL WMMLLISQAE AALENLVILN AASLAGTHGL VPFLVFFCFA
     WYLKGKWVPG AVYALYGMWP LLLLLLALPQ RAYALDTEVA ASCGGVVLVG LMALTLSPYY
     KRYISWCLWW LQYFLTRVEA ELHVWVPPLN ARGGRDAVIL LMCAVYPMLL FDITKLLLAV
     FGPLWILQAS LVKVPYFVRV QGLLRICALA RKMAGGHYVQ MAVIKLGALT GTYVYNHLTP
     LRDWAHNSLR DLAVAVEPVV FSRMETKLIT WGADTAACGD IINGLPVSAR KGREILLGPA
     DGMVSKGWRL LAPITAYAQQ TRGLLGCIIT SLTGRDKNQV EGEVQIVSTA AQTFLATCIN
     GVCWTVYHGA GTRTIASPRG PVIQMYTNVD KDLVGWPAPQ GSRSLTPCTC GSSDLYLVTR
     HADVIPVRRR GDGRGSLLSP RPISYLKGSS GGPLLCPAGH AVGIFRAAVC TRGVAKAVDF
     IPVESLETTM RSPVFTDNSS PPVVPQSFQV AHLHAPTGSG KSTKVPAAYA AQGYKVLVLN
     PSVAATLGFG AYMSKAHGID PNLRTGVRTI TTGSPITYST YGKFLADGGC SGGAYDIIIC
     DECHSTDATS ILGIGTVLDQ AETAGARLVV LATATPPGSV TVPHSNIEEV ALSTTGEIPF
     YGKAIPLEVI KGGRHLIFCH SKKKCDELAA KLVTLGINAV AYYRGLDVSV IPTSGDVVVV
     ATDALMTGFT GDFDSVIDCN TCVTQTVDFS LDPTFTIETT TLPQDAVSRT QRRGRTGRGK
     PGTYRFVAPG ERPSGMFDSS VLCECYDAGC AWYELTPAET TVRLRAYMNT PGLPVCQDHL
     EFWEGVFTGL THIDAHFLSQ TKQSGENFPY LVAYQATVCA RAQAPPPSWD QMWKCLIRLK
     PTLHGPTPLL YRLGAVQNEV TLTHPITKYI MTCMSADLEV VTSTWVLVGG VLAALAAYCL
     STGCVVIVGR VVLSGKPAII PDREVLYREF DEMEECSQHS PYIEQGMALA EQFKQKVLGL
     LQTASRQAEV ITPAVQTNWQ KLEAFWAKHM WNFISGIQYL AGLSTLPGNP AIASLMAFTA
     AVTSPLTTSQ TLLFNILGGW VASQLAAPGA ATAFVGAGLA GAAIGSVGLG KVLVDILAGY
     GAGVAGALVA FKIMSGEVPS TEDLVNLLPA ILSPGALVVG VVCAAILRRH VGPGEGAVQW
     MNRLIAFASR GNHVSPTHYV PESDAAARVT AILSSLTVTQ LLRRLHQWIS SECTTPCSGS
     WLRDIWDWIC EVLSDFKTWL KAKLMPQLPG IPFVSCQRGY KGVWRGDGIM HTRCHCGAEI
     TGHVKNGTMR IVGPRTCRNM WSGTFPINAY TTGPCTPLPA PNYTFALLRV SAEEYVEIRQ
     VGDFHYVTGM TADNLKCPCQ VPSPEFFTEL DGVRLHRFAP PCKPLLREEV SFRVGLHEYP
     VGSQLPCEPE PDVSVLTSML TDPSHITAEA AGRRLARGSP PSVASSSASQ LSAPSLKATC
     TANHDSPDAE LIEANLLWRQ EMGGNITRVE SENKVVILDS FDPLVAEEDE REISVPAEIL
     RKSRRFAQAL PVWARPDYNP PLLETWKKPD YEPPVVHGCP LPSPRPPPVP PPRKKRTVVL
     TESTLSTALA ELAIKSFGSS STSGITGDNT TTSSEPASSG CPPDSDAESY SSMPPLEGEP
     GDPDLSDGSW STVSSEAGTE DVVCCSMSYS WTGALVTPCA AEEQKLPINA LSNSLLRHHN
     LVYSTTSRSA CQRQKKVTFD RLQVLDSHYQ DVLKEVKAAA SKVKANLLSV EEACSLTPPH
     SAKSKFGYGA KDVRCHARKA VNHINSVWKD LLEDSVTPID TTIMAKNEVF CVQPEKGGRK
     AARLIVFPDL GVRVCEKMAL YDVVSKLPPA VMGSSYGFQY SPGQRVEFLV QAWKSKKTPM
     GFSYDTRCFD STVTESDIRT EEAIYQCCDL DPQARVAIKS LTERLYVGGP LTNSRGENCG
     YRRCRASGVL TTSCGNTLTC YIKARAACRA AGLQNCTMLV CGDDLVVICE SAGVQEDAAS
     LRAFTEAMTR YSAPPGDPPQ PEYDLELITS CSSNVSVAHD GAGKRVYYLT RDPTTPLARA
     AWETARHTPV NSWLGNIIMF APTLWARMIL MTHFFSVLIA RDQLEQALDC EIYGACYSIE
     PLDLPSIIQR LHGLSAFSLH SYSPGEINRV AACLRKLGVP PLRAWRHRAR SVRARLLSRG
     GRAAICGKYL FNWAVRTKLN LTPIAAAGQL DLSGWFTAGY SGGDIYHSVS RARPRWFWFC
     LLLLAAGVGI YLLPNR
//
DBGET integrated database retrieval system