ID B1PHI7_9ALPC Unreviewed; 4324 AA.
AC B1PHI7;
DT 29-APR-2008, integrated into UniProtKB/TrEMBL.
DT 29-APR-2008, sequence version 1.
DT 08-NOV-2023, entry version 101.
DE SubName: Full=ORF1a polyprotein {ECO:0000313|EMBL:ACA52155.1};
GN Name=ORF1a {ECO:0000313|EMBL:ACA52155.1};
OS Bat coronavirus 1B.
OC Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
OC Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
OC Alphacoronavirus; Minunacovirus; Miniopterus bat coronavirus 1.
OX NCBI_TaxID=393768 {ECO:0000313|EMBL:ACA52155.1, ECO:0000313|Proteomes:UP000147155};
RN [1] {ECO:0000313|EMBL:ACA52155.1, ECO:0000313|Proteomes:UP000147155}
RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
RC STRAIN=AFCD307 {ECO:0000313|EMBL:ACA52155.1};
RX PubMed=18420807; DOI=10.1099/vir.0.83605-0;
RA Chu D.K., Peiris J.S., Chen H., Guan Y., Poon L.L.;
RT "Genomic characterizations of bat coronaviruses (1A, 1B and HKU8) and
RT evidence for co-infections in Miniopterus bats.";
RL J. Gen. Virol. 89:1282-1287(2008).
CC -!- CATALYTIC ACTIVITY:
CC Reaction=Thiol-dependent hydrolysis of ester, thioester, amide, peptide
CC and isopeptide bonds formed by the C-terminal Gly of ubiquitin (a 76-
CC residue protein attached to proteins as an intracellular targeting
CC signal).; EC=3.4.19.12; Evidence={ECO:0000256|ARBA:ARBA00000707};
CC -!- SUBCELLULAR LOCATION: Host cytoplasm, host perinuclear region
CC {ECO:0000256|ARBA:ARBA00004407}. Host membrane
CC {ECO:0000256|ARBA:ARBA00004301}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004301}. Membrane
CC {ECO:0000256|ARBA:ARBA00004141}; Multi-pass membrane protein
CC {ECO:0000256|ARBA:ARBA00004141}.
CC -!- SIMILARITY: Belongs to the coronaviruses polyprotein 1ab family.
CC {ECO:0000256|ARBA:ARBA00008087, ECO:0000256|PROSITE-ProRule:PRU01294}.
CC ---------------------------------------------------------------------------
CC Copyrighted by the UniProt Consortium, see https://www.uniprot.org/terms
CC Distributed under the Creative Commons Attribution (CC BY 4.0) License
CC ---------------------------------------------------------------------------
DR EMBL; EU420137; ACA52155.1; -; Genomic_RNA.
DR Proteomes; UP000147155; Genome.
DR GO; GO:0033644; C:host cell membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0044220; C:host cell perinuclear region of cytoplasm; IEA:UniProtKB-SubCell.
DR GO; GO:0016020; C:membrane; IEA:UniProtKB-SubCell.
DR GO; GO:0004843; F:cysteine-type deubiquitinase activity; IEA:UniProtKB-EC.
DR GO; GO:0004197; F:cysteine-type endopeptidase activity; IEA:InterPro.
DR GO; GO:0008242; F:omega peptidase activity; IEA:InterPro.
DR GO; GO:0003723; F:RNA binding; IEA:UniProtKB-KW.
DR GO; GO:0016740; F:transferase activity; IEA:InterPro.
DR GO; GO:0008270; F:zinc ion binding; IEA:InterPro.
DR GO; GO:0039520; P:induction by virus of host autophagy; IEA:UniProtKB-KW.
DR GO; GO:0039648; P:modulation by symbiont of host protein ubiquitination; IEA:UniProtKB-KW.
DR GO; GO:0006508; P:proteolysis; IEA:UniProtKB-KW.
DR GO; GO:0039548; P:suppression by virus of host viral-induced cytoplasmic pattern recognition receptor signaling pathway via inhibition of IRF3 activity; IEA:UniProtKB-KW.
DR GO; GO:0019079; P:viral genome replication; IEA:InterPro.
DR GO; GO:0019082; P:viral protein processing; IEA:InterPro.
DR CDD; cd21901; alpha_betaCoV_Nsp10; 1.
DR CDD; cd21558; alphaCoV-Nsp6; 1.
DR CDD; cd21514; alphaCoV_Nsp2_HCoV-229E-like; 1.
DR CDD; cd21665; alphaCoV_Nsp5_Mpro; 1.
DR CDD; cd21826; alphaCoV_Nsp7; 1.
DR CDD; cd21830; alphaCoV_Nsp8; 1.
DR CDD; cd21897; alphaCoV_Nsp9; 1.
DR CDD; cd21731; alphaCoV_PLPro; 1.
DR CDD; cd21473; cv_Nsp4_TM; 1.
DR CDD; cd21557; Macro_X_Nsp3-like; 1.
DR CDD; cd21875; PEDV-like_alphaCoV_Nsp1; 1.
DR CDD; cd21712; TM_Y_alphaCoV_Nsp3_C; 1.
DR Gene3D; 1.10.8.1190; -; 2.
DR Gene3D; 6.10.140.2090; -; 1.
DR Gene3D; 1.10.150.420; Coronavirus nonstructural protein 4 C-terminus; 1.
DR Gene3D; 3.40.220.10; Leucine Aminopeptidase, subunit E, domain 1; 1.
DR Gene3D; 1.10.1840.10; main proteinase (3clpro) structure, domain 3; 1.
DR Gene3D; 1.10.8.370; nsp7 replicase; 1.
DR Gene3D; 3.30.70.3540; Nsp8 replicase, head domain; 1.
DR Gene3D; 2.40.10.250; Replicase NSP9; 1.
DR Gene3D; 2.40.10.10; Trypsin-like serine proteases; 2.
DR InterPro; IPR046443; a/bCoV_NSP1_glob.
DR InterPro; IPR043613; CoV_NSP2_C.
DR InterPro; IPR043611; CoV_NSP3_C.
DR InterPro; IPR047566; CoV_NSP3_Y3.
DR InterPro; IPR032505; CoV_NSP4_C.
DR InterPro; IPR043612; CoV_NSP4_N.
DR InterPro; IPR002589; Macro_dom.
DR InterPro; IPR043472; Macro_dom-like.
DR InterPro; IPR044371; Macro_X_NSP3-like.
DR InterPro; IPR036333; NSP10_sf_CoV.
DR InterPro; IPR044385; NSP2_HCoV-229E-like.
DR InterPro; IPR043615; NSP2_N_CoV.
DR InterPro; IPR044357; NSP3_Ubl1_dom_CoV.
DR InterPro; IPR044353; Nsp3_Ubl2_dom_CoV.
DR InterPro; IPR038123; NSP4_C_sf_CoV.
DR InterPro; IPR044309; NSP5_Mpro_alphaCoV.
DR InterPro; IPR044369; NSP6_alphaCoV.
DR InterPro; IPR043610; NSP6_CoV.
DR InterPro; IPR014828; NSP7_CoV.
DR InterPro; IPR037204; NSP7_sf_CoV.
DR InterPro; IPR014829; NSP8_CoV.
DR InterPro; IPR037230; NSP8_sf_CoV.
DR InterPro; IPR014822; NSP9_CoV.
DR InterPro; IPR036499; NSP9_sf_CoV.
DR InterPro; IPR011050; Pectin_lyase_fold/virulence.
DR InterPro; IPR013016; Peptidase_C16_CoV.
DR InterPro; IPR008740; Peptidase_C30_CoV.
DR InterPro; IPR043477; Peptidase_C30_dom3_CoV.
DR InterPro; IPR009003; Peptidase_S1_PA.
DR InterPro; IPR043504; Peptidase_S1_PA_chymotrypsin.
DR InterPro; IPR043178; PLpro_thumb_sf_CoV.
DR InterPro; IPR018995; RNA_synth_NSP10_CoV.
DR Pfam; PF09401; CoV_NSP10; 1.
DR Pfam; PF19212; CoV_NSP2_C; 2.
DR Pfam; PF19211; CoV_NSP2_N; 1.
DR Pfam; PF19218; CoV_NSP3_C; 1.
DR Pfam; PF16348; CoV_NSP4_C; 1.
DR Pfam; PF19217; CoV_NSP4_N; 1.
DR Pfam; PF19213; CoV_NSP6; 1.
DR Pfam; PF08716; CoV_NSP7; 1.
DR Pfam; PF08717; CoV_NSP8; 1.
DR Pfam; PF08710; CoV_NSP9; 1.
DR Pfam; PF08715; CoV_peptidase; 2.
DR Pfam; PF01661; Macro; 1.
DR Pfam; PF05409; Peptidase_C30; 1.
DR SMART; SM00506; A1pp; 1.
DR SUPFAM; SSF144246; Coronavirus NSP10-like; 1.
DR SUPFAM; SSF140367; Coronavirus NSP7-like; 1.
DR SUPFAM; SSF143076; Coronavirus NSP8-like; 1.
DR SUPFAM; SSF52949; Macro domain-like; 1.
DR SUPFAM; SSF51126; Pectin lyase-like; 1.
DR SUPFAM; SSF101816; Replicase NSP9; 1.
DR SUPFAM; SSF50494; Trypsin-like serine proteases; 1.
DR PROSITE; PS51952; COV_EXON_MTASE_COACT; 1.
DR PROSITE; PS51962; COV_NSP1; 1.
DR PROSITE; PS51989; COV_NSP2_N; 1.
DR PROSITE; PS51992; COV_NSP3_Y3; 1.
DR PROSITE; PS51943; COV_NSP3A_UBL; 1.
DR PROSITE; PS51944; COV_NSP3D_UBL; 1.
DR PROSITE; PS51946; COV_NSP4C; 1.
DR PROSITE; PS51949; COV_NSP7; 1.
DR PROSITE; PS51950; COV_NSP8; 1.
DR PROSITE; PS51951; COV_NSP9_SSRNA_BD; 1.
DR PROSITE; PS51442; M_PRO; 1.
DR PROSITE; PS51154; MACRO; 1.
DR PROSITE; PS51124; PEPTIDASE_C16; 2.
PE 3: Inferred from homology;
KW Activation of host autophagy by virus {ECO:0000256|ARBA:ARBA00023050};
KW Disulfide bond {ECO:0000256|ARBA:ARBA00023157};
KW Host cytoplasm {ECO:0000256|ARBA:ARBA00023200};
KW Host membrane {ECO:0000256|ARBA:ARBA00022870};
KW Host-virus interaction {ECO:0000256|ARBA:ARBA00022581};
KW Hydrolase {ECO:0000256|ARBA:ARBA00022801};
KW Inhibition of host innate immune response by virus
KW {ECO:0000256|ARBA:ARBA00022632};
KW Inhibition of host IRF3 by virus {ECO:0000256|ARBA:ARBA00022931};
KW Inhibition of host RLR pathway by virus {ECO:0000256|ARBA:ARBA00022482};
KW Membrane {ECO:0000256|ARBA:ARBA00023136, ECO:0000256|SAM:Phobius};
KW Metal-binding {ECO:0000256|ARBA:ARBA00022723};
KW Modulation of host ubiquitin pathway by viral deubiquitinase
KW {ECO:0000256|ARBA:ARBA00022876};
KW Modulation of host ubiquitin pathway by virus
KW {ECO:0000256|ARBA:ARBA00022662}; Protease {ECO:0000256|ARBA:ARBA00022670};
KW Repeat {ECO:0000256|ARBA:ARBA00022737};
KW Ribosomal frameshifting {ECO:0000256|ARBA:ARBA00022758};
KW RNA-binding {ECO:0000256|ARBA:ARBA00022884, ECO:0000256|PROSITE-
KW ProRule:PRU01296}; Thiol protease {ECO:0000256|ARBA:ARBA00022807};
KW Transmembrane {ECO:0000256|ARBA:ARBA00022692, ECO:0000256|SAM:Phobius};
KW Transmembrane helix {ECO:0000256|ARBA:ARBA00022989,
KW ECO:0000256|SAM:Phobius};
KW Ubl conjugation pathway {ECO:0000256|ARBA:ARBA00022786};
KW Viral immunoevasion {ECO:0000256|ARBA:ARBA00023280};
KW Zinc {ECO:0000256|ARBA:ARBA00022833};
KW Zinc-finger {ECO:0000256|ARBA:ARBA00022771, ECO:0000256|PROSITE-
KW ProRule:PRU00444}.
FT TRANSMEM 2180..2195
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2239..2259
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2324..2347
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2735..2757
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 2993..3014
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3020..3040
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3047..3066
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3078..3100
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3512..3531
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3543..3560
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3567..3586
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3606..3628
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3640..3667
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3673..3691
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT TRANSMEM 3703..3726
FT /note="Helical"
FT /evidence="ECO:0000256|SAM:Phobius"
FT DOMAIN 2..109
FT /note="CoV Nsp1 globular"
FT /evidence="ECO:0000259|PROSITE:PS51962"
FT DOMAIN 112..356
FT /note="CoV Nsp2 N-terminal"
FT /evidence="ECO:0000259|PROSITE:PS51989"
FT DOMAIN 893..989
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS51943"
FT DOMAIN 1181..1431
FT /note="Peptidase C16"
FT /evidence="ECO:0000259|PROSITE:PS51124"
FT DOMAIN 1433..1600
FT /note="Macro"
FT /evidence="ECO:0000259|PROSITE:PS51154"
FT DOMAIN 1848..1903
FT /note="Ubiquitin-like"
FT /evidence="ECO:0000259|PROSITE:PS51944"
FT DOMAIN 1910..2174
FT /note="Peptidase C16"
FT /evidence="ECO:0000259|PROSITE:PS51124"
FT DOMAIN 2624..2728
FT /note="CoV Nsp3 Y"
FT /evidence="ECO:0000259|PROSITE:PS51992"
FT DOMAIN 3108..3203
FT /note="Nsp4C"
FT /evidence="ECO:0000259|PROSITE:PS51946"
FT DOMAIN 3204..3505
FT /note="Peptidase C30"
FT /evidence="ECO:0000259|PROSITE:PS51442"
FT DOMAIN 3785..3867
FT /note="RdRp Nsp7 cofactor"
FT /evidence="ECO:0000259|PROSITE:PS51949"
FT DOMAIN 3868..4061
FT /note="RdRp Nsp8 cofactor"
FT /evidence="ECO:0000259|PROSITE:PS51950"
FT DOMAIN 4062..4171
FT /note="Nsp9 ssRNA-binding"
FT /evidence="ECO:0000259|PROSITE:PS51951"
FT DOMAIN 4172..4311
FT /note="ExoN/MTase coactivator"
FT /evidence="ECO:0000259|PROSITE:PS51952"
SQ SEQUENCE 4324 AA; 476631 MW; 4C75D27EBA9CF1B6 CRC64;
MQSNLVTLAF ANDSEISAEG FCDVETAVYA FSVSAVNGFA DCRFVAQGLE NCLVGVEADD
YVLCVVGDVQ LKAYIAKFSH RPFNLRGWIV RSNSNYFLET MDLVFGCGAG TSIPVDNYMC
GANGKPVITE DMWYFCDYFG DDGDKITING QEYHKAWNVT RSDVPYQFQN AGTILSIEYV
STEAHVLPDG AIAKTAKPPK FSKNVVLSEK CKALYDACGS PFVTNGTNVL EVVTNPIFAH
GFVQCKCGSK HWTTGDWSGF KSVCCGTPGR VLCTVFGGVT PGSILLTSTR VDATPGATRY
YHGLTLKHIC NVDDVACWRV VKVQAVAGFV VKGSLEECVS TFDTCTHDNF TTVAKAFKLG
MLTGSFSDDV VASVISGSLD VGLSVLDVTT AVTKPWFVLK CGSLLETAWD ALIVAVKQLP
VMASEVLKFF NNLSQVLIVV RDGVIDIIHN VPEAFKSAFE VFKDLVSGVF DLVVDHFKVA
NKKFKRAGNY VLFENALACI VSAKVKGVKQ AGLKKLLYAK AIVGATVKVT VSRIETATVK
LTECKPSKFV KKGNVAVIND IAFFHSDGVY RLMSDSDEVY EDIAFTAEGT SPVKTTVFNC
AKPDGFPDIT STDVEVLVRE VKTALDSFSR VYDKYSCAVK SGDCVVTHKY VFNVPSFVED
KTMFVDLCKD YVVDSGFEAF YVNALAATNA DDFNPVYSAF EIFKTKVECP EELKNIDGGS
IFETFINTVN DAVNFVKSLK IVVTATEVMI NTVKRFKRFA SVLAKLYSEF LTSVKRVISI
GSVTCFHYGF VKPMLVIKDV FYCIEDAIVD TFNVATEAGL NTVKTFIGGD SSITVSRVEI
ASVELEAAQY VKPEDNGYVS VIDGHTFYTC GDYYHPCDQQ NCFSQCFKKV GGSAVTFSEK
VAVKQIDPVY KVKLIFEFED DTISSVCKQA IGKYITFEGN DWSNFEETIH NAMSVVGEFV
DLPDYYIYDE EGGNDLNNSV MISQWPMFDP SALQLLVADL GVNCDFNGKS SIEECLTSIP
DTALCVSLEK SCDCGTFNAI MEGFALDFKP CADIDTCDNC GGLCTTTVLS MTGTGFVRSC
DEPLMPFNVT FEGYGVYKDV CFVNDTVLPP PIDDDLTPIE DVTEEDIIVA EEVVDVATIT
AVEVTDAEPT VITSEEEVND ANEPQAEEEV KPEEVIDVSS DLDDVNKALS FMMPQETKFV
DPFKFDYFDH EGIRVLRQNN NNCWVATTLV QLQLSGLLDD DDSMALFKAG SVSPLVRKCY
NATGAIMGSL GDSSQCLEVL LKDLHSMFIT CDSTCGCGSG TYELSGSVFR FMPTRDSFDY
GACTVCGKVL KLKIKTLVGT GIFCQDPKPF NTARAIVKPI CASIYQGSTT SGHYKTNVYG
KRFCVDGSGV SSICNGNVNT ILLKDCNYGI PAEEPKQKEF EKFVTPDDVV QIAQPKPKPF
TTYDNIEFYQ GDISDLVGLD FDFIVNAANE NLKHAGGVAA AIDKLTGNEL QSLSNKYVKA
NGQVKVGSGV MIRCKKYSVF NVVGPRKGKN APTLLEKCYK TILHENGVPL TPLISVGIFG
IPLATSFDAL LNTSSGRTVR CFCYTDKECN EIKTLVSDRK KQVNAVTVIA ENKPIAEAKA
EKKPIAEAKV EKEPIAEAKV EKEPIAEAKV EKKPTADAKK ADKKLATEKS VVAESKQVSA
VDNKSVAEVK NTPVADEKLI AEVKEPVLKV AGVSYYNIED SFSIGVDNIV ILTNSKLDLG
KLGEYVNEYS GGALKSAVSG YLSKTPNVPA GNVISMHCSS LLTVAFAVVP SDGDVQYVKN
VKRTISKLSK LKGSSVCSFS TLDMHKRLLS VFNKFCVDNI DSVKDIHDTK TTVKVSLDGR
NVVDVDVAAD KTIGEQLNAC TTDNVIISNS VVTDVIDTVV NVAPEADWDT FYGFPNAAEF
HMLDHSAYAF DSDVVDGKRA LVGTDNNCWV NAVCLQLQFA EVDFTSEGLK DMWNEFLVGN
VAKFCHWLYW LVRATKGDAG DAENALNMLA KYVKAHGTIT ITRETDDGCC ANEHRISSFV
VNASVLRSGC TDGYCKHGNA YTACVSKVDG VSVIVNVDRP SVMSDNLLLT GTSYTAFSGP
MDSGHYRVYN PVTSKMFDGA NCVGGDLCNL AVTAVVVKNK VFKMQTSNSN TPVKIVKKLD
DASEKFFSFG DVVSKNICNS IIWFFTMLSI IFKAFKTRDF KVFALAPERT GVILSRSLKY
NVKATQFLLK RKQGYVLKFL KLSVIAYALY ALSFMFVRFS PANEYFCKEH VEGYGNSTFV
KDEYCASTMC KVCLFGYQEL ADLPHTKVVW KYVGFPIFVN WLPFLYLAFL FIFGGIFVKG
LVCYFLAQYV NNFGVYFGMQ ETFWPLQVIP FNVFGDEIVV TFLVYKALMF IKHVCFGCDK
PSCVACSKSA RLTRVPMQTI VNGANKSFYV VANGGSSYCH KHKFFCLNCD SYGPGNTFIN
ETVARELSNV VKTNVQPTGD SFIEVDKVSF ENGFYYLYSG ETFWRYNFDV TDAKYGCKEV
LKNCNVLADF IVYNNTGSNV SQIRNACVYF SQMLCKPIKL VDATLLSTLN VDFNGALHSA
FIQVLNDSFS KDLSSCASMT ECKQALGFDV SDEEFVNAVS NAHRFNVLLS DNSFNNLLTS
YAKPEEQLST HDVATCMRFN AKVVNHNVLI KENVPIVWLA RDFQQLSEEG RKYLVKTTKA
KGVTFLLTFN SNAMNVKLPV ISIVNKKGAG VSSKFIWWVC AAIITFFLCL SISEGLVATS
FADFGFKYIK DGVMHDFDQP LSCVHNVFDN FNSWHEARFG SIPSNMLKCP IVVGTLDDVR
NVPGVPSGIV LVGKTLVFAI KAVFTDAGNC YGLNGLTTAG ACLFNSACTK LEGLGGTHVY
CYKDGLFEGS KRYFDLVPHS NYKMEDGNFV KLPETLVNGF GINIIRTMET TYCRVGECLK
SKAGVCFGAN RFFVYNDDFG SDYICGNGLM SFVKNLFNTF TMSLSVMALS GQVIFNCVVA
AMAIFICFLV VKFKRMFGDL SYGVCSVIAA VTINNLSYVF TQNMLFMFVY ATFYFLAVRN
LNYAWIWHAS YVVAYFNLAP WFIIVWYVVT MLTGLLPSVL KLKISTNLFE GDKFVGTFEN
AAFGTFVIDM HSYEKLVNSI TPEKLKQHAS MFNKYKYYSG SASEADYRCA CFAHLAKAMT
DYAANHQDML YSPPSISYNS TLQAGLRKFA QPSGVIEHCI VRVSYGNMVL NGIWLGDEVI
CPRHVIASST NTTIDYDHEY TMMRLHNFSV SSGNLFIGVV SAKMRGASLV IKVNQNNPHT
PKHVFKTLKA GDAFNILACY DGVPSGVYGT ILRHNKTIRG SFINGACGSP GFNINGDTVE
FVYLHQLELG SGCHVGSNME GAMYGGFEDQ PSLQIEGADC LVTVNVIAFL YGAILNGCTW
FLSNERITAE VFNGWAHANN FTEVGSLDCF NILAAKTGVD IQRVLASIQK LSKGFGGRNI
IGYASLTDEF TVTEVVKQMY GVSLQSKRVP SVFNNVILVS VFWSMFLSEL LYYTSSYWIK
PDLITAVFVL LFGVAVMLTF TIKHKVLFLY TFLIPSVVIS ACYNLAWDLY IRELLAKYFD
YHMSIFSMDI QGCFNILACI FVNAIHTWRF VKTGTATRLT YVLSLCVSVY NYWCCGDFLS
LAMMVLLNIN NNWYIGAFAY RFSVFVVNYM DPSVIRMLGG VKVILFMYVL CGYLCCVYYG
ICYWFNRFFK CTMGLYEFKV SPAEFKYMVA NDLRAPTGVF DSMSLSLKLM GLGGERTIKI
STVQSKLTDI KCTNVVLMGC LSSMNIEANS KKWSYCVDLH NKINLCDDAE KAMEYLLALV
TFFISEHADF NVSELVDSYF GDNSILQSVA STFVNMPSFM AYENARQSYE EAINNGSSPQ
LVKQLKRAMN IAKAELDHES SVQRKLNRMA EQAAAQMYKE ARAVNKKSKV ISSLHTLLFG
MLRKLDMSSV DNILSLARDG VVPLSIIPAA CATKLTIVVS DFESFKRIFQ LGNVQYAGVV
WSLIEVKDND GKPVHVKEIT ATNTALTWPL ILNCERVVKL QNNEVIPGKL NVRPVKGEGD
GGFTADGKAL FNNEGGKTFM YAFIADKPDL KVVKWEFDGG CNVIELEPPC KFAVVDAGGN
NVIKYLYFVK NLNTLRRGAV LGFIGATVRL QAGKQTELAV NSSLLTLCSF AVDPAKCYLY
AVKSGVKPIN NCVKMLSNGS GTGQAVTVGV EATTNQDSYG GASVCLYCRA HVDHPSIDGF
CQFKGRYVQV PVGTVDPIRF CLENQVCKVC HCWLNNGCSC DRTAVVQSMD HAYLNEQGAL
VHLD
//