LOCUS NC_034440 29642 bp RNA linear VRL 20-NOV-2020
DEFINITION Bat coronavirus isolate PREDICT/PDF-2180, complete genome.
ACCESSION NC_034440
VERSION NC_034440.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Bat coronavirus
ORGANISM Bat coronavirus
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Nidovirales; Cornidovirineae; Coronaviridae; Coronavirinae.
REFERENCE 1 (bases 1 to 29642)
AUTHORS Anthony,S.J., Gilardi,K., Menachery,V.D., Goldstein,T., Ssebide,B.,
Mbabazi,R., Navarrete-Macias,I., Liang,E., Wells,H., Hicks,A.,
Petrosov,A., Byarugaba,D.K., Debbink,K., Dinnon,K.H., Scobey,T.,
Randell,S.H., Yount,B.L., Cranfield,M., Johnson,C.K., Baric,R.S.,
Lipkin,W.I. and Mazet,J.A.
TITLE Further Evidence for Bats as the Evolutionary Source of Middle East
Respiratory Syndrome Coronavirus
JOURNAL MBio 8 (2), e00373-17 (2017)
PUBMED 28377531
REMARK Publication Status: Online-Only
REFERENCE 2 (bases 1 to 29642)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (05-MAY-2017) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 29642)
AUTHORS Anthony,S.J., Gilardi,K.V., Goldstein,T., Ssebide,B., Mbabazi,R.,
Navarrete-Macias,I., Liang,E., Wells,H.L., Hicks,A.L., Petrosov,A.,
Byarugaba,D., Debbink,K., Yount,B.L., Menachery,V.D., Cranfield,M.,
Johnson,C.K., Baric,R.S., Lipkin,W.I. and Mazet,J.K.
TITLE Direct Submission
JOURNAL Submitted (15-JUL-2016) Center for Infection and Immunity, Columbia
University, 722 West 168th Street, 17th Floor, New York, NY 10032,
USA
COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence is identical to KX574227.
Annotation was based on information found in PMID: 31653070 and
annotation of PMID: 29346682.
##Assembly-Data-START##
Assembly Method :: MIRA v. 4.0
Sequencing Technology :: Illumina
##Assembly-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..29642
/organism="Bat coronavirus"
/mol_type="genomic RNA"
/isolate="PREDICT/PDF-2180"
/isolation_source="rectal swab"
/host="Pipistrellus cf. hesperidus; specimen voucher:
OTBA03-20130220"
/db_xref="taxon:1508220"
/country="Uganda"
/lat_lon="1.12 S 29.68 E"
/collection_date="20-Feb-2013"
/note="USAID PREDICT Consortium"
5'UTR 1..60
gene 60..21290
/gene="ORF1b"
/locus_tag="CAU86_gp01"
/db_xref="GeneID:37627555"
gene 61..21290
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/db_xref="GeneID:37627558"
CDS join(61..13224,13224..21290)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/ribosomal_slippage
/note="polyprotein pp1ab; ORF1ab polyprotein is cleaved to
yield the RNA-dependent RNA polymerase and other
nonstructural proteins"
/codon_start=1
/product="ORF1ab polyprotein"
/protein_id="YP_009361856.2"
/db_xref="GeneID:37627558"
/translation="MSFVADVTAQGARGTYRAALNSEKHHDHVSLTVPLCGSGDLVSK
LSPWFMDGYDACEAVKVMLSNKEKLLFVPIRLVGYTKHLPGPRVYLVERLINGIYTDP
FMVNQVAYSSSANAGLVGTTLQGKPIGLFFPFDADLVTGDHTFLLRKYGRGGYHYTPF
HYERDATSRPEWMDDLEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPINDYAGL
MAKEGITKLADIEADVASRVDADGFIVLKNKLYRLVWHVERKDVQYAKQSIFTINSVV
QREGLQDIPPHYFTLGGKIDMLVPRNKWNGVANLPLKQKILYTFYGKESLENHSYIYH
SAFTDCGGCGNGSWLTGNAVQGFSCGCGASYLSNDVEVQSSGLIKPNALFCATCPFAK
GDSCSSSCKHSIAQLVSYLSERCNVIADSKSFTLVFGGVAYAYFGCEEGTMYFVPRAK
SVVSKIGDSIFTGCTGSWTKVTQIANLFLEQTQRSLNFVGEFVVNDVVLAILSGTTTN
VDKLRELLKGITLEKLRDYLADYDVAVTLGPFMDNAVNVGGKGLQYATITAPFLVLTG
LGESFKKVAAIPYKVCKSFKETLSYYADSILYRVFPYDMDSDVSSFTELLFDCVGLSV
ASTYFIVRLLQDKTGDFMSTILSSCQSAVRKLLDTCLEATEATLNFLLELANLFKIFL
RGAYVYTSQGFVVLQGKMSSLVKQVVDLLNKGMQLLHTKVSWAGSKVSAVIYSGRESL
IFPTGTYYCVSTKAKSVQHQFDVILPGDCSKKQLGLLEPTDNSTTVEVTVSSNTVETV
VGQLEQTNMHSPDVIVGDYVIISDKLFVRSKEEDRVVFYPACTNGTAVPTLFKLKGGA
PVKRVAFGDDEIHEVAAVRSVTVEYNIHAVLDALLASSSLRTFVVDKSLSIEEFVDVV
KEQVSDLLAKLLRGMPIPDFDLDDFIDTPCYCFNADGDVSWSSTMIFSLHPVECEDDS
FECDSDQDDDQESVCEPLVEETNVQVQESDDDGWAAAVEEAFPIEELEEPPVQVVPND
SVVRSQVAQPIEIVVQETPVQPLEDVAPAVATPSIQLQEIQTEVLDTPPVYEADIEQT
QIVVSKPKRLRKKRNVDPLFNFEHKVITDCVTMVLGDAIQVAKCYDEAVLVNAANTYL
KHGGGIAGAINAASNGAVQQESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDAR
AKQDVSLLGKCYKAMNAYPLVVTPLVSAGIFGVQPSVSFDYLIREVKTRVLVVVNSQD
IYKSLTTVEVPQGLTFSYDGLRGALRKARDYGFTVFVCTDNSANTKVLRNKGVDYTKK
STTVDGVQYYCYTAKDTLDSIVLEANKASGIISMPLGYVSHGLDLMQAGAIVRRVKVP
YVCLLANKEQEAILMSEDVKLSPSADFVKHVRTNGGYNSWHLVEGELLVRDLTLNKLL
HWSDQTICYKSDKFYVVKNGVALPFETLAACRTYLDSRTAQQLTIEVLVTVDGVNFRT
VVLNNKSSYRSQLGCVFYNGADISDTIPDEKQNGCSLYLADNLTADETKVLKELYGPV
DPTFLHRFYSLKAVVQKWKMVVCDKVRSLKLSDNNCYINVVIMILDLLKDIKFVIPAL
QHAFMKHKGGDSTEFIALIMTYGNCTFGAPDDATRLLHTVLAKAELCCSARMVWREWC
NVCGIKDVVIQGLKACCYVGVQTVEDLHARMTYVCQCGGERHRQLVEHTAPWLLLSGT
PNEKLVTTSTAPDFVAFNVFQGLETAVGHYVHARLKDGLILKFDSGTLSKTSDWKCKV
TDVLFPNQKYSSDCNVVRYSLDGKFRTEVDPDLSAFYVKDGKYFTSEPPVTYSPATVL
AGSVYTNSCLVSSDGQPGGDAISLSFNNLLGFDSSKPVTKKYTYSILPKEDGDVLLAE
FSTYDPIYKNGAMLKGKPVLWVTNASYDATLNKFNRATLRQIYDVAPIEIENKYTPLS
VEPSPVEKVSTVEVALAKPELTIVKCKGLIKPFVKANVSFVSDETGLPVVEYLSKEDL
HTLYVDPKYQVIVLKDNALSTIFRLHTVESGDLNVVAASGSLTRKVKLLFRASFYFKE
LASRTLTATTVVGSCINSVVRHLGVTKGILASLFSFVKMLFVLPLSYFSDSETSTTEV
KVSALKTAGVVTGNVLKQCCTAAVDLSMDKLRRVDWKATLRLLLMLCTTMVLLSSVYH
LYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGVSSGCDGLAAAYRANSFDVPTFCANR
SVMCNWCLINQDSITHYPALKMVQTHLSHYVLNIDWLWFALEVGLAYILYTSAFNWLL
LAGTLQYFFAQTSIFVDWRSYNYVVSSAFWLFTHIPMPGLVRIYNLLACLWLLRKFYQ
HVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYIAANGGISFCRRHNWNCVDCD
TAGVGNTFICEEVASDLTTTLRRPVNSTDRSHYYVDSVLVKETVVQFNYRRDGQSCYE
RFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQESLARSACVYYSQVLCKS
ILLVDSSLVTSVGNSGEIAIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNF
HSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQLTNESYNNYVPSYVKPDSV
STGDLGSLIDCNAASVNQTSMRQANGACIWNAAAYMKLSDVLKRQIRIACRKCNLAFR
LTTSKLRANDNMLSVKFTATKIVGGAPTWFNTLRDFTLKSYVFVTIIVFLCAVLMYFC
LPTFAMAPVEFYEDRILEYKVLDNGIIRDISPDDKCFANKYRSFSQWYHEHVGGSYDN
SISCPLTVAVIAGVAGARIPDVPTTLAWVNRQIVFFVSRVFANSNSVCYTPINEIPYK
SFSDSGCILPSECTMFRDAEGRMSPYCYDPTVLPGAFAYSQMKPHVRYDLYDTNMFIK
FPEVVFESTLRITKTLTTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLSRPGVYCGS
DYVDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQCAM
IAVIAAVLNSLCICFVSSIPLCIVPYTALYYYATFYFTNEPAAIMHVSWYIMFGPIVP
MWLTCVYTVAMCFRHFFWVVAYFSKKHVEVFTDGKLNCSFQDAASNIFVVNKDTYAAL
RNAITNDVYSRYLGLFNKYKYYSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPP
NCSITSGVLQSGLVRMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQ
LADPNYDALLVSMTNHSFSVNKHIGAPANLRVIGHAMQGTLLKLTVDVANPSTPAYTF
TTVKPGASFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCY
MHQMELANGTHTGSAFDGTMYGAFLDKQVHQVQLTDKYCSTNVVAWLYAAILNGCAWF
VKSNRTSIVSFNEWALANQFTEFVGTQSIDMLAVKTGVAIEQLLYAIQQLHTGFQGKQ
ILGSSMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATFVLSYVVFLQTTKFT
LWNYLFETIPTQLFPLLFVTVACVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPATP
ISSALIAVANWLAPTNVYMRTTHTDIGVYISLSLVLAIVVKRLYNPSLSNFALALCSG
VMWLYTYSVGEVSSPIAYLVFVTTLTSDYTITVFVTVNLAKICTYIIFAYAPQLTLVF
PEVKMILLLYTCFGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAP
RNSWEAMSLNFKLLGIGGTPCIKVAAIQSKLTDLKCTSVVLLSVLQQLHLEANSKAWA
FCVKCHNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFETPSVLQATLSE
FSHLATFAELEAAQRAYQEAMDSGDASPQVLKALQKAVNVAKNAYEKDKAVARKLERM
AEQAMTSMYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVV
PLCASNKLRVVIPDFTVWNQVVTYPSLNYAGALWDIAVINNVDNEIVKSSDVVENNES
LTWPLVLECTRAASSAIKLQNNEIKPSGLRTMVVSAGQEQTNCNTSSLAYYEPVQGRK
MLMALLSDNAYLKWARVEGQEGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHR
GQVLGHIAATVRLQAGSNTEFAANSSVLSLVNFTVDPQKAYIDFVNAGGAPLTNCVKM
LTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPSQ
CTRDPVGFCLTNTPCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNESGVLLVNARIE
PCASGLSTDVVFRAFDICNYKAKVAGIGKYYKTNTCRFVELDDQGHHLDSYFVVKRHT
MENYELEKHCYDLLRDCDSVAPHDFFVFDVDKTKTPHIVRQRLTEYTMMDLVYALRHF
DQNNCEVLKAILVKYDCCDATYFENKLWFDFVENPSVIGVYHKLGERVRQAVLSTVKF
CDHMVKAGLVGVLTLDNQDLNGKWYDFGDFVITQPGSGVAIVDSYYSYLMPVLSMTNC
LAAETHRDCDFNKPLIEWPLTEYDFTDYKVQLFEKYFKYWDQTYHANCVNCTDDRCVL
HCANFNVLFAMTMPKTCFGPIVRKIFVDGVPFVVSCGYHYKELGLVMNMDVSLHRHRL
SLKELMMYAADPAMHIASSNAFLDLRTSCFSVAALTTGLTFQTVRPGNFNQDFYDFVV
SKGFFKEGSSVTLKHFFFAQDGNAAITDYNYYSYNLPTMCDIKQMLFCMEVVNKYFEI
YDGGCLNASEVVVNNLDKSAGHPFNKFGKARVYYESMSYQEQDELFAMTKRNVIPTMT
QMNLKYAISAKNRARTVAGVSILSTMTNRQYHQKMLKSMAATRGSTCVIGTTKFYGGW
DFMLKTLYKDVDNPHLMGWDYPKCDRAMPNMCRIFASLILARKHGTCCTTRDRFYRLA
NECAQVLSEYVLCGGGYYVKPGGTSSGDATTAYANSVFNILQATTANVSALMGANGNK
IVDKEVKDMQFELYVNVYRSTNPDPKFVDRYYAFLNKHFSMMILSDDGVVCYNSDYAA
KGYIAGIQNFKETLYYQNNVFMSEAKCWVETDLKKGPHEFCSQHTLYIKDGDDGYFLP
YPDPSRILSAGCFVDDIVKTDGTLMVERFVSLAIDAYPLTKHEDIEYQNVFWVYLQYI
EKLYKDLTGHMLDSYSVMLCGDNSAKFWEEAFYRELYSSPTTLQAVGSCVVCHSQTSL
RCGTCIRRPFLCCKCCYDHVIATPHKMVLSVSPYVCNAPGCDVADVTKLYLGGMSYFC
VDHRPVCSFPLCTNGLVFGLYKNMCTGSPSIVEFNRLATCDWTESGDYTLANTTTEPL
KLFAAETLRATEEASKQSYAIATIKEIVGDRQLLLVWEAGKSKPPLNRNYVFTGYHIT
KNSKVQLGEYIFERIDYSDAVSYKSSTTYKLTVGDIFILTSHSVATLTAPTIVNQERY
VKITGLYPTITVPEEFASHVANFQKAGYSKYVTVQGPPGTGKSHFAIGLAIYYPTARV
VYTACSHAAVDALCEKAFKYLNIAKCSRIIPAKARVECYDRFKVNETNSQYLFSTINA
LPETSADILVVDEVSMCTNYDLSIINARVKAKHIVYVGDPAQLPAPRTLLTRGTLEPE
NFNSVTRLMCNLGPDIFLSMCYRCPKEIVSTVSALVYNNKLLAKKELSGQCFKMLYKG
NVTHDASSAINRPQLAFVKNFITANPAWSKAVFISPYNSQNAVARSMLGLTTQTVDSS
QGSEYQYVIFCQTADTAHANNINRFNVAITRAQKGILCVMTSQALFDSLEFTELSFTN
YKLQSQIVTGLFKDCSRETSGLSPAYAPTYVSVDDKYKTCDELCVNLNLPANVPYSRV
ISRMGFKLDASVPGYPKLFITREEAVRQVRSWIGFDVEGAHASRNACGTNVPLQLGFS
TGVNFVVQPVGVVDTEWGNMLTGISARPPPGEQFKHLVPLMHKGAAWPIVRRRIVQML
SDTLDKLSDYCTFVCWAHGFELTSASYFCKIGKEQKCCMCNRRAAAYSSPLQSYACWS
HSCGYDYVYNPFFVDVQQWGYVGNLATNHDRYCSVHQGAHVASNDAIMTRCLAIHACF
IEHVDWDIEYPYISHEKKLNSCCRIVERNVVRAALLAGSFDRVYDIGNPKGIPIVDHP
VVEWHYFDAQPLTRKVQQLFYTEDLASRFADGLCLFWNCNVPKYPNNAIVCRFDTRVH
SEFNLPGCDGGSLYVNKHAFHTPAYDVSAFRDLKPLPFFYYSTTPCEVHGTGSMLEDI
DYVPLKSAVCVTACNLGGAVCRKHATEYRDYMEAYNLVSASGFRLWCYKTFDIYNLWS
TFTKVQGLENIAYNVVKQGHFTGVDGELPVAVVNDKIFTKSGVNDICVFENKTTLPTN
VAFELYAKRVVRSHPDFKLLHNLQADICYKFVLWDYERCNIYGTATIGVCKYTDIEVN
SALNICFDIRDNGSLEKFMTTPNAILISDRKIKNYPCMVGPDYAYFNGAIIRDSDTVK
QPVKFYFYKKVNNEFVEFSDCAYTQGRSCSDFEAMSVMETDFLALDSDVFIKKYGLEN
YAFEHVVYGDFSHTTLGGLHLLIGLYKKHLDGHIIMEEMIRESSTIHNYFITETSTAS
FKAVCSVIDLKLDDFVQILKSQDLGVVSKVVKVPIDLTMIEFMLWCKDGQVQTFYPRL
QASADWKPGQAMPSLFKVQNVNLERCELANYKQSIPMPRGVHMNIAKYMQLCQYLNTC
TIAVPANMRVIHFGAGSDKGIAPGTSVLRQWLPTDAIIIDNDLNDFVSDADISLFGDC
VTVRVGQQVDLVISDMYDPSTKNITGSNESKALFFTYLCNFINNNLALGGSVAIKITE
HSWSVDLYEIMGKFAWWTVFCTNANASSSEGFLLGINYLGTIKENIDGGAMHANYIFW
RNSNPMNLSTYSLFDLSKFQLKLKGTPVLQLKESQINELVISLLSQGKLLIRDNDVLS
VSTDVLVNFYRGKR"
misc_feature 127..636
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="non-structural protein 1 from Middle East
respiratory syndrome-related coronavirus and
betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp1; cd21878"
/db_xref="CDD:409340"
misc_feature 646..2625
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to MERS-CoV Nsp2, and related proteins from
betacoronaviruses in the C lineage; Region:
betaCoV_Nsp2_MERS-like; cd21517"
/db_xref="CDD:394868"
misc_feature 2686..2949
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="first ubiquitin-like (Ubl) domain located at the
N-terminus of coronavirus SARS-CoV non-structural protein
3 (Nsp3) and related proteins; Region:
Ubl1_cv_Nsp3_N-like; cd21467"
/db_xref="CDD:394822"
misc_feature <3016..3303
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ribonuclease E; Reviewed; Region: rne; PRK10811"
/db_xref="CDD:236766"
misc_feature 3463..3834
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3481..3489,3499..3519,3742..3747,3751..3765,
3829..3831)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3886..4245
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="SUD-M macrodomain (or Mac3 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-M_Nsp3-like; cd21563"
/db_xref="CDD:394884"
misc_feature 4255..4482
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Middle East
respiratory syndrome-related coronavirus and related
betacoronaviruses in the C lineage; Region:
SUD_C_MERS-CoV_Nsp3; cd21523"
/db_xref="CDD:394839"
misc_feature 4495..5427
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(4816..4827,4978..4986,4990..4995,5002..5004,
5014..5016,5092..5094,5116..5121,5164..5166,5170..5172,
5239..5241,5296..5298,5317..5328,5413..5415)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(4978..4986,5236..5241,5296..5298,5305..5307,
5317..5319,5326..5328,5413..5415)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5536..5904
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="nucleic acid binding domain of non-structural
protein 3 from Middle East respiratory syndrome-related
coronavirus and betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp3_NAB; cd21823"
/db_xref="CDD:409349"
misc_feature 5962..6324
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Middle East respiratory syndrome-related
coronavirus and betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp3_betaSM; cd21815"
/db_xref="CDD:409630"
misc_feature 6574..8271
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Middle East respiratory
syndrome-related coronavirus and betacoronavirus in the C
lineage; Region: TM_Y_MERS-CoV-like_Nsp3_C; cd21716"
/db_xref="CDD:409664"
misc_feature 6574..6639
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409664"
misc_feature 6961..7029
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409664"
misc_feature 8329..9471
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8329..8397
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9130..9195
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9238..9303
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9382..9450
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9529..9786
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
misc_feature 9802..10692
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9802..9825,9832..9834,10153..10155,10165..10185,
10210..10224,10297..10299,10315..10317,10648..10650,
10660..10662,10684..10689)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(9853..9855,9862..9870,9937..9939,9952..9954,
10219..10236,10288..10299,10303..10305,10315..10317,
10360..10362,10366..10374)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
misc_feature 10711..11586
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
misc_feature 11587..11835
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11590..11592,11599..11610,11617..11625,11629..11634,
11641..11643,11668..11670,11677..11679,11695..11697,
11731..11748,11752..11769,11788..11802)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
misc_feature 11845..12432
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 8; Region:
betaCoV_Nsp8; cd21831"
/db_xref="CDD:409258"
misc_feature order(12073..12078,12085..12090,12094..12102,12106..12114,
12118..12123,12127..12135,12145..12150,12154..12159,
12163..12171,12175..12204,12211..12213,12217..12231,
12235..12237,12247..12249,12259..12261,12283..12285,
12298..12300,12325..12327,12370..12372,12388..12390)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409258"
misc_feature 12433..12762
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12433..12450
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12439..12447,12451..12459,12640..12645,12709..12714,
12718..12726,12730..12738,12742..12750,12754..12762)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
misc_feature 12763..13155
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="alphacoronavirus and betacoronavirus non-structural
protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
/db_xref="CDD:409326"
misc_feature order(12763..12786,12796..12798,12802..12810,12814..12825,
12835..12840,12847..12852,12859..12861,12880..12897,
12934..12939,12967..12969,12973..12978,12988..12990,
12994..13011,13024..13032,13039..13050)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature order(12802..12810,12814..12822,12835..12837,12880..12882,
12886..12897,12934..12942,12994..13005,13012..13014,
13045..13050,13105..13107)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12823..12828,12835..12837,12886..12897,12934..12942,
13012..13014,13045..13050,13105..13107,13117..13119)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12880..12903,12934..12939,12967..12978,12991..12996,
13000..13002,13039..13050)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature join(13192..13224,13224..15983)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Middle East respiratory syndrome-related
coronavirus RNA-dependent RNA polymerase, also known as
non-structural protein 12, and similar proteins from
betacoronaviruses in the C lineage: responsible for
replication and transcription of the viral RNA genome;
Region: MERS-CoV-like_RdRp; cd21592"
/db_xref="CDD:394896"
misc_feature order(13992..14006,14154..14159,14163..14165,14169..14183,
14199..14210,14217..14219,14289..14291,14298..14303,
14307..14312,14319..14321,14325..14339,14343..14363,
14373..14375,14379..14381,14391..14396,14406..14408,
14700..14705,14712..14714,14727..14732,14736..14738,
14745..14756,15183..15185)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative Nsp8 interaction site [polypeptide
binding]; other site"
/db_xref="CDD:394896"
misc_feature order(14412..14426,14430..14432,14445..14447,14472..14474,
14478..14480,14505..14522,14835..14837,14841..14843,
15714..15716)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative Nsp7 interaction site [polypeptide
binding]; other site"
/db_xref="CDD:394896"
misc_feature order(14679..14681,14685..14693,14820..14822,14856..14858,
14862..14864,14880..14882,14892..14894,14904..14906,
14916..14918,14955..14957,14961..14963,14967..14969,
15231..15242,15249..15251,15459..15470,15624..15629,
15681..15683,15705..15707,15756..15761,15771..15773,
15777..15782)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative RNA binding site [nucleotide binding];
other site"
/db_xref="CDD:394896"
misc_feature 14685..14726
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif G; other site"
/db_xref="CDD:394896"
misc_feature 14799..14867
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif F; other site"
/db_xref="CDD:394896"
misc_feature order(14832..14834,15225..15233,15246..15248,15258..15260,
15462..15464)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="inhibitor binding site [chemical binding];
inhibition site"
/db_xref="CDD:394896"
misc_feature 15018..15068
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif A; other site"
/db_xref="CDD:394896"
misc_feature order(15039..15041,15462..15470)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="catalytic residues [active]"
/db_xref="CDD:394896"
misc_feature 15225..15314
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif B; other site"
/db_xref="CDD:394896"
misc_feature 15444..15488
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif C; other site"
/db_xref="CDD:394896"
misc_feature 15510..15575
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif D; other site"
/db_xref="CDD:394896"
misc_feature 15615..15650
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="conserved polymerase motif E; other site"
/db_xref="CDD:394896"
misc_feature 15984..16268
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Cys/His rich zinc-binding domain (CH/ZBD) of
coronavirus SARS NSP13 helicase and related proteins;
Region: ZBD_cv_Nsp13-like; cd21401"
/db_xref="CDD:439168"
misc_feature order(16116..16118,16179..16181,16185..16187,16224..16226,
16251..16265)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439168"
misc_feature 16278..16421
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="stalk domain of coronavirus Nsp13 helicase and
related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
/db_xref="CDD:410205"
misc_feature order(16287..16289,16374..16376)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="key interaction residues; other site"
/db_xref="CDD:410205"
misc_feature 16431..16667
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="1B domain of coronavirus SARS NSP13 helicase and
related proteins; Region: 1B_cv_Nsp13-like; cd21409"
/db_xref="CDD:394817"
misc_feature order(16515..16520,16524..16526,16617..16619)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="nucleic acid substrate binding site [nucleotide
binding]; other site"
/db_xref="CDD:394817"
misc_feature 16629..16637
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:394817"
misc_feature 16734..17753
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="helicase domain of betacoronavirus non-structural
protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
/db_xref="CDD:409655"
misc_feature order(16836..16853,17193..17195,17310..17312,17595..17597,
17601..17603,17682..17684)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ATP binding site [chemical binding]; other site"
/db_xref="CDD:409655"
misc_feature order(16845..16850,17103..17108,17193..17195,17682..17684)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative active site [active]"
/db_xref="CDD:409655"
misc_feature 17790..19343
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="nonstructural protein 14 of betacoronavirus;
Region: betaCoV_Nsp14; cd21659"
/db_xref="CDD:394958"
misc_feature order(17790..17792,17796..17807,17832..17861,17928..17930,
17940..17942,17955..17978,18078..18083,18147..18149,
18153..18155,18165..18170,18351..18353,18360..18365,
18372..18380,18426..18428)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="heterodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394958"
misc_feature order(18045..18047,18051..18053,18348..18350,18579..18581,
18594..18596)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ExoN active site [active]"
/db_xref="CDD:394958"
misc_feature order(18651..18653,18693..18695,18702..18707,18714..18716,
18774..18785,18789..18791,18831..18839,18864..18872,
18918..18932,18966..18968,19023..19025,19029..19034,
19041..19043,19047..19049,19284..19286)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="N7-MTase active site [active]"
/db_xref="CDD:394958"
misc_feature 19350..19532
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="N-terminal domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: NTD_alpha_betaCoV_Nsp15-like; cd21171"
/db_xref="CDD:439163"
misc_feature order(19350..19358,19410..19412,19416..19427,19449..19451,
19464..19466,19491..19493,19497..19508)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439163"
misc_feature order(19374..19391,19428..19439,19443..19445,19452..19454,
19458..19475,19482..19493,19530..19532)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439163"
misc_feature 19545..19928
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="middle domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: M_alpha_beta_cv_Nsp15-like; cd21167"
/db_xref="CDD:439161"
misc_feature order(19578..19583,19611..19613,19617..19619,19623..19625,
19629..19637,19815..19820,19824..19835)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439161"
misc_feature 19656..19661
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439161"
misc_feature 19920..20372
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Nidoviral uridylate-specific endoribonuclease
(NendoU) domain of coronavirus Nonstructural Protein 15
(Nsp15) and related proteins; Region:
NendoU_cv_Nsp15-like; cd21161"
/db_xref="CDD:439158"
misc_feature order(19944..19946,19950..19952,20058..20066,20130..20132,
20142..20147,20151..20153,20169..20171,20175..20177,
20181..20183,20187..20189,20193..20198,20202..20204)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439158"
misc_feature order(20040..20042,20052..20054,20076..20081,20085..20087,
20205..20207,20214..20219,20358..20360,20364..20366)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative active site [active]"
/db_xref="CDD:439158"
misc_feature 20382..21269
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Coronavirus NSP13; Region: NSP13; pfam06460"
/db_xref="CDD:399456"
CDS 61..13227
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="polyprotein pp1a; ORF1a polyprotein is cleaved to
yield the nonstructural proteins"
/codon_start=1
/product="ORF1a polyprotein"
/protein_id="YP_009361855.1"
/db_xref="GeneID:37627558"
/translation="MSFVADVTAQGARGTYRAALNSEKHHDHVSLTVPLCGSGDLVSK
LSPWFMDGYDACEAVKVMLSNKEKLLFVPIRLVGYTKHLPGPRVYLVERLINGIYTDP
FMVNQVAYSSSANAGLVGTTLQGKPIGLFFPFDADLVTGDHTFLLRKYGRGGYHYTPF
HYERDATSRPEWMDDLEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPINDYAGL
MAKEGITKLADIEADVASRVDADGFIVLKNKLYRLVWHVERKDVQYAKQSIFTINSVV
QREGLQDIPPHYFTLGGKIDMLVPRNKWNGVANLPLKQKILYTFYGKESLENHSYIYH
SAFTDCGGCGNGSWLTGNAVQGFSCGCGASYLSNDVEVQSSGLIKPNALFCATCPFAK
GDSCSSSCKHSIAQLVSYLSERCNVIADSKSFTLVFGGVAYAYFGCEEGTMYFVPRAK
SVVSKIGDSIFTGCTGSWTKVTQIANLFLEQTQRSLNFVGEFVVNDVVLAILSGTTTN
VDKLRELLKGITLEKLRDYLADYDVAVTLGPFMDNAVNVGGKGLQYATITAPFLVLTG
LGESFKKVAAIPYKVCKSFKETLSYYADSILYRVFPYDMDSDVSSFTELLFDCVGLSV
ASTYFIVRLLQDKTGDFMSTILSSCQSAVRKLLDTCLEATEATLNFLLELANLFKIFL
RGAYVYTSQGFVVLQGKMSSLVKQVVDLLNKGMQLLHTKVSWAGSKVSAVIYSGRESL
IFPTGTYYCVSTKAKSVQHQFDVILPGDCSKKQLGLLEPTDNSTTVEVTVSSNTVETV
VGQLEQTNMHSPDVIVGDYVIISDKLFVRSKEEDRVVFYPACTNGTAVPTLFKLKGGA
PVKRVAFGDDEIHEVAAVRSVTVEYNIHAVLDALLASSSLRTFVVDKSLSIEEFVDVV
KEQVSDLLAKLLRGMPIPDFDLDDFIDTPCYCFNADGDVSWSSTMIFSLHPVECEDDS
FECDSDQDDDQESVCEPLVEETNVQVQESDDDGWAAAVEEAFPIEELEEPPVQVVPND
SVVRSQVAQPIEIVVQETPVQPLEDVAPAVATPSIQLQEIQTEVLDTPPVYEADIEQT
QIVVSKPKRLRKKRNVDPLFNFEHKVITDCVTMVLGDAIQVAKCYDEAVLVNAANTYL
KHGGGIAGAINAASNGAVQQESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDAR
AKQDVSLLGKCYKAMNAYPLVVTPLVSAGIFGVQPSVSFDYLIREVKTRVLVVVNSQD
IYKSLTTVEVPQGLTFSYDGLRGALRKARDYGFTVFVCTDNSANTKVLRNKGVDYTKK
STTVDGVQYYCYTAKDTLDSIVLEANKASGIISMPLGYVSHGLDLMQAGAIVRRVKVP
YVCLLANKEQEAILMSEDVKLSPSADFVKHVRTNGGYNSWHLVEGELLVRDLTLNKLL
HWSDQTICYKSDKFYVVKNGVALPFETLAACRTYLDSRTAQQLTIEVLVTVDGVNFRT
VVLNNKSSYRSQLGCVFYNGADISDTIPDEKQNGCSLYLADNLTADETKVLKELYGPV
DPTFLHRFYSLKAVVQKWKMVVCDKVRSLKLSDNNCYINVVIMILDLLKDIKFVIPAL
QHAFMKHKGGDSTEFIALIMTYGNCTFGAPDDATRLLHTVLAKAELCCSARMVWREWC
NVCGIKDVVIQGLKACCYVGVQTVEDLHARMTYVCQCGGERHRQLVEHTAPWLLLSGT
PNEKLVTTSTAPDFVAFNVFQGLETAVGHYVHARLKDGLILKFDSGTLSKTSDWKCKV
TDVLFPNQKYSSDCNVVRYSLDGKFRTEVDPDLSAFYVKDGKYFTSEPPVTYSPATVL
AGSVYTNSCLVSSDGQPGGDAISLSFNNLLGFDSSKPVTKKYTYSILPKEDGDVLLAE
FSTYDPIYKNGAMLKGKPVLWVTNASYDATLNKFNRATLRQIYDVAPIEIENKYTPLS
VEPSPVEKVSTVEVALAKPELTIVKCKGLIKPFVKANVSFVSDETGLPVVEYLSKEDL
HTLYVDPKYQVIVLKDNALSTIFRLHTVESGDLNVVAASGSLTRKVKLLFRASFYFKE
LASRTLTATTVVGSCINSVVRHLGVTKGILASLFSFVKMLFVLPLSYFSDSETSTTEV
KVSALKTAGVVTGNVLKQCCTAAVDLSMDKLRRVDWKATLRLLLMLCTTMVLLSSVYH
LYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGVSSGCDGLAAAYRANSFDVPTFCANR
SVMCNWCLINQDSITHYPALKMVQTHLSHYVLNIDWLWFALEVGLAYILYTSAFNWLL
LAGTLQYFFAQTSIFVDWRSYNYVVSSAFWLFTHIPMPGLVRIYNLLACLWLLRKFYQ
HVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYIAANGGISFCRRHNWNCVDCD
TAGVGNTFICEEVASDLTTTLRRPVNSTDRSHYYVDSVLVKETVVQFNYRRDGQSCYE
RFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQESLARSACVYYSQVLCKS
ILLVDSSLVTSVGNSGEIAIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNF
HSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQLTNESYNNYVPSYVKPDSV
STGDLGSLIDCNAASVNQTSMRQANGACIWNAAAYMKLSDVLKRQIRIACRKCNLAFR
LTTSKLRANDNMLSVKFTATKIVGGAPTWFNTLRDFTLKSYVFVTIIVFLCAVLMYFC
LPTFAMAPVEFYEDRILEYKVLDNGIIRDISPDDKCFANKYRSFSQWYHEHVGGSYDN
SISCPLTVAVIAGVAGARIPDVPTTLAWVNRQIVFFVSRVFANSNSVCYTPINEIPYK
SFSDSGCILPSECTMFRDAEGRMSPYCYDPTVLPGAFAYSQMKPHVRYDLYDTNMFIK
FPEVVFESTLRITKTLTTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLSRPGVYCGS
DYVDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQCAM
IAVIAAVLNSLCICFVSSIPLCIVPYTALYYYATFYFTNEPAAIMHVSWYIMFGPIVP
MWLTCVYTVAMCFRHFFWVVAYFSKKHVEVFTDGKLNCSFQDAASNIFVVNKDTYAAL
RNAITNDVYSRYLGLFNKYKYYSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPP
NCSITSGVLQSGLVRMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQ
LADPNYDALLVSMTNHSFSVNKHIGAPANLRVIGHAMQGTLLKLTVDVANPSTPAYTF
TTVKPGASFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCY
MHQMELANGTHTGSAFDGTMYGAFLDKQVHQVQLTDKYCSTNVVAWLYAAILNGCAWF
VKSNRTSIVSFNEWALANQFTEFVGTQSIDMLAVKTGVAIEQLLYAIQQLHTGFQGKQ
ILGSSMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATFVLSYVVFLQTTKFT
LWNYLFETIPTQLFPLLFVTVACVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPATP
ISSALIAVANWLAPTNVYMRTTHTDIGVYISLSLVLAIVVKRLYNPSLSNFALALCSG
VMWLYTYSVGEVSSPIAYLVFVTTLTSDYTITVFVTVNLAKICTYIIFAYAPQLTLVF
PEVKMILLLYTCFGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAP
RNSWEAMSLNFKLLGIGGTPCIKVAAIQSKLTDLKCTSVVLLSVLQQLHLEANSKAWA
FCVKCHNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFETPSVLQATLSE
FSHLATFAELEAAQRAYQEAMDSGDASPQVLKALQKAVNVAKNAYEKDKAVARKLERM
AEQAMTSMYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVV
PLCASNKLRVVIPDFTVWNQVVTYPSLNYAGALWDIAVINNVDNEIVKSSDVVENNES
LTWPLVLECTRAASSAIKLQNNEIKPSGLRTMVVSAGQEQTNCNTSSLAYYEPVQGRK
MLMALLSDNAYLKWARVEGQEGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHR
GQVLGHIAATVRLQAGSNTEFAANSSVLSLVNFTVDPQKAYIDFVNAGGAPLTNCVKM
LTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPSQ
CTRDPVGFCLTNTPCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNESGVLL"
misc_feature 127..636
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="non-structural protein 1 from Middle East
respiratory syndrome-related coronavirus and
betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp1; cd21878"
/db_xref="CDD:409340"
misc_feature 646..2625
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to MERS-CoV Nsp2, and related proteins from
betacoronaviruses in the C lineage; Region:
betaCoV_Nsp2_MERS-like; cd21517"
/db_xref="CDD:394868"
misc_feature 2686..2949
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="first ubiquitin-like (Ubl) domain located at the
N-terminus of coronavirus SARS-CoV non-structural protein
3 (Nsp3) and related proteins; Region:
Ubl1_cv_Nsp3_N-like; cd21467"
/db_xref="CDD:394822"
misc_feature <3016..3303
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ribonuclease E; Reviewed; Region: rne; PRK10811"
/db_xref="CDD:236766"
misc_feature 3463..3834
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3481..3489,3499..3519,3742..3747,3751..3765,
3829..3831)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3886..4245
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="SUD-M macrodomain (or Mac3 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-M_Nsp3-like; cd21563"
/db_xref="CDD:394884"
misc_feature 4255..4482
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Middle East
respiratory syndrome-related coronavirus and related
betacoronaviruses in the C lineage; Region:
SUD_C_MERS-CoV_Nsp3; cd21523"
/db_xref="CDD:394839"
misc_feature 4495..5427
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(4816..4827,4978..4986,4990..4995,5002..5004,
5014..5016,5092..5094,5116..5121,5164..5166,5170..5172,
5239..5241,5296..5298,5317..5328,5413..5415)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(4978..4986,5236..5241,5296..5298,5305..5307,
5317..5319,5326..5328,5413..5415)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5536..5904
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="nucleic acid binding domain of non-structural
protein 3 from Middle East respiratory syndrome-related
coronavirus and betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp3_NAB; cd21823"
/db_xref="CDD:409349"
misc_feature 5962..6324
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Middle East respiratory syndrome-related
coronavirus and betacoronavirus in the C lineage; Region:
MERS-CoV-like_Nsp3_betaSM; cd21815"
/db_xref="CDD:409630"
misc_feature 6574..8271
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Middle East respiratory
syndrome-related coronavirus and betacoronavirus in the C
lineage; Region: TM_Y_MERS-CoV-like_Nsp3_C; cd21716"
/db_xref="CDD:409664"
misc_feature 6574..6639
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409664"
misc_feature 6961..7029
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409664"
misc_feature 8329..9471
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8329..8397
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9130..9195
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9238..9303
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9382..9450
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9529..9786
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
misc_feature 9802..10692
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9802..9825,9832..9834,10153..10155,10165..10185,
10210..10224,10297..10299,10315..10317,10648..10650,
10660..10662,10684..10689)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(9853..9855,9862..9870,9937..9939,9952..9954,
10219..10236,10288..10299,10303..10305,10315..10317,
10360..10362,10366..10374)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
misc_feature 10711..11586
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
misc_feature 11587..11835
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11590..11592,11599..11610,11617..11625,11629..11634,
11641..11643,11668..11670,11677..11679,11695..11697,
11731..11748,11752..11769,11788..11802)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
misc_feature 11845..12432
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 8; Region:
betaCoV_Nsp8; cd21831"
/db_xref="CDD:409258"
misc_feature order(12073..12078,12085..12090,12094..12102,12106..12114,
12118..12123,12127..12135,12145..12150,12154..12159,
12163..12171,12175..12204,12211..12213,12217..12231,
12235..12237,12247..12249,12259..12261,12283..12285,
12298..12300,12325..12327,12370..12372,12388..12390)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409258"
misc_feature 12433..12762
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12433..12450
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12439..12447,12451..12459,12640..12645,12709..12714,
12718..12726,12730..12738,12742..12750,12754..12762)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
misc_feature 12763..13155
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="alphacoronavirus and betacoronavirus non-structural
protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
/db_xref="CDD:409326"
misc_feature order(12763..12786,12796..12798,12802..12810,12814..12825,
12835..12840,12847..12852,12859..12861,12880..12897,
12934..12939,12967..12969,12973..12978,12988..12990,
12994..13011,13024..13032,13039..13050)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature order(12802..12810,12814..12822,12835..12837,12880..12882,
12886..12897,12934..12942,12994..13005,13012..13014,
13045..13050,13105..13107)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12823..12828,12835..12837,12886..12897,12934..12942,
13012..13014,13045..13050,13105..13107,13117..13119)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12880..12903,12934..12939,12967..12978,12991..12996,
13000..13002,13039..13050)
/gene="ORF1ab"
/locus_tag="CAU86_gp02"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
gene 21232..25269
/gene="S"
/locus_tag="CAU86_gp03"
/db_xref="GeneID:37627554"
CDS 21232..25269
/gene="S"
/locus_tag="CAU86_gp03"
/note="surface glycoprotein"
/codon_start=1
/product="spike protein"
/protein_id="YP_009361857.1"
/db_xref="GeneID:37627554"
/translation="MTYSVSLLMCLLTFIGANAKIVSIPGGVGTGACPQVDMQPSYFI
KHNWPEPIDMNKADGVIYPNGRTYSNITLQTTNLFPRNGDLGTQYVYSASNEKSRTSN
VAFISNYSYYGNPFGDGIVIRIGQNSNKTGSVIVGTAQTTIKKIYPALMLGSSFGNFS
VNNKSGAYFNHTLLILPSKCGTVFQVAYCLLQPRTDSYCPGNANYVSYALIDSPTDCT
SADESKRRNGLEDIKKYFNLVNCTYFEEFNVTADERAEWFGITQDSQGVHLYTSRKNG
FNSNNLFLFASVPIYDKINYYTVIPRSIITPANQRSAWAAFYVYPLHQLSYLLNFDVN
GYITQAADCGYNDYTQLVCSYGDFNMKSGVYSTSYYSAKPVGAYYEAHVYPDCNFTDL
FRENAPTIMQYKRQVFTRCNYNLTLLLSLVQVDEFVCDKITPEALATGCYSSLTVDWF
AFPYAWKSYLAIGSADRIVRFNYNQDYSNPSCRIHSKVNSSVGISYSGLYSYITNCNY
GGFNKDDVVKPGGRASQPCVTGALNSPTNGQVWSFNFGGVPYRTSRLTYTDHLKNPLD
MVYVITVKYEPGAETVCPKQVRPDYSTNITGLLGSCISYDIYGITGTGVFQLCNATGI
PQQKFVYDKFDNIIGFHSDDGNYYCVAPCVSVPVSVIYDDNTNQYATLFGSVACQHIS
TMAAQFSRETRASLVSRNMQNLLQTSVGCVMGFHETNDTVEDCNLSLGQSLCAIPPNT
NLRVGRSTFGLGSLAYNSPLRVDALNSSEFKVSLPLNFTFGVTQEYIETSIQKITVDC
KQYVCNGFAKCEKLLEQYGQFCSKINQALHGANLRQDDFVRNLFESVKTPQTVPLTTG
FGGEFNLTLLEPLSVSTGSSNARSALEELLFDKVTIADPGYMQGYDDCMQQGPASARD
LICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGAGWTAGLSSFAAIPFAQSIFYRL
NGVGITQQVLSENQKIIANKFNQALGAMQTGFTTTNEAFQKVQDAVNTNAQALAKLAS
ELSNTFGAISSSIGDIIQRLDVLEQEVQIDRLINGRLTTLNAFVAQQLVRSESAARSA
QLAKDKVNECVKSQSTRSGFCGQGTHIVSFVINAPNGLYFMHVGYHPSQHIEVVAAYG
LCDAANPTNCIAPVNGYFIKNQTTRGVDDWSYTGSSFYAPEPITTLNTRYVAPQVTFQ
NISTNLPPPLLGNSTGTDFKDELDEFFKNVSTSIPNFGALTQINTTLLDLSDEMLALQ
QVVKALNESYIDLKELGNYTYYNKWPWYIWLGFIAGLLALALCVFFILCCTGCGTSCL
GKLKCNRCCDKYEEYDLEPHKIHIH"
misc_feature 21316..22293
/gene="S"
/locus_tag="CAU86_gp03"
/note="N-terminal domain of the S1 subunit of the Spike
(S) protein from Middle East respiratory syndrome-related
coronavirus and related betacoronaviruses in the C
lineage; Region: MERS-CoV-like_Spike_S1_NTD; cd21626"
/db_xref="CDD:394952"
misc_feature order(21316..21327,21709..21711,21814..21822,21832..21840,
21868..21870,22144..22146)
/gene="S"
/locus_tag="CAU86_gp03"
/note="neutralizing Ab binding site [polypeptide binding];
other site"
/db_xref="CDD:394952"
misc_feature order(21415..21438,21442..21444,21457..21459,21568..21570,
21694..21699,21724..21726,22021..22026,22054..22056,
22078..22080,22102..22104,22108..22110,22117..22119,
22228..22239,22291..22293)
/gene="S"
/locus_tag="CAU86_gp03"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394952"
misc_feature 22345..22983
/gene="S"
/locus_tag="CAU86_gp03"
/note="receptor-binding domain of the S1 subunit of the
Spike (S) protein from Middle East respiratory syndrome
coronavirus; Region: MERS-like_CoV_Spike_S1_RBD; cd21479"
/db_xref="CDD:394826"
misc_feature order(22444..22446,22507..22518,22522..22533,22537..22539,
22549..22551,22561..22563,22567..22569,22660..22662,
22669..22671,22789..22797,22852..22854,22951..22956,
22960..22962)
/gene="S"
/locus_tag="CAU86_gp03"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394826"
misc_feature order(22693..22704,22717..22761,22777..22800,22822..22854,
22876..22920,22924..22929)
/gene="S"
/locus_tag="CAU86_gp03"
/note="receptor binding motif; other site"
/db_xref="CDD:394826"
misc_feature order(22744..22746,22756..22758,22822..22842,22879..22881,
22885..22887,22891..22893)
/gene="S"
/locus_tag="CAU86_gp03"
/note="receptor binding site [polypeptide binding]; other
site"
/db_xref="CDD:394826"
misc_feature 23050..25089
/gene="S"
/locus_tag="CAU86_gp03"
/note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
region, and the S2 fusion subunit of the spike (S)
glycoprotein from Middle East respiratory syndrome
coronavirus and related betacoronaviruses in the C
lineage; Region: MERS-CoV-like_Spike_SD1-2_S1-S2_S2;
cd22379"
/db_xref="CDD:411966"
misc_feature order(23083..23085,23380..23382,23527..23529,23560..23562,
23815..23817,24733..24735,24880..24882,24928..24930,
24973..24975,25036..25038,25069..25071)
/gene="S"
/locus_tag="CAU86_gp03"
/note="N-linked glycosylation sites [posttranslational
modification]; other site"
/db_xref="CDD:411966"
misc_feature order(23434..23442,23455..23478,23482..23538)
/gene="S"
/locus_tag="CAU86_gp03"
/note="S1/S2 cleavage region; other site"
/db_xref="CDD:411966"
misc_feature 23773..23832
/gene="S"
/locus_tag="CAU86_gp03"
/note="fusion peptide; other site"
/db_xref="CDD:411966"
misc_feature 23869..23922
/gene="S"
/locus_tag="CAU86_gp03"
/note="internal fusion peptide; other site"
/db_xref="CDD:411966"
misc_feature 24181..24378
/gene="S"
/locus_tag="CAU86_gp03"
/note="heptad repeat 1 [structural motif]; Region: heptad
repeat 1"
/db_xref="CDD:411966"
misc_feature 24940..25065
/gene="S"
/locus_tag="CAU86_gp03"
/note="heptad repeat 2 [structural motif]; Region: heptad
repeat 2"
/db_xref="CDD:411966"
misc_feature 25153..25266
/gene="S"
/locus_tag="CAU86_gp03"
/note="Coronavirus spike glycoprotein S2, intravirion;
Region: CoV_S2_C; pfam19214"
/db_xref="CDD:437051"
gene 25284..25589
/gene="ORF3"
/locus_tag="CAU86_gp04"
/db_xref="GeneID:37627553"
CDS 25284..25589
/gene="ORF3"
/locus_tag="CAU86_gp04"
/codon_start=1
/product="ORF3 protein"
/protein_id="YP_009361858.1"
/db_xref="GeneID:37627553"
/translation="MRVQRPPTLLLVVGLTLLALAYSKPLYVPEHCQNYSGRMLRACI
RTAQTDTVGLYTNLVIQTGTATFESAVPVDRGSPSTHADTYELNTSVTLFDVGYSVN"
gene 25598..25927
/gene="ORF4a"
/locus_tag="CAU86_gp05"
/db_xref="GeneID:37627559"
CDS 25598..25927
/gene="ORF4a"
/locus_tag="CAU86_gp05"
/codon_start=1
/product="ORF4a protein"
/protein_id="YP_009361859.1"
/db_xref="GeneID:37627559"
/translation="MDYVSLLNQIWQKYLNLPDTVCLYIPKPASSFKPVAGTSLHPVQ
WECKITFAGYTEVAVNSTKALAKQDAARRIMWLLHRDGGIPDGCSLHMRHSSIFSDVP
EETPFSE"
misc_feature 25670..25822
/gene="ORF4a"
/locus_tag="CAU86_gp05"
/note="double-stranded RNA binding motif (DSRM)
superfamily; Region: DSRM_SF; cd00048"
/db_xref="CDD:380679"
gene 25803..26579
/gene="ORF4b"
/locus_tag="CAU86_gp06"
/db_xref="GeneID:37627560"
CDS 25803..26579
/gene="ORF4b"
/locus_tag="CAU86_gp06"
/codon_start=1
/product="ORF4b protein"
/protein_id="YP_009361860.1"
/db_xref="GeneID:37627560"
/translation="MQPAELCGCSIEMEEYPMDVHSTCVTPASSRMFRKRRHSPSRNL
RYVKRRFSSLRPEDISLVTEPTHYLRVIFHSPNTWYIRSGHDLDSVHKWLKPYGGIPV
NEYHITLALLSLSEQHLAMDISPIAIFLRNVRFELFDFTLLRKTLALKASEICCDNLH
RFQPITRVNMALPLIKEWLRVQGFPIYNSHLPLHMSVSKLHALDDNTCEYVANMSCFK
QYPTQMFVRPIAVELVSIRQSSNAPRCIVHSVPILHAPGF"
misc_feature 25857..26573
/gene="ORF4b"
/locus_tag="CAU86_gp06"
/note="accessory protein ORF4b, also known as
non-structural protein 3c (NS3c) in Middle East
respiratory syndrome (MERS)-related CoV; Region:
ORF4b_MERS-CoV-like; cd21651"
/db_xref="CDD:394925"
gene 26586..27263
/gene="ORF5"
/locus_tag="CAU86_gp07"
/db_xref="GeneID:37627562"
CDS 26586..27263
/gene="ORF5"
/locus_tag="CAU86_gp07"
/codon_start=1
/product="ORF5 protein"
/protein_id="YP_009361861.1"
/db_xref="GeneID:37627562"
/translation="MAFSLALFKPISLVPAFPEAHGGEPAQFANVFTCIPTVGYIAAL
TVNVCILPLLLLIPQDTCRRSIFKTSILYGLFVYNFILAITLINGVYTPTGGTLVAFL
VVLMITWLADRVRFCLLLRSYIPLFDMRSHFIRVSTVSSYGMVPVNQTKPLFIRNFDQ
RCRCSRCFYVHSSHYLECTYISRFTKVSLVAVTDFSLNGITSTVFVPSTRDSVPLHII
APSVLSV"
misc_feature 26592..27254
/gene="ORF5"
/locus_tag="CAU86_gp07"
/note="Non-structural protein ORF5 from Middle East
respiratory syndrome-related coronavirus and related
betacoronaviruses in the C lineage; Region:
MERS-CoV-like_ORF5; cd21645"
/db_xref="CDD:394928"
gene 27342..27590
/gene="E"
/locus_tag="CAU86_gp08"
/db_xref="GeneID:37627561"
CDS 27342..27590
/gene="E"
/locus_tag="CAU86_gp08"
/codon_start=1
/product="small envelope protein"
/protein_id="YP_009361862.1"
/db_xref="GeneID:37627561"
/translation="MLPFVQQQLGSFIVNFFIFTVACAVILLVCMAFLTATRLCVQCI
TGVNTLLVQPAVYMYNTGRSVYVKFQESKPPLPPDEWV"
misc_feature 27345..27584
/gene="E"
/locus_tag="CAU86_gp08"
/note="Middle East respiratory syndrome-related
coronavirus Envelope small membrane protein and similar
proteins; Region: MERS-CoV-like_E; cd21533"
/db_xref="CDD:394860"
misc_feature order(27363..27365,27384..27389,27393..27398,27402..27428,
27432..27434,27480..27488,27510..27512,27519..27524,
27528..27536)
/gene="E"
/locus_tag="CAU86_gp08"
/note="putative homopentameric interface [polypeptide
binding]; other site"
/db_xref="CDD:394860"
gene 27605..28264
/gene="M"
/locus_tag="CAU86_gp09"
/db_xref="GeneID:37627556"
CDS 27605..28264
/gene="M"
/locus_tag="CAU86_gp09"
/codon_start=1
/product="membrane glycoprotein"
/protein_id="YP_009361863.1"
/db_xref="GeneID:37627556"
/translation="MSNMTQLTEQQIISIIKDWNFAWSLIFLLITIVLQYGYPSRSMT
VYVFKMFVLWLLWPSSMALSIFSAVYPIDLASQIISGIIAGVSALMWISYFVQSIRLF
MRTGSWWSFNPETNCLLNVPLGGTTVVRPLVEDSTSVTAVVANGYLKMAGMHFGACDY
DRLPSEVTVAKPNVLIALKMVKRQSYGTNSGVAIYHRYKAGNYRSPPITADSELALLR
A"
misc_feature 27608..28258
/gene="M"
/locus_tag="CAU86_gp09"
/note="Membrane (or Matrix) protein from Middle East
respiratory syndrome-related coronavirus and related
betacoronaviruses in the C lineage; Region:
MERS-like-CoV_M; cd21567"
/db_xref="CDD:394853"
gene 28311..29558
/gene="N"
/locus_tag="CAU86_gp10"
/db_xref="GeneID:37627557"
CDS 28311..29558
/gene="N"
/locus_tag="CAU86_gp10"
/codon_start=1
/product="nucleocapsid phosphoprotein"
/protein_id="YP_009361864.1"
/db_xref="GeneID:37627557"
/translation="MATPAAPRAVSFADNNDNSNNNQSRGRGRNPKPRPAPNNTVSWY
TGLTQHGKVSLSFPPGQGVPLNANSTPAQNAGYWRRQDRKINTGNGTKSLAPRWYFYY
TGTGPEANLPFRAVKDGIIWVHEDGATDAPSTFGTRNPNNDAAIVTQFAPGTKLPKNF
HIEGTGGNSQSSSRASSASRNSSRSNSRGSRSGNSSRGTSPGPSGVGAVGGEMLYLDL
LNRLQALESGKTKQAQPKVITKKDAVAAKNKMRHKRVATKGFNMVQAFGLRGPGDLQG
NFGDLQLNKLGTEDPRWPQIAELAPSASAFIGMSQFKLTHQSNDTDGAPVYFLRYSGA
IKLDPKNPNYNKWLELIEQNVDAYKTFPKKEKKQKAPKEEPSDQMNVQPPKEQRVQGS
ITQRSRTPRPSVQPGPMTDVNTD"
misc_feature 28401..29465
/gene="N"
/locus_tag="CAU86_gp10"
/note="Coronavirus nucleocapsid protein; Region:
Corona_nucleoca; pfam00937"
/db_xref="CDD:425955"
misc_feature order(28431..28448,28599..28601,28605..28607,28611..28613,
28722..28724,28743..28745)
/gene="N"
/locus_tag="CAU86_gp10"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439219"
CDS 28357..28938
/gene="N"
/locus_tag="CAU86_gp10"
/codon_start=1
/product="Orf8b"
/protein_id="YP_009944307.1"
/db_xref="GeneID:37627557"
/translation="MTTPIITSLEEEEETLNLDLHQITLSPGTRGLPNTGKSLFPSHL
DRAYLLMPILPLRKMLGIGGDRTEKLIQEMEPSHWLPGGTSTTLEPDLRPTSLSELSR
TESSGSMRMAPLMLLQLLGRGTLTMMLLLLRNSRPVLSFLKTSTLKGLEAIANHLQER
LVPAETLLDPIPEVPDLVTPPAALPQVHLESEL"
misc_feature 28522..28833
/gene="N"
/locus_tag="CAU86_gp10"
/note="MERS-CoV ORF8b protein and related Merbecovirus
proteins; Region: merbe_CoV_ORF8b-like; cd21661"
/db_xref="CDD:394942"
3'UTR 29559..29642
ORIGIN
1 cttgtacgtc tcggtcacaa tatacggttc catccggtgc gtggcaattc ggggcacatc
61 atgtctttcg tggctgatgt gaccgcgcaa ggtgcgcgcg gtacgtatcg agcagcgctc
121 aactctgaaa aacatcatga ccatgtgtct ctaactgtcc cactctgtgg ttcaggagac
181 ctggtttcaa aactttcacc atggttcatg gatggctatg atgcctgtga agcggtgaag
241 gtcatgttat ctaacaaaga gaagttactc tttgtgccca tccgtctggt tgggtatact
301 aagcatctcc caggccctcg cgtttacctg gttgagaggc tcattaacgg tatttatacc
361 gatcctttta tggttaacca agtggcttat agctctagtg caaatgctgg ccttgttggc
421 acaactttgc agggcaagcc tattggtctc ttcttcccct ttgacgccga tcttgttact
481 ggagatcata cctttctcct gcgcaagtat gggcgtggtg gttatcacta cactcctttt
541 cattatgagc gtgatgccac ttcccgtcct gagtggatgg atgacctcga agcagatcca
601 aagggcaagt atgcccagaa tttgcttaag aagttgattg gtggtgatgt caccccagtc
661 gaccaataca tgtgtggtgt tgatggaaag cccattaacg actatgcagg tttaatggct
721 aaggagggaa taaccaaatt ggctgacatt gaagctgatg tagcatcacg tgttgatgct
781 gatggcttca ttgtgttgaa gaacaagttg tacagattgg tttggcatgt tgagcgtaag
841 gacgttcagt atgccaaaca atcaatcttc actattaata gtgtggttca aagggaaggt
901 ctccaagaca ttccccctca ctactttact cttggtggta agattgacat gcttgttcca
961 cgtaacaagt ggaatggtgt ggctaactta cctcttaaac agaaaattct ttatacattc
1021 tatggtaaag agtctcttga gaaccattct tacatttacc attctgcgtt cactgattgt
1081 ggaggctgcg gtaatggttc atggcttaca gggaacgctg ttcagggttt ctcctgtggt
1141 tgtggggcat catatttgtc taatgatgtc gaagttcaat catctggctt gataaagcca
1201 aatgcccttt tttgtgcgac ttgtcccttt gctaaaggtg acagttgttc ttctagctgc
1261 aaacattcaa ttgctcaatt ggttagttac ctttctgagc gttgtaatgt tattgcagat
1321 tctaaatcct tcacgcttgt ctttggaggc gttgcttacg cttactttgg ctgtgaggaa
1381 ggtactatgt actttgttcc tagagctaag tctgtggtgt cgaagattgg agattccatc
1441 ttcacaggct gtacaggttc ttggaccaaa gttactcaga ttgctaacct gtttttagaa
1501 cagacccagc gttctcttaa ttttgtggga gaattcgtgg tcaacgatgt tgtcctcgca
1561 attctttcag gaacaacaac caatgtggac aagttacgtg agcttcttaa agggatcact
1621 cttgagaagc tacgtgatta ccttgccgat tatgacgttg ctgtcacact cggtcctttt
1681 atggataatg ctgttaatgt tggtggtaag ggtctgcaat acgctaccat tacagcaccc
1741 tttttagttc tcactggttt aggtgagtcc tttaagaaag ttgcagccat accgtataag
1801 gtttgcaaat cttttaagga gactttgtcc tattatgctg atagcatatt gtacagagtc
1861 tttccttatg acatggattc tgatgtgtca tcttttactg agctactgtt tgactgtgtt
1921 ggtctgtcag tggcttccac ctatttcata gttcgcctgt tgcaagataa gacaggtgac
1981 ttcatgtcca ctatactttc atcatgccag tctgctgtac gtaagctcct tgacacttgt
2041 cttgaagcca ctgaagcaac tctcaacttc ttgttggagc tggcaaatct tttcaagatc
2101 tttctccgcg gagcctacgt ctatacgtca cagggctttg tggtgctcca gggcaaaatg
2161 tcttcacttg ttaaacaagt agtggacttg ctcaataagg gtatgcaatt gttgcataca
2221 aaggtctcct gggccggctc taaagtcagt gctgttattt acagtggccg ggaatctttg
2281 attttcccta caggaactta ttattgtgtt agcaccaaag caaaatctgt ccaacatcag
2341 ttcgatgtga tcttgcctgg tgattgttct aagaagcagt taggtctgct tgaacctact
2401 gacaactcta caacggttga ggttactgta tccagtaaca cggttgaaac tgttgtaggt
2461 caacttgaac agactaatat gcatagtcct gatgttatag taggagacta tgttattatt
2521 agtgataaac tgtttgtgcg aagcaaggaa gaagaccgcg ttgtcttcta tcctgcttgt
2581 actaatggta ctgctgtacc taccttgttt aaacttaaag gtggtgcacc tgttaagagg
2641 gtagcttttg gtgatgatga gatccatgaa gttgctgctg taagaagtgt aaccgtcgag
2701 tacaacattc atgctgtatt agacgcactg cttgcttctt ctagtcttag aacttttgtt
2761 gtagataagt ctttgtcaat agaggaattt gttgacgtag taaaagagca agtctctgat
2821 ttgctcgcca aattgctgcg tggaatgcca attcctgatt ttgacttaga cgattttatt
2881 gacacaccat gttactgctt taatgctgat ggtgatgtgt cctggtcctc cactatgatc
2941 ttctcattac accctgtgga gtgtgaagat gatagttttg agtgtgactc tgaccaagat
3001 gatgatcaag agtctgtttg tgaaccattg gttgaggaaa ccaatgttca ggtacaagag
3061 tctgacgatg atgggtgggc tgctgctgtt gaagaggcat tccccataga agagttagaa
3121 gaacctcctg tccaggtcgt gcccaacgat tctgttgtta ggagtcaagt cgcacagcct
3181 atagaaattg ttgtacagga aactcctgtg caacctcttg aggatgttgc gcctgcagtt
3241 gcaacgccta gtattcaact tcaggaaata cagactgaag tgttagatac accccctgtg
3301 tatgaagctg atatagagca aacacagatt gttgtttcaa aacctaagag attgcgcaaa
3361 aagcgtaatg ttgacccttt gtttaatttt gaacataagg tcattacaga ttgtgtcacc
3421 atggttttag gtgatgcaat tcaagtagct aagtgttatg atgaagctgt gttggttaat
3481 gctgccaaca catatcttaa gcatggcggt ggtatcgctg gtgctattaa cgcagcgtca
3541 aatggtgctg tacaacagga gtcagatgaa tacatcttgg ctaaagggcc actacaggta
3601 ggagattcag tcctcctgca gggtcattct ctcgctaaaa atatcttgca tgtcgtaggt
3661 cccgatgccc gcgctaagca ggatgtttct cttcttggta agtgctacaa ggctatgaat
3721 gcatatcctc ttgtagtaac tccacttgtt tcagcaggca tatttggcgt acagccttct
3781 gtgtcttttg attatcttat tagagaggtc aaaactagag tattagttgt tgttaactct
3841 caagatattt ataaaagtct tactacagtg gaagttccgc agggtttaac tttctcctat
3901 gatgggttgc gtggggcgct gcgtaaagcc agagattatg gttttactgt attcgtttgc
3961 actgacaact cagccaacac taaagttctt agaaacaaag gtgttgatta tactaagaag
4021 tccactactg tggatggcgt gcaatattat tgctacaccg ctaaagatac tcttgatagt
4081 attgttctag aggctaataa agcttccgga attatatcta tgcctttggg atatgtatct
4141 catggtttag acttaatgca ggcaggagcc atagtacgta gagtaaaggt accctacgtg
4201 tgcctcctag ctaataaaga gcaagaagct attttaatgt ctgaggacgt taagttaagt
4261 ccttcagctg attttgtgaa gcatgtccgt actaatggag gttataactc ttggcatcta
4321 gtcgagggtg agctattagt acgtgatttg actcttaata agcttctgca ttggtctgat
4381 caaaccatat gctataagtc tgataagttt tatgtggtaa agaacggtgt tgctttgcca
4441 tttgaaactt tggctgcatg tcgtacctat cttgattcac gtacggcaca acagttgaca
4501 atcgaagtgc tcgtcacagt cgatggtgtt aattttagaa ctgtggttct aaataataag
4561 agctcctata gatctcagct tggctgcgtg ttctataatg gtgctgatat ttctgatacc
4621 attcctgatg aaaaacagaa tggttgcagc ttgtatttag cagacaattt gactgctgat
4681 gaaacaaagg tgcttaaaga gttatatggc cctgttgatc ctacttttct acacagattc
4741 tattcactta aggcagtagt ccagaagtgg aagatggttg tgtgtgataa ggtacgttct
4801 ctcaaattga gtgataataa ttgctacatt aatgtggtaa ttatgattct tgatttgttg
4861 aaggacatta aatttgtaat acctgcttta cagcatgctt ttatgaaaca taagggcggt
4921 gattctaccg aattcattgc tctcattatg acttatggca attgcacatt tggtgcccca
4981 gacgatgcta ctcggttact tcacaccgtg cttgccaagg ctgagttatg ctgttcggca
5041 cgcatggttt ggagagagtg gtgcaacgtt tgtggcataa aagatgttgt catacaaggc
5101 cttaaggcat gttgttacgt gggtgtgcaa actgttgaag atctgcacgc acgcatgacg
5161 tatgtatgcc agtgtggtgg tgaaaggcat cgacaattag ttgagcacac cgcaccctgg
5221 ttgctactgt caggtacacc aaatgagaaa ttggtgacaa cctctacggc tcctgacttt
5281 gtagcattta atgtctttca ggggttagag acggctgtag gccattatgt ccatgcccgt
5341 ctgaaggatg gtcttatttt aaaatttgac tctggcactt taagcaagac ttccgattgg
5401 aagtgtaagg tgacagatgt cctatttccc aatcagaagt acagtagcga ctgtaatgtc
5461 gtgcgatact ctcttgatgg taagttcaga acagaggttg atcctgacct ttctgctttc
5521 tatgtcaagg atggtaaata ttttacaagt gagccacccg tgacttattc acctgctact
5581 gttttagcag gtagtgttta tactaatagt tgccttgtat cgtctgatgg acaacctggc
5641 ggtgatgcta ttagtttaag ttttaataac cttttagggt ttgattctag taaaccagtc
5701 acaaagaagt acacatattc cattcttcct aaggaggatg gagatgtttt gttggctgag
5761 tttagtactt atgaccctat ttacaagaac ggcgctatgc ttaaaggcaa acctgttctt
5821 tgggtcacca atgcatctta tgatgcaact cttaataagt tcaatagggc tactttacgt
5881 caaatatatg acgtagcacc cattgaaatt gaaaataaat acactccttt gagtgtagaa
5941 ccttcaccag ttgaaaaagt ttctactgtt gaagttgctt tagctaagcc agaactgaca
6001 attgtcaaat gcaagggttt gatcaaacca tttgtaaaag ccaatgtaag ttttgtttct
6061 gacgagacag gtcttcctgt tgtcgaatat ctgtctaagg aagatttaca tactttgtat
6121 gtcgatccta agtaccaagt cattgtctta aaggacaatg cactttctac tattttcaga
6181 ttgcacactg ttgaatctgg tgatttaaac gttgttgcag cttcaggttc tttaactcgt
6241 aaggttaagc tactttttag agcttccttt tactttaagg aacttgcttc ccgcactctc
6301 actgctacca ctgttgtagg tagttgtatt aacagtgttg tgcggcattt aggtgttact
6361 aaaggtatct tggcaagtct ttttagcttt gttaagatgc tatttgtgct tccactatct
6421 tattttagtg attcagaaac tagcaccact gaggtcaaag tcagtgcttt aaaaacagca
6481 ggcgttgtga cagggaatgt tttaaaacaa tgttgcaccg cagccgttga tttaagtatg
6541 gataagttac gtcgtgtgga ttggaaggca accttacgac tgttacttat gttgtgtaca
6601 actatggtat tgttgtcatc tgtgtatcac ttgtatgtgt ttaatcaagt actatcaagt
6661 gatgttatgt ttgaggatgc ccaaggtttg aaaaagttct acaaagaagt tagagcttac
6721 ctaggtgtgt catcaggttg tgatggtctt gctgcagctt atagagctaa ttcttttgat
6781 gtacctacat tctgcgcaaa tcgttctgtg atgtgtaact ggtgtttgat aaaccaagat
6841 tccataacgc actacccagc tcttaagatg gttcaaacac atcttagcca ctatgtttta
6901 aacatagatt ggttgtggtt tgcacttgag gttggtttag catacatact ctatacctcg
6961 gccttcaatt ggttattgtt ggcaggtaca ttgcagtatt tctttgcaca gacttctata
7021 tttgtggact ggcggtcata caattatgtt gtctctagtg ctttttggtt gttcacccac
7081 attcctatgc cgggtctagt cagaatctat aatttgttgg catgcctctg gcttttacgc
7141 aaattctatc agcatgttat taacggttgt aaggacacgg catgtctgct ttgttataag
7201 aggaatcgac ttactagagt tgaagcttct actgtcgtct gtggtggaaa acgtacgttt
7261 tacattgcag caaatggcgg tatttcattc tgtcgtaggc ataattggaa ttgtgttgat
7321 tgtgacactg caggtgtagg gaataccttc atctgtgaag aagtcgcaag tgatctcact
7381 accaccctac gcaggcctgt taactccacg gatagatcac attattatgt ggattccgtg
7441 ttagttaaag agactgttgt gcagtttaat tatcgtagag acggtcaatc atgctatgag
7501 cggtttcctc tctgcgcttt cacaaattta gataagttga agttcaaaga ggtttgtaaa
7561 actaccactg gtatacctga atacaacttt atcatttatg actcatcaga tcgtggccag
7621 gaaagtttag ctaggtctgc gtgtgtttat tactctcaag tcttgtgtaa atcaattctt
7681 ttggttgatt caagtcttgt gacgtctgtt ggtaattctg gtgaaattgc catcaaaatg
7741 tttgattcct ttgttaatag tttcgtctcg ttgtataacg tcacccgcga caagttggaa
7801 aaacttattt caacagctcg cgatggtgtt aaacgcggcg acaacttcca tagtgtctta
7861 acaacattca ttgatgctgc acgcggcccc gctggtgtgg agtctgatgt tgaaactaat
7921 gaaattgttg attctgtgca gtatgctcat aaacatgaca tacaacttac taatgagagt
7981 tacaataatt atgtaccttc atatgtaaag cctgatagtg tttctaccgg tgatttaggt
8041 agtctcatag attgtaatgc agcttcagtt aaccagacaa gcatgcgcca agctaatggc
8101 gcatgcatct ggaatgctgc tgcatatatg aaactctcgg atgttcttaa gcgacagatt
8161 cgcattgcat gccgtaagtg taatttagct tttcgtctta cgacttctaa gctacgtgct
8221 aatgacaata tgttatctgt taaattcact gccactaaga ttgttggtgg tgctcctaca
8281 tggtttaata cattgcgtga ctttacgttg aagagttacg tttttgttac cattatagtt
8341 tttctgtgtg ctgttcttat gtacttttgt ttacctacat ttgctatggc accagttgag
8401 ttttatgaag accgcatcct agaatataag gttctagata atggtatcat tagggatatt
8461 agtcccgatg ataagtgctt tgctaacaag tacaggtctt ttagtcagtg gtatcatgag
8521 catgtgggtg gtagttatga taattccatc tcttgcccat tgactgttgc ggttatagct
8581 ggtgtagcgg gtgcgcgcat accagatgtc cctacaactt tagcgtgggt taatagacag
8641 attgttttct ttgtctcccg cgtttttgct aattccaata gtgtttgtta tacaccaatt
8701 aatgagatac cttataaaag tttctctgat agtggatgca ttctaccatc tgaatgtact
8761 atgtttaggg atgctgaagg gcgtatgtca ccttattgtt atgatcccac tgtattgcct
8821 ggagctttcg cgtatagtca gatgaagccc catgttcgct atgacttgta tgatactaac
8881 atgtttatta agtttcctga agtggtcttt gagagcaccc tcaggattac taagacactt
8941 actactcagt actgcagatt tggtagctgt gagtacgcac aggaaggtgt ttgtatcact
9001 acaaatggct cttgggctat ttttaatgac caccatctta gtaggccagg tgtctactgt
9061 ggttctgact atgtagacat tgtcagacgt ttagcagtgt cattgttcca acccattact
9121 tatttccaac ttacaacttc gttggtcttg ggtattggtt tgtgtgcttt tctgacactt
9181 ttgttttatt atattaataa agtaaaacgt gcattcgcag actacactca gtgtgctatg
9241 attgccgtta ttgctgctgt tcttaatagc ttgtgcattt gctttgtttc gtctatacct
9301 ttatgtatag tgccttacac tgcattgtac tactatgcta cattctattt tactaatgag
9361 cctgcagcta ttatgcatgt ttcatggtac attatgtttg gtcccatagt acctatgtgg
9421 ttgacctgtg tttatacagt tgcaatgtgc tttagacact tcttctgggt tgttgcttat
9481 ttcagtaaga aacacgtcga ggtttttact gatggtaagc ttaattgtag tttccaagat
9541 gcagcctcta atatttttgt tgttaacaag gatacttatg ctgctttaag aaatgctata
9601 actaatgatg tgtactcgcg gtatcttggc ttgtttaata agtacaagta ttattctggt
9661 gctatggaaa ctgccgctta ccgtgaagct gctgcatgtc atctcgctaa ggccttgcaa
9721 acatacagtg aaactggtag tgacttgttg taccaacctc ctaactgtag catcacctct
9781 ggtgtgttgc agagtggttt ggtcagaatg tcgcatccca gtggtgatgt tgaggcttgt
9841 atggttcaag ttacctgtgg tagcatgact cttaatggcc tctggcttga taacactgtc
9901 tggtgtccac gtcatgttat gtgcccagca gaccagttgg ctgatcctaa ttatgatgct
9961 ctgcttgttt ccatgactaa tcatagtttt agtgtcaata aacatatagg tgctccggca
10021 aatctgcgtg ttattggcca tgctatgcag ggtactcttc tgaagttgac tgtcgatgtt
10081 gctaatccta gcactccagc ctacacgttt actacagtga aacctggtgc ttcatttagc
10141 gtgctagcat gctataatgg acggccaacc ggtactttta ctgttgttat gcgccctaac
10201 tacacaatta aaggttcttt cttgtgtggt tcttgtggta gtgttggtta cacaaaagaa
10261 ggtagtgtga ttaacttctg ttatatgcat caaatggagt tagctaatgg tacacatacc
10321 ggctctgcat ttgatggtac tatgtatggt gcattcttag ataagcaggt gcatcaggta
10381 caattaacag acaaatactg cagtactaat gtggtagctt ggttgtatgc agcaatactt
10441 aatgggtgcg cgtggtttgt aaaatccaat cgcactagta ttgtttcatt taatgaatgg
10501 gctcttgcca accagtttac agagtttgtt ggcacacaat ccattgatat gttagctgtt
10561 aaaacaggcg ttgccattga acagctcctt tatgctatcc aacaattgca tactggattc
10621 cagggcaaac aaatccttgg cagttcaatg ttggaagatg agttcacacc cgaagatgtt
10681 aatatgcaaa ttatgggtgt tgttatgcag agtggtgtaa gaaaggttac gtatggtact
10741 gcgcattggt tgtttgcaac ctttgtgctt tcctatgttg tgttcttaca aaccactaaa
10801 tttacattgt ggaactattt gtttgagacc atacccactc aattgttccc cctcttattt
10861 gtgactgttg catgtgttat gttattggtt aaacataaac acaccttttt aacactcttt
10921 ttgttgcctg tagccatttg tttgacttat gcaaacattg tctatgagcc tgctactccc
10981 atttcttcag cgttgatagc tgtggctaat tggttagccc ctactaatgt ttatatgcgc
11041 actacacata ctgatattgg tgtctacatt agtttgtcac ttgtattagc tatagtagtg
11101 aaacgcttgt acaacccttc actatctaac tttgctcttg cattgtgtag tggtgtgatg
11161 tggttgtata cttatagcgt tggcgaagtt tctagcccca ttgcctatct tgtctttgtt
11221 actacactta ctagtgatta tacgattact gtctttgtga ctgttaatct tgcaaaaatt
11281 tgcacttata ttatctttgc ttatgcacca cagcttacgc ttgtgttccc agaagtgaag
11341 atgattctct tattatacac atgctttggt tttatgtgta catgctattt tggtgtcttc
11401 tctttgttga accttaagtt gcgtgcgcca atgggtgttt acgactttaa ggtctccact
11461 caggagttca ggttcatgac ggctaataat ttaacagctc cgaggaattc ttgggaagcc
11521 atgtctctga actttaagct actaggtatt ggcggtacac cctgtataaa ggttgctgcc
11581 atacaatcta aacttactga tcttaagtgc acttcagtgg tgttgctttc agttttgcaa
11641 caattgcatc ttgaagctaa tagtaaggct tgggcttttt gtgtcaagtg ccataatgac
11701 atattggctg ccacagaccc tagtgaggct tttgaaaaat tcgttagtct ctttgccact
11761 cttatgactt tctctggtaa cgtagatctt gatgcactag ctagtgatat ctttgaaaca
11821 cctagtgttc ttcaagctac tttgtctgaa ttctctcact tggcaacttt tgctgagtta
11881 gaggctgcac agagagccta tcaggaagcc atggactctg gtgatgcatc accccaagtc
11941 cttaaagctt tacagaaggc agttaacgtt gctaagaatg cctatgagaa agataaggct
12001 gtagcacgta agttagaacg tatggccgag caagctatga cgtctatgta taagcaagca
12061 cgtgctgaag acaagaaagc taaaattgtt agtgctatgc aaactatgct ttttggtatg
12121 attaagaagc tcgacaatga tgttcttaat ggtatcattt ctaatgctag gaatggatgt
12181 atacctctta gtgttgtacc actttgtgct tcaaacaaac ttcgtgtagt aattccggac
12241 tttaccgtct ggaatcaagt tgtcacatat ccctcgctta attatgctgg ggctttgtgg
12301 gacattgcag ttataaacaa tgtggataat gaaattgtta agtcttcgga tgttgttgaa
12361 aacaacgaaa gcttgacatg gccacttgtc ttagaatgca ctagagcagc ttcctctgct
12421 attaagttgc aaaataatga gattaaacct tctggtctta gaactatggt tgtttcagct
12481 ggtcaagaac agaccaactg taatacaagc tctttagcat attacgaacc tgttcagggt
12541 cgcaagatgt taatggcact tctttcagac aatgcctacc ttaagtgggc acgtgttgaa
12601 ggacaggaag gttttgtaag tgttgaactg caacctcctt gtaaattttt gattgcggga
12661 cctaagggac ctgaaatccg atacctctat tttgtcaaaa atcttaacaa ccttcatcgt
12721 ggtcaggtgc ttggacacat tgctgccact gttagattac aagccggttc caataccgag
12781 tttgcagcta attcttcagt gttgtcactt gttaatttca ctgttgatcc tcaaaaagct
12841 tacatcgact ttgttaatgc tggcggtgcc ccattgacaa attgtgttaa gatgcttact
12901 cctaaaactg gtacaggtat tgctatatct gttaagccag agagtacagc tgaccaagaa
12961 acttatggtg gtgcgtctgt gtgtctgtat tgccgtgcgc atatagagca cccagacgta
13021 tctggtgttt gtaaatataa gggtaagttc gtccagattc catcgcagtg tactcgtgac
13081 cctgttggtt tttgtttaac gaataccccc tgcaatgtct gtcaatattg gattggctat
13141 gggtgcaatt gtgactcgct tagacaagca gcactgcccc agtccaagga ttctaatttt
13201 ttaaacgagt ccggggttct attgtaaatg cccgaataga accctgtgca agtggtttgt
13261 ccactgatgt cgtttttagg gcatttgaca tctgcaacta taaggctaag gttgctggta
13321 ttggaaaata ctacaagact aatacttgta ggtttgttga attagatgat caaggtcatc
13381 atttagactc ctattttgtc gtcaagagac atactatgga gaattacgag ctagagaagc
13441 actgttacga tttgttacgt gactgtgact ctgtggcacc tcatgatttc ttcgtctttg
13501 acgtcgacaa aactaaaact cctcatattg tgcgtcagcg tttaactgag tacactatga
13561 tggatcttgt ttatgcattg aggcactttg atcaaaataa ttgtgaagtg cttaaagcta
13621 ttttagtaaa gtatgattgt tgtgatgcta catactttga aaataaactc tggtttgatt
13681 ttgttgaaaa tcccagtgtt attggtgttt atcacaaact tggagaacgg gttcgccaag
13741 ctgtgttaag cactgttaaa ttctgtgacc acatggtaaa ggccggttta gtcggtgttt
13801 taacactcga caatcaggac cttaatggta agtggtatga ttttggtgat tttgtaatca
13861 cacaacccgg ttcaggagtg gctatagttg atagctacta ttcttattta atgcctgtgc
13921 tctctatgac caattgtttg gcagctgaga ctcacaggga ttgtgatttt aataagcctc
13981 tcattgagtg gccacttact gagtatgatt ttactgatta caaagtacag ctctttgaga
14041 agtactttaa gtactgggat cagacgtacc atgctaattg cgtgaattgt actgatgatc
14101 gttgtgtgtt acattgtgct aatttcaatg tattatttgc tatgaccatg cctaagacat
14161 gctttggacc tattgtccgg aagatatttg ttgatggtgt gccatttgta gtatcttgtg
14221 gttatcacta caaggaatta ggtttagtca tgaatatgga tgttagtctt cataggcata
14281 gactttctct taaggagcta atgatgtatg cagcagatcc tgctatgcac attgcctctt
14341 caaacgcatt tcttgatttg aggacatcat gttttagtgt cgcagcctta accacaggtc
14401 tgacctttca gaccgtgcgg cccggcaatt ttaaccagga tttctatgat ttcgtggtct
14461 ccaaaggatt ctttaaagag ggttcttctg ttactctcaa acatttcttc tttgcccaag
14521 atggcaatgc tgctattaca gattataatt attactctta taatctgccc actatgtgtg
14581 acatcaagca aatgttgttt tgcatggagg ttgtaaacaa gtacttcgag atctatgacg
14641 gtggttgtct taatgcctct gaagtggttg ttaataatct agacaaaagt gctggccatc
14701 cttttaataa gtttggaaag gctcgtgtct attatgagag catgtcttat caggaacaag
14761 atgaactctt tgccatgaca aagcgtaacg tcattcctac catgactcaa atgaatttaa
14821 agtatgctat tagtgccaag aatagagctc gcactgttgc aggcgtctct atacttagca
14881 ccatgactaa tcgccagtac catcagaaaa tgcttaagtc catggctgca actcgtggtt
14941 cgacttgcgt cattggtact actaagttct atggtggctg ggatttcatg cttaaaacat
15001 tgtataaaga tgttgataat ccacatctta tgggttggga ttaccctaag tgtgatagag
15061 ctatgcccaa tatgtgtaga atttttgctt cactcatatt ggctcgtaag catggaactt
15121 gttgtactac aagggacagg ttttaccgct tagctaatga gtgtgctcaa gtgctaagtg
15181 aatatgttct gtgcggtggt ggttactacg ttaaacctgg tggtaccagt agcggtgatg
15241 ccacaactgc atacgccaat agtgtgttta atattttgca ggccactact gcgaatgtta
15301 gtgcacttat gggtgctaat ggcaacaaaa ttgttgacaa agaagttaaa gacatgcagt
15361 ttgaactgta tgtcaatgtt tacaggagta ccaatcctga tcccaaattt gtagataggt
15421 attatgcttt tcttaacaag cacttttcta tgatgatatt atctgatgat ggtgttgtct
15481 gctataatag tgactatgca gccaaaggtt acattgctgg tatacagaat tttaaggaaa
15541 cgctgtatta ccagaacaat gtctttatgt ctgaagccaa atgctgggtg gaaaccgatc
15601 tgaagaaagg gccacatgaa ttttgttcac agcatacgct ttatattaag gatggtgacg
15661 atggttactt cctgccttat ccagacccct ctaggatctt gtctgccggt tgctttgtag
15721 atgatatcgt caagactgac ggtactctca tggttgagcg gtttgtgtca ttagctatag
15781 acgcataccc tctcacaaag catgaagata tagaatacca aaatgtattc tgggtttatt
15841 tacagtatat tgaaaagctg tataaagacc ttactggcca catgcttgac agttattctg
15901 ttatgttatg tggtgataat tctgctaagt tttgggagga ggcattttat agagaactct
15961 atagctctcc taccaccttg caggctgttg gttcgtgtgt tgtatgccat tcgcagacat
16021 ccctgcgctg tggtacatgc atacgcagac cctttctttg ttgtaagtgc tgctatgatc
16081 acgttatagc aactcctcat aaaatggttc tgtctgtttc accttacgtc tgtaacgcac
16141 ctgggtgtga tgttgctgac gttactaaac tatatttagg tggtatgagc tacttctgtg
16201 tagatcatag gcctgtttgt agttttcctc tttgcactaa tggtcttgta tttggattat
16261 acaagaatat gtgcacaggt agtccctcta tagttgaatt caatagactg gctacatgtg
16321 actggactga aagtggtgac tatacacttg ctaatactac tacagaacca cttaaattgt
16381 ttgctgctga aaccctacgt gccactgaag aggcatctaa gcagtcttat gcaattgcca
16441 ctattaaaga aatagttggt gatagacaat tactacttgt gtgggaggct ggtaaatcca
16501 aaccaccact caatcgtaat tatgttttta ctggttacca tataaccaaa aatagtaagg
16561 tgcagctcgg tgagtatatc ttcgagcgca ttgattacag tgatgctgta tcctacaagt
16621 ccagtacaac gtataaactg actgtaggtg acatctttat acttacctct cattcggtgg
16681 ctaccttgac ggcacccaca attgtgaatc aagagaggta tgttaaaata actggattat
16741 atccaactat tactgttccc gaggagtttg caagccatgt tgccaacttc caaaaagcag
16801 gatatagtaa gtatgtcact gtccagggac cacctggcac tggcaagagt cattttgcta
16861 tagggttagc gatttactac cctacagcac gtgttgttta tacagcttgc tcacatgctg
16921 ctgttgatgc attatgtgag aaagctttta aatatttgaa cattgctaaa tgttcccgta
16981 ttattcctgc caaggcacgt gttgagtgtt atgacaggtt taaggttaat gaaacaaatt
17041 ctcaatattt gtttagtact attaatgctt taccagaaac ttctgctgat attctggttg
17101 ttgatgaagt tagcatgtgc actaattatg atctttctat cattaatgca cgtgttaaag
17161 ctaagcacat tgtctatgta ggtgatccag cacagttgcc agcacctagg actctgctta
17221 ctagaggcac attggaacct gaaaatttca atagtgtcac taggttgatg tgtaacttag
17281 gacctgacat atttttgagt atgtgctaca ggtgtcctaa ggagattgtt agtactgtca
17341 gtgctcttgt ctacaataat aaattgttag ccaagaagga actatcagga cagtgcttta
17401 aaatgctcta taagggcaat gttacgcatg atgctagctc tgccattaat agaccacaac
17461 ttgcatttgt caagaacttt ataactgcta acccagcctg gagtaaggca gtctttattt
17521 caccttataa ttcacagaat gctgtggctc gctctatgtt gggccttaca acccaaactg
17581 ttgattcttc acagggttca gaataccaat atgtcatttt ctgtcaaaca gcagatactg
17641 cgcatgctaa caacattaat agatttaatg ttgccattac acgcgcacag aaaggtattc
17701 tttgtgttat gacatctcaa gcactctttg attctctgga gtttactgag ttgtctttta
17761 ctaattataa acttcagtct cagattgtca ctggactttt taaagattgt tccagagaaa
17821 cctctggcct ttcacctgct tatgcaccaa catatgttag tgttgatgat aagtataaaa
17881 cgtgtgatga gctttgcgtg aacctcaatt tacccgcaaa cgttccatat tcacgtgtta
17941 tttccaggat gggcttcaag cttgatgcta gtgtccctgg ttatcctaaa ctcttcatta
18001 ctcgtgaaga ggctgttagg caggttcgaa gctggatagg cttcgacgtg gaaggtgctc
18061 atgcatcacg taatgcatgc ggtaccaatg tgcctttaca attaggattc tctaccggcg
18121 tgaactttgt tgttcagcct gtaggtgttg tagatactga gtgggggaat atgcttactg
18181 gcatttctgc ccgtcctcca ccaggtgaac agtttaaaca cttagttccc cttatgcata
18241 agggagctgc gtggcctatt gttagacgac gtatagttca aatgttgtca gacactttag
18301 acaaattgtc tgactactgt acgtttgttt gttgggctca tggctttgaa ttgacatctg
18361 catcttattt ttgtaagata ggtaaagagc agaagtgttg tatgtgtaat agacgcgctg
18421 cagcgtactc ttcacctctg caatcttatg cctgctggtc tcattcctgc ggttatgatt
18481 atgtctacaa ccccttcttt gttgatgttc aacagtgggg ttatgtaggc aatcttgcta
18541 ctaatcacga tcgttattgc tcggtgcatc agggtgctca tgtagcctct aacgatgcaa
18601 taatgactcg ttgtttagct attcatgctt gttttattga acatgtagat tgggatattg
18661 agtatcctta tatctcacat gagaaaaagt tgaattcctg ctgccgaatt gttgaaagaa
18721 atgttgtacg tgctgctttg ttggcaggct cctttgatag agtgtacgac ataggcaacc
18781 ctaaaggaat tcctattgtt gatcaccctg tggttgaatg gcattatttt gatgcacagc
18841 ccttgactag gaaagtacaa cagcttttct atactgagga tttagcctca agatttgctg
18901 atgggctctg cttgttttgg aattgtaatg ttccaaaata tcctaataat gcaattgttt
18961 gtaggtttga tactcgtgtg cactcagagt tcaatttgcc aggttgtgat ggtggtagtt
19021 tgtatgttaa caagcacgcc tttcacacac cagcatacga tgtaagtgca tttcgtgatc
19081 tgaaaccttt acctttcttc tattattcta ctactccatg tgaagttcat ggtactggta
19141 gtatgttaga agatatagat tatgtacctc ttaagtctgc agtgtgtgtt actgcctgca
19201 atctaggagg tgctgtttgt aggaaacatg ctacggagta cagagattat atggaagcat
19261 ataaccttgt ctctgcatca ggtttccgtt tatggtgtta taagaccttt gacatttaca
19321 acctttggtc tacttttact aaagttcaag gtttagaaaa cattgcttat aatgttgtta
19381 aacaaggcca ctttactggt gtagatggag agctacctgt agctgtagtc aatgataaaa
19441 tcttcaccaa gagtggcgtt aatgacatat gtgtgtttga gaataaaacc actttgccta
19501 caaatgtagc ttttgaactg tatgctaagc gtgtggtgcg ctcacatcca gactttaagt
19561 tactccataa tttacaagct gacatttgtt acaagttcgt cctttgggat tatgaacgtt
19621 gtaacatcta tggtacagct actattggtg tatgtaagta cactgatata gaagtcaatt
19681 cagccttgaa tatatgtttt gacattcgtg ataatggttc attggaaaag tttatgacta
19741 cacccaatgc cattctcatt tcagatagaa aaatcaagaa ctacccgtgt atggtaggtc
19801 ctgattatgc ttacttcaat ggtgctatta tcagagacag tgatactgta aagcagccag
19861 tgaaatttta tttttataag aaagttaata atgagtttgt cgagttttct gactgtgctt
19921 acacacaggg tcgctcttgt agtgactttg aggctatgtc agttatggag acagactttc
19981 ttgctcttga tagtgatgtt ttcataaaga agtatggttt ggaaaactat gcctttgaac
20041 atgtggtata tggtgatttt tcacatacta cattgggtgg ccttcatctg cttattgggt
20101 tgtacaagaa gcatttggat ggtcatatta ttatggaaga aatgatcaga gaaagttcaa
20161 ctatccataa ctatttcatt actgagacta gcacagcgtc ttttaaggcg gtttgctcag
20221 tcattgattt aaagcttgac gactttgtac agattttaaa gagtcaagac cttggcgttg
20281 tatccaaggt agtcaaggtt cctatagacc taactatgat tgaattcatg ttatggtgta
20341 aagatggcca ggtacagaca ttctatcctc ggctccaagc atctgctgat tggaaaccgg
20401 gccaggccat gccttcatta tttaaagttc aaaatgtgaa ccttgaacgc tgtgagcttg
20461 ctaattacaa gcaatctatt cctatgcctc gcggtgtgca catgaacatc gccaaatata
20521 tgcaattgtg ccagtattta aatacttgca caatagctgt tccagccaat atgagagtta
20581 ttcattttgg tgctggttca gataaaggta tcgcacctgg tacctctgtt ttaagacaat
20641 ggctcccgac tgatgccatt ataattgata atgatctaaa tgattttgtg tcagatgctg
20701 acatatcttt atttggagac tgcgtaactg tacgtgttgg acaacaagtc gatcttgtta
20761 tatctgacat gtatgatcct agtactaaga atattacagg tagtaatgag tctaaggctc
20821 tattctttac ctacctgtgt aattttatta ataataatct tgctcttggt ggttctgttg
20881 ccattaaaat aacagaacac tcatggagcg ttgatcttta tgaaataatg ggaaaatttg
20941 cttggtggac agttttctgt accaatgcaa atgcatcctc ttctgaagga ttcctgttag
21001 gtattaatta cttgggtact attaaagaaa atatagatgg cggtgctatg cacgcaaatt
21061 atatattttg gagaaactcc aaccctatga atctgagtac ttactcactt tttgatttat
21121 ccaagtttca attaaaatta aaaggaacgc cagttcttca attaaaggag agtcagatta
21181 acgaactagt tatatctctc ctgtcgcagg gtaaattact tatccgcgac aatgacgtac
21241 tcagtgtctc tactgatgtg cttgttaact tttatagggg caaacgctaa aattgtgtcg
21301 atacctggtg gtgtcggtac tggagcttgt ccccaggttg atatgcagcc cagttatttt
21361 ataaagcata actggcctga acctattgac atgaataagg cagacggtgt catctaccca
21421 aatggccgca cttattctaa catcacatta cagaccacta atctgtttcc tcgtaatggt
21481 gatttaggca ctcagtatgt ctattcagca tctaatgaga aaagccgcac tagcaatgtg
21541 gcttttatta gtaattattc atactatggc aatccctttg gcgacggcat tgtcatacgt
21601 ataggtcaaa attctaataa gactggtagt gtcattgtgg gcacagcaca gactactatt
21661 aaaaagatct acccagctct tatgcttggt agttcttttg gcaatttctc tgttaataat
21721 aagtcaggtg cttattttaa tcacaccctt cttatcttac ctagcaagtg tggcactgta
21781 tttcaggtgg catactgcct tctacaacca aggactgact cttattgtcc cggtaatgct
21841 aactatgtta gctatgcact cattgattct cctacggatt gtacatctgc ggatgaatct
21901 aaacgtagga atggtcttga ggacattaaa aagtacttca atttggtcaa ttgtacctat
21961 tttgaagagt ttaatgtcac agctgacgag cgtgcagagt ggtttggcat cacccaagat
22021 tctcagggtg tgcacctcta cacctctcgt aaaaatggtt tcaattcaaa taatctcttt
22081 ttatttgctt ctgtgcccat ttatgataag atcaattact acactgtaat tcctcgctca
22141 ataataactc ctgccaacca gcgttctgct tgggcagcat tttatgtata tcctttacac
22201 caacttagct acttgctaaa ttttgatgtt aatggttata ttacgcaagc agcagactgt
22261 ggttacaatg attataccca gcttgtctgc tcgtatggtg attttaatat gaaatctggt
22321 gtttactcta cttcatatta ttcagccaaa cctgtaggtg cgtactatga agctcatgtt
22381 tacccagatt gcaattttac tgatttgttc cgggaaaatg ctcccacaat catgcaatat
22441 aagcgtcaag tttttacgcg ttgtaattac aacctcacac tactgctctc tcttgtgcag
22501 gtggatgagt ttgtctgcga taagattacc cctgaggctc ttgcaacagg gtgttattcg
22561 tctcttaccg tcgattggtt tgcatttccg tatgcttgga agtcatacct agctataggt
22621 tcagcagatc gcattgtgcg gtttaattat aaccaggatt atagcaatcc ctcttgtaga
22681 attcactcca aggtgaattc ttcagttggc atttcttact ctggtttata tagttatatt
22741 actaattgta attacggcgg cttcaacaag gatgacgttg ttaagcctgg tggtcgtgcc
22801 agtcaaccct gtgttactgg cgcactcaat tcacctacta acggtcaagt ctggtctttt
22861 aattttggtg gcgtccctta cagaacctcc cgcctcacct acactgacca tcttaaaaac
22921 cctctagata tggtttatgt catcactgtt aagtatgaac caggcgctga aactgtatgt
22981 cccaaacaag tgcgtcctga ttatagtacc aatattactg gcttattagg ctcttgtatc
23041 agttatgaca tttatggtat aactggtact ggtgttttcc agctgtgtaa tgcaactgga
23101 attcctcaac aaaagtttgt ctatgacaaa tttgataata taattggctt tcactctgat
23161 gacggcaatt attattgtgt ggcaccttgt gtcagtgtgc ctgtttctgt tatatatgat
23221 gacaacacta atcaatacgc cacattgttt ggcagtgttg cttgtcaaca tatatctaca
23281 atggctgctc agttttctcg tgaaactcgt gcttccctcg tttcaagaaa tatgcagaat
23341 cttctacaga cttctgtcgg ttgtgtcatg ggtttccatg aaactaatga caccgttgaa
23401 gactgcaatc tttctttggg acaatcactc tgcgcaatcc cacctaacac caacttgagg
23461 gttggtcgct ccacctttgg attaggttct ttagcctaca acagtccatt gcgtgttgat
23521 gcacttaact cctctgagtt taaggtctcc ttgcctctca attttacatt tggtgttact
23581 caggaatata ttgaaactag catacagaag attacagttg actgtaaaca gtacgtgtgc
23641 aatggttttg ctaagtgtga aaagctgctc gaacaatacg gtcagttttg ctctaaaatt
23701 aaccaggctc tccatggcgc gaatcttcgc caggatgact ttgttcgtaa tctgtttgag
23761 agtgttaaaa caccacagac agttcctctt actacaggtt ttggagggga gtttaatctt
23821 actcttctag agccgctttc tgtttctaca ggttcttcta atgcgcgtag tgctttggaa
23881 gagcttttgt ttgacaaagt cactatagct gatcctggct acatgcaagg ttatgatgac
23941 tgcatgcaac agggccctgc ctcagctcgt gatcttatct gtgctcaata tgttgctggc
24001 tacaaagtgt taccacccct tatggatgtt aacatggaag ctgcatacac ctcttctttg
24061 cttggtagca tagctggtgc tggctggact gctggtttat catcatttgc tgccattcca
24121 tttgcacaga gtatatttta caggttaaat ggtgttggta taacacaaca ggttctatct
24181 gagaatcaaa agattattgc taacaagttt aaccaagctc ttggtgccat gcaaactggt
24241 ttcacaacaa ctaatgaagc ttttcagaaa gttcaagatg ctgtgaacac taatgcacag
24301 gctctagcta agttggctag tgaactatcc aatacttttg gtgctatttc ttcttccatt
24361 ggtgacatca ttcaacgtct tgatgtgctt gaacaggaag tccaaataga cagacttatt
24421 aatggccgtc tgactacact caacgccttt gttgctcagc agcttgttcg ttctgaatct
24481 gctgctcgtt ctgcacaatt ggctaaggat aaagtcaatg agtgtgttaa atcacaatcc
24541 actagatctg gattttgtgg tcaaggcact catatagtgt cctttgttat taacgcccct
24601 aatggcctct actttatgca tgttggttac caccctagcc aacatattga ggttgttgct
24661 gcctatggtc tttgtgacgc ggccaacccc actaattgta tagccccagt taatggctac
24721 tttattaaaa atcaaactac taggggtgtt gatgattggt catatacagg ttcttccttt
24781 tatgctccag aacccatcac cactcttaat actaggtatg tcgcacctca agtgacattc
24841 caaaacattt ctactaacct tcctcctcct ctgttgggca attccactgg aactgacttc
24901 aaagatgagt tggatgaatt tttcaagaat gttagcacca gtataccaaa ttttggtgct
24961 ctaacacaaa ttaatactac tttattggat ctttccgatg aaatgctagc tttacagcaa
25021 gttgtcaaag cgcttaatga gtcatatatt gaccttaaag agctcggcaa ctatacttat
25081 tacaacaaat ggccttggta catttggttg ggtttcattg ctggacttct agccctagcc
25141 ctttgtgttt tctttattct ttgctgcact ggttgcggca caagttgttt aggaaaactt
25201 aaatgtaatc gttgttgtga caaatatgaa gaatacgacc ttgagccgca taaaattcat
25261 attcactaat taacgaactt gtgatgagag tacaacgtcc acccactctt ctcctggtcg
25321 ttggactcac tctcttagct ttagcttatt caaaacctct ttatgtacct gaacattgtc
25381 agaattattc tggtcgtatg cttagggctt gtattaggac tgcccagact gatactgttg
25441 gtctttacac caatcttgtt attcagactg gcactgccac ctttgaatca gcggtacctg
25501 ttgatcgtgg atcaccttca actcacgctg acacttatga gcttaatact agtgtgactc
25561 tttttgacgt tggctactca gttaattaac gaactctatg gattacgtgt ctctgctcaa
25621 ccaaatttgg cagaagtacc ttaatcttcc tgatacagtt tgtttgtaca ttcccaaacc
25681 tgcttctagt tttaaacctg tagccggcac ttccttgcat cccgttcagt gggagtgtaa
25741 gattacattt gctggttaca cagaggttgc agttaattct actaaagctt tagctaagca
25801 ggatgcagcc cgcagaatta tgtggctgct ccatagagat ggaggaatac ccgatggatg
25861 ttcactccac atgcgtcact ccagcatctt ctcggatgtt ccggaagaga cgccattctc
25921 cgagtagaaa tcttcgctat gttaagcgta gattttcttc tctacgccct gaagatatta
25981 gtttggtcac tgaacccact cattacctca gggttatctt tcacagccct aatacttggt
26041 atattaggtc tggtcatgat ttagactctg tccacaaatg gttgaaaccg tatggtggta
26101 ttcctgtgaa cgagtatcat attaccttgg ctttactgtc actttctgaa caacatttag
26161 ccatggatat atctcccatt gcaattttcc ttcgcaatgt gcgttttgag ctctttgatt
26221 ttactctact ccgtaaaaca cttgctctca aagcgtcaga gatttgctgt gataacttac
26281 ataggtttca acctattaca agggttaaca tggctctccc tcttattaag gaatggttgc
26341 gtgttcaggg tttccctatt tacaatagcc accttcctct acacatgtct gtttctaagc
26401 tgcatgcttt agatgataat acttgtgagt atgttgctaa catgtcttgc ttcaaacagt
26461 atcccaccca gatgtttgtg agacctatcg ctgttgaatt ggtttccata cgtcaatctt
26521 ctaatgcacc tcgatgcata gttcattcag ttcccatatt acatgcgcca ggattttaac
26581 gaactatggc tttttcttta gccttgttca agcctatttc tttagtgcct gcatttcctg
26641 aagcgcatgg tggtgaacct gctcaatttg ctaatgtttt cacatgcatt cctactgtag
26701 gctatatagc cgcacttaca gtgaatgtgt gcattttacc attactactc ctcattccac
26761 aggacacttg caggcgtagt attttcaaaa caagcatcct ttatggtttg tttgtttata
26821 attttatatt agccattaca ttaattaatg gtgtttacac tcccactgga ggcacattag
26881 ttgccttcct ggtagtgctt atgattactt ggcttgctga cagagttaga ttctgtctct
26941 tgctgcgttc ctatattcca ctttttgaca tgagatctca ttttatccgt gtcagtacag
27001 tgtcatcata tggcatggtt ccagttaatc aaaccaagcc attatttatt agaaactttg
27061 accagcgttg tcgctgctca cgttgttttt atgtgcattc ttctcattat ctagagtgca
27121 cttatattag ccgttttact aaagtcagtc ttgtagcagt tacagacttt tctttaaacg
27181 gcatcacttc tactgtattc gtgccttcaa cgcgcgattc agttcctctt cacataatcg
27241 caccgagcgt gcttagtgta taagctcgcc tagcgcaact atgggtcccg tgtagaggct
27301 attccattag tctctatctt tggacatttg gaaaacgaac tatgttaccc tttgtccaac
27361 aacaattagg gtcattcata gtaaactttt tcatatttac cgtagcgtgt gctgtcatac
27421 ttttggtgtg catggctttc cttacggcca ctcgattatg tgtgcaatgc ataacaggtg
27481 taaacacact gttagttcag cccgcagtat acatgtataa tactggacgt tcagtctatg
27541 taaaattcca ggagagcaaa ccccctctac ctcctgatga gtgggtttaa cgaactcctt
27601 aattatgtca aatatgacgc agctcactga gcagcagatt attagtataa ttaaagattg
27661 gaactttgca tggtctctga tctttctttt aattactatc gtgctacaat atggttaccc
27721 atctcgtagc atgactgtct atgtctttaa aatgtttgtt ttatggcttt tatggccttc
27781 atcaatggca ctctcaattt tcagtgccgt ttatccaatt gacctagctt cccagattat
27841 ttctggcata atagctggtg tctctgcgct catgtggatt tcctacttcg tgcagagcat
27901 tagactcttt atgcgaacag gctcttggtg gtcattcaat cctgaaacta attgcctttt
27961 gaacgttcca cttggtggca caactgtagt gcgtccacta gtcgaagatt ccactagtgt
28021 tactgctgtt gttgccaatg ggtacctcaa gatggctggt atgcactttg gtgcgtgtga
28081 ctacgatcga ctccctagtg aggtgaccgt ggctaaaccc aacgtgctta tcgcattgaa
28141 aatggtaaag agacaaagct atggaactaa ttctggtgtt gctatttacc atagatataa
28201 ggcaggtaat tacagaagtc cgcccataac ggcggatagt gaacttgcat tgctacgagc
28261 ataagctcct aagtaagagt tgattttaac gaatcttaat tttcattgtt atggccactc
28321 ccgctgcacc tcgtgccgtg tcttttgccg ataacaatga caactccaat aataaccagt
28381 ctcgaggaag aggaagaaac cctaaacctc gacctgcacc aaataacact gtctcctggt
28441 acacggggct tacccaacac gggaaagtct ctctttcctt cccacctgga cagggcgtac
28501 ctcttaatgc caattctacc cctgcgcaaa atgctgggta ttggcggaga caggacagaa
28561 aaattaatac aggaaatgga accaagtcac tggctcccag gtggtacttc tactacactg
28621 gaaccggacc tgaggccaac ctccctttcc gagctgtcaa ggacggaatc atctgggtcc
28681 atgaggatgg cgccactgat gctccttcaa cttttgggac gcggaaccct aacaatgatg
28741 ctgctattgt tacgcaattc gcgcccggta ctaagcttcc taaaaacttc cacattgaag
28801 ggactggagg caatagccaa tcatcttcaa gagcgtctag tgccagcaga aactcttcta
28861 gatccaattc ccgaggttcc agatctggta actcctcccg cggcacttcc ccaggtccat
28921 ctggagtcgg agctgtaggt ggagaaatgc tgtacctcga tttgcttaac agattacagg
28981 ctctggaatc tggcaaaaca aagcaagcac agcctaaagt aataactaaa aaggatgctg
29041 ttgctgctaa aaacaagatg cgccataagc gtgtcgccac caagggtttc aacatggtgc
29101 aagctttcgg tctgcgtggc ccaggcgacc tccagggaaa ctttggtgat ctccaactta
29161 acaaacttgg cactgaggac cctcgctggc cccaaattgc tgagcttgcc ccatcagcca
29221 gtgctttcat tggtatgtct caatttaaac ttacccatca gagcaatgat actgatggtg
29281 cccctgtata ctttcttcga tacagtggtg ccataaaact tgacccaaag aaccctaact
29341 acaataagtg gttggagctc attgagcaga atgttgatgc ctacaaaact ttccctaaaa
29401 aggagaagaa acaaaaggca cctaaagaag aaccatctga ccagatgaat gtgcagccgc
29461 ctaaggagca gcgtgtgcag ggtagtatta cccagcgctc ccgcactcct aggcctagtg
29521 tgcagcctgg tcctatgact gatgttaaca ctgattagtg ttattcaaag taacaagagc
29581 gaggcaaccg tttgtgtttg gtaaccccat ttcaccatcg tttgtccact cttgcacaga
29641 ag
//