GenomeNet

Database: RefSeq
Entry: NC_034440
LinkDB: NC_034440
Original site: NC_034440 
LOCUS       NC_034440              29642 bp    RNA     linear   VRL 20-NOV-2020
DEFINITION  Bat coronavirus isolate PREDICT/PDF-2180, complete genome.
ACCESSION   NC_034440
VERSION     NC_034440.1
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      Bat coronavirus
  ORGANISM  Bat coronavirus
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Nidovirales; Cornidovirineae; Coronaviridae; Coronavirinae.
REFERENCE   1  (bases 1 to 29642)
  AUTHORS   Anthony,S.J., Gilardi,K., Menachery,V.D., Goldstein,T., Ssebide,B.,
            Mbabazi,R., Navarrete-Macias,I., Liang,E., Wells,H., Hicks,A.,
            Petrosov,A., Byarugaba,D.K., Debbink,K., Dinnon,K.H., Scobey,T.,
            Randell,S.H., Yount,B.L., Cranfield,M., Johnson,C.K., Baric,R.S.,
            Lipkin,W.I. and Mazet,J.A.
  TITLE     Further Evidence for Bats as the Evolutionary Source of Middle East
            Respiratory Syndrome Coronavirus
  JOURNAL   MBio 8 (2), e00373-17 (2017)
   PUBMED   28377531
  REMARK    Publication Status: Online-Only
REFERENCE   2  (bases 1 to 29642)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-MAY-2017) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   3  (bases 1 to 29642)
  AUTHORS   Anthony,S.J., Gilardi,K.V., Goldstein,T., Ssebide,B., Mbabazi,R.,
            Navarrete-Macias,I., Liang,E., Wells,H.L., Hicks,A.L., Petrosov,A.,
            Byarugaba,D., Debbink,K., Yount,B.L., Menachery,V.D., Cranfield,M.,
            Johnson,C.K., Baric,R.S., Lipkin,W.I. and Mazet,J.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (15-JUL-2016) Center for Infection and Immunity, Columbia
            University, 722 West 168th Street, 17th Floor, New York, NY 10032,
            USA
COMMENT     REVIEWED REFSEQ: This record has been curated by NCBI staff. The
            reference sequence is identical to KX574227.
            Annotation was based on information found in PMID: 31653070  and
            annotation of PMID: 29346682.
            
            ##Assembly-Data-START##
            Assembly Method       :: MIRA v. 4.0
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..29642
                     /organism="Bat coronavirus"
                     /mol_type="genomic RNA"
                     /isolate="PREDICT/PDF-2180"
                     /isolation_source="rectal swab"
                     /host="Pipistrellus cf. hesperidus; specimen voucher:
                     OTBA03-20130220"
                     /db_xref="taxon:1508220"
                     /country="Uganda"
                     /lat_lon="1.12 S 29.68 E"
                     /collection_date="20-Feb-2013"
                     /note="USAID PREDICT Consortium"
     5'UTR           1..60
     gene            60..21290
                     /gene="ORF1b"
                     /locus_tag="CAU86_gp01"
                     /db_xref="GeneID:37627555"
     gene            61..21290
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /db_xref="GeneID:37627558"
     CDS             join(61..13224,13224..21290)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /ribosomal_slippage
                     /note="polyprotein pp1ab; ORF1ab polyprotein is cleaved to
                     yield the RNA-dependent RNA polymerase and other
                     nonstructural proteins"
                     /codon_start=1
                     /product="ORF1ab polyprotein"
                     /protein_id="YP_009361856.2"
                     /db_xref="GeneID:37627558"
                     /translation="MSFVADVTAQGARGTYRAALNSEKHHDHVSLTVPLCGSGDLVSK
                     LSPWFMDGYDACEAVKVMLSNKEKLLFVPIRLVGYTKHLPGPRVYLVERLINGIYTDP
                     FMVNQVAYSSSANAGLVGTTLQGKPIGLFFPFDADLVTGDHTFLLRKYGRGGYHYTPF
                     HYERDATSRPEWMDDLEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPINDYAGL
                     MAKEGITKLADIEADVASRVDADGFIVLKNKLYRLVWHVERKDVQYAKQSIFTINSVV
                     QREGLQDIPPHYFTLGGKIDMLVPRNKWNGVANLPLKQKILYTFYGKESLENHSYIYH
                     SAFTDCGGCGNGSWLTGNAVQGFSCGCGASYLSNDVEVQSSGLIKPNALFCATCPFAK
                     GDSCSSSCKHSIAQLVSYLSERCNVIADSKSFTLVFGGVAYAYFGCEEGTMYFVPRAK
                     SVVSKIGDSIFTGCTGSWTKVTQIANLFLEQTQRSLNFVGEFVVNDVVLAILSGTTTN
                     VDKLRELLKGITLEKLRDYLADYDVAVTLGPFMDNAVNVGGKGLQYATITAPFLVLTG
                     LGESFKKVAAIPYKVCKSFKETLSYYADSILYRVFPYDMDSDVSSFTELLFDCVGLSV
                     ASTYFIVRLLQDKTGDFMSTILSSCQSAVRKLLDTCLEATEATLNFLLELANLFKIFL
                     RGAYVYTSQGFVVLQGKMSSLVKQVVDLLNKGMQLLHTKVSWAGSKVSAVIYSGRESL
                     IFPTGTYYCVSTKAKSVQHQFDVILPGDCSKKQLGLLEPTDNSTTVEVTVSSNTVETV
                     VGQLEQTNMHSPDVIVGDYVIISDKLFVRSKEEDRVVFYPACTNGTAVPTLFKLKGGA
                     PVKRVAFGDDEIHEVAAVRSVTVEYNIHAVLDALLASSSLRTFVVDKSLSIEEFVDVV
                     KEQVSDLLAKLLRGMPIPDFDLDDFIDTPCYCFNADGDVSWSSTMIFSLHPVECEDDS
                     FECDSDQDDDQESVCEPLVEETNVQVQESDDDGWAAAVEEAFPIEELEEPPVQVVPND
                     SVVRSQVAQPIEIVVQETPVQPLEDVAPAVATPSIQLQEIQTEVLDTPPVYEADIEQT
                     QIVVSKPKRLRKKRNVDPLFNFEHKVITDCVTMVLGDAIQVAKCYDEAVLVNAANTYL
                     KHGGGIAGAINAASNGAVQQESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDAR
                     AKQDVSLLGKCYKAMNAYPLVVTPLVSAGIFGVQPSVSFDYLIREVKTRVLVVVNSQD
                     IYKSLTTVEVPQGLTFSYDGLRGALRKARDYGFTVFVCTDNSANTKVLRNKGVDYTKK
                     STTVDGVQYYCYTAKDTLDSIVLEANKASGIISMPLGYVSHGLDLMQAGAIVRRVKVP
                     YVCLLANKEQEAILMSEDVKLSPSADFVKHVRTNGGYNSWHLVEGELLVRDLTLNKLL
                     HWSDQTICYKSDKFYVVKNGVALPFETLAACRTYLDSRTAQQLTIEVLVTVDGVNFRT
                     VVLNNKSSYRSQLGCVFYNGADISDTIPDEKQNGCSLYLADNLTADETKVLKELYGPV
                     DPTFLHRFYSLKAVVQKWKMVVCDKVRSLKLSDNNCYINVVIMILDLLKDIKFVIPAL
                     QHAFMKHKGGDSTEFIALIMTYGNCTFGAPDDATRLLHTVLAKAELCCSARMVWREWC
                     NVCGIKDVVIQGLKACCYVGVQTVEDLHARMTYVCQCGGERHRQLVEHTAPWLLLSGT
                     PNEKLVTTSTAPDFVAFNVFQGLETAVGHYVHARLKDGLILKFDSGTLSKTSDWKCKV
                     TDVLFPNQKYSSDCNVVRYSLDGKFRTEVDPDLSAFYVKDGKYFTSEPPVTYSPATVL
                     AGSVYTNSCLVSSDGQPGGDAISLSFNNLLGFDSSKPVTKKYTYSILPKEDGDVLLAE
                     FSTYDPIYKNGAMLKGKPVLWVTNASYDATLNKFNRATLRQIYDVAPIEIENKYTPLS
                     VEPSPVEKVSTVEVALAKPELTIVKCKGLIKPFVKANVSFVSDETGLPVVEYLSKEDL
                     HTLYVDPKYQVIVLKDNALSTIFRLHTVESGDLNVVAASGSLTRKVKLLFRASFYFKE
                     LASRTLTATTVVGSCINSVVRHLGVTKGILASLFSFVKMLFVLPLSYFSDSETSTTEV
                     KVSALKTAGVVTGNVLKQCCTAAVDLSMDKLRRVDWKATLRLLLMLCTTMVLLSSVYH
                     LYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGVSSGCDGLAAAYRANSFDVPTFCANR
                     SVMCNWCLINQDSITHYPALKMVQTHLSHYVLNIDWLWFALEVGLAYILYTSAFNWLL
                     LAGTLQYFFAQTSIFVDWRSYNYVVSSAFWLFTHIPMPGLVRIYNLLACLWLLRKFYQ
                     HVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYIAANGGISFCRRHNWNCVDCD
                     TAGVGNTFICEEVASDLTTTLRRPVNSTDRSHYYVDSVLVKETVVQFNYRRDGQSCYE
                     RFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQESLARSACVYYSQVLCKS
                     ILLVDSSLVTSVGNSGEIAIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNF
                     HSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQLTNESYNNYVPSYVKPDSV
                     STGDLGSLIDCNAASVNQTSMRQANGACIWNAAAYMKLSDVLKRQIRIACRKCNLAFR
                     LTTSKLRANDNMLSVKFTATKIVGGAPTWFNTLRDFTLKSYVFVTIIVFLCAVLMYFC
                     LPTFAMAPVEFYEDRILEYKVLDNGIIRDISPDDKCFANKYRSFSQWYHEHVGGSYDN
                     SISCPLTVAVIAGVAGARIPDVPTTLAWVNRQIVFFVSRVFANSNSVCYTPINEIPYK
                     SFSDSGCILPSECTMFRDAEGRMSPYCYDPTVLPGAFAYSQMKPHVRYDLYDTNMFIK
                     FPEVVFESTLRITKTLTTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLSRPGVYCGS
                     DYVDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQCAM
                     IAVIAAVLNSLCICFVSSIPLCIVPYTALYYYATFYFTNEPAAIMHVSWYIMFGPIVP
                     MWLTCVYTVAMCFRHFFWVVAYFSKKHVEVFTDGKLNCSFQDAASNIFVVNKDTYAAL
                     RNAITNDVYSRYLGLFNKYKYYSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPP
                     NCSITSGVLQSGLVRMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQ
                     LADPNYDALLVSMTNHSFSVNKHIGAPANLRVIGHAMQGTLLKLTVDVANPSTPAYTF
                     TTVKPGASFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCY
                     MHQMELANGTHTGSAFDGTMYGAFLDKQVHQVQLTDKYCSTNVVAWLYAAILNGCAWF
                     VKSNRTSIVSFNEWALANQFTEFVGTQSIDMLAVKTGVAIEQLLYAIQQLHTGFQGKQ
                     ILGSSMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATFVLSYVVFLQTTKFT
                     LWNYLFETIPTQLFPLLFVTVACVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPATP
                     ISSALIAVANWLAPTNVYMRTTHTDIGVYISLSLVLAIVVKRLYNPSLSNFALALCSG
                     VMWLYTYSVGEVSSPIAYLVFVTTLTSDYTITVFVTVNLAKICTYIIFAYAPQLTLVF
                     PEVKMILLLYTCFGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAP
                     RNSWEAMSLNFKLLGIGGTPCIKVAAIQSKLTDLKCTSVVLLSVLQQLHLEANSKAWA
                     FCVKCHNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFETPSVLQATLSE
                     FSHLATFAELEAAQRAYQEAMDSGDASPQVLKALQKAVNVAKNAYEKDKAVARKLERM
                     AEQAMTSMYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVV
                     PLCASNKLRVVIPDFTVWNQVVTYPSLNYAGALWDIAVINNVDNEIVKSSDVVENNES
                     LTWPLVLECTRAASSAIKLQNNEIKPSGLRTMVVSAGQEQTNCNTSSLAYYEPVQGRK
                     MLMALLSDNAYLKWARVEGQEGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHR
                     GQVLGHIAATVRLQAGSNTEFAANSSVLSLVNFTVDPQKAYIDFVNAGGAPLTNCVKM
                     LTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPSQ
                     CTRDPVGFCLTNTPCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNESGVLLVNARIE
                     PCASGLSTDVVFRAFDICNYKAKVAGIGKYYKTNTCRFVELDDQGHHLDSYFVVKRHT
                     MENYELEKHCYDLLRDCDSVAPHDFFVFDVDKTKTPHIVRQRLTEYTMMDLVYALRHF
                     DQNNCEVLKAILVKYDCCDATYFENKLWFDFVENPSVIGVYHKLGERVRQAVLSTVKF
                     CDHMVKAGLVGVLTLDNQDLNGKWYDFGDFVITQPGSGVAIVDSYYSYLMPVLSMTNC
                     LAAETHRDCDFNKPLIEWPLTEYDFTDYKVQLFEKYFKYWDQTYHANCVNCTDDRCVL
                     HCANFNVLFAMTMPKTCFGPIVRKIFVDGVPFVVSCGYHYKELGLVMNMDVSLHRHRL
                     SLKELMMYAADPAMHIASSNAFLDLRTSCFSVAALTTGLTFQTVRPGNFNQDFYDFVV
                     SKGFFKEGSSVTLKHFFFAQDGNAAITDYNYYSYNLPTMCDIKQMLFCMEVVNKYFEI
                     YDGGCLNASEVVVNNLDKSAGHPFNKFGKARVYYESMSYQEQDELFAMTKRNVIPTMT
                     QMNLKYAISAKNRARTVAGVSILSTMTNRQYHQKMLKSMAATRGSTCVIGTTKFYGGW
                     DFMLKTLYKDVDNPHLMGWDYPKCDRAMPNMCRIFASLILARKHGTCCTTRDRFYRLA
                     NECAQVLSEYVLCGGGYYVKPGGTSSGDATTAYANSVFNILQATTANVSALMGANGNK
                     IVDKEVKDMQFELYVNVYRSTNPDPKFVDRYYAFLNKHFSMMILSDDGVVCYNSDYAA
                     KGYIAGIQNFKETLYYQNNVFMSEAKCWVETDLKKGPHEFCSQHTLYIKDGDDGYFLP
                     YPDPSRILSAGCFVDDIVKTDGTLMVERFVSLAIDAYPLTKHEDIEYQNVFWVYLQYI
                     EKLYKDLTGHMLDSYSVMLCGDNSAKFWEEAFYRELYSSPTTLQAVGSCVVCHSQTSL
                     RCGTCIRRPFLCCKCCYDHVIATPHKMVLSVSPYVCNAPGCDVADVTKLYLGGMSYFC
                     VDHRPVCSFPLCTNGLVFGLYKNMCTGSPSIVEFNRLATCDWTESGDYTLANTTTEPL
                     KLFAAETLRATEEASKQSYAIATIKEIVGDRQLLLVWEAGKSKPPLNRNYVFTGYHIT
                     KNSKVQLGEYIFERIDYSDAVSYKSSTTYKLTVGDIFILTSHSVATLTAPTIVNQERY
                     VKITGLYPTITVPEEFASHVANFQKAGYSKYVTVQGPPGTGKSHFAIGLAIYYPTARV
                     VYTACSHAAVDALCEKAFKYLNIAKCSRIIPAKARVECYDRFKVNETNSQYLFSTINA
                     LPETSADILVVDEVSMCTNYDLSIINARVKAKHIVYVGDPAQLPAPRTLLTRGTLEPE
                     NFNSVTRLMCNLGPDIFLSMCYRCPKEIVSTVSALVYNNKLLAKKELSGQCFKMLYKG
                     NVTHDASSAINRPQLAFVKNFITANPAWSKAVFISPYNSQNAVARSMLGLTTQTVDSS
                     QGSEYQYVIFCQTADTAHANNINRFNVAITRAQKGILCVMTSQALFDSLEFTELSFTN
                     YKLQSQIVTGLFKDCSRETSGLSPAYAPTYVSVDDKYKTCDELCVNLNLPANVPYSRV
                     ISRMGFKLDASVPGYPKLFITREEAVRQVRSWIGFDVEGAHASRNACGTNVPLQLGFS
                     TGVNFVVQPVGVVDTEWGNMLTGISARPPPGEQFKHLVPLMHKGAAWPIVRRRIVQML
                     SDTLDKLSDYCTFVCWAHGFELTSASYFCKIGKEQKCCMCNRRAAAYSSPLQSYACWS
                     HSCGYDYVYNPFFVDVQQWGYVGNLATNHDRYCSVHQGAHVASNDAIMTRCLAIHACF
                     IEHVDWDIEYPYISHEKKLNSCCRIVERNVVRAALLAGSFDRVYDIGNPKGIPIVDHP
                     VVEWHYFDAQPLTRKVQQLFYTEDLASRFADGLCLFWNCNVPKYPNNAIVCRFDTRVH
                     SEFNLPGCDGGSLYVNKHAFHTPAYDVSAFRDLKPLPFFYYSTTPCEVHGTGSMLEDI
                     DYVPLKSAVCVTACNLGGAVCRKHATEYRDYMEAYNLVSASGFRLWCYKTFDIYNLWS
                     TFTKVQGLENIAYNVVKQGHFTGVDGELPVAVVNDKIFTKSGVNDICVFENKTTLPTN
                     VAFELYAKRVVRSHPDFKLLHNLQADICYKFVLWDYERCNIYGTATIGVCKYTDIEVN
                     SALNICFDIRDNGSLEKFMTTPNAILISDRKIKNYPCMVGPDYAYFNGAIIRDSDTVK
                     QPVKFYFYKKVNNEFVEFSDCAYTQGRSCSDFEAMSVMETDFLALDSDVFIKKYGLEN
                     YAFEHVVYGDFSHTTLGGLHLLIGLYKKHLDGHIIMEEMIRESSTIHNYFITETSTAS
                     FKAVCSVIDLKLDDFVQILKSQDLGVVSKVVKVPIDLTMIEFMLWCKDGQVQTFYPRL
                     QASADWKPGQAMPSLFKVQNVNLERCELANYKQSIPMPRGVHMNIAKYMQLCQYLNTC
                     TIAVPANMRVIHFGAGSDKGIAPGTSVLRQWLPTDAIIIDNDLNDFVSDADISLFGDC
                     VTVRVGQQVDLVISDMYDPSTKNITGSNESKALFFTYLCNFINNNLALGGSVAIKITE
                     HSWSVDLYEIMGKFAWWTVFCTNANASSSEGFLLGINYLGTIKENIDGGAMHANYIFW
                     RNSNPMNLSTYSLFDLSKFQLKLKGTPVLQLKESQINELVISLLSQGKLLIRDNDVLS
                     VSTDVLVNFYRGKR"
     misc_feature    127..636
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="non-structural protein 1 from Middle East
                     respiratory syndrome-related coronavirus and
                     betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp1; cd21878"
                     /db_xref="CDD:409340"
     misc_feature    646..2625
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to MERS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the C lineage; Region:
                     betaCoV_Nsp2_MERS-like; cd21517"
                     /db_xref="CDD:394868"
     misc_feature    2686..2949
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="first ubiquitin-like (Ubl) domain located at the
                     N-terminus of coronavirus SARS-CoV non-structural protein
                     3 (Nsp3) and related proteins; Region:
                     Ubl1_cv_Nsp3_N-like; cd21467"
                     /db_xref="CDD:394822"
     misc_feature    <3016..3303
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ribonuclease E; Reviewed; Region: rne; PRK10811"
                     /db_xref="CDD:236766"
     misc_feature    3463..3834
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="X-domain (or Mac1 domain) of viral non-structural
                     protein 3 and related macrodomains; Region:
                     Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:438957"
     misc_feature    order(3481..3489,3499..3519,3742..3747,3751..3765,
                     3829..3831)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ADP-ribose binding site [chemical binding]; other
                     site"
                     /db_xref="CDD:438957"
     misc_feature    3886..4245
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="SUD-M macrodomain (or Mac3 domain) of the SARS
                     Unique Domain (SUD) of SARS-CoV non-structural protein 3
                     and related macrodomains; Region:
                     Macro_cv_SUD-M_Nsp3-like; cd21563"
                     /db_xref="CDD:394884"
     misc_feature    4255..4482
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Middle East
                     respiratory syndrome-related coronavirus and related
                     betacoronaviruses in the C lineage; Region:
                     SUD_C_MERS-CoV_Nsp3; cd21523"
                     /db_xref="CDD:394839"
     misc_feature    4495..5427
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(4816..4827,4978..4986,4990..4995,5002..5004,
                     5014..5016,5092..5094,5116..5121,5164..5166,5170..5172,
                     5239..5241,5296..5298,5317..5328,5413..5415)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(4978..4986,5236..5241,5296..5298,5305..5307,
                     5317..5319,5326..5328,5413..5415)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    5536..5904
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Middle East respiratory syndrome-related
                     coronavirus and betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp3_NAB; cd21823"
                     /db_xref="CDD:409349"
     misc_feature    5962..6324
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus-specific marker of non-structural
                     protein 3 from Middle East respiratory syndrome-related
                     coronavirus and betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp3_betaSM; cd21815"
                     /db_xref="CDD:409630"
     misc_feature    6574..8271
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Middle East respiratory
                     syndrome-related coronavirus and betacoronavirus in the C
                     lineage; Region: TM_Y_MERS-CoV-like_Nsp3_C; cd21716"
                     /db_xref="CDD:409664"
     misc_feature    6574..6639
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409664"
     misc_feature    6961..7029
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409664"
     misc_feature    8329..9471
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8329..8397
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9130..9195
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9238..9303
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9382..9450
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9529..9786
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     misc_feature    9802..10692
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(9802..9825,9832..9834,10153..10155,10165..10185,
                     10210..10224,10297..10299,10315..10317,10648..10650,
                     10660..10662,10684..10689)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(9853..9855,9862..9870,9937..9939,9952..9954,
                     10219..10236,10288..10299,10303..10305,10315..10317,
                     10360..10362,10366..10374)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     misc_feature    10711..11586
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     misc_feature    11587..11835
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11590..11592,11599..11610,11617..11625,11629..11634,
                     11641..11643,11668..11670,11677..11679,11695..11697,
                     11731..11748,11752..11769,11788..11802)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     misc_feature    11845..12432
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 8; Region:
                     betaCoV_Nsp8; cd21831"
                     /db_xref="CDD:409258"
     misc_feature    order(12073..12078,12085..12090,12094..12102,12106..12114,
                     12118..12123,12127..12135,12145..12150,12154..12159,
                     12163..12171,12175..12204,12211..12213,12217..12231,
                     12235..12237,12247..12249,12259..12261,12283..12285,
                     12298..12300,12325..12327,12370..12372,12388..12390)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409258"
     misc_feature    12433..12762
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12433..12450
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12439..12447,12451..12459,12640..12645,12709..12714,
                     12718..12726,12730..12738,12742..12750,12754..12762)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     misc_feature    12763..13155
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="alphacoronavirus and betacoronavirus non-structural
                     protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
                     /db_xref="CDD:409326"
     misc_feature    order(12763..12786,12796..12798,12802..12810,12814..12825,
                     12835..12840,12847..12852,12859..12861,12880..12897,
                     12934..12939,12967..12969,12973..12978,12988..12990,
                     12994..13011,13024..13032,13039..13050)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     misc_feature    order(12802..12810,12814..12822,12835..12837,12880..12882,
                     12886..12897,12934..12942,12994..13005,13012..13014,
                     13045..13050,13105..13107)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(12823..12828,12835..12837,12886..12897,12934..12942,
                     13012..13014,13045..13050,13105..13107,13117..13119)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(12880..12903,12934..12939,12967..12978,12991..12996,
                     13000..13002,13039..13050)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     misc_feature    join(13192..13224,13224..15983)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Middle East respiratory syndrome-related
                     coronavirus RNA-dependent RNA polymerase, also known as
                     non-structural protein 12, and similar proteins from
                     betacoronaviruses in the C lineage: responsible for
                     replication and transcription of the viral RNA genome;
                     Region: MERS-CoV-like_RdRp; cd21592"
                     /db_xref="CDD:394896"
     misc_feature    order(13992..14006,14154..14159,14163..14165,14169..14183,
                     14199..14210,14217..14219,14289..14291,14298..14303,
                     14307..14312,14319..14321,14325..14339,14343..14363,
                     14373..14375,14379..14381,14391..14396,14406..14408,
                     14700..14705,14712..14714,14727..14732,14736..14738,
                     14745..14756,15183..15185)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative Nsp8 interaction site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394896"
     misc_feature    order(14412..14426,14430..14432,14445..14447,14472..14474,
                     14478..14480,14505..14522,14835..14837,14841..14843,
                     15714..15716)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative Nsp7 interaction site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394896"
     misc_feature    order(14679..14681,14685..14693,14820..14822,14856..14858,
                     14862..14864,14880..14882,14892..14894,14904..14906,
                     14916..14918,14955..14957,14961..14963,14967..14969,
                     15231..15242,15249..15251,15459..15470,15624..15629,
                     15681..15683,15705..15707,15756..15761,15771..15773,
                     15777..15782)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative RNA binding site [nucleotide binding];
                     other site"
                     /db_xref="CDD:394896"
     misc_feature    14685..14726
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif G; other site"
                     /db_xref="CDD:394896"
     misc_feature    14799..14867
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif F; other site"
                     /db_xref="CDD:394896"
     misc_feature    order(14832..14834,15225..15233,15246..15248,15258..15260,
                     15462..15464)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="inhibitor binding site [chemical binding];
                     inhibition site"
                     /db_xref="CDD:394896"
     misc_feature    15018..15068
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif A; other site"
                     /db_xref="CDD:394896"
     misc_feature    order(15039..15041,15462..15470)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="catalytic residues [active]"
                     /db_xref="CDD:394896"
     misc_feature    15225..15314
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif B; other site"
                     /db_xref="CDD:394896"
     misc_feature    15444..15488
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif C; other site"
                     /db_xref="CDD:394896"
     misc_feature    15510..15575
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif D; other site"
                     /db_xref="CDD:394896"
     misc_feature    15615..15650
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="conserved polymerase motif E; other site"
                     /db_xref="CDD:394896"
     misc_feature    15984..16268
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Cys/His rich zinc-binding domain (CH/ZBD) of
                     coronavirus SARS NSP13 helicase and related proteins;
                     Region: ZBD_cv_Nsp13-like; cd21401"
                     /db_xref="CDD:439168"
     misc_feature    order(16116..16118,16179..16181,16185..16187,16224..16226,
                     16251..16265)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:439168"
     misc_feature    16278..16421
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="stalk domain of coronavirus Nsp13 helicase and
                     related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
                     /db_xref="CDD:410205"
     misc_feature    order(16287..16289,16374..16376)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="key interaction residues; other site"
                     /db_xref="CDD:410205"
     misc_feature    16431..16667
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="1B domain of coronavirus SARS NSP13 helicase and
                     related proteins; Region: 1B_cv_Nsp13-like; cd21409"
                     /db_xref="CDD:394817"
     misc_feature    order(16515..16520,16524..16526,16617..16619)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="nucleic acid substrate binding site [nucleotide
                     binding]; other site"
                     /db_xref="CDD:394817"
     misc_feature    16629..16637
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394817"
     misc_feature    16734..17753
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="helicase domain of betacoronavirus non-structural
                     protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
                     /db_xref="CDD:409655"
     misc_feature    order(16836..16853,17193..17195,17310..17312,17595..17597,
                     17601..17603,17682..17684)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ATP binding site [chemical binding]; other site"
                     /db_xref="CDD:409655"
     misc_feature    order(16845..16850,17103..17108,17193..17195,17682..17684)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative active site [active]"
                     /db_xref="CDD:409655"
     misc_feature    17790..19343
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="nonstructural protein 14 of betacoronavirus;
                     Region: betaCoV_Nsp14; cd21659"
                     /db_xref="CDD:394958"
     misc_feature    order(17790..17792,17796..17807,17832..17861,17928..17930,
                     17940..17942,17955..17978,18078..18083,18147..18149,
                     18153..18155,18165..18170,18351..18353,18360..18365,
                     18372..18380,18426..18428)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="heterodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394958"
     misc_feature    order(18045..18047,18051..18053,18348..18350,18579..18581,
                     18594..18596)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ExoN active site [active]"
                     /db_xref="CDD:394958"
     misc_feature    order(18651..18653,18693..18695,18702..18707,18714..18716,
                     18774..18785,18789..18791,18831..18839,18864..18872,
                     18918..18932,18966..18968,19023..19025,19029..19034,
                     19041..19043,19047..19049,19284..19286)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="N7-MTase active site [active]"
                     /db_xref="CDD:394958"
     misc_feature    19350..19532
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="N-terminal domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: NTD_alpha_betaCoV_Nsp15-like; cd21171"
                     /db_xref="CDD:439163"
     misc_feature    order(19350..19358,19410..19412,19416..19427,19449..19451,
                     19464..19466,19491..19493,19497..19508)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="hexamer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:439163"
     misc_feature    order(19374..19391,19428..19439,19443..19445,19452..19454,
                     19458..19475,19482..19493,19530..19532)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:439163"
     misc_feature    19545..19928
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="middle domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: M_alpha_beta_cv_Nsp15-like; cd21167"
                     /db_xref="CDD:439161"
     misc_feature    order(19578..19583,19611..19613,19617..19619,19623..19625,
                     19629..19637,19815..19820,19824..19835)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:439161"
     misc_feature    19656..19661
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="hexamer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:439161"
     misc_feature    19920..20372
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Nidoviral uridylate-specific endoribonuclease
                     (NendoU) domain of coronavirus Nonstructural Protein 15
                     (Nsp15) and related proteins; Region:
                     NendoU_cv_Nsp15-like; cd21161"
                     /db_xref="CDD:439158"
     misc_feature    order(19944..19946,19950..19952,20058..20066,20130..20132,
                     20142..20147,20151..20153,20169..20171,20175..20177,
                     20181..20183,20187..20189,20193..20198,20202..20204)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:439158"
     misc_feature    order(20040..20042,20052..20054,20076..20081,20085..20087,
                     20205..20207,20214..20219,20358..20360,20364..20366)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative active site [active]"
                     /db_xref="CDD:439158"
     misc_feature    20382..21269
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Coronavirus NSP13; Region: NSP13; pfam06460"
                     /db_xref="CDD:399456"
     CDS             61..13227
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="polyprotein pp1a; ORF1a polyprotein is cleaved to
                     yield the nonstructural proteins"
                     /codon_start=1
                     /product="ORF1a polyprotein"
                     /protein_id="YP_009361855.1"
                     /db_xref="GeneID:37627558"
                     /translation="MSFVADVTAQGARGTYRAALNSEKHHDHVSLTVPLCGSGDLVSK
                     LSPWFMDGYDACEAVKVMLSNKEKLLFVPIRLVGYTKHLPGPRVYLVERLINGIYTDP
                     FMVNQVAYSSSANAGLVGTTLQGKPIGLFFPFDADLVTGDHTFLLRKYGRGGYHYTPF
                     HYERDATSRPEWMDDLEADPKGKYAQNLLKKLIGGDVTPVDQYMCGVDGKPINDYAGL
                     MAKEGITKLADIEADVASRVDADGFIVLKNKLYRLVWHVERKDVQYAKQSIFTINSVV
                     QREGLQDIPPHYFTLGGKIDMLVPRNKWNGVANLPLKQKILYTFYGKESLENHSYIYH
                     SAFTDCGGCGNGSWLTGNAVQGFSCGCGASYLSNDVEVQSSGLIKPNALFCATCPFAK
                     GDSCSSSCKHSIAQLVSYLSERCNVIADSKSFTLVFGGVAYAYFGCEEGTMYFVPRAK
                     SVVSKIGDSIFTGCTGSWTKVTQIANLFLEQTQRSLNFVGEFVVNDVVLAILSGTTTN
                     VDKLRELLKGITLEKLRDYLADYDVAVTLGPFMDNAVNVGGKGLQYATITAPFLVLTG
                     LGESFKKVAAIPYKVCKSFKETLSYYADSILYRVFPYDMDSDVSSFTELLFDCVGLSV
                     ASTYFIVRLLQDKTGDFMSTILSSCQSAVRKLLDTCLEATEATLNFLLELANLFKIFL
                     RGAYVYTSQGFVVLQGKMSSLVKQVVDLLNKGMQLLHTKVSWAGSKVSAVIYSGRESL
                     IFPTGTYYCVSTKAKSVQHQFDVILPGDCSKKQLGLLEPTDNSTTVEVTVSSNTVETV
                     VGQLEQTNMHSPDVIVGDYVIISDKLFVRSKEEDRVVFYPACTNGTAVPTLFKLKGGA
                     PVKRVAFGDDEIHEVAAVRSVTVEYNIHAVLDALLASSSLRTFVVDKSLSIEEFVDVV
                     KEQVSDLLAKLLRGMPIPDFDLDDFIDTPCYCFNADGDVSWSSTMIFSLHPVECEDDS
                     FECDSDQDDDQESVCEPLVEETNVQVQESDDDGWAAAVEEAFPIEELEEPPVQVVPND
                     SVVRSQVAQPIEIVVQETPVQPLEDVAPAVATPSIQLQEIQTEVLDTPPVYEADIEQT
                     QIVVSKPKRLRKKRNVDPLFNFEHKVITDCVTMVLGDAIQVAKCYDEAVLVNAANTYL
                     KHGGGIAGAINAASNGAVQQESDEYILAKGPLQVGDSVLLQGHSLAKNILHVVGPDAR
                     AKQDVSLLGKCYKAMNAYPLVVTPLVSAGIFGVQPSVSFDYLIREVKTRVLVVVNSQD
                     IYKSLTTVEVPQGLTFSYDGLRGALRKARDYGFTVFVCTDNSANTKVLRNKGVDYTKK
                     STTVDGVQYYCYTAKDTLDSIVLEANKASGIISMPLGYVSHGLDLMQAGAIVRRVKVP
                     YVCLLANKEQEAILMSEDVKLSPSADFVKHVRTNGGYNSWHLVEGELLVRDLTLNKLL
                     HWSDQTICYKSDKFYVVKNGVALPFETLAACRTYLDSRTAQQLTIEVLVTVDGVNFRT
                     VVLNNKSSYRSQLGCVFYNGADISDTIPDEKQNGCSLYLADNLTADETKVLKELYGPV
                     DPTFLHRFYSLKAVVQKWKMVVCDKVRSLKLSDNNCYINVVIMILDLLKDIKFVIPAL
                     QHAFMKHKGGDSTEFIALIMTYGNCTFGAPDDATRLLHTVLAKAELCCSARMVWREWC
                     NVCGIKDVVIQGLKACCYVGVQTVEDLHARMTYVCQCGGERHRQLVEHTAPWLLLSGT
                     PNEKLVTTSTAPDFVAFNVFQGLETAVGHYVHARLKDGLILKFDSGTLSKTSDWKCKV
                     TDVLFPNQKYSSDCNVVRYSLDGKFRTEVDPDLSAFYVKDGKYFTSEPPVTYSPATVL
                     AGSVYTNSCLVSSDGQPGGDAISLSFNNLLGFDSSKPVTKKYTYSILPKEDGDVLLAE
                     FSTYDPIYKNGAMLKGKPVLWVTNASYDATLNKFNRATLRQIYDVAPIEIENKYTPLS
                     VEPSPVEKVSTVEVALAKPELTIVKCKGLIKPFVKANVSFVSDETGLPVVEYLSKEDL
                     HTLYVDPKYQVIVLKDNALSTIFRLHTVESGDLNVVAASGSLTRKVKLLFRASFYFKE
                     LASRTLTATTVVGSCINSVVRHLGVTKGILASLFSFVKMLFVLPLSYFSDSETSTTEV
                     KVSALKTAGVVTGNVLKQCCTAAVDLSMDKLRRVDWKATLRLLLMLCTTMVLLSSVYH
                     LYVFNQVLSSDVMFEDAQGLKKFYKEVRAYLGVSSGCDGLAAAYRANSFDVPTFCANR
                     SVMCNWCLINQDSITHYPALKMVQTHLSHYVLNIDWLWFALEVGLAYILYTSAFNWLL
                     LAGTLQYFFAQTSIFVDWRSYNYVVSSAFWLFTHIPMPGLVRIYNLLACLWLLRKFYQ
                     HVINGCKDTACLLCYKRNRLTRVEASTVVCGGKRTFYIAANGGISFCRRHNWNCVDCD
                     TAGVGNTFICEEVASDLTTTLRRPVNSTDRSHYYVDSVLVKETVVQFNYRRDGQSCYE
                     RFPLCAFTNLDKLKFKEVCKTTTGIPEYNFIIYDSSDRGQESLARSACVYYSQVLCKS
                     ILLVDSSLVTSVGNSGEIAIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNF
                     HSVLTTFIDAARGPAGVESDVETNEIVDSVQYAHKHDIQLTNESYNNYVPSYVKPDSV
                     STGDLGSLIDCNAASVNQTSMRQANGACIWNAAAYMKLSDVLKRQIRIACRKCNLAFR
                     LTTSKLRANDNMLSVKFTATKIVGGAPTWFNTLRDFTLKSYVFVTIIVFLCAVLMYFC
                     LPTFAMAPVEFYEDRILEYKVLDNGIIRDISPDDKCFANKYRSFSQWYHEHVGGSYDN
                     SISCPLTVAVIAGVAGARIPDVPTTLAWVNRQIVFFVSRVFANSNSVCYTPINEIPYK
                     SFSDSGCILPSECTMFRDAEGRMSPYCYDPTVLPGAFAYSQMKPHVRYDLYDTNMFIK
                     FPEVVFESTLRITKTLTTQYCRFGSCEYAQEGVCITTNGSWAIFNDHHLSRPGVYCGS
                     DYVDIVRRLAVSLFQPITYFQLTTSLVLGIGLCAFLTLLFYYINKVKRAFADYTQCAM
                     IAVIAAVLNSLCICFVSSIPLCIVPYTALYYYATFYFTNEPAAIMHVSWYIMFGPIVP
                     MWLTCVYTVAMCFRHFFWVVAYFSKKHVEVFTDGKLNCSFQDAASNIFVVNKDTYAAL
                     RNAITNDVYSRYLGLFNKYKYYSGAMETAAYREAAACHLAKALQTYSETGSDLLYQPP
                     NCSITSGVLQSGLVRMSHPSGDVEACMVQVTCGSMTLNGLWLDNTVWCPRHVMCPADQ
                     LADPNYDALLVSMTNHSFSVNKHIGAPANLRVIGHAMQGTLLKLTVDVANPSTPAYTF
                     TTVKPGASFSVLACYNGRPTGTFTVVMRPNYTIKGSFLCGSCGSVGYTKEGSVINFCY
                     MHQMELANGTHTGSAFDGTMYGAFLDKQVHQVQLTDKYCSTNVVAWLYAAILNGCAWF
                     VKSNRTSIVSFNEWALANQFTEFVGTQSIDMLAVKTGVAIEQLLYAIQQLHTGFQGKQ
                     ILGSSMLEDEFTPEDVNMQIMGVVMQSGVRKVTYGTAHWLFATFVLSYVVFLQTTKFT
                     LWNYLFETIPTQLFPLLFVTVACVMLLVKHKHTFLTLFLLPVAICLTYANIVYEPATP
                     ISSALIAVANWLAPTNVYMRTTHTDIGVYISLSLVLAIVVKRLYNPSLSNFALALCSG
                     VMWLYTYSVGEVSSPIAYLVFVTTLTSDYTITVFVTVNLAKICTYIIFAYAPQLTLVF
                     PEVKMILLLYTCFGFMCTCYFGVFSLLNLKLRAPMGVYDFKVSTQEFRFMTANNLTAP
                     RNSWEAMSLNFKLLGIGGTPCIKVAAIQSKLTDLKCTSVVLLSVLQQLHLEANSKAWA
                     FCVKCHNDILAATDPSEAFEKFVSLFATLMTFSGNVDLDALASDIFETPSVLQATLSE
                     FSHLATFAELEAAQRAYQEAMDSGDASPQVLKALQKAVNVAKNAYEKDKAVARKLERM
                     AEQAMTSMYKQARAEDKKAKIVSAMQTMLFGMIKKLDNDVLNGIISNARNGCIPLSVV
                     PLCASNKLRVVIPDFTVWNQVVTYPSLNYAGALWDIAVINNVDNEIVKSSDVVENNES
                     LTWPLVLECTRAASSAIKLQNNEIKPSGLRTMVVSAGQEQTNCNTSSLAYYEPVQGRK
                     MLMALLSDNAYLKWARVEGQEGFVSVELQPPCKFLIAGPKGPEIRYLYFVKNLNNLHR
                     GQVLGHIAATVRLQAGSNTEFAANSSVLSLVNFTVDPQKAYIDFVNAGGAPLTNCVKM
                     LTPKTGTGIAISVKPESTADQETYGGASVCLYCRAHIEHPDVSGVCKYKGKFVQIPSQ
                     CTRDPVGFCLTNTPCNVCQYWIGYGCNCDSLRQAALPQSKDSNFLNESGVLL"
     misc_feature    127..636
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="non-structural protein 1 from Middle East
                     respiratory syndrome-related coronavirus and
                     betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp1; cd21878"
                     /db_xref="CDD:409340"
     misc_feature    646..2625
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to MERS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the C lineage; Region:
                     betaCoV_Nsp2_MERS-like; cd21517"
                     /db_xref="CDD:394868"
     misc_feature    2686..2949
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="first ubiquitin-like (Ubl) domain located at the
                     N-terminus of coronavirus SARS-CoV non-structural protein
                     3 (Nsp3) and related proteins; Region:
                     Ubl1_cv_Nsp3_N-like; cd21467"
                     /db_xref="CDD:394822"
     misc_feature    <3016..3303
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ribonuclease E; Reviewed; Region: rne; PRK10811"
                     /db_xref="CDD:236766"
     misc_feature    3463..3834
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="X-domain (or Mac1 domain) of viral non-structural
                     protein 3 and related macrodomains; Region:
                     Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:438957"
     misc_feature    order(3481..3489,3499..3519,3742..3747,3751..3765,
                     3829..3831)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ADP-ribose binding site [chemical binding]; other
                     site"
                     /db_xref="CDD:438957"
     misc_feature    3886..4245
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="SUD-M macrodomain (or Mac3 domain) of the SARS
                     Unique Domain (SUD) of SARS-CoV non-structural protein 3
                     and related macrodomains; Region:
                     Macro_cv_SUD-M_Nsp3-like; cd21563"
                     /db_xref="CDD:394884"
     misc_feature    4255..4482
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Middle East
                     respiratory syndrome-related coronavirus and related
                     betacoronaviruses in the C lineage; Region:
                     SUD_C_MERS-CoV_Nsp3; cd21523"
                     /db_xref="CDD:394839"
     misc_feature    4495..5427
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(4816..4827,4978..4986,4990..4995,5002..5004,
                     5014..5016,5092..5094,5116..5121,5164..5166,5170..5172,
                     5239..5241,5296..5298,5317..5328,5413..5415)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(4978..4986,5236..5241,5296..5298,5305..5307,
                     5317..5319,5326..5328,5413..5415)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    5536..5904
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Middle East respiratory syndrome-related
                     coronavirus and betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp3_NAB; cd21823"
                     /db_xref="CDD:409349"
     misc_feature    5962..6324
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus-specific marker of non-structural
                     protein 3 from Middle East respiratory syndrome-related
                     coronavirus and betacoronavirus in the C lineage; Region:
                     MERS-CoV-like_Nsp3_betaSM; cd21815"
                     /db_xref="CDD:409630"
     misc_feature    6574..8271
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Middle East respiratory
                     syndrome-related coronavirus and betacoronavirus in the C
                     lineage; Region: TM_Y_MERS-CoV-like_Nsp3_C; cd21716"
                     /db_xref="CDD:409664"
     misc_feature    6574..6639
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409664"
     misc_feature    6961..7029
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409664"
     misc_feature    8329..9471
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8329..8397
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9130..9195
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9238..9303
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9382..9450
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9529..9786
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     misc_feature    9802..10692
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(9802..9825,9832..9834,10153..10155,10165..10185,
                     10210..10224,10297..10299,10315..10317,10648..10650,
                     10660..10662,10684..10689)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(9853..9855,9862..9870,9937..9939,9952..9954,
                     10219..10236,10288..10299,10303..10305,10315..10317,
                     10360..10362,10366..10374)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     misc_feature    10711..11586
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     misc_feature    11587..11835
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11590..11592,11599..11610,11617..11625,11629..11634,
                     11641..11643,11668..11670,11677..11679,11695..11697,
                     11731..11748,11752..11769,11788..11802)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     misc_feature    11845..12432
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 8; Region:
                     betaCoV_Nsp8; cd21831"
                     /db_xref="CDD:409258"
     misc_feature    order(12073..12078,12085..12090,12094..12102,12106..12114,
                     12118..12123,12127..12135,12145..12150,12154..12159,
                     12163..12171,12175..12204,12211..12213,12217..12231,
                     12235..12237,12247..12249,12259..12261,12283..12285,
                     12298..12300,12325..12327,12370..12372,12388..12390)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409258"
     misc_feature    12433..12762
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12433..12450
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12439..12447,12451..12459,12640..12645,12709..12714,
                     12718..12726,12730..12738,12742..12750,12754..12762)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     misc_feature    12763..13155
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="alphacoronavirus and betacoronavirus non-structural
                     protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
                     /db_xref="CDD:409326"
     misc_feature    order(12763..12786,12796..12798,12802..12810,12814..12825,
                     12835..12840,12847..12852,12859..12861,12880..12897,
                     12934..12939,12967..12969,12973..12978,12988..12990,
                     12994..13011,13024..13032,13039..13050)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     misc_feature    order(12802..12810,12814..12822,12835..12837,12880..12882,
                     12886..12897,12934..12942,12994..13005,13012..13014,
                     13045..13050,13105..13107)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(12823..12828,12835..12837,12886..12897,12934..12942,
                     13012..13014,13045..13050,13105..13107,13117..13119)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(12880..12903,12934..12939,12967..12978,12991..12996,
                     13000..13002,13039..13050)
                     /gene="ORF1ab"
                     /locus_tag="CAU86_gp02"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     gene            21232..25269
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /db_xref="GeneID:37627554"
     CDS             21232..25269
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="surface glycoprotein"
                     /codon_start=1
                     /product="spike protein"
                     /protein_id="YP_009361857.1"
                     /db_xref="GeneID:37627554"
                     /translation="MTYSVSLLMCLLTFIGANAKIVSIPGGVGTGACPQVDMQPSYFI
                     KHNWPEPIDMNKADGVIYPNGRTYSNITLQTTNLFPRNGDLGTQYVYSASNEKSRTSN
                     VAFISNYSYYGNPFGDGIVIRIGQNSNKTGSVIVGTAQTTIKKIYPALMLGSSFGNFS
                     VNNKSGAYFNHTLLILPSKCGTVFQVAYCLLQPRTDSYCPGNANYVSYALIDSPTDCT
                     SADESKRRNGLEDIKKYFNLVNCTYFEEFNVTADERAEWFGITQDSQGVHLYTSRKNG
                     FNSNNLFLFASVPIYDKINYYTVIPRSIITPANQRSAWAAFYVYPLHQLSYLLNFDVN
                     GYITQAADCGYNDYTQLVCSYGDFNMKSGVYSTSYYSAKPVGAYYEAHVYPDCNFTDL
                     FRENAPTIMQYKRQVFTRCNYNLTLLLSLVQVDEFVCDKITPEALATGCYSSLTVDWF
                     AFPYAWKSYLAIGSADRIVRFNYNQDYSNPSCRIHSKVNSSVGISYSGLYSYITNCNY
                     GGFNKDDVVKPGGRASQPCVTGALNSPTNGQVWSFNFGGVPYRTSRLTYTDHLKNPLD
                     MVYVITVKYEPGAETVCPKQVRPDYSTNITGLLGSCISYDIYGITGTGVFQLCNATGI
                     PQQKFVYDKFDNIIGFHSDDGNYYCVAPCVSVPVSVIYDDNTNQYATLFGSVACQHIS
                     TMAAQFSRETRASLVSRNMQNLLQTSVGCVMGFHETNDTVEDCNLSLGQSLCAIPPNT
                     NLRVGRSTFGLGSLAYNSPLRVDALNSSEFKVSLPLNFTFGVTQEYIETSIQKITVDC
                     KQYVCNGFAKCEKLLEQYGQFCSKINQALHGANLRQDDFVRNLFESVKTPQTVPLTTG
                     FGGEFNLTLLEPLSVSTGSSNARSALEELLFDKVTIADPGYMQGYDDCMQQGPASARD
                     LICAQYVAGYKVLPPLMDVNMEAAYTSSLLGSIAGAGWTAGLSSFAAIPFAQSIFYRL
                     NGVGITQQVLSENQKIIANKFNQALGAMQTGFTTTNEAFQKVQDAVNTNAQALAKLAS
                     ELSNTFGAISSSIGDIIQRLDVLEQEVQIDRLINGRLTTLNAFVAQQLVRSESAARSA
                     QLAKDKVNECVKSQSTRSGFCGQGTHIVSFVINAPNGLYFMHVGYHPSQHIEVVAAYG
                     LCDAANPTNCIAPVNGYFIKNQTTRGVDDWSYTGSSFYAPEPITTLNTRYVAPQVTFQ
                     NISTNLPPPLLGNSTGTDFKDELDEFFKNVSTSIPNFGALTQINTTLLDLSDEMLALQ
                     QVVKALNESYIDLKELGNYTYYNKWPWYIWLGFIAGLLALALCVFFILCCTGCGTSCL
                     GKLKCNRCCDKYEEYDLEPHKIHIH"
     misc_feature    21316..22293
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="N-terminal domain of the S1 subunit of the Spike
                     (S) protein from Middle East respiratory syndrome-related
                     coronavirus and related betacoronaviruses in the C
                     lineage; Region: MERS-CoV-like_Spike_S1_NTD; cd21626"
                     /db_xref="CDD:394952"
     misc_feature    order(21316..21327,21709..21711,21814..21822,21832..21840,
                     21868..21870,22144..22146)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="neutralizing Ab binding site [polypeptide binding];
                     other site"
                     /db_xref="CDD:394952"
     misc_feature    order(21415..21438,21442..21444,21457..21459,21568..21570,
                     21694..21699,21724..21726,22021..22026,22054..22056,
                     22078..22080,22102..22104,22108..22110,22117..22119,
                     22228..22239,22291..22293)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394952"
     misc_feature    22345..22983
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="receptor-binding domain of the S1 subunit of the
                     Spike (S) protein from Middle East respiratory syndrome
                     coronavirus; Region: MERS-like_CoV_Spike_S1_RBD; cd21479"
                     /db_xref="CDD:394826"
     misc_feature    order(22444..22446,22507..22518,22522..22533,22537..22539,
                     22549..22551,22561..22563,22567..22569,22660..22662,
                     22669..22671,22789..22797,22852..22854,22951..22956,
                     22960..22962)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394826"
     misc_feature    order(22693..22704,22717..22761,22777..22800,22822..22854,
                     22876..22920,22924..22929)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="receptor binding motif; other site"
                     /db_xref="CDD:394826"
     misc_feature    order(22744..22746,22756..22758,22822..22842,22879..22881,
                     22885..22887,22891..22893)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="receptor binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394826"
     misc_feature    23050..25089
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
                     region, and the S2 fusion subunit of the spike (S)
                     glycoprotein from Middle East respiratory syndrome
                     coronavirus and related betacoronaviruses in the C
                     lineage; Region: MERS-CoV-like_Spike_SD1-2_S1-S2_S2;
                     cd22379"
                     /db_xref="CDD:411966"
     misc_feature    order(23083..23085,23380..23382,23527..23529,23560..23562,
                     23815..23817,24733..24735,24880..24882,24928..24930,
                     24973..24975,25036..25038,25069..25071)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="N-linked glycosylation sites [posttranslational
                     modification]; other site"
                     /db_xref="CDD:411966"
     misc_feature    order(23434..23442,23455..23478,23482..23538)
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="S1/S2 cleavage region; other site"
                     /db_xref="CDD:411966"
     misc_feature    23773..23832
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="fusion peptide; other site"
                     /db_xref="CDD:411966"
     misc_feature    23869..23922
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="internal fusion peptide; other site"
                     /db_xref="CDD:411966"
     misc_feature    24181..24378
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="heptad repeat 1 [structural motif]; Region: heptad
                     repeat 1"
                     /db_xref="CDD:411966"
     misc_feature    24940..25065
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="heptad repeat 2 [structural motif]; Region: heptad
                     repeat 2"
                     /db_xref="CDD:411966"
     misc_feature    25153..25266
                     /gene="S"
                     /locus_tag="CAU86_gp03"
                     /note="Coronavirus spike glycoprotein S2, intravirion;
                     Region: CoV_S2_C; pfam19214"
                     /db_xref="CDD:437051"
     gene            25284..25589
                     /gene="ORF3"
                     /locus_tag="CAU86_gp04"
                     /db_xref="GeneID:37627553"
     CDS             25284..25589
                     /gene="ORF3"
                     /locus_tag="CAU86_gp04"
                     /codon_start=1
                     /product="ORF3 protein"
                     /protein_id="YP_009361858.1"
                     /db_xref="GeneID:37627553"
                     /translation="MRVQRPPTLLLVVGLTLLALAYSKPLYVPEHCQNYSGRMLRACI
                     RTAQTDTVGLYTNLVIQTGTATFESAVPVDRGSPSTHADTYELNTSVTLFDVGYSVN"
     gene            25598..25927
                     /gene="ORF4a"
                     /locus_tag="CAU86_gp05"
                     /db_xref="GeneID:37627559"
     CDS             25598..25927
                     /gene="ORF4a"
                     /locus_tag="CAU86_gp05"
                     /codon_start=1
                     /product="ORF4a protein"
                     /protein_id="YP_009361859.1"
                     /db_xref="GeneID:37627559"
                     /translation="MDYVSLLNQIWQKYLNLPDTVCLYIPKPASSFKPVAGTSLHPVQ
                     WECKITFAGYTEVAVNSTKALAKQDAARRIMWLLHRDGGIPDGCSLHMRHSSIFSDVP
                     EETPFSE"
     misc_feature    25670..25822
                     /gene="ORF4a"
                     /locus_tag="CAU86_gp05"
                     /note="double-stranded RNA binding motif (DSRM)
                     superfamily; Region: DSRM_SF; cd00048"
                     /db_xref="CDD:380679"
     gene            25803..26579
                     /gene="ORF4b"
                     /locus_tag="CAU86_gp06"
                     /db_xref="GeneID:37627560"
     CDS             25803..26579
                     /gene="ORF4b"
                     /locus_tag="CAU86_gp06"
                     /codon_start=1
                     /product="ORF4b protein"
                     /protein_id="YP_009361860.1"
                     /db_xref="GeneID:37627560"
                     /translation="MQPAELCGCSIEMEEYPMDVHSTCVTPASSRMFRKRRHSPSRNL
                     RYVKRRFSSLRPEDISLVTEPTHYLRVIFHSPNTWYIRSGHDLDSVHKWLKPYGGIPV
                     NEYHITLALLSLSEQHLAMDISPIAIFLRNVRFELFDFTLLRKTLALKASEICCDNLH
                     RFQPITRVNMALPLIKEWLRVQGFPIYNSHLPLHMSVSKLHALDDNTCEYVANMSCFK
                     QYPTQMFVRPIAVELVSIRQSSNAPRCIVHSVPILHAPGF"
     misc_feature    25857..26573
                     /gene="ORF4b"
                     /locus_tag="CAU86_gp06"
                     /note="accessory protein ORF4b, also known as
                     non-structural protein 3c (NS3c) in Middle East
                     respiratory syndrome (MERS)-related CoV; Region:
                     ORF4b_MERS-CoV-like; cd21651"
                     /db_xref="CDD:394925"
     gene            26586..27263
                     /gene="ORF5"
                     /locus_tag="CAU86_gp07"
                     /db_xref="GeneID:37627562"
     CDS             26586..27263
                     /gene="ORF5"
                     /locus_tag="CAU86_gp07"
                     /codon_start=1
                     /product="ORF5 protein"
                     /protein_id="YP_009361861.1"
                     /db_xref="GeneID:37627562"
                     /translation="MAFSLALFKPISLVPAFPEAHGGEPAQFANVFTCIPTVGYIAAL
                     TVNVCILPLLLLIPQDTCRRSIFKTSILYGLFVYNFILAITLINGVYTPTGGTLVAFL
                     VVLMITWLADRVRFCLLLRSYIPLFDMRSHFIRVSTVSSYGMVPVNQTKPLFIRNFDQ
                     RCRCSRCFYVHSSHYLECTYISRFTKVSLVAVTDFSLNGITSTVFVPSTRDSVPLHII
                     APSVLSV"
     misc_feature    26592..27254
                     /gene="ORF5"
                     /locus_tag="CAU86_gp07"
                     /note="Non-structural protein ORF5 from Middle East
                     respiratory syndrome-related coronavirus and related
                     betacoronaviruses in the C lineage; Region:
                     MERS-CoV-like_ORF5; cd21645"
                     /db_xref="CDD:394928"
     gene            27342..27590
                     /gene="E"
                     /locus_tag="CAU86_gp08"
                     /db_xref="GeneID:37627561"
     CDS             27342..27590
                     /gene="E"
                     /locus_tag="CAU86_gp08"
                     /codon_start=1
                     /product="small envelope protein"
                     /protein_id="YP_009361862.1"
                     /db_xref="GeneID:37627561"
                     /translation="MLPFVQQQLGSFIVNFFIFTVACAVILLVCMAFLTATRLCVQCI
                     TGVNTLLVQPAVYMYNTGRSVYVKFQESKPPLPPDEWV"
     misc_feature    27345..27584
                     /gene="E"
                     /locus_tag="CAU86_gp08"
                     /note="Middle East respiratory syndrome-related
                     coronavirus Envelope small membrane protein and similar
                     proteins; Region: MERS-CoV-like_E; cd21533"
                     /db_xref="CDD:394860"
     misc_feature    order(27363..27365,27384..27389,27393..27398,27402..27428,
                     27432..27434,27480..27488,27510..27512,27519..27524,
                     27528..27536)
                     /gene="E"
                     /locus_tag="CAU86_gp08"
                     /note="putative homopentameric interface [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394860"
     gene            27605..28264
                     /gene="M"
                     /locus_tag="CAU86_gp09"
                     /db_xref="GeneID:37627556"
     CDS             27605..28264
                     /gene="M"
                     /locus_tag="CAU86_gp09"
                     /codon_start=1
                     /product="membrane glycoprotein"
                     /protein_id="YP_009361863.1"
                     /db_xref="GeneID:37627556"
                     /translation="MSNMTQLTEQQIISIIKDWNFAWSLIFLLITIVLQYGYPSRSMT
                     VYVFKMFVLWLLWPSSMALSIFSAVYPIDLASQIISGIIAGVSALMWISYFVQSIRLF
                     MRTGSWWSFNPETNCLLNVPLGGTTVVRPLVEDSTSVTAVVANGYLKMAGMHFGACDY
                     DRLPSEVTVAKPNVLIALKMVKRQSYGTNSGVAIYHRYKAGNYRSPPITADSELALLR
                     A"
     misc_feature    27608..28258
                     /gene="M"
                     /locus_tag="CAU86_gp09"
                     /note="Membrane (or Matrix) protein from Middle East
                     respiratory syndrome-related coronavirus and related
                     betacoronaviruses in the C lineage; Region:
                     MERS-like-CoV_M; cd21567"
                     /db_xref="CDD:394853"
     gene            28311..29558
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /db_xref="GeneID:37627557"
     CDS             28311..29558
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /codon_start=1
                     /product="nucleocapsid phosphoprotein"
                     /protein_id="YP_009361864.1"
                     /db_xref="GeneID:37627557"
                     /translation="MATPAAPRAVSFADNNDNSNNNQSRGRGRNPKPRPAPNNTVSWY
                     TGLTQHGKVSLSFPPGQGVPLNANSTPAQNAGYWRRQDRKINTGNGTKSLAPRWYFYY
                     TGTGPEANLPFRAVKDGIIWVHEDGATDAPSTFGTRNPNNDAAIVTQFAPGTKLPKNF
                     HIEGTGGNSQSSSRASSASRNSSRSNSRGSRSGNSSRGTSPGPSGVGAVGGEMLYLDL
                     LNRLQALESGKTKQAQPKVITKKDAVAAKNKMRHKRVATKGFNMVQAFGLRGPGDLQG
                     NFGDLQLNKLGTEDPRWPQIAELAPSASAFIGMSQFKLTHQSNDTDGAPVYFLRYSGA
                     IKLDPKNPNYNKWLELIEQNVDAYKTFPKKEKKQKAPKEEPSDQMNVQPPKEQRVQGS
                     ITQRSRTPRPSVQPGPMTDVNTD"
     misc_feature    28401..29465
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /note="Coronavirus nucleocapsid protein; Region:
                     Corona_nucleoca; pfam00937"
                     /db_xref="CDD:425955"
     misc_feature    order(28431..28448,28599..28601,28605..28607,28611..28613,
                     28722..28724,28743..28745)
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /note="RNA binding site [nucleotide binding]; other site"
                     /db_xref="CDD:439219"
     CDS             28357..28938
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /codon_start=1
                     /product="Orf8b"
                     /protein_id="YP_009944307.1"
                     /db_xref="GeneID:37627557"
                     /translation="MTTPIITSLEEEEETLNLDLHQITLSPGTRGLPNTGKSLFPSHL
                     DRAYLLMPILPLRKMLGIGGDRTEKLIQEMEPSHWLPGGTSTTLEPDLRPTSLSELSR
                     TESSGSMRMAPLMLLQLLGRGTLTMMLLLLRNSRPVLSFLKTSTLKGLEAIANHLQER
                     LVPAETLLDPIPEVPDLVTPPAALPQVHLESEL"
     misc_feature    28522..28833
                     /gene="N"
                     /locus_tag="CAU86_gp10"
                     /note="MERS-CoV ORF8b protein and related Merbecovirus
                     proteins; Region: merbe_CoV_ORF8b-like; cd21661"
                     /db_xref="CDD:394942"
     3'UTR           29559..29642
ORIGIN      
        1 cttgtacgtc tcggtcacaa tatacggttc catccggtgc gtggcaattc ggggcacatc
       61 atgtctttcg tggctgatgt gaccgcgcaa ggtgcgcgcg gtacgtatcg agcagcgctc
      121 aactctgaaa aacatcatga ccatgtgtct ctaactgtcc cactctgtgg ttcaggagac
      181 ctggtttcaa aactttcacc atggttcatg gatggctatg atgcctgtga agcggtgaag
      241 gtcatgttat ctaacaaaga gaagttactc tttgtgccca tccgtctggt tgggtatact
      301 aagcatctcc caggccctcg cgtttacctg gttgagaggc tcattaacgg tatttatacc
      361 gatcctttta tggttaacca agtggcttat agctctagtg caaatgctgg ccttgttggc
      421 acaactttgc agggcaagcc tattggtctc ttcttcccct ttgacgccga tcttgttact
      481 ggagatcata cctttctcct gcgcaagtat gggcgtggtg gttatcacta cactcctttt
      541 cattatgagc gtgatgccac ttcccgtcct gagtggatgg atgacctcga agcagatcca
      601 aagggcaagt atgcccagaa tttgcttaag aagttgattg gtggtgatgt caccccagtc
      661 gaccaataca tgtgtggtgt tgatggaaag cccattaacg actatgcagg tttaatggct
      721 aaggagggaa taaccaaatt ggctgacatt gaagctgatg tagcatcacg tgttgatgct
      781 gatggcttca ttgtgttgaa gaacaagttg tacagattgg tttggcatgt tgagcgtaag
      841 gacgttcagt atgccaaaca atcaatcttc actattaata gtgtggttca aagggaaggt
      901 ctccaagaca ttccccctca ctactttact cttggtggta agattgacat gcttgttcca
      961 cgtaacaagt ggaatggtgt ggctaactta cctcttaaac agaaaattct ttatacattc
     1021 tatggtaaag agtctcttga gaaccattct tacatttacc attctgcgtt cactgattgt
     1081 ggaggctgcg gtaatggttc atggcttaca gggaacgctg ttcagggttt ctcctgtggt
     1141 tgtggggcat catatttgtc taatgatgtc gaagttcaat catctggctt gataaagcca
     1201 aatgcccttt tttgtgcgac ttgtcccttt gctaaaggtg acagttgttc ttctagctgc
     1261 aaacattcaa ttgctcaatt ggttagttac ctttctgagc gttgtaatgt tattgcagat
     1321 tctaaatcct tcacgcttgt ctttggaggc gttgcttacg cttactttgg ctgtgaggaa
     1381 ggtactatgt actttgttcc tagagctaag tctgtggtgt cgaagattgg agattccatc
     1441 ttcacaggct gtacaggttc ttggaccaaa gttactcaga ttgctaacct gtttttagaa
     1501 cagacccagc gttctcttaa ttttgtggga gaattcgtgg tcaacgatgt tgtcctcgca
     1561 attctttcag gaacaacaac caatgtggac aagttacgtg agcttcttaa agggatcact
     1621 cttgagaagc tacgtgatta ccttgccgat tatgacgttg ctgtcacact cggtcctttt
     1681 atggataatg ctgttaatgt tggtggtaag ggtctgcaat acgctaccat tacagcaccc
     1741 tttttagttc tcactggttt aggtgagtcc tttaagaaag ttgcagccat accgtataag
     1801 gtttgcaaat cttttaagga gactttgtcc tattatgctg atagcatatt gtacagagtc
     1861 tttccttatg acatggattc tgatgtgtca tcttttactg agctactgtt tgactgtgtt
     1921 ggtctgtcag tggcttccac ctatttcata gttcgcctgt tgcaagataa gacaggtgac
     1981 ttcatgtcca ctatactttc atcatgccag tctgctgtac gtaagctcct tgacacttgt
     2041 cttgaagcca ctgaagcaac tctcaacttc ttgttggagc tggcaaatct tttcaagatc
     2101 tttctccgcg gagcctacgt ctatacgtca cagggctttg tggtgctcca gggcaaaatg
     2161 tcttcacttg ttaaacaagt agtggacttg ctcaataagg gtatgcaatt gttgcataca
     2221 aaggtctcct gggccggctc taaagtcagt gctgttattt acagtggccg ggaatctttg
     2281 attttcccta caggaactta ttattgtgtt agcaccaaag caaaatctgt ccaacatcag
     2341 ttcgatgtga tcttgcctgg tgattgttct aagaagcagt taggtctgct tgaacctact
     2401 gacaactcta caacggttga ggttactgta tccagtaaca cggttgaaac tgttgtaggt
     2461 caacttgaac agactaatat gcatagtcct gatgttatag taggagacta tgttattatt
     2521 agtgataaac tgtttgtgcg aagcaaggaa gaagaccgcg ttgtcttcta tcctgcttgt
     2581 actaatggta ctgctgtacc taccttgttt aaacttaaag gtggtgcacc tgttaagagg
     2641 gtagcttttg gtgatgatga gatccatgaa gttgctgctg taagaagtgt aaccgtcgag
     2701 tacaacattc atgctgtatt agacgcactg cttgcttctt ctagtcttag aacttttgtt
     2761 gtagataagt ctttgtcaat agaggaattt gttgacgtag taaaagagca agtctctgat
     2821 ttgctcgcca aattgctgcg tggaatgcca attcctgatt ttgacttaga cgattttatt
     2881 gacacaccat gttactgctt taatgctgat ggtgatgtgt cctggtcctc cactatgatc
     2941 ttctcattac accctgtgga gtgtgaagat gatagttttg agtgtgactc tgaccaagat
     3001 gatgatcaag agtctgtttg tgaaccattg gttgaggaaa ccaatgttca ggtacaagag
     3061 tctgacgatg atgggtgggc tgctgctgtt gaagaggcat tccccataga agagttagaa
     3121 gaacctcctg tccaggtcgt gcccaacgat tctgttgtta ggagtcaagt cgcacagcct
     3181 atagaaattg ttgtacagga aactcctgtg caacctcttg aggatgttgc gcctgcagtt
     3241 gcaacgccta gtattcaact tcaggaaata cagactgaag tgttagatac accccctgtg
     3301 tatgaagctg atatagagca aacacagatt gttgtttcaa aacctaagag attgcgcaaa
     3361 aagcgtaatg ttgacccttt gtttaatttt gaacataagg tcattacaga ttgtgtcacc
     3421 atggttttag gtgatgcaat tcaagtagct aagtgttatg atgaagctgt gttggttaat
     3481 gctgccaaca catatcttaa gcatggcggt ggtatcgctg gtgctattaa cgcagcgtca
     3541 aatggtgctg tacaacagga gtcagatgaa tacatcttgg ctaaagggcc actacaggta
     3601 ggagattcag tcctcctgca gggtcattct ctcgctaaaa atatcttgca tgtcgtaggt
     3661 cccgatgccc gcgctaagca ggatgtttct cttcttggta agtgctacaa ggctatgaat
     3721 gcatatcctc ttgtagtaac tccacttgtt tcagcaggca tatttggcgt acagccttct
     3781 gtgtcttttg attatcttat tagagaggtc aaaactagag tattagttgt tgttaactct
     3841 caagatattt ataaaagtct tactacagtg gaagttccgc agggtttaac tttctcctat
     3901 gatgggttgc gtggggcgct gcgtaaagcc agagattatg gttttactgt attcgtttgc
     3961 actgacaact cagccaacac taaagttctt agaaacaaag gtgttgatta tactaagaag
     4021 tccactactg tggatggcgt gcaatattat tgctacaccg ctaaagatac tcttgatagt
     4081 attgttctag aggctaataa agcttccgga attatatcta tgcctttggg atatgtatct
     4141 catggtttag acttaatgca ggcaggagcc atagtacgta gagtaaaggt accctacgtg
     4201 tgcctcctag ctaataaaga gcaagaagct attttaatgt ctgaggacgt taagttaagt
     4261 ccttcagctg attttgtgaa gcatgtccgt actaatggag gttataactc ttggcatcta
     4321 gtcgagggtg agctattagt acgtgatttg actcttaata agcttctgca ttggtctgat
     4381 caaaccatat gctataagtc tgataagttt tatgtggtaa agaacggtgt tgctttgcca
     4441 tttgaaactt tggctgcatg tcgtacctat cttgattcac gtacggcaca acagttgaca
     4501 atcgaagtgc tcgtcacagt cgatggtgtt aattttagaa ctgtggttct aaataataag
     4561 agctcctata gatctcagct tggctgcgtg ttctataatg gtgctgatat ttctgatacc
     4621 attcctgatg aaaaacagaa tggttgcagc ttgtatttag cagacaattt gactgctgat
     4681 gaaacaaagg tgcttaaaga gttatatggc cctgttgatc ctacttttct acacagattc
     4741 tattcactta aggcagtagt ccagaagtgg aagatggttg tgtgtgataa ggtacgttct
     4801 ctcaaattga gtgataataa ttgctacatt aatgtggtaa ttatgattct tgatttgttg
     4861 aaggacatta aatttgtaat acctgcttta cagcatgctt ttatgaaaca taagggcggt
     4921 gattctaccg aattcattgc tctcattatg acttatggca attgcacatt tggtgcccca
     4981 gacgatgcta ctcggttact tcacaccgtg cttgccaagg ctgagttatg ctgttcggca
     5041 cgcatggttt ggagagagtg gtgcaacgtt tgtggcataa aagatgttgt catacaaggc
     5101 cttaaggcat gttgttacgt gggtgtgcaa actgttgaag atctgcacgc acgcatgacg
     5161 tatgtatgcc agtgtggtgg tgaaaggcat cgacaattag ttgagcacac cgcaccctgg
     5221 ttgctactgt caggtacacc aaatgagaaa ttggtgacaa cctctacggc tcctgacttt
     5281 gtagcattta atgtctttca ggggttagag acggctgtag gccattatgt ccatgcccgt
     5341 ctgaaggatg gtcttatttt aaaatttgac tctggcactt taagcaagac ttccgattgg
     5401 aagtgtaagg tgacagatgt cctatttccc aatcagaagt acagtagcga ctgtaatgtc
     5461 gtgcgatact ctcttgatgg taagttcaga acagaggttg atcctgacct ttctgctttc
     5521 tatgtcaagg atggtaaata ttttacaagt gagccacccg tgacttattc acctgctact
     5581 gttttagcag gtagtgttta tactaatagt tgccttgtat cgtctgatgg acaacctggc
     5641 ggtgatgcta ttagtttaag ttttaataac cttttagggt ttgattctag taaaccagtc
     5701 acaaagaagt acacatattc cattcttcct aaggaggatg gagatgtttt gttggctgag
     5761 tttagtactt atgaccctat ttacaagaac ggcgctatgc ttaaaggcaa acctgttctt
     5821 tgggtcacca atgcatctta tgatgcaact cttaataagt tcaatagggc tactttacgt
     5881 caaatatatg acgtagcacc cattgaaatt gaaaataaat acactccttt gagtgtagaa
     5941 ccttcaccag ttgaaaaagt ttctactgtt gaagttgctt tagctaagcc agaactgaca
     6001 attgtcaaat gcaagggttt gatcaaacca tttgtaaaag ccaatgtaag ttttgtttct
     6061 gacgagacag gtcttcctgt tgtcgaatat ctgtctaagg aagatttaca tactttgtat
     6121 gtcgatccta agtaccaagt cattgtctta aaggacaatg cactttctac tattttcaga
     6181 ttgcacactg ttgaatctgg tgatttaaac gttgttgcag cttcaggttc tttaactcgt
     6241 aaggttaagc tactttttag agcttccttt tactttaagg aacttgcttc ccgcactctc
     6301 actgctacca ctgttgtagg tagttgtatt aacagtgttg tgcggcattt aggtgttact
     6361 aaaggtatct tggcaagtct ttttagcttt gttaagatgc tatttgtgct tccactatct
     6421 tattttagtg attcagaaac tagcaccact gaggtcaaag tcagtgcttt aaaaacagca
     6481 ggcgttgtga cagggaatgt tttaaaacaa tgttgcaccg cagccgttga tttaagtatg
     6541 gataagttac gtcgtgtgga ttggaaggca accttacgac tgttacttat gttgtgtaca
     6601 actatggtat tgttgtcatc tgtgtatcac ttgtatgtgt ttaatcaagt actatcaagt
     6661 gatgttatgt ttgaggatgc ccaaggtttg aaaaagttct acaaagaagt tagagcttac
     6721 ctaggtgtgt catcaggttg tgatggtctt gctgcagctt atagagctaa ttcttttgat
     6781 gtacctacat tctgcgcaaa tcgttctgtg atgtgtaact ggtgtttgat aaaccaagat
     6841 tccataacgc actacccagc tcttaagatg gttcaaacac atcttagcca ctatgtttta
     6901 aacatagatt ggttgtggtt tgcacttgag gttggtttag catacatact ctatacctcg
     6961 gccttcaatt ggttattgtt ggcaggtaca ttgcagtatt tctttgcaca gacttctata
     7021 tttgtggact ggcggtcata caattatgtt gtctctagtg ctttttggtt gttcacccac
     7081 attcctatgc cgggtctagt cagaatctat aatttgttgg catgcctctg gcttttacgc
     7141 aaattctatc agcatgttat taacggttgt aaggacacgg catgtctgct ttgttataag
     7201 aggaatcgac ttactagagt tgaagcttct actgtcgtct gtggtggaaa acgtacgttt
     7261 tacattgcag caaatggcgg tatttcattc tgtcgtaggc ataattggaa ttgtgttgat
     7321 tgtgacactg caggtgtagg gaataccttc atctgtgaag aagtcgcaag tgatctcact
     7381 accaccctac gcaggcctgt taactccacg gatagatcac attattatgt ggattccgtg
     7441 ttagttaaag agactgttgt gcagtttaat tatcgtagag acggtcaatc atgctatgag
     7501 cggtttcctc tctgcgcttt cacaaattta gataagttga agttcaaaga ggtttgtaaa
     7561 actaccactg gtatacctga atacaacttt atcatttatg actcatcaga tcgtggccag
     7621 gaaagtttag ctaggtctgc gtgtgtttat tactctcaag tcttgtgtaa atcaattctt
     7681 ttggttgatt caagtcttgt gacgtctgtt ggtaattctg gtgaaattgc catcaaaatg
     7741 tttgattcct ttgttaatag tttcgtctcg ttgtataacg tcacccgcga caagttggaa
     7801 aaacttattt caacagctcg cgatggtgtt aaacgcggcg acaacttcca tagtgtctta
     7861 acaacattca ttgatgctgc acgcggcccc gctggtgtgg agtctgatgt tgaaactaat
     7921 gaaattgttg attctgtgca gtatgctcat aaacatgaca tacaacttac taatgagagt
     7981 tacaataatt atgtaccttc atatgtaaag cctgatagtg tttctaccgg tgatttaggt
     8041 agtctcatag attgtaatgc agcttcagtt aaccagacaa gcatgcgcca agctaatggc
     8101 gcatgcatct ggaatgctgc tgcatatatg aaactctcgg atgttcttaa gcgacagatt
     8161 cgcattgcat gccgtaagtg taatttagct tttcgtctta cgacttctaa gctacgtgct
     8221 aatgacaata tgttatctgt taaattcact gccactaaga ttgttggtgg tgctcctaca
     8281 tggtttaata cattgcgtga ctttacgttg aagagttacg tttttgttac cattatagtt
     8341 tttctgtgtg ctgttcttat gtacttttgt ttacctacat ttgctatggc accagttgag
     8401 ttttatgaag accgcatcct agaatataag gttctagata atggtatcat tagggatatt
     8461 agtcccgatg ataagtgctt tgctaacaag tacaggtctt ttagtcagtg gtatcatgag
     8521 catgtgggtg gtagttatga taattccatc tcttgcccat tgactgttgc ggttatagct
     8581 ggtgtagcgg gtgcgcgcat accagatgtc cctacaactt tagcgtgggt taatagacag
     8641 attgttttct ttgtctcccg cgtttttgct aattccaata gtgtttgtta tacaccaatt
     8701 aatgagatac cttataaaag tttctctgat agtggatgca ttctaccatc tgaatgtact
     8761 atgtttaggg atgctgaagg gcgtatgtca ccttattgtt atgatcccac tgtattgcct
     8821 ggagctttcg cgtatagtca gatgaagccc catgttcgct atgacttgta tgatactaac
     8881 atgtttatta agtttcctga agtggtcttt gagagcaccc tcaggattac taagacactt
     8941 actactcagt actgcagatt tggtagctgt gagtacgcac aggaaggtgt ttgtatcact
     9001 acaaatggct cttgggctat ttttaatgac caccatctta gtaggccagg tgtctactgt
     9061 ggttctgact atgtagacat tgtcagacgt ttagcagtgt cattgttcca acccattact
     9121 tatttccaac ttacaacttc gttggtcttg ggtattggtt tgtgtgcttt tctgacactt
     9181 ttgttttatt atattaataa agtaaaacgt gcattcgcag actacactca gtgtgctatg
     9241 attgccgtta ttgctgctgt tcttaatagc ttgtgcattt gctttgtttc gtctatacct
     9301 ttatgtatag tgccttacac tgcattgtac tactatgcta cattctattt tactaatgag
     9361 cctgcagcta ttatgcatgt ttcatggtac attatgtttg gtcccatagt acctatgtgg
     9421 ttgacctgtg tttatacagt tgcaatgtgc tttagacact tcttctgggt tgttgcttat
     9481 ttcagtaaga aacacgtcga ggtttttact gatggtaagc ttaattgtag tttccaagat
     9541 gcagcctcta atatttttgt tgttaacaag gatacttatg ctgctttaag aaatgctata
     9601 actaatgatg tgtactcgcg gtatcttggc ttgtttaata agtacaagta ttattctggt
     9661 gctatggaaa ctgccgctta ccgtgaagct gctgcatgtc atctcgctaa ggccttgcaa
     9721 acatacagtg aaactggtag tgacttgttg taccaacctc ctaactgtag catcacctct
     9781 ggtgtgttgc agagtggttt ggtcagaatg tcgcatccca gtggtgatgt tgaggcttgt
     9841 atggttcaag ttacctgtgg tagcatgact cttaatggcc tctggcttga taacactgtc
     9901 tggtgtccac gtcatgttat gtgcccagca gaccagttgg ctgatcctaa ttatgatgct
     9961 ctgcttgttt ccatgactaa tcatagtttt agtgtcaata aacatatagg tgctccggca
    10021 aatctgcgtg ttattggcca tgctatgcag ggtactcttc tgaagttgac tgtcgatgtt
    10081 gctaatccta gcactccagc ctacacgttt actacagtga aacctggtgc ttcatttagc
    10141 gtgctagcat gctataatgg acggccaacc ggtactttta ctgttgttat gcgccctaac
    10201 tacacaatta aaggttcttt cttgtgtggt tcttgtggta gtgttggtta cacaaaagaa
    10261 ggtagtgtga ttaacttctg ttatatgcat caaatggagt tagctaatgg tacacatacc
    10321 ggctctgcat ttgatggtac tatgtatggt gcattcttag ataagcaggt gcatcaggta
    10381 caattaacag acaaatactg cagtactaat gtggtagctt ggttgtatgc agcaatactt
    10441 aatgggtgcg cgtggtttgt aaaatccaat cgcactagta ttgtttcatt taatgaatgg
    10501 gctcttgcca accagtttac agagtttgtt ggcacacaat ccattgatat gttagctgtt
    10561 aaaacaggcg ttgccattga acagctcctt tatgctatcc aacaattgca tactggattc
    10621 cagggcaaac aaatccttgg cagttcaatg ttggaagatg agttcacacc cgaagatgtt
    10681 aatatgcaaa ttatgggtgt tgttatgcag agtggtgtaa gaaaggttac gtatggtact
    10741 gcgcattggt tgtttgcaac ctttgtgctt tcctatgttg tgttcttaca aaccactaaa
    10801 tttacattgt ggaactattt gtttgagacc atacccactc aattgttccc cctcttattt
    10861 gtgactgttg catgtgttat gttattggtt aaacataaac acaccttttt aacactcttt
    10921 ttgttgcctg tagccatttg tttgacttat gcaaacattg tctatgagcc tgctactccc
    10981 atttcttcag cgttgatagc tgtggctaat tggttagccc ctactaatgt ttatatgcgc
    11041 actacacata ctgatattgg tgtctacatt agtttgtcac ttgtattagc tatagtagtg
    11101 aaacgcttgt acaacccttc actatctaac tttgctcttg cattgtgtag tggtgtgatg
    11161 tggttgtata cttatagcgt tggcgaagtt tctagcccca ttgcctatct tgtctttgtt
    11221 actacactta ctagtgatta tacgattact gtctttgtga ctgttaatct tgcaaaaatt
    11281 tgcacttata ttatctttgc ttatgcacca cagcttacgc ttgtgttccc agaagtgaag
    11341 atgattctct tattatacac atgctttggt tttatgtgta catgctattt tggtgtcttc
    11401 tctttgttga accttaagtt gcgtgcgcca atgggtgttt acgactttaa ggtctccact
    11461 caggagttca ggttcatgac ggctaataat ttaacagctc cgaggaattc ttgggaagcc
    11521 atgtctctga actttaagct actaggtatt ggcggtacac cctgtataaa ggttgctgcc
    11581 atacaatcta aacttactga tcttaagtgc acttcagtgg tgttgctttc agttttgcaa
    11641 caattgcatc ttgaagctaa tagtaaggct tgggcttttt gtgtcaagtg ccataatgac
    11701 atattggctg ccacagaccc tagtgaggct tttgaaaaat tcgttagtct ctttgccact
    11761 cttatgactt tctctggtaa cgtagatctt gatgcactag ctagtgatat ctttgaaaca
    11821 cctagtgttc ttcaagctac tttgtctgaa ttctctcact tggcaacttt tgctgagtta
    11881 gaggctgcac agagagccta tcaggaagcc atggactctg gtgatgcatc accccaagtc
    11941 cttaaagctt tacagaaggc agttaacgtt gctaagaatg cctatgagaa agataaggct
    12001 gtagcacgta agttagaacg tatggccgag caagctatga cgtctatgta taagcaagca
    12061 cgtgctgaag acaagaaagc taaaattgtt agtgctatgc aaactatgct ttttggtatg
    12121 attaagaagc tcgacaatga tgttcttaat ggtatcattt ctaatgctag gaatggatgt
    12181 atacctctta gtgttgtacc actttgtgct tcaaacaaac ttcgtgtagt aattccggac
    12241 tttaccgtct ggaatcaagt tgtcacatat ccctcgctta attatgctgg ggctttgtgg
    12301 gacattgcag ttataaacaa tgtggataat gaaattgtta agtcttcgga tgttgttgaa
    12361 aacaacgaaa gcttgacatg gccacttgtc ttagaatgca ctagagcagc ttcctctgct
    12421 attaagttgc aaaataatga gattaaacct tctggtctta gaactatggt tgtttcagct
    12481 ggtcaagaac agaccaactg taatacaagc tctttagcat attacgaacc tgttcagggt
    12541 cgcaagatgt taatggcact tctttcagac aatgcctacc ttaagtgggc acgtgttgaa
    12601 ggacaggaag gttttgtaag tgttgaactg caacctcctt gtaaattttt gattgcggga
    12661 cctaagggac ctgaaatccg atacctctat tttgtcaaaa atcttaacaa ccttcatcgt
    12721 ggtcaggtgc ttggacacat tgctgccact gttagattac aagccggttc caataccgag
    12781 tttgcagcta attcttcagt gttgtcactt gttaatttca ctgttgatcc tcaaaaagct
    12841 tacatcgact ttgttaatgc tggcggtgcc ccattgacaa attgtgttaa gatgcttact
    12901 cctaaaactg gtacaggtat tgctatatct gttaagccag agagtacagc tgaccaagaa
    12961 acttatggtg gtgcgtctgt gtgtctgtat tgccgtgcgc atatagagca cccagacgta
    13021 tctggtgttt gtaaatataa gggtaagttc gtccagattc catcgcagtg tactcgtgac
    13081 cctgttggtt tttgtttaac gaataccccc tgcaatgtct gtcaatattg gattggctat
    13141 gggtgcaatt gtgactcgct tagacaagca gcactgcccc agtccaagga ttctaatttt
    13201 ttaaacgagt ccggggttct attgtaaatg cccgaataga accctgtgca agtggtttgt
    13261 ccactgatgt cgtttttagg gcatttgaca tctgcaacta taaggctaag gttgctggta
    13321 ttggaaaata ctacaagact aatacttgta ggtttgttga attagatgat caaggtcatc
    13381 atttagactc ctattttgtc gtcaagagac atactatgga gaattacgag ctagagaagc
    13441 actgttacga tttgttacgt gactgtgact ctgtggcacc tcatgatttc ttcgtctttg
    13501 acgtcgacaa aactaaaact cctcatattg tgcgtcagcg tttaactgag tacactatga
    13561 tggatcttgt ttatgcattg aggcactttg atcaaaataa ttgtgaagtg cttaaagcta
    13621 ttttagtaaa gtatgattgt tgtgatgcta catactttga aaataaactc tggtttgatt
    13681 ttgttgaaaa tcccagtgtt attggtgttt atcacaaact tggagaacgg gttcgccaag
    13741 ctgtgttaag cactgttaaa ttctgtgacc acatggtaaa ggccggttta gtcggtgttt
    13801 taacactcga caatcaggac cttaatggta agtggtatga ttttggtgat tttgtaatca
    13861 cacaacccgg ttcaggagtg gctatagttg atagctacta ttcttattta atgcctgtgc
    13921 tctctatgac caattgtttg gcagctgaga ctcacaggga ttgtgatttt aataagcctc
    13981 tcattgagtg gccacttact gagtatgatt ttactgatta caaagtacag ctctttgaga
    14041 agtactttaa gtactgggat cagacgtacc atgctaattg cgtgaattgt actgatgatc
    14101 gttgtgtgtt acattgtgct aatttcaatg tattatttgc tatgaccatg cctaagacat
    14161 gctttggacc tattgtccgg aagatatttg ttgatggtgt gccatttgta gtatcttgtg
    14221 gttatcacta caaggaatta ggtttagtca tgaatatgga tgttagtctt cataggcata
    14281 gactttctct taaggagcta atgatgtatg cagcagatcc tgctatgcac attgcctctt
    14341 caaacgcatt tcttgatttg aggacatcat gttttagtgt cgcagcctta accacaggtc
    14401 tgacctttca gaccgtgcgg cccggcaatt ttaaccagga tttctatgat ttcgtggtct
    14461 ccaaaggatt ctttaaagag ggttcttctg ttactctcaa acatttcttc tttgcccaag
    14521 atggcaatgc tgctattaca gattataatt attactctta taatctgccc actatgtgtg
    14581 acatcaagca aatgttgttt tgcatggagg ttgtaaacaa gtacttcgag atctatgacg
    14641 gtggttgtct taatgcctct gaagtggttg ttaataatct agacaaaagt gctggccatc
    14701 cttttaataa gtttggaaag gctcgtgtct attatgagag catgtcttat caggaacaag
    14761 atgaactctt tgccatgaca aagcgtaacg tcattcctac catgactcaa atgaatttaa
    14821 agtatgctat tagtgccaag aatagagctc gcactgttgc aggcgtctct atacttagca
    14881 ccatgactaa tcgccagtac catcagaaaa tgcttaagtc catggctgca actcgtggtt
    14941 cgacttgcgt cattggtact actaagttct atggtggctg ggatttcatg cttaaaacat
    15001 tgtataaaga tgttgataat ccacatctta tgggttggga ttaccctaag tgtgatagag
    15061 ctatgcccaa tatgtgtaga atttttgctt cactcatatt ggctcgtaag catggaactt
    15121 gttgtactac aagggacagg ttttaccgct tagctaatga gtgtgctcaa gtgctaagtg
    15181 aatatgttct gtgcggtggt ggttactacg ttaaacctgg tggtaccagt agcggtgatg
    15241 ccacaactgc atacgccaat agtgtgttta atattttgca ggccactact gcgaatgtta
    15301 gtgcacttat gggtgctaat ggcaacaaaa ttgttgacaa agaagttaaa gacatgcagt
    15361 ttgaactgta tgtcaatgtt tacaggagta ccaatcctga tcccaaattt gtagataggt
    15421 attatgcttt tcttaacaag cacttttcta tgatgatatt atctgatgat ggtgttgtct
    15481 gctataatag tgactatgca gccaaaggtt acattgctgg tatacagaat tttaaggaaa
    15541 cgctgtatta ccagaacaat gtctttatgt ctgaagccaa atgctgggtg gaaaccgatc
    15601 tgaagaaagg gccacatgaa ttttgttcac agcatacgct ttatattaag gatggtgacg
    15661 atggttactt cctgccttat ccagacccct ctaggatctt gtctgccggt tgctttgtag
    15721 atgatatcgt caagactgac ggtactctca tggttgagcg gtttgtgtca ttagctatag
    15781 acgcataccc tctcacaaag catgaagata tagaatacca aaatgtattc tgggtttatt
    15841 tacagtatat tgaaaagctg tataaagacc ttactggcca catgcttgac agttattctg
    15901 ttatgttatg tggtgataat tctgctaagt tttgggagga ggcattttat agagaactct
    15961 atagctctcc taccaccttg caggctgttg gttcgtgtgt tgtatgccat tcgcagacat
    16021 ccctgcgctg tggtacatgc atacgcagac cctttctttg ttgtaagtgc tgctatgatc
    16081 acgttatagc aactcctcat aaaatggttc tgtctgtttc accttacgtc tgtaacgcac
    16141 ctgggtgtga tgttgctgac gttactaaac tatatttagg tggtatgagc tacttctgtg
    16201 tagatcatag gcctgtttgt agttttcctc tttgcactaa tggtcttgta tttggattat
    16261 acaagaatat gtgcacaggt agtccctcta tagttgaatt caatagactg gctacatgtg
    16321 actggactga aagtggtgac tatacacttg ctaatactac tacagaacca cttaaattgt
    16381 ttgctgctga aaccctacgt gccactgaag aggcatctaa gcagtcttat gcaattgcca
    16441 ctattaaaga aatagttggt gatagacaat tactacttgt gtgggaggct ggtaaatcca
    16501 aaccaccact caatcgtaat tatgttttta ctggttacca tataaccaaa aatagtaagg
    16561 tgcagctcgg tgagtatatc ttcgagcgca ttgattacag tgatgctgta tcctacaagt
    16621 ccagtacaac gtataaactg actgtaggtg acatctttat acttacctct cattcggtgg
    16681 ctaccttgac ggcacccaca attgtgaatc aagagaggta tgttaaaata actggattat
    16741 atccaactat tactgttccc gaggagtttg caagccatgt tgccaacttc caaaaagcag
    16801 gatatagtaa gtatgtcact gtccagggac cacctggcac tggcaagagt cattttgcta
    16861 tagggttagc gatttactac cctacagcac gtgttgttta tacagcttgc tcacatgctg
    16921 ctgttgatgc attatgtgag aaagctttta aatatttgaa cattgctaaa tgttcccgta
    16981 ttattcctgc caaggcacgt gttgagtgtt atgacaggtt taaggttaat gaaacaaatt
    17041 ctcaatattt gtttagtact attaatgctt taccagaaac ttctgctgat attctggttg
    17101 ttgatgaagt tagcatgtgc actaattatg atctttctat cattaatgca cgtgttaaag
    17161 ctaagcacat tgtctatgta ggtgatccag cacagttgcc agcacctagg actctgctta
    17221 ctagaggcac attggaacct gaaaatttca atagtgtcac taggttgatg tgtaacttag
    17281 gacctgacat atttttgagt atgtgctaca ggtgtcctaa ggagattgtt agtactgtca
    17341 gtgctcttgt ctacaataat aaattgttag ccaagaagga actatcagga cagtgcttta
    17401 aaatgctcta taagggcaat gttacgcatg atgctagctc tgccattaat agaccacaac
    17461 ttgcatttgt caagaacttt ataactgcta acccagcctg gagtaaggca gtctttattt
    17521 caccttataa ttcacagaat gctgtggctc gctctatgtt gggccttaca acccaaactg
    17581 ttgattcttc acagggttca gaataccaat atgtcatttt ctgtcaaaca gcagatactg
    17641 cgcatgctaa caacattaat agatttaatg ttgccattac acgcgcacag aaaggtattc
    17701 tttgtgttat gacatctcaa gcactctttg attctctgga gtttactgag ttgtctttta
    17761 ctaattataa acttcagtct cagattgtca ctggactttt taaagattgt tccagagaaa
    17821 cctctggcct ttcacctgct tatgcaccaa catatgttag tgttgatgat aagtataaaa
    17881 cgtgtgatga gctttgcgtg aacctcaatt tacccgcaaa cgttccatat tcacgtgtta
    17941 tttccaggat gggcttcaag cttgatgcta gtgtccctgg ttatcctaaa ctcttcatta
    18001 ctcgtgaaga ggctgttagg caggttcgaa gctggatagg cttcgacgtg gaaggtgctc
    18061 atgcatcacg taatgcatgc ggtaccaatg tgcctttaca attaggattc tctaccggcg
    18121 tgaactttgt tgttcagcct gtaggtgttg tagatactga gtgggggaat atgcttactg
    18181 gcatttctgc ccgtcctcca ccaggtgaac agtttaaaca cttagttccc cttatgcata
    18241 agggagctgc gtggcctatt gttagacgac gtatagttca aatgttgtca gacactttag
    18301 acaaattgtc tgactactgt acgtttgttt gttgggctca tggctttgaa ttgacatctg
    18361 catcttattt ttgtaagata ggtaaagagc agaagtgttg tatgtgtaat agacgcgctg
    18421 cagcgtactc ttcacctctg caatcttatg cctgctggtc tcattcctgc ggttatgatt
    18481 atgtctacaa ccccttcttt gttgatgttc aacagtgggg ttatgtaggc aatcttgcta
    18541 ctaatcacga tcgttattgc tcggtgcatc agggtgctca tgtagcctct aacgatgcaa
    18601 taatgactcg ttgtttagct attcatgctt gttttattga acatgtagat tgggatattg
    18661 agtatcctta tatctcacat gagaaaaagt tgaattcctg ctgccgaatt gttgaaagaa
    18721 atgttgtacg tgctgctttg ttggcaggct cctttgatag agtgtacgac ataggcaacc
    18781 ctaaaggaat tcctattgtt gatcaccctg tggttgaatg gcattatttt gatgcacagc
    18841 ccttgactag gaaagtacaa cagcttttct atactgagga tttagcctca agatttgctg
    18901 atgggctctg cttgttttgg aattgtaatg ttccaaaata tcctaataat gcaattgttt
    18961 gtaggtttga tactcgtgtg cactcagagt tcaatttgcc aggttgtgat ggtggtagtt
    19021 tgtatgttaa caagcacgcc tttcacacac cagcatacga tgtaagtgca tttcgtgatc
    19081 tgaaaccttt acctttcttc tattattcta ctactccatg tgaagttcat ggtactggta
    19141 gtatgttaga agatatagat tatgtacctc ttaagtctgc agtgtgtgtt actgcctgca
    19201 atctaggagg tgctgtttgt aggaaacatg ctacggagta cagagattat atggaagcat
    19261 ataaccttgt ctctgcatca ggtttccgtt tatggtgtta taagaccttt gacatttaca
    19321 acctttggtc tacttttact aaagttcaag gtttagaaaa cattgcttat aatgttgtta
    19381 aacaaggcca ctttactggt gtagatggag agctacctgt agctgtagtc aatgataaaa
    19441 tcttcaccaa gagtggcgtt aatgacatat gtgtgtttga gaataaaacc actttgccta
    19501 caaatgtagc ttttgaactg tatgctaagc gtgtggtgcg ctcacatcca gactttaagt
    19561 tactccataa tttacaagct gacatttgtt acaagttcgt cctttgggat tatgaacgtt
    19621 gtaacatcta tggtacagct actattggtg tatgtaagta cactgatata gaagtcaatt
    19681 cagccttgaa tatatgtttt gacattcgtg ataatggttc attggaaaag tttatgacta
    19741 cacccaatgc cattctcatt tcagatagaa aaatcaagaa ctacccgtgt atggtaggtc
    19801 ctgattatgc ttacttcaat ggtgctatta tcagagacag tgatactgta aagcagccag
    19861 tgaaatttta tttttataag aaagttaata atgagtttgt cgagttttct gactgtgctt
    19921 acacacaggg tcgctcttgt agtgactttg aggctatgtc agttatggag acagactttc
    19981 ttgctcttga tagtgatgtt ttcataaaga agtatggttt ggaaaactat gcctttgaac
    20041 atgtggtata tggtgatttt tcacatacta cattgggtgg ccttcatctg cttattgggt
    20101 tgtacaagaa gcatttggat ggtcatatta ttatggaaga aatgatcaga gaaagttcaa
    20161 ctatccataa ctatttcatt actgagacta gcacagcgtc ttttaaggcg gtttgctcag
    20221 tcattgattt aaagcttgac gactttgtac agattttaaa gagtcaagac cttggcgttg
    20281 tatccaaggt agtcaaggtt cctatagacc taactatgat tgaattcatg ttatggtgta
    20341 aagatggcca ggtacagaca ttctatcctc ggctccaagc atctgctgat tggaaaccgg
    20401 gccaggccat gccttcatta tttaaagttc aaaatgtgaa ccttgaacgc tgtgagcttg
    20461 ctaattacaa gcaatctatt cctatgcctc gcggtgtgca catgaacatc gccaaatata
    20521 tgcaattgtg ccagtattta aatacttgca caatagctgt tccagccaat atgagagtta
    20581 ttcattttgg tgctggttca gataaaggta tcgcacctgg tacctctgtt ttaagacaat
    20641 ggctcccgac tgatgccatt ataattgata atgatctaaa tgattttgtg tcagatgctg
    20701 acatatcttt atttggagac tgcgtaactg tacgtgttgg acaacaagtc gatcttgtta
    20761 tatctgacat gtatgatcct agtactaaga atattacagg tagtaatgag tctaaggctc
    20821 tattctttac ctacctgtgt aattttatta ataataatct tgctcttggt ggttctgttg
    20881 ccattaaaat aacagaacac tcatggagcg ttgatcttta tgaaataatg ggaaaatttg
    20941 cttggtggac agttttctgt accaatgcaa atgcatcctc ttctgaagga ttcctgttag
    21001 gtattaatta cttgggtact attaaagaaa atatagatgg cggtgctatg cacgcaaatt
    21061 atatattttg gagaaactcc aaccctatga atctgagtac ttactcactt tttgatttat
    21121 ccaagtttca attaaaatta aaaggaacgc cagttcttca attaaaggag agtcagatta
    21181 acgaactagt tatatctctc ctgtcgcagg gtaaattact tatccgcgac aatgacgtac
    21241 tcagtgtctc tactgatgtg cttgttaact tttatagggg caaacgctaa aattgtgtcg
    21301 atacctggtg gtgtcggtac tggagcttgt ccccaggttg atatgcagcc cagttatttt
    21361 ataaagcata actggcctga acctattgac atgaataagg cagacggtgt catctaccca
    21421 aatggccgca cttattctaa catcacatta cagaccacta atctgtttcc tcgtaatggt
    21481 gatttaggca ctcagtatgt ctattcagca tctaatgaga aaagccgcac tagcaatgtg
    21541 gcttttatta gtaattattc atactatggc aatccctttg gcgacggcat tgtcatacgt
    21601 ataggtcaaa attctaataa gactggtagt gtcattgtgg gcacagcaca gactactatt
    21661 aaaaagatct acccagctct tatgcttggt agttcttttg gcaatttctc tgttaataat
    21721 aagtcaggtg cttattttaa tcacaccctt cttatcttac ctagcaagtg tggcactgta
    21781 tttcaggtgg catactgcct tctacaacca aggactgact cttattgtcc cggtaatgct
    21841 aactatgtta gctatgcact cattgattct cctacggatt gtacatctgc ggatgaatct
    21901 aaacgtagga atggtcttga ggacattaaa aagtacttca atttggtcaa ttgtacctat
    21961 tttgaagagt ttaatgtcac agctgacgag cgtgcagagt ggtttggcat cacccaagat
    22021 tctcagggtg tgcacctcta cacctctcgt aaaaatggtt tcaattcaaa taatctcttt
    22081 ttatttgctt ctgtgcccat ttatgataag atcaattact acactgtaat tcctcgctca
    22141 ataataactc ctgccaacca gcgttctgct tgggcagcat tttatgtata tcctttacac
    22201 caacttagct acttgctaaa ttttgatgtt aatggttata ttacgcaagc agcagactgt
    22261 ggttacaatg attataccca gcttgtctgc tcgtatggtg attttaatat gaaatctggt
    22321 gtttactcta cttcatatta ttcagccaaa cctgtaggtg cgtactatga agctcatgtt
    22381 tacccagatt gcaattttac tgatttgttc cgggaaaatg ctcccacaat catgcaatat
    22441 aagcgtcaag tttttacgcg ttgtaattac aacctcacac tactgctctc tcttgtgcag
    22501 gtggatgagt ttgtctgcga taagattacc cctgaggctc ttgcaacagg gtgttattcg
    22561 tctcttaccg tcgattggtt tgcatttccg tatgcttgga agtcatacct agctataggt
    22621 tcagcagatc gcattgtgcg gtttaattat aaccaggatt atagcaatcc ctcttgtaga
    22681 attcactcca aggtgaattc ttcagttggc atttcttact ctggtttata tagttatatt
    22741 actaattgta attacggcgg cttcaacaag gatgacgttg ttaagcctgg tggtcgtgcc
    22801 agtcaaccct gtgttactgg cgcactcaat tcacctacta acggtcaagt ctggtctttt
    22861 aattttggtg gcgtccctta cagaacctcc cgcctcacct acactgacca tcttaaaaac
    22921 cctctagata tggtttatgt catcactgtt aagtatgaac caggcgctga aactgtatgt
    22981 cccaaacaag tgcgtcctga ttatagtacc aatattactg gcttattagg ctcttgtatc
    23041 agttatgaca tttatggtat aactggtact ggtgttttcc agctgtgtaa tgcaactgga
    23101 attcctcaac aaaagtttgt ctatgacaaa tttgataata taattggctt tcactctgat
    23161 gacggcaatt attattgtgt ggcaccttgt gtcagtgtgc ctgtttctgt tatatatgat
    23221 gacaacacta atcaatacgc cacattgttt ggcagtgttg cttgtcaaca tatatctaca
    23281 atggctgctc agttttctcg tgaaactcgt gcttccctcg tttcaagaaa tatgcagaat
    23341 cttctacaga cttctgtcgg ttgtgtcatg ggtttccatg aaactaatga caccgttgaa
    23401 gactgcaatc tttctttggg acaatcactc tgcgcaatcc cacctaacac caacttgagg
    23461 gttggtcgct ccacctttgg attaggttct ttagcctaca acagtccatt gcgtgttgat
    23521 gcacttaact cctctgagtt taaggtctcc ttgcctctca attttacatt tggtgttact
    23581 caggaatata ttgaaactag catacagaag attacagttg actgtaaaca gtacgtgtgc
    23641 aatggttttg ctaagtgtga aaagctgctc gaacaatacg gtcagttttg ctctaaaatt
    23701 aaccaggctc tccatggcgc gaatcttcgc caggatgact ttgttcgtaa tctgtttgag
    23761 agtgttaaaa caccacagac agttcctctt actacaggtt ttggagggga gtttaatctt
    23821 actcttctag agccgctttc tgtttctaca ggttcttcta atgcgcgtag tgctttggaa
    23881 gagcttttgt ttgacaaagt cactatagct gatcctggct acatgcaagg ttatgatgac
    23941 tgcatgcaac agggccctgc ctcagctcgt gatcttatct gtgctcaata tgttgctggc
    24001 tacaaagtgt taccacccct tatggatgtt aacatggaag ctgcatacac ctcttctttg
    24061 cttggtagca tagctggtgc tggctggact gctggtttat catcatttgc tgccattcca
    24121 tttgcacaga gtatatttta caggttaaat ggtgttggta taacacaaca ggttctatct
    24181 gagaatcaaa agattattgc taacaagttt aaccaagctc ttggtgccat gcaaactggt
    24241 ttcacaacaa ctaatgaagc ttttcagaaa gttcaagatg ctgtgaacac taatgcacag
    24301 gctctagcta agttggctag tgaactatcc aatacttttg gtgctatttc ttcttccatt
    24361 ggtgacatca ttcaacgtct tgatgtgctt gaacaggaag tccaaataga cagacttatt
    24421 aatggccgtc tgactacact caacgccttt gttgctcagc agcttgttcg ttctgaatct
    24481 gctgctcgtt ctgcacaatt ggctaaggat aaagtcaatg agtgtgttaa atcacaatcc
    24541 actagatctg gattttgtgg tcaaggcact catatagtgt cctttgttat taacgcccct
    24601 aatggcctct actttatgca tgttggttac caccctagcc aacatattga ggttgttgct
    24661 gcctatggtc tttgtgacgc ggccaacccc actaattgta tagccccagt taatggctac
    24721 tttattaaaa atcaaactac taggggtgtt gatgattggt catatacagg ttcttccttt
    24781 tatgctccag aacccatcac cactcttaat actaggtatg tcgcacctca agtgacattc
    24841 caaaacattt ctactaacct tcctcctcct ctgttgggca attccactgg aactgacttc
    24901 aaagatgagt tggatgaatt tttcaagaat gttagcacca gtataccaaa ttttggtgct
    24961 ctaacacaaa ttaatactac tttattggat ctttccgatg aaatgctagc tttacagcaa
    25021 gttgtcaaag cgcttaatga gtcatatatt gaccttaaag agctcggcaa ctatacttat
    25081 tacaacaaat ggccttggta catttggttg ggtttcattg ctggacttct agccctagcc
    25141 ctttgtgttt tctttattct ttgctgcact ggttgcggca caagttgttt aggaaaactt
    25201 aaatgtaatc gttgttgtga caaatatgaa gaatacgacc ttgagccgca taaaattcat
    25261 attcactaat taacgaactt gtgatgagag tacaacgtcc acccactctt ctcctggtcg
    25321 ttggactcac tctcttagct ttagcttatt caaaacctct ttatgtacct gaacattgtc
    25381 agaattattc tggtcgtatg cttagggctt gtattaggac tgcccagact gatactgttg
    25441 gtctttacac caatcttgtt attcagactg gcactgccac ctttgaatca gcggtacctg
    25501 ttgatcgtgg atcaccttca actcacgctg acacttatga gcttaatact agtgtgactc
    25561 tttttgacgt tggctactca gttaattaac gaactctatg gattacgtgt ctctgctcaa
    25621 ccaaatttgg cagaagtacc ttaatcttcc tgatacagtt tgtttgtaca ttcccaaacc
    25681 tgcttctagt tttaaacctg tagccggcac ttccttgcat cccgttcagt gggagtgtaa
    25741 gattacattt gctggttaca cagaggttgc agttaattct actaaagctt tagctaagca
    25801 ggatgcagcc cgcagaatta tgtggctgct ccatagagat ggaggaatac ccgatggatg
    25861 ttcactccac atgcgtcact ccagcatctt ctcggatgtt ccggaagaga cgccattctc
    25921 cgagtagaaa tcttcgctat gttaagcgta gattttcttc tctacgccct gaagatatta
    25981 gtttggtcac tgaacccact cattacctca gggttatctt tcacagccct aatacttggt
    26041 atattaggtc tggtcatgat ttagactctg tccacaaatg gttgaaaccg tatggtggta
    26101 ttcctgtgaa cgagtatcat attaccttgg ctttactgtc actttctgaa caacatttag
    26161 ccatggatat atctcccatt gcaattttcc ttcgcaatgt gcgttttgag ctctttgatt
    26221 ttactctact ccgtaaaaca cttgctctca aagcgtcaga gatttgctgt gataacttac
    26281 ataggtttca acctattaca agggttaaca tggctctccc tcttattaag gaatggttgc
    26341 gtgttcaggg tttccctatt tacaatagcc accttcctct acacatgtct gtttctaagc
    26401 tgcatgcttt agatgataat acttgtgagt atgttgctaa catgtcttgc ttcaaacagt
    26461 atcccaccca gatgtttgtg agacctatcg ctgttgaatt ggtttccata cgtcaatctt
    26521 ctaatgcacc tcgatgcata gttcattcag ttcccatatt acatgcgcca ggattttaac
    26581 gaactatggc tttttcttta gccttgttca agcctatttc tttagtgcct gcatttcctg
    26641 aagcgcatgg tggtgaacct gctcaatttg ctaatgtttt cacatgcatt cctactgtag
    26701 gctatatagc cgcacttaca gtgaatgtgt gcattttacc attactactc ctcattccac
    26761 aggacacttg caggcgtagt attttcaaaa caagcatcct ttatggtttg tttgtttata
    26821 attttatatt agccattaca ttaattaatg gtgtttacac tcccactgga ggcacattag
    26881 ttgccttcct ggtagtgctt atgattactt ggcttgctga cagagttaga ttctgtctct
    26941 tgctgcgttc ctatattcca ctttttgaca tgagatctca ttttatccgt gtcagtacag
    27001 tgtcatcata tggcatggtt ccagttaatc aaaccaagcc attatttatt agaaactttg
    27061 accagcgttg tcgctgctca cgttgttttt atgtgcattc ttctcattat ctagagtgca
    27121 cttatattag ccgttttact aaagtcagtc ttgtagcagt tacagacttt tctttaaacg
    27181 gcatcacttc tactgtattc gtgccttcaa cgcgcgattc agttcctctt cacataatcg
    27241 caccgagcgt gcttagtgta taagctcgcc tagcgcaact atgggtcccg tgtagaggct
    27301 attccattag tctctatctt tggacatttg gaaaacgaac tatgttaccc tttgtccaac
    27361 aacaattagg gtcattcata gtaaactttt tcatatttac cgtagcgtgt gctgtcatac
    27421 ttttggtgtg catggctttc cttacggcca ctcgattatg tgtgcaatgc ataacaggtg
    27481 taaacacact gttagttcag cccgcagtat acatgtataa tactggacgt tcagtctatg
    27541 taaaattcca ggagagcaaa ccccctctac ctcctgatga gtgggtttaa cgaactcctt
    27601 aattatgtca aatatgacgc agctcactga gcagcagatt attagtataa ttaaagattg
    27661 gaactttgca tggtctctga tctttctttt aattactatc gtgctacaat atggttaccc
    27721 atctcgtagc atgactgtct atgtctttaa aatgtttgtt ttatggcttt tatggccttc
    27781 atcaatggca ctctcaattt tcagtgccgt ttatccaatt gacctagctt cccagattat
    27841 ttctggcata atagctggtg tctctgcgct catgtggatt tcctacttcg tgcagagcat
    27901 tagactcttt atgcgaacag gctcttggtg gtcattcaat cctgaaacta attgcctttt
    27961 gaacgttcca cttggtggca caactgtagt gcgtccacta gtcgaagatt ccactagtgt
    28021 tactgctgtt gttgccaatg ggtacctcaa gatggctggt atgcactttg gtgcgtgtga
    28081 ctacgatcga ctccctagtg aggtgaccgt ggctaaaccc aacgtgctta tcgcattgaa
    28141 aatggtaaag agacaaagct atggaactaa ttctggtgtt gctatttacc atagatataa
    28201 ggcaggtaat tacagaagtc cgcccataac ggcggatagt gaacttgcat tgctacgagc
    28261 ataagctcct aagtaagagt tgattttaac gaatcttaat tttcattgtt atggccactc
    28321 ccgctgcacc tcgtgccgtg tcttttgccg ataacaatga caactccaat aataaccagt
    28381 ctcgaggaag aggaagaaac cctaaacctc gacctgcacc aaataacact gtctcctggt
    28441 acacggggct tacccaacac gggaaagtct ctctttcctt cccacctgga cagggcgtac
    28501 ctcttaatgc caattctacc cctgcgcaaa atgctgggta ttggcggaga caggacagaa
    28561 aaattaatac aggaaatgga accaagtcac tggctcccag gtggtacttc tactacactg
    28621 gaaccggacc tgaggccaac ctccctttcc gagctgtcaa ggacggaatc atctgggtcc
    28681 atgaggatgg cgccactgat gctccttcaa cttttgggac gcggaaccct aacaatgatg
    28741 ctgctattgt tacgcaattc gcgcccggta ctaagcttcc taaaaacttc cacattgaag
    28801 ggactggagg caatagccaa tcatcttcaa gagcgtctag tgccagcaga aactcttcta
    28861 gatccaattc ccgaggttcc agatctggta actcctcccg cggcacttcc ccaggtccat
    28921 ctggagtcgg agctgtaggt ggagaaatgc tgtacctcga tttgcttaac agattacagg
    28981 ctctggaatc tggcaaaaca aagcaagcac agcctaaagt aataactaaa aaggatgctg
    29041 ttgctgctaa aaacaagatg cgccataagc gtgtcgccac caagggtttc aacatggtgc
    29101 aagctttcgg tctgcgtggc ccaggcgacc tccagggaaa ctttggtgat ctccaactta
    29161 acaaacttgg cactgaggac cctcgctggc cccaaattgc tgagcttgcc ccatcagcca
    29221 gtgctttcat tggtatgtct caatttaaac ttacccatca gagcaatgat actgatggtg
    29281 cccctgtata ctttcttcga tacagtggtg ccataaaact tgacccaaag aaccctaact
    29341 acaataagtg gttggagctc attgagcaga atgttgatgc ctacaaaact ttccctaaaa
    29401 aggagaagaa acaaaaggca cctaaagaag aaccatctga ccagatgaat gtgcagccgc
    29461 ctaaggagca gcgtgtgcag ggtagtatta cccagcgctc ccgcactcct aggcctagtg
    29521 tgcagcctgg tcctatgact gatgttaaca ctgattagtg ttattcaaag taacaagagc
    29581 gaggcaaccg tttgtgtttg gtaaccccat ttcaccatcg tttgtccact cttgcacaga
    29641 ag
//
DBGET integrated database retrieval system