GenomeNet

Database: RefSeq
Entry: NC_004718
LinkDB: NC_004718
Original site: NC_004718 
LOCUS       NC_004718              29751 bp    RNA     linear   VRL 20-NOV-2020
DEFINITION  SARS coronavirus Tor2, complete genome.
ACCESSION   NC_004718
VERSION     NC_004718.3
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      SARS coronavirus Tor2
  ORGANISM  SARS coronavirus Tor2
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
            Betacoronavirus; Sarbecovirus.
REFERENCE   1  (bases 1 to 29751)
  AUTHORS   He,R., Dobie,F., Ballantine,M., Leeson,A., Li,Y., Bastien,N.,
            Cutts,T., Andonov,A., Cao,J., Booth,T.F., Plummer,F.A., Tyler,S.,
            Baker,L. and Li,X.
  CONSRTM   BCCA Genome Sciences Centre, British Columbia Centre for Disease
            Control and National Microbiology Laboratory Canada
  TITLE     Analysis of multimerization of the SARS coronavirus nucleocapsid
            protein
  JOURNAL   Biochem. Biophys. Res. Commun. 316 (2), 476-483 (2004)
   PUBMED   15020242
REFERENCE   2  (bases 1 to 29751)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (27-MAY-2020) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   3  (bases 1 to 29751)
  CONSRTM   BCCA Genome Sciences Centre, British Columbia Centre for Disease
            Control and National Microbiology Laboratory Canada
  TITLE     Direct Submission
  JOURNAL   Submitted (30-APR-2003) Genome Sciences Centre, British Columbia
            Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
            4E6, Canada
  REMARK    Sequence update by submitter
REFERENCE   4  (bases 1 to 29751)
  CONSRTM   BCCA Genome Sciences Centre, British Columbia Centre for Disease
            Control and National Microbiology Laboratory Canada
  TITLE     Direct Submission
  JOURNAL   Submitted (23-APR-2003) Genome Sciences Centre, British Columbia
            Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
            4E6, Canada
  REMARK    Sequence update by submitter
REFERENCE   5  (bases 1 to 29751)
  CONSRTM   BCCA Genome Sciences Centre, British Columbia Centre for Disease
            Control and National Microbiology Laboratory Canada
  TITLE     Direct Submission
  JOURNAL   Submitted (13-APR-2003) Genome Sciences Centre, British Columbia
            Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
            4E6, Canada
COMMENT     REVIEWED REFSEQ: This record has been curated by NCBI staff. The
            reference sequence is identical to AY274119.
            On or before Mar 28, 2016 this sequence version replaced
            NC_028858.1, NC_028866.1, NC_028884.1, NC_028893.1, NC_028873.1,
            NC_028845.1, NC_009696.1, NC_009695.1, NC_013664.1, NC_009693.1,
            NC_009694.1, NC_004718.2.
            Annotation based on that found in PMID: 31987001, PMID: 31967327
            and the annotation of P0C6X7.1.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..29751
                     /organism="SARS coronavirus Tor2"
                     /mol_type="genomic RNA"
                     /isolate="Tor2"
                     /host="Homo sapiens; patient #2 with severe acute
                     respiratory syndrome (SARS)"
                     /db_xref="taxon:227984"
                     /country="Canada: Toronto"
     5'UTR           1..264
     gene            265..21485
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /db_xref="GeneID:1489680"
     CDS             join(265..13392,13392..21485)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /ribosomal_slippage
                     /note="ORF1ab polyprotein is cleaved to yield the
                     RNA-dependent RNA polymerase and other nonstructural
                     proteins; polyprotein pp1ab; replicase 1AB"
                     /codon_start=1
                     /product="ORF1ab polyprotein"
                     /protein_id="NP_828849.7"
                     /db_xref="GeneID:1489680"
                     /translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEARE
                     HLKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGI
                     TLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQN
                     WNTKHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQ
                     LDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFP
                     LNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTC
                     DFLKATCEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSN
                     IETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDL
                     LEILSRERVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGN
                     YKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAA
                     VTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKL
                     RPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVV
                     NKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVT
                     FLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQ
                     YCALSPGLLATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNE
                     KCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGE
                     ENFSSRMYCSFYPPDEEEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVR
                     VEEEEEEDWLDDTTEQSEIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSA
                     NPMVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNL
                     AKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQT
                     VRTQVYIAVNDKALYEQVVMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDV
                     KPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYM
                     VGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT
                     ALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATI
                     QRKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLE
                     EAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSG
                     QRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLLSLREVKTIKVFTTVD
                     NTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFE
                     YYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQLEVKFN
                     APALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLN
                     VVCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMM
                     SAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTD
                     VFYKETSYTTTIKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNA
                     SFDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKK
                     GAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNL
                     ACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMA
                     AYVENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSN
                     CAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDA
                     GINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYL
                     NSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVL
                     AYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAMVRMYIFF
                     ASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFC
                     KTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHL
                     YFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCDESASKSASVYY
                     SQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSE
                     LAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTY
                     NKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKK
                     NNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVLAALVCYIVMPV
                     HTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGGSYKNDKSC
                     PVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEYSDFA
                     TSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPNT
                     YLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVFCGVDAMN
                     LIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHVVAANAL
                     LFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWIT
                     AIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLL
                     PLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSI
                     TSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNP
                     NYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQ
                     TFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP
                     TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTT
                     LNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGS
                     TILEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFF
                     VYENAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRI
                     MTWLELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKV
                     YYGNALDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCI
                     MLVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDA
                     FKLNIKLLGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLH
                     NDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPS
                     YAAYATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQ
                     MYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAK
                     LMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLI
                     VTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALL
                     SDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVL
                     GSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTH
                     TGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAND
                     PVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFFKRVCGVSAARLTPCGTG
                     TSTDVVYRAFDIYNEKVAGFAKFLKTNCCRFQEKDEEGNLLDSYFVVKRHTMSNYQHE
                     ETIYNLVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDT
                     LKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTVQFCDAMRDA
                     GIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSLLMPILTLTRALAAESHM
                     DADLAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNV
                     LFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLV
                     YAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKE
                     GSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCIN
                     ANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYA
                     ISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTV
                     YSDVETPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCNLSHRFYRLANECAQVL
                     SEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVR
                     NLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAVVCYNSNYAAQGLVASI
                     KNFKAVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRI
                     LGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDEL
                     TGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIR
                     RPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPI
                     SFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAET
                     LKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQI
                     GEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLY
                     PTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSH
                     AAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTAD
                     IVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCR
                     LMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVS
                     SAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDY
                     VIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDRDLYDKLQFTSLEIPRRNVATLQ
                     AENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDIPGIPKDMTYRRLISMM
                     GFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVN
                     LVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTL
                     KGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVG
                     FDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRV
                     DWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW
                     KFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVL
                     SNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDID
                     YVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNT
                     FTRLQSLENVAYNVVNKGHFDGHAGEAPVSIINNAVYTKVDGIDVEIFENKTTLPVNV
                     AFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKP
                     TESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIG
                     ESVKTQFNYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKL
                     EGYAFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQT
                     GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYP
                     KLQASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLN
                     TLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIG
                     DCATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGSIAVKI
                     TEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHANYI
                     FWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIRENNR
                     VVVSSDILVNN"
     mat_peptide     265..804
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp1"
                     /protein_id="NP_828860.2"
     misc_feature    301..645
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N-terminal domain of non-structural protein 1 from
                     Severe acute respiratory syndrome-related coronavirus and
                     betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp1_N; cd21796"
                     /db_xref="CDD:409335"
     mat_peptide     805..2718
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp2"
                     /protein_id="NP_828861.2"
     misc_feature    808..2718
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to SARS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the B lineage; Region:
                     cv_beta_Nsp2_SARS-like; cd21516"
                     /db_xref="CDD:394867"
     mat_peptide     2719..8484
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp3"
                     /note="papain-like proteinase"
                     /protein_id="NP_828862.2"
     misc_feature    2905..3351
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Protein of unknown function (DUF3655); Region:
                     DUF3655; pfam12379"
                     /db_xref="CDD:403549"
     misc_feature    3364..3729
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="X-domain of viral non-structural protein 3 and
                     related macrodomains; Region: Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:394882"
     misc_feature    order(3376..3378,3412..3414,3418..3420,3553..3555,
                     3637..3660)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394882"
     misc_feature    3889..4266
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="SUD-N macrodomain of the SARS Unique Domain (SUD)
                     of SARS-CoV non-structural protein 3 and related
                     macrodomains; Region: Macro_cv_SUD-N_Nsp3-like; cd21562"
                     /db_xref="CDD:394883"
     misc_feature    order(3961..3963,3994..3996,4000..4002,4099..4101,
                     4174..4197)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394883"
     misc_feature    4243..4671
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Single-stranded poly(A) binding domain; Region:
                     SUD-M; pfam11633"
                     /db_xref="CDD:314498"
     misc_feature    4678..4878
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Severe Acute
                     Respiratory Syndrome coronavirus and related
                     betacoronaviruses in the B lineage; Region:
                     SUD_C_SARS-CoV_Nsp3; cd21525"
                     /db_xref="CDD:394841"
     misc_feature    4891..5799
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(5209..5220,5368..5376,5380..5385,5392..5394,
                     5479..5481,5548..5553,5557..5559,5626..5628,5674..5676,
                     5695..5703,5785..5787)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(5368..5376,5623..5628,5674..5676,5683..5685,
                     5701..5703,5785..5787)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    5932..6252
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_NAB; cd21822"
                     /db_xref="CDD:409348"
     misc_feature    6325..6672
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus-specific marker of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_betaSM; cd21814"
                     /db_xref="CDD:409629"
     misc_feature    6889..8481
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Severe acute respiratory
                     syndrome-related coronavirus and betacoronavirus in the B
                     lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
                     /db_xref="CDD:409665"
     misc_feature    6889..6954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409665"
     misc_feature    7204..7272
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409665"
     mat_peptide     8485..9984
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp4"
                     /protein_id="NP_904322.1"
     misc_feature    8524..9666
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8524..8592
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9322..9387
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9430..9495
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9574..9642
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9700..9978
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     mat_peptide     9985..10902
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="3C-like protease"
                     /note="3CLp; nsp5"
                     /protein_id="NP_828863.1"
     misc_feature    9994..10884
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(9994..10017,10024..10026,10336..10338,10348..10368,
                     10393..10407,10480..10482,10498..10500,10840..10842,
                     10852..10854,10876..10881)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(10057..10065,10105..10107,10402..10419,10471..10482,
                     10498..10500,10543..10560)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     mat_peptide     10903..11772
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp6"
                     /protein_id="NP_828864.1"
     misc_feature    10903..11772
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     mat_peptide     11773..12021
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp7"
                     /protein_id="NP_828865.1"
     misc_feature    11773..12021
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11776..11778,11785..11796,11803..11811,11815..11820,
                     11827..11829,11854..11856,11863..11865,11881..11883,
                     11917..11934,11938..11955,11974..11988)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     mat_peptide     12022..12615
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp8"
                     /protein_id="NP_828866.1"
     misc_feature    12022..12612
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nsp8 replicase; Region: nsp8; pfam08717"
                     /db_xref="CDD:400866"
     mat_peptide     12616..12954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp9"
                     /protein_id="NP_828867.1"
     misc_feature    12616..12954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12616..12633
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12622..12627,12634..12639,12832..12837,12901..12906,
                     12910..12918,12922..12930,12934..12939)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     mat_peptide     12955..13371
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp10"
                     /protein_id="NP_828868.1"
     misc_feature    12955..13347
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="coronavirus non-structural protein 10; Region:
                     CoV_Nsp10; cd21872"
                     /db_xref="CDD:409325"
     misc_feature    order(12955..12978,12988..12990,12994..13002,13006..13014,
                     13027..13032,13039..13044,13051..13053,13072..13089,
                     13126..13131,13159..13161,13165..13170,13180..13182,
                     13186..13203,13216..13224,13231..13242)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409325"
     misc_feature    order(12994..13002,13006..13014,13027..13029,13072..13074,
                     13078..13089,13126..13134,13186..13197,13204..13206,
                     13237..13242,13297..13299)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409325"
     misc_feature    order(13027..13029,13078..13089,13126..13134,13204..13206,
                     13237..13242,13297..13299)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409325"
     misc_feature    order(13072..13095,13123..13131,13159..13170,13183..13188,
                     13192..13194,13231..13242)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409325"
     mat_peptide     join(13372..13392,13392..13394)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="RNA-dependent RNA polymerase"
                     /note="RdRp; nsp12"
                     /protein_id="YP_009924301.1"
     misc_feature    join(13387..13392,13392..16166)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Severe acute respiratory syndrome coronavirus
                     RNA-dependent RNA polymerase, also known as non-structural
                     protein 12, and similar proteins from betacoronaviruses in
                     the B lineage: responsible for replication and
                     transcription of the viral RNA genome; Region:
                     SARS-CoV-like_RdRp; cd21591"
                     /db_xref="CDD:394895"
     misc_feature    order(14175..14189,14337..14342,14346..14348,14352..14366,
                     14382..14393,14400..14402,14472..14474,14481..14486,
                     14490..14495,14502..14504,14508..14522,14526..14546,
                     14556..14558,14562..14564,14574..14579,14589..14591,
                     14883..14888,14895..14897,14910..14915,14919..14921,
                     14928..14939,15366..15368)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp8 interaction site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394895"
     misc_feature    order(14595..14609,14613..14615,14628..14630,14655..14657,
                     14661..14663,14688..14705,15018..15020,15024..15026,
                     15897..15899)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp7 interaction site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394895"
     misc_feature    order(14724..14726,15003..15005,15033..15035,15222..15227,
                     15231..15239,15414..15416,15429..15431,15441..15443,
                     15645..15653)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative inhibitor binding site [chemical binding];
                     other site"
                     /db_xref="CDD:394895"
     misc_feature    order(14862..14864,14868..14876,15003..15005,15039..15041,
                     15045..15047,15063..15065,15075..15077,15087..15089,
                     15099..15101,15138..15140,15144..15146,15150..15152,
                     15414..15425,15432..15434,15642..15653,15807..15812,
                     15864..15866,15888..15890,15939..15944,15954..15956,
                     15960..15965)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative RNA binding site [nucleotide binding];
                     other site"
                     /db_xref="CDD:394895"
     misc_feature    14868..14909
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif G; other site"
                     /db_xref="CDD:394895"
     misc_feature    14982..15050
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif F; other site"
                     /db_xref="CDD:394895"
     misc_feature    15201..15251
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif A; other site"
                     /db_xref="CDD:394895"
     misc_feature    order(15408..15458,15462..15497)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif B; other site"
                     /db_xref="CDD:394895"
     misc_feature    15627..15671
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif C; other site"
                     /db_xref="CDD:394895"
     misc_feature    order(15693..15701,15705..15758)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif D; other site"
                     /db_xref="CDD:394895"
     misc_feature    15798..15833
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="conserved polymerase motif E; other site"
                     /db_xref="CDD:394895"
     mat_peptide     16167..17969
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="helicase/NTPase"
                     /note="nsp13"
                     /protein_id="NP_828870.1"
     misc_feature    16167..16451
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Cys/His rich zinc-binding domain (CH/ZBD) of
                     coronavirus SARS NSP13 helicase and related proteins;
                     Region: ZBD_cv_Nsp13-like; cd21401"
                     /db_xref="CDD:394808"
     misc_feature    16461..16604
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="stalk domain of coronavirus Nsp13 helicase and
                     related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
                     /db_xref="CDD:410205"
     misc_feature    order(16470..16472,16557..16559)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="key interaction residues; other site"
                     /db_xref="CDD:410205"
     misc_feature    16614..16850
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="1B domain of coronavirus SARS NSP13 helicase and
                     related proteins; Region: 1B_cv_Nsp13-like; cd21409"
                     /db_xref="CDD:394817"
     misc_feature    order(16701..16703,16707..16709,16767..16769)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nucleic acid substrate binding site [nucleotide
                     binding]; other site"
                     /db_xref="CDD:394817"
     misc_feature    16917..17936
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="helicase domain of betacoronavirus non-structural
                     protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
                     /db_xref="CDD:409655"
     misc_feature    order(17019..17036,17376..17378,17493..17495,17778..17780,
                     17784..17786,17865..17867)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="ATP binding site [chemical binding]; other site"
                     /db_xref="CDD:409655"
     misc_feature    order(17028..17033,17286..17291,17376..17378,17865..17867)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative active site [active]"
                     /db_xref="CDD:409655"
     mat_peptide     17970..19550
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="3' to 5' exonuclease"
                     /note="ExoN; nsp14"
                     /protein_id="NP_828871.1"
     misc_feature    17982..19544
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nonstructural protein 14 of betacoronavirus;
                     Region: betaCoV_Nsp14; cd21659"
                     /db_xref="CDD:394958"
     misc_feature    order(17982..17984,17988..17999,18024..18053,18120..18122,
                     18132..18134,18147..18170,18270..18275,18339..18341,
                     18345..18347,18357..18362,18543..18545,18552..18557,
                     18564..18572,18618..18620)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="heterodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394958"
     misc_feature    order(18237..18239,18243..18245,18540..18542,18771..18773,
                     18786..18788)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="ExoN active site [active]"
                     /db_xref="CDD:394958"
     misc_feature    order(18843..18845,18885..18887,18894..18899,18906..18908,
                     18966..18977,18981..18983,19023..19031,19065..19073,
                     19122..19136,19170..19172,19227..19229,19233..19238,
                     19245..19247,19251..19253,19485..19487)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N7-MTase active site [active]"
                     /db_xref="CDD:394958"
     mat_peptide     19551..20588
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="endoribonuclease"
                     /note="nsp15"
                     /protein_id="NP_828872.1"
     misc_feature    19551..19733
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N-terminal domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: NTD_alpha_beta_cv_Nsp15-like; cd21171"
                     /db_xref="CDD:394900"
     misc_feature    19743..20138
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="middle domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: M_alpha_beta_cv_Nsp15-like; cd21167"
                     /db_xref="CDD:394905"
     misc_feature    20130..20582
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nidoviral uridylate-specific endoribonuclease
                     (NendoU) domain of coronavirus Nonstructural Protein 15
                     (Nsp15) and related proteins; Region:
                     NendoU_cv_Nsp15-like; cd21161"
                     /db_xref="CDD:394912"
     misc_feature    order(20250..20252,20262..20264,20289..20291,20295..20297,
                     20415..20417,20427..20429,20568..20570)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative active site [active]"
                     /db_xref="CDD:394912"
     mat_peptide     20589..21482
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="2'-O-MTase"
                     /note="2'-O-methyltransferase; nsp16"
                     /protein_id="NP_828873.2"
     misc_feature    20592..21479
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Coronavirus NSP13; Region: NSP13; pfam06460"
                     /db_xref="CDD:399456"
     CDS             265..13413
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="ORF1a polyprotein is cleaved to yield nonstructural
                     proteins; polyprotein pp1a"
                     /codon_start=1
                     /product="ORF1a polyprotein"
                     /protein_id="YP_009944365.1"
                     /db_xref="GeneID:1489680"
                     /translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEARE
                     HLKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGI
                     TLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQN
                     WNTKHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQ
                     LDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFP
                     LNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTC
                     DFLKATCEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSN
                     IETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDL
                     LEILSRERVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGN
                     YKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAA
                     VTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKL
                     RPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVV
                     NKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVT
                     FLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQ
                     YCALSPGLLATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNE
                     KCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGE
                     ENFSSRMYCSFYPPDEEEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVR
                     VEEEEEEDWLDDTTEQSEIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSA
                     NPMVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNL
                     AKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQT
                     VRTQVYIAVNDKALYEQVVMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDV
                     KPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYM
                     VGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT
                     ALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATI
                     QRKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLE
                     EAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSG
                     QRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLLSLREVKTIKVFTTVD
                     NTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFE
                     YYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQLEVKFN
                     APALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLN
                     VVCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMM
                     SAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTD
                     VFYKETSYTTTIKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNA
                     SFDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKK
                     GAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNL
                     ACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMA
                     AYVENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSN
                     CAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDA
                     GINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYL
                     NSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVL
                     AYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAMVRMYIFF
                     ASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFC
                     KTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHL
                     YFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCDESASKSASVYY
                     SQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSE
                     LAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTY
                     NKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKK
                     NNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVLAALVCYIVMPV
                     HTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGGSYKNDKSC
                     PVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEYSDFA
                     TSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPNT
                     YLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVFCGVDAMN
                     LIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHVVAANAL
                     LFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWIT
                     AIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLL
                     PLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSI
                     TSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNP
                     NYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQ
                     TFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP
                     TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTT
                     LNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGS
                     TILEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFF
                     VYENAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRI
                     MTWLELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKV
                     YYGNALDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCI
                     MLVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDA
                     FKLNIKLLGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLH
                     NDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPS
                     YAAYATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQ
                     MYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAK
                     LMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLI
                     VTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALL
                     SDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVL
                     GSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTH
                     TGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAND
                     PVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFLNGFAV"
     mat_peptide     265..804
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp1"
                     /protein_id="YP_009944366.1"
     misc_feature    301..645
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N-terminal domain of non-structural protein 1 from
                     Severe acute respiratory syndrome-related coronavirus and
                     betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp1_N; cd21796"
                     /db_xref="CDD:409335"
     mat_peptide     805..2718
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp2"
                     /protein_id="YP_009944367.1"
     misc_feature    808..2718
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to SARS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the B lineage; Region:
                     cv_beta_Nsp2_SARS-like; cd21516"
                     /db_xref="CDD:394867"
     mat_peptide     2719..8484
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp3"
                     /note="papain-like proteinase"
                     /protein_id="YP_009944368.1"
     misc_feature    2905..3351
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Protein of unknown function (DUF3655); Region:
                     DUF3655; pfam12379"
                     /db_xref="CDD:403549"
     misc_feature    3364..3729
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="X-domain of viral non-structural protein 3 and
                     related macrodomains; Region: Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:394882"
     misc_feature    order(3376..3378,3412..3414,3418..3420,3553..3555,
                     3637..3660)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394882"
     misc_feature    3889..4266
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="SUD-N macrodomain of the SARS Unique Domain (SUD)
                     of SARS-CoV non-structural protein 3 and related
                     macrodomains; Region: Macro_cv_SUD-N_Nsp3-like; cd21562"
                     /db_xref="CDD:394883"
     misc_feature    order(3961..3963,3994..3996,4000..4002,4099..4101,
                     4174..4197)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394883"
     misc_feature    4243..4671
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Single-stranded poly(A) binding domain; Region:
                     SUD-M; pfam11633"
                     /db_xref="CDD:314498"
     misc_feature    4678..4878
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Severe Acute
                     Respiratory Syndrome coronavirus and related
                     betacoronaviruses in the B lineage; Region:
                     SUD_C_SARS-CoV_Nsp3; cd21525"
                     /db_xref="CDD:394841"
     misc_feature    4891..5799
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(5209..5220,5368..5376,5380..5385,5392..5394,
                     5479..5481,5548..5553,5557..5559,5626..5628,5674..5676,
                     5695..5703,5785..5787)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(5368..5376,5623..5628,5674..5676,5683..5685,
                     5701..5703,5785..5787)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    5932..6252
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_NAB; cd21822"
                     /db_xref="CDD:409348"
     misc_feature    6325..6672
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus-specific marker of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_betaSM; cd21814"
                     /db_xref="CDD:409629"
     misc_feature    6889..8481
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Severe acute respiratory
                     syndrome-related coronavirus and betacoronavirus in the B
                     lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
                     /db_xref="CDD:409665"
     misc_feature    6889..6954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409665"
     misc_feature    7204..7272
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409665"
     mat_peptide     8485..9984
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp4"
                     /protein_id="YP_009944369.1"
     misc_feature    8524..9666
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8524..8592
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9322..9387
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9430..9495
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9574..9642
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9700..9978
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     mat_peptide     9985..10902
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="3C-like protease"
                     /note="3CLp; nsp5"
                     /protein_id="YP_009944370.1"
     misc_feature    9994..10884
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(9994..10017,10024..10026,10336..10338,10348..10368,
                     10393..10407,10480..10482,10498..10500,10840..10842,
                     10852..10854,10876..10881)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(10057..10065,10105..10107,10402..10419,10471..10482,
                     10498..10500,10543..10560)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     mat_peptide     10903..11772
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp6"
                     /protein_id="YP_009944371.1"
     misc_feature    10903..11772
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     mat_peptide     11773..12021
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp7"
                     /protein_id="YP_009944372.1"
     misc_feature    11773..12021
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11776..11778,11785..11796,11803..11811,11815..11820,
                     11827..11829,11854..11856,11863..11865,11881..11883,
                     11917..11934,11938..11955,11974..11988)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     mat_peptide     12022..12615
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp8"
                     /protein_id="YP_009944373.1"
     misc_feature    12022..12612
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="nsp8 replicase; Region: nsp8; pfam08717"
                     /db_xref="CDD:400866"
     mat_peptide     12616..12954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp9"
                     /protein_id="YP_009944374.1"
     misc_feature    12616..12954
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12616..12633
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12622..12627,12634..12639,12832..12837,12901..12906,
                     12910..12918,12922..12930,12934..12939)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     mat_peptide     12955..13371
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="nsp10"
                     /protein_id="YP_009944375.1"
     misc_feature    12955..13347
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="coronavirus non-structural protein 10; Region:
                     CoV_Nsp10; cd21872"
                     /db_xref="CDD:409325"
     misc_feature    order(12955..12978,12988..12990,12994..13002,13006..13014,
                     13027..13032,13039..13044,13051..13053,13072..13089,
                     13126..13131,13159..13161,13165..13170,13180..13182,
                     13186..13203,13216..13224,13231..13242)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409325"
     misc_feature    order(12994..13002,13006..13014,13027..13029,13072..13074,
                     13078..13089,13126..13134,13186..13197,13204..13206,
                     13237..13242,13297..13299)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409325"
     misc_feature    order(13027..13029,13078..13089,13126..13134,13204..13206,
                     13237..13242,13297..13299)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409325"
     misc_feature    order(13072..13095,13123..13131,13159..13170,13183..13188,
                     13192..13194,13231..13242)
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409325"
     mat_peptide     13372..13410
                     /gene="ORF1ab"
                     /locus_tag="sarsp1"
                     /product="ndp11"
                     /protein_id="YP_009944376.1"
     gene            21492..25259
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /db_xref="GeneID:1489668"
     CDS             21492..25259
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /codon_start=1
                     /product="spike glycoprotein"
                     /protein_id="YP_009825051.1"
                     /db_xref="GeneID:1489668"
                     /translation="MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPD
                     EIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNVVRGWV
                     FGSTMNNKSQSVIIINNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCT
                     FEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLKP
                     IFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAV
                     DCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKF
                     PSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGD
                     DVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRP
                     FERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPA
                     TVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPK
                     TSEILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYST
                     GNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLG
                     ADSSIAYSNNTIAIPTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGS
                     FCTQLNRALSGIAAEQDRNTREVFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRS
                     FIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYT
                     AALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAIS
                     QIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAE
                     VQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYH
                     LMSFPQAAPHGVVFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQ
                     RNFFSPQIITTDNTFVSGNCDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDV
                     DLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWLGFIA
                     GLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT"
     misc_feature    21540..22364
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="N-terminal domain of the S1 subunit of the Spike
                     (S) protein from Severe acute respiratory syndrome
                     coronavirus and related betacoronaviruses in the B
                     lineage; Region: SARS-CoV-like_Spike_S1_NTD; cd21624"
                     /db_xref="CDD:394950"
     misc_feature    order(21615..21617,21624..21638,21642..21644,21969..21971,
                     22062..22067,22140..22142,22158..22160,22164..22166,
                     22170..22172,22296..22298)
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394950"
     misc_feature    22407..23072
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="receptor-binding domain of the S1 subunit of severe
                     acute respiratory syndrome-related coronavirus Spike (S)
                     protein; Region: SARS-CoV_Spike_S1_RBD; cd21481"
                     /db_xref="CDD:394828"
     misc_feature    order(22515..22517,22572..22583,22593..22604,22608..22610,
                     22620..22622,22638..22640,22665..22670,22674..22676,
                     22692..22697,22701..22703,22722..22724,22731..22733,
                     22989..22991,22995..23000,23004..23006)
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394828"
     misc_feature    order(22668..22670,22677..22679,22701..22703,22797..22799,
                     22809..22811,22815..22820,22914..22916,22920..22922,
                     22926..22928)
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="putative receptor binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394828"
     misc_feature    order(22764..22766,22791..22820,22914..22934,22968..22973)
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="receptor binding motif; other site"
                     /db_xref="CDD:394828"
     misc_feature    23076..25061
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
                     region, and the S2 fusion subunit of the spike (S)
                     glycoprotein from SARS-CoV-2 (COVID-19) and related
                     betacoronaviruses in the B lineage; Region:
                     SARS-CoV-like_Spike_SD1-2_S1-S2_S2; cd22378"
                     /db_xref="CDD:411965"
     misc_feature    order(23463..23465,23517..23534,23562..23564)
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="S1/S2 cleavage region; other site"
                     /db_xref="CDD:411965"
     misc_feature    23829..23855
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="fusion peptide; other site"
                     /db_xref="CDD:411965"
     misc_feature    23883..23936
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="internal fusion peptide; other site"
                     /db_xref="CDD:411965"
     misc_feature    24189..24386
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="heptad repeat 1 [structural motif]; Region: heptad
                     repeat 1"
                     /db_xref="CDD:411965"
     misc_feature    24921..25046
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="heptad repeat 2 [structural motif]; Region: heptad
                     repeat 2"
                     /db_xref="CDD:411965"
     misc_feature    25134..25256
                     /gene="S"
                     /locus_tag="sars2"
                     /gene_synonym="E2"
                     /note="Coronavirus spike glycoprotein S2, intravirion;
                     Region: CoV_S2_C; pfam19214"
                     /db_xref="CDD:408983"
     gene            25268..26092
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /db_xref="GeneID:1489669"
     CDS             25268..26092
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /codon_start=1
                     /product="ORF3a protein"
                     /protein_id="YP_009825052.1"
                     /db_xref="GeneID:1489669"
                     /translation="MDLFMRFFTLRSITAQPVKIDNASPASTVHATATIPLQASLPFG
                     WLVIGVAFLAVFQSATKIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAGME
                     AQFLYLYALIYFLQCINACRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPY
                     NSVTDTIVVTEGDGISTPKLKEDYQIGGYSEDRHSGVKDYVVVHGYFTEVYYQLESTQ
                     ITTDTGIENATFFIFNKLVKDPPNVQIHTIDGSSGVANPAMDPIYDEPTTTTSVPL"
     misc_feature    25268..26086
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /note="Coronavirus accessory protein 3a; Region:
                     APA3_viroporin; pfam11289"
                     /db_xref="CDD:402744"
     misc_feature    25382..25444
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     misc_feature    25502..25564
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     misc_feature    25580..25642
                     /gene="ORF3a"
                     /locus_tag="sars3a"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     gene            25689..26153
                     /gene="ORF3b"
                     /locus_tag="sars3b"
                     /db_xref="GeneID:1489670"
     CDS             25689..26153
                     /gene="ORF3b"
                     /locus_tag="sars3b"
                     /note="ORF4"
                     /codon_start=1
                     /product="ORF3b protein"
                     /protein_id="YP_009825053.1"
                     /db_xref="GeneID:1489670"
                     /translation="MMPTTLFAGTHITMTTVYHITVSQIQLSLLKVTAFQHQNSKKTT
                     KLVVILRIGTQVLKTMSLYMAISPKFTTSLSLHKLLQTLVLKMLHSSSLTSLLKTHRM
                     CKYTQSTALQELLIQQWIQFMMSRRRLLACLCKHKKVSTNLCTHSFRKKQVR"
     misc_feature    25689..26147
                     /gene="ORF3b"
                     /locus_tag="sars3b"
                     /note="accessory protein ORF3b of severe acute respiratory
                     syndrome-associated coronavirus; Region: SARS-CoV_ORF3b;
                     cl40696"
                     /db_xref="CDD:424327"
     gene            26117..26347
                     /gene="E"
                     /locus_tag="sars4"
                     /db_xref="GeneID:1489671"
     CDS             26117..26347
                     /gene="E"
                     /locus_tag="sars4"
                     /codon_start=1
                     /product="small envelope protein"
                     /protein_id="YP_009825054.1"
                     /db_xref="GeneID:1489671"
                     /translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCC
                     NIVNVSLVKPTVYVYSRVKNLNSSEGVPDLLV"
     misc_feature    26120..26344
                     /gene="E"
                     /locus_tag="sars4"
                     /note="Severe acute respiratory syndrome coronavirus 2
                     Envelope small membrane protein; Region: SARS-CoV-2_E;
                     cd21536"
                     /db_xref="CDD:394862"
     misc_feature    order(26138..26140,26159..26164,26168..26173,26177..26203,
                     26207..26209,26255..26263,26285..26287,26294..26299,
                     26303..26311)
                     /gene="E"
                     /locus_tag="sars4"
                     /note="homopentameric interface [polypeptide binding];
                     other site"
                     /db_xref="CDD:394862"
     gene            26398..27063
                     /gene="M"
                     /locus_tag="sars5"
                     /db_xref="GeneID:1489672"
     CDS             26398..27063
                     /gene="M"
                     /locus_tag="sars5"
                     /codon_start=1
                     /product="membrane glycoprotein M"
                     /protein_id="YP_009825055.1"
                     /db_xref="GeneID:1489672"
                     /translation="MADNGTITVEELKQLLEQWNLVIGFLFLAWIMLLQFAYSNRNRF
                     LYIIKLVFLWLLWPVTLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYFVASFRLF
                     ARTRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRGHLRMAGHSLGRCDI
                     KDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRIGNYKLNTDHAGSNDNIAL
                     LVQ"
     misc_feature    26407..27060
                     /gene="M"
                     /locus_tag="sars5"
                     /note="Membrane (or Matrix) protein from Severe acute
                     respiratory syndrome (SARS) coronavirus, SARS-CoV-2, and
                     related betacoronaviruses in the B lineage; Region:
                     SARS-like-CoV_M; cd21569"
                     /db_xref="CDD:394855"
     gene            26913..27265
                     /gene="ORF6"
                     /locus_tag="sars6"
                     /db_xref="GeneID:1489673"
     CDS             27074..27265
                     /gene="ORF6"
                     /locus_tag="sars6"
                     /note="ORF7"
                     /codon_start=1
                     /product="ORF6 protein"
                     /protein_id="YP_009825056.1"
                     /db_xref="GeneID:1489673"
                     /translation="MFHLVDFQVTIAEILIIIMRTFRIAIWNLDVIISSIVRQLFKPL
                     TKKNYSELDDEEPMELDYP"
     misc_feature    27074..27259
                     /gene="ORF6"
                     /locus_tag="sars6"
                     /note="Open reading frame 6 from SARS coronavirus; Region:
                     Sars6; pfam12133"
                     /db_xref="CDD:403379"
     gene            27273..27641
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /db_xref="GeneID:1489674"
     CDS             27273..27641
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="ORF8"
                     /codon_start=1
                     /product="ORF7a protein"
                     /protein_id="YP_009825057.1"
                     /db_xref="GeneID:1489674"
                     /translation="MKIILFLTLIVFTSCELYHYQECVRGTTVLLKEPCPSGTYEGNS
                     PFHPLADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQELYSPLF
                     LIVAALVFLILCFTIKRKTE"
     misc_feature    27318..27638
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="SARS coronavirus X4 like; Region: SARS_X4;
                     pfam08779"
                     /db_xref="CDD:400915"
     misc_feature    27318..27344
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand A [structural motif]; Region: Ig strand
                     A"
                     /db_xref="CDD:394934"
     misc_feature    27348..27371
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:394934"
     misc_feature    27384..27398
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:394934"
     misc_feature    27411..27422
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand D [structural motif]; Region: Ig strand
                     D"
                     /db_xref="CDD:394934"
     misc_feature    27426..27446
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:394934"
     misc_feature    27450..27476
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:394934"
     misc_feature    27480..27512
                     /gene="ORF7a"
                     /locus_tag="sars7a"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:394934"
     gene            27638..27772
                     /gene="ORF7b"
                     /locus_tag="sars7b"
                     /db_xref="GeneID:1489675"
     CDS             27638..27772
                     /gene="ORF7b"
                     /locus_tag="sars7b"
                     /note="ORF9"
                     /codon_start=1
                     /product="ORF7b protein"
                     /protein_id="YP_009825058.1"
                     /db_xref="GeneID:1489675"
                     /translation="MNELTLIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV
                     "
     misc_feature    27638..27769
                     /gene="ORF7b"
                     /locus_tag="sars7b"
                     /note="Severe Acute Respiratory Syndrome coronavirus
                     structural accessory protein ORF7b and related proteins;
                     Region: ORF7b_SARS-CoV-like; cd21635"
                     /db_xref="CDD:394939"
     gene            27779..27898
                     /gene="ORF8a"
                     /locus_tag="sars8a"
                     /db_xref="GeneID:1489676"
     CDS             27779..27898
                     /gene="ORF8a"
                     /locus_tag="sars8a"
                     /note="ORF10"
                     /codon_start=1
                     /product="ORF8a protein"
                     /protein_id="YP_009825059.1"
                     /db_xref="GeneID:1489676"
                     /translation="MKLLIVLTCISLCSCICTVVQRCASNKPHVLEDPCKVQH"
     misc_feature    27782..>27883
                     /gene="ORF8a"
                     /locus_tag="sars8a"
                     /note="SARS-CoV-2 ORF8 immunoglobulin (Ig) domain protein
                     and related proteins; Region: ORF8-Ig_SARS-CoV-2-like;
                     cl40466"
                     /db_xref="CDD:424097"
     misc_feature    27827..27847
                     /gene="ORF8a"
                     /locus_tag="sars8a"
                     /note="Ig strand A [structural motif]; Region: Ig strand
                     A"
                     /db_xref="CDD:394944"
     misc_feature    27860..27883
                     /gene="ORF8a"
                     /locus_tag="sars8a"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:394944"
     gene            27864..28118
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /db_xref="GeneID:1489677"
     CDS             27864..28118
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="ORF11"
                     /codon_start=1
                     /product="ORF8b protein"
                     /protein_id="YP_009825060.1"
                     /db_xref="GeneID:1489677"
                     /translation="MCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTMVQTCTPNV
                     TINCQDPAGGALIARCWYLHEGHQTAAFRDVLVVLNKRTN"
     misc_feature    <27882..28109
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="SARS-CoV-2 ORF8 immunoglobulin (Ig) domain protein
                     and related proteins; Region: ORF8-Ig_SARS-CoV-2-like;
                     cl40466"
                     /db_xref="CDD:424097"
     misc_feature    27897..27911
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand C' [structural motif]; Region: Ig strand
                     C'"
                     /db_xref="CDD:394944"
     misc_feature    27915..27929
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand C' [structural motif]; Region: Ig strand
                     C'"
                     /db_xref="CDD:394944"
     misc_feature    27942..27956
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand D [structural motif]; Region: Ig strand
                     D"
                     /db_xref="CDD:394944"
     misc_feature    27960..27971
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:394944"
     misc_feature    27993..28010
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:394944"
     misc_feature    28023..28043
                     /gene="ORF8b"
                     /locus_tag="sars8b"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:394944"
     gene            28120..29388
                     /gene="N"
                     /locus_tag="sars9a"
                     /db_xref="GeneID:1489678"
     CDS             28120..29388
                     /gene="N"
                     /locus_tag="sars9a"
                     /codon_start=1
                     /product="nucleocapsid protein"
                     /protein_id="YP_009825061.1"
                     /db_xref="GeneID:1489678"
                     /translation="MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQ
                     GLPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKM
                     KELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNNAATVL
                     QLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGET
                     ALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFG
                     RRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTY
                     HGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPT
                     VTLLPAADMDDFSRQLQNSMSGASADSTQA"
     misc_feature    28240..29295
                     /gene="N"
                     /locus_tag="sars9a"
                     /note="Coronavirus nucleocapsid protein; Region:
                     Corona_nucleoca; pfam00937"
                     /db_xref="CDD:395751"
     gene            28130..28426
                     /locus_tag="sars9b"
                     /db_xref="GeneID:1489679"
     CDS             28130..28426
                     /locus_tag="sars9b"
                     /note="ORF13"
                     /codon_start=1
                     /product="ORF9b protein"
                     /protein_id="YP_009825062.1"
                     /db_xref="GeneID:1489679"
                     /translation="MDPNQTNVVPPALHLVDPQIQLTITRMEDAMGQGQNSADPKVYP
                     IILRLGSQLSLSMARRNLDSLEARAFQSTPIVVQMTKLATTEELPDEFVVVTAK"
     misc_feature    28130..28423
                     /locus_tag="sars9b"
                     /note="SARS lipid binding protein; Region:
                     SARS_lipid_bind; pfam09399"
                     /db_xref="CDD:401376"
     misc_feature    order(28184..28204,28256..28258,28280..28309,28313..28315,
                     28337..28339,28382..28387,28391..28417,28421..28423)
                     /locus_tag="sars9b"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409250"
     misc_feature    order(28193..28195,28262..28264,28268..28270,28286..28294,
                     28358..28360,28412..28414)
                     /locus_tag="sars9b"
                     /note="lipid binding cavity [chemical binding];
                     lipid-binding site"
                     /db_xref="CDD:409250"
     CDS             28583..28795
                     /gene="N"
                     /locus_tag="sars9a"
                     /note="ORF14"
                     /codon_start=1
                     /product="ORF9a protein"
                     /protein_id="YP_009825063.1"
                     /db_xref="GeneID:1489678"
                     /translation="MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQ
                     LLAAVGEILLLEWLAEVVKLPSRYCC"
     misc_feature    28583..28792
                     /gene="N"
                     /locus_tag="sars9a"
                     /note="accessory protein ORF9c (also referred to as ORF14)
                     from Severe acute respiratory syndrome-associated
                     coronavirus and related coronaviruses; Region:
                     SARS-CoV_ORF9c; cl38891"
                     /db_xref="CDD:422948"
     3'UTR           28796..29751
ORIGIN      
        1 atattaggtt tttacctacc caggaaaagc caaccaacct cgatctcttg tagatctgtt
       61 ctctaaacga actttaaaat ctgtgtagct gtcgctcggc tgcatgccta gtgcacctac
      121 gcagtataaa caataataaa ttttactgtc gttgacaaga aacgagtaac tcgtccctct
      181 tctgcagact gcttacggtt tcgtccgtgt tgcagtcgat catcagcata cctaggtttc
      241 gtccgggtgt gaccgaaagg taagatggag agccttgttc ttggtgtcaa cgagaaaaca
      301 cacgtccaac tcagtttgcc tgtccttcag gttagagacg tgctagtgcg tggcttcggg
      361 gactctgtgg aagaggccct atcggaggca cgtgaacacc tcaaaaatgg cacttgtggt
      421 ctagtagagc tggaaaaagg cgtactgccc cagcttgaac agccctatgt gttcattaaa
      481 cgttctgatg ccttaagcac caatcacggc cacaaggtcg ttgagctggt tgcagaaatg
      541 gacggcattc agtacggtcg tagcggtata acactgggag tactcgtgcc acatgtgggc
      601 gaaaccccaa ttgcataccg caatgttctt cttcgtaaga acggtaataa gggagccggt
      661 ggtcatagct atggcatcga tctaaagtct tatgacttag gtgacgagct tggcactgat
      721 cccattgaag attatgaaca aaactggaac actaagcatg gcagtggtgc actccgtgaa
      781 ctcactcgtg agctcaatgg aggtgcagtc actcgctatg tcgacaacaa tttctgtggc
      841 ccagatgggt accctcttga ttgcatcaaa gattttctcg cacgcgcggg caagtcaatg
      901 tgcactcttt ccgaacaact tgattacatc gagtcgaaga gaggtgtcta ctgctgccgt
      961 gaccatgagc atgaaattgc ctggttcact gagcgctctg ataagagcta cgagcaccag
     1021 acacccttcg aaattaagag tgccaagaaa tttgacactt tcaaagggga atgcccaaag
     1081 tttgtgtttc ctcttaactc aaaagtcaaa gtcattcaac cacgtgttga aaagaaaaag
     1141 actgagggtt tcatggggcg tatacgctct gtgtaccctg ttgcatctcc acaggagtgt
     1201 aacaatatgc acttgtctac cttgatgaaa tgtaatcatt gcgatgaagt ttcatggcag
     1261 acgtgcgact ttctgaaagc cacttgtgaa cattgtggca ctgaaaattt agttattgaa
     1321 ggacctacta catgtgggta cctacctact aatgctgtag tgaaaatgcc atgtcctgcc
     1381 tgtcaagacc cagagattgg acctgagcat agtgttgcag attatcacaa ccactcaaac
     1441 attgaaactc gactccgcaa gggaggtagg actagatgtt ttggaggctg tgtgtttgcc
     1501 tatgttggct gctataataa gcgtgcctac tgggttcctc gtgctagtgc tgatattggc
     1561 tcaggccata ctggcattac tggtgacaat gtggagacct tgaatgagga tctccttgag
     1621 atactgagtc gtgaacgtgt taacattaac attgttggcg attttcattt gaatgaagag
     1681 gttgccatca ttttggcatc tttctctgct tctacaagtg cctttattga cactataaag
     1741 agtcttgatt acaagtcttt caaaaccatt gttgagtcct gcggtaacta taaagttacc
     1801 aagggaaagc ccgtaaaagg tgcttggaac attggacaac agagatcagt tttaacacca
     1861 ctgtgtggtt ttccctcaca ggctgctggt gttatcagat caatttttgc gcgcacactt
     1921 gatgcagcaa accactcaat tcctgatttg caaagagcag ctgtcaccat acttgatggt
     1981 atttctgaac agtcattacg tcttgtcgac gccatggttt atacttcaga cctgctcacc
     2041 aacagtgtca ttattatggc atatgtaact ggtggtcttg tacaacagac ttctcagtgg
     2101 ttgtctaatc ttttgggcac tactgttgaa aaactcaggc ctatctttga atggattgag
     2161 gcgaaactta gtgcaggagt tgaatttctc aaggatgctt gggagattct caaatttctc
     2221 attacaggtg tttttgacat cgtcaagggt caaatacagg ttgcttcaga taacatcaag
     2281 gattgtgtaa aatgcttcat tgatgttgtt aacaaggcac tcgaaatgtg cattgatcaa
     2341 gtcactatcg ctggcgcaaa gttgcgatca ctcaacttag gtgaagtctt catcgctcaa
     2401 agcaagggac tttaccgtca gtgtatacgt ggcaaggagc agctgcaact actcatgcct
     2461 cttaaggcac caaaagaagt aacctttctt gaaggtgatt cacatgacac agtacttacc
     2521 tctgaggagg ttgttctcaa gaacggtgaa ctcgaagcac tcgagacgcc cgttgatagc
     2581 ttcacaaatg gagctatcgt tggcacacca gtctgtgtaa atggcctcat gctcttagag
     2641 attaaggaca aagaacaata ctgcgcattg tctcctggtt tactggctac aaacaatgtc
     2701 tttcgcttaa aagggggtgc accaattaaa ggtgtaacct ttggagaaga tactgtttgg
     2761 gaagttcaag gttacaagaa tgtgagaatc acatttgagc ttgatgaacg tgttgacaaa
     2821 gtgcttaatg aaaagtgctc tgtctacact gttgaatccg gtaccgaagt tactgagttt
     2881 gcatgtgttg tagcagaggc tgttgtgaag actttacaac cagtttctga tctccttacc
     2941 aacatgggta ttgatcttga tgagtggagt gtagctacat tctacttatt tgatgatgct
     3001 ggtgaagaaa acttttcatc acgtatgtat tgttcctttt accctccaga tgaggaagaa
     3061 gaggacgatg cagagtgtga ggaagaagaa attgatgaaa cctgtgaaca tgagtacggt
     3121 acagaggatg attatcaagg tctccctctg gaatttggtg cctcagctga aacagttcga
     3181 gttgaggaag aagaagagga agactggctg gatgatacta ctgagcaatc agagattgag
     3241 ccagaaccag aacctacacc tgaagaacca gttaatcagt ttactggtta tttaaaactt
     3301 actgacaatg ttgccattaa atgtgttgac atcgttaagg aggcacaaag tgctaatcct
     3361 atggtgattg taaatgctgc taacatacac ctgaaacatg gtggtggtgt agcaggtgca
     3421 ctcaacaagg caaccaatgg tgccatgcaa aaggagagtg atgattacat taagctaaat
     3481 ggccctctta cagtaggagg gtcttgtttg ctttctggac ataatcttgc taagaagtgt
     3541 ctgcatgttg ttggacctaa cctaaatgca ggtgaggaca tccagcttct taaggcagca
     3601 tatgaaaatt tcaattcaca ggacatctta cttgcaccat tgttgtcagc aggcatattt
     3661 ggtgctaaac cacttcagtc tttacaagtg tgcgtgcaga cggttcgtac acaggtttat
     3721 attgcagtca atgacaaagc tctttatgag caggttgtca tggattatct tgataacctg
     3781 aagcctagag tggaagcacc taaacaagag gagccaccaa acacagaaga ttccaaaact
     3841 gaggagaaat ctgtcgtaca gaagcctgtc gatgtgaagc caaaaattaa ggcctgcatt
     3901 gatgaggtta ccacaacact ggaagaaact aagtttctta ccaataagtt actcttgttt
     3961 gctgatatca atggtaagct ttaccatgat tctcagaaca tgcttagagg tgaagatatg
     4021 tctttccttg agaaggatgc accttacatg gtaggtgatg ttatcactag tggtgatatc
     4081 acttgtgttg taataccctc caaaaaggct ggtggcacta ctgagatgct ctcaagagct
     4141 ttgaagaaag tgccagttga tgagtatata accacgtacc ctggacaagg atgtgctggt
     4201 tatacacttg aggaagctaa gactgctctt aagaaatgca aatctgcatt ttatgtacta
     4261 ccttcagaag cacctaatgc taaggaagag attctaggaa ctgtatcctg gaatttgaga
     4321 gaaatgcttg ctcatgctga agagacaaga aaattaatgc ctatatgcat ggatgttaga
     4381 gccataatgg caaccatcca acgtaagtat aaaggaatta aaattcaaga gggcatcgtt
     4441 gactatggtg tccgattctt cttttatact agtaaagagc ctgtagcttc tattattacg
     4501 aagctgaact ctctaaatga gccgcttgtc acaatgccaa ttggttatgt gacacatggt
     4561 tttaatcttg aagaggctgc gcgctgtatg cgttctctta aagctcctgc cgtagtgtca
     4621 gtatcatcac cagatgctgt tactacatat aatggatacc tcacttcgtc atcaaagaca
     4681 tctgaggagc actttgtaga aacagtttct ttggctggct cttacagaga ttggtcctat
     4741 tcaggacagc gtacagagtt aggtgttgaa tttcttaagc gtggtgacaa aattgtgtac
     4801 cacactctgg agagccccgt cgagtttcat cttgacggtg aggttctttc acttgacaaa
     4861 ctaaagagtc tcttatccct gcgggaggtt aagactataa aagtgttcac aactgtggac
     4921 aacactaatc tccacacaca gcttgtggat atgtctatga catatggaca gcagtttggt
     4981 ccaacatact tggatggtgc tgatgttaca aaaattaaac ctcatgtaaa tcatgagggt
     5041 aagactttct ttgtactacc tagtgatgac acactacgta gtgaagcttt cgagtactac
     5101 catactcttg atgagagttt tcttggtagg tacatgtctg ctttaaacca cacaaagaaa
     5161 tggaaatttc ctcaagttgg tggtttaact tcaattaaat gggctgataa caattgttat
     5221 ttgtctagtg ttttattagc acttcaacag cttgaagtca aattcaatgc accagcactt
     5281 caagaggctt attatagagc ccgtgctggt gatgctgcta acttttgtgc actcatactc
     5341 gcttacagta ataaaactgt tggcgagctt ggtgatgtca gagaaactat gacccatctt
     5401 ctacagcatg ctaatttgga atctgcaaag cgagttctta atgtggtgtg taaacattgt
     5461 ggtcagaaaa ctactacctt aacgggtgta gaagctgtga tgtatatggg tactctatct
     5521 tatgataatc ttaagacagg tgtttccatt ccatgtgtgt gtggtcgtga tgctacacaa
     5581 tatctagtac aacaagagtc ttcttttgtt atgatgtctg caccacctgc tgagtataaa
     5641 ttacagcaag gtacattctt atgtgcgaat gagtacactg gtaactatca gtgtggtcat
     5701 tacactcata taactgctaa ggagaccctc tatcgtattg acggagctca ccttacaaag
     5761 atgtcagagt acaaaggacc agtgactgat gttttctaca aggaaacatc ttacactaca
     5821 accatcaagc ctgtgtcgta taaactcgat ggagttactt acacagagat tgaaccaaaa
     5881 ttggatgggt attataaaaa ggataatgct tactatacag agcagcctat agaccttgta
     5941 ccaactcaac cattaccaaa tgcgagtttt gataatttca aactcacatg ttctaacaca
     6001 aaatttgctg atgatttaaa tcaaatgaca ggcttcacaa agccagcttc acgagagcta
     6061 tctgtcacat tcttcccaga cttgaatggc gatgtagtgg ctattgacta tagacactat
     6121 tcagcgagtt tcaagaaagg tgctaaatta ctgcataagc caattgtttg gcacattaac
     6181 caggctacaa ccaagacaac gttcaaacca aacacttggt gtttacgttg tctttggagt
     6241 acaaagccag tagatacttc aaattcattt gaagttctgg cagtagaaga cacacaagga
     6301 atggacaatc ttgcttgtga aagtcaacaa cccacctctg aagaagtagt ggaaaatcct
     6361 accatacaga aggaagtcat agagtgtgac gtgaaaacta ccgaagttgt aggcaatgtc
     6421 atacttaaac catcagatga aggtgttaaa gtaacacaag agttaggtca tgaggatctt
     6481 atggctgctt atgtggaaaa cacaagcatt accattaaga aacctaatga gctttcacta
     6541 gccttaggtt taaaaacaat tgccactcat ggtattgctg caattaatag tgttccttgg
     6601 agtaaaattt tggcttatgt caaaccattc ttaggacaag cagcaattac aacatcaaat
     6661 tgcgctaaga gattagcaca acgtgtgttt aacaattata tgccttatgt gtttacatta
     6721 ttgttccaat tgtgtacttt tactaaaagt accaattcta gaattagagc ttcactacct
     6781 acaactattg ctaaaaatag tgttaagagt gttgctaaat tatgtttgga tgccggcatt
     6841 aattatgtga agtcacccaa attttctaaa ttgttcacaa tcgctatgtg gctattgttg
     6901 ttaagtattt gcttaggttc tctaatctgt gtaactgctg cttttggtgt actcttatct
     6961 aattttggtg ctccttctta ttgtaatggc gttagagaat tgtatcttaa ttcgtctaac
     7021 gttactacta tggatttctg tgaaggttct tttccttgca gcatttgttt aagtggatta
     7081 gactcccttg attcttatcc agctcttgaa accattcagg tgacgatttc atcgtacaag
     7141 ctagacttga caattttagg tctggccgct gagtgggttt tggcatatat gttgttcaca
     7201 aaattctttt atttattagg tctttcagct ataatgcagg tgttctttgg ctattttgct
     7261 agtcatttca tcagcaattc ttggctcatg tggtttatca ttagtattgt acaaatggca
     7321 cccgtttctg caatggttag gatgtacatc ttctttgctt ctttctacta catatggaag
     7381 agctatgttc atatcatgga tggttgcacc tcttcgactt gcatgatgtg ctataagcgc
     7441 aatcgtgcca cacgcgttga gtgtacaact attgttaatg gcatgaagag atctttctat
     7501 gtctatgcaa atggaggccg tggcttctgc aagactcaca attggaattg tctcaattgt
     7561 gacacatttt gcactggtag tacattcatt agtgatgaag ttgctcgtga tttgtcactc
     7621 cagtttaaaa gaccaatcaa ccctactgac cagtcatcgt atattgttga tagtgttgct
     7681 gtgaaaaatg gcgcgcttca cctctacttt gacaaggctg gtcaaaagac ctatgagaga
     7741 catccgctct cccattttgt caatttagac aatttgagag ctaacaacac taaaggttca
     7801 ctgcctatta atgtcatagt ttttgatggc aagtccaaat gcgacgagtc tgcttctaag
     7861 tctgcttctg tgtactacag tcagctgatg tgccaaccta ttctgttgct tgaccaagct
     7921 cttgtatcag acgttggaga tagtactgaa gtttccgtta agatgtttga tgcttatgtc
     7981 gacacctttt cagcaacttt tagtgttcct atggaaaaac ttaaggcact tgttgctaca
     8041 gctcacagcg agttagcaaa gggtgtagct ttagatggtg tcctttctac attcgtgtca
     8101 gctgcccgac aaggtgttgt tgataccgat gttgacacaa aggatgttat tgaatgtctc
     8161 aaactttcac atcactctga cttagaagtg acaggtgaca gttgtaacaa tttcatgctc
     8221 acctataata aggttgaaaa catgacgccc agagatcttg gcgcatgtat tgactgtaat
     8281 gcaaggcata tcaatgccca agtagcaaaa agtcacaatg tttcactcat ctggaatgta
     8341 aaagactaca tgtctttatc tgaacagctg cgtaaacaaa ttcgtagtgc tgccaagaag
     8401 aacaacatac cttttagact aacttgtgct acaactagac aggttgtcaa tgtcataact
     8461 actaaaatct cactcaaggg tggtaagatt gttagtactt gttttaaact tatgcttaag
     8521 gccacattat tgtgcgttct tgctgcattg gtttgttata tcgttatgcc agtacataca
     8581 ttgtcaatcc atgatggtta cacaaatgaa atcattggtt acaaagccat tcaggatggt
     8641 gtcactcgtg acatcatttc tactgatgat tgttttgcaa ataaacatgc tggttttgac
     8701 gcatggttta gccagcgtgg tggttcatac aaaaatgaca aaagctgccc tgtagtagct
     8761 gctatcatta caagagagat tggtttcata gtgcctggct taccgggtac tgtgctgaga
     8821 gcaatcaatg gtgacttctt gcattttcta cctcgtgttt ttagtgctgt tggcaacatt
     8881 tgctacacac cttccaaact cattgagtat agtgattttg ctacctctgc ttgcgttctt
     8941 gctgctgagt gtacaatttt taaggatgct atgggcaaac ctgtgccata ttgttatgac
     9001 actaatttgc tagagggttc tatttcttat agtgagcttc gtccagacac tcgttatgtg
     9061 cttatggatg gttccatcat acagtttcct aacacttacc tggagggttc tgttagagta
     9121 gtaacaactt ttgatgctga gtactgtaga catggtacat gcgaaaggtc agaagtaggt
     9181 atttgcctat ctaccagtgg tagatgggtt cttaataatg agcattacag agctctatca
     9241 ggagttttct gtggtgttga tgcgatgaat ctcatagcta acatctttac tcctcttgtg
     9301 caacctgtgg gtgctttaga tgtgtctgct tcagtagtgg ctggtggtat tattgccata
     9361 ttggtgactt gtgctgccta ctactttatg aaattcagac gtgtttttgg tgagtacaac
     9421 catgttgttg ctgctaatgc acttttgttt ttgatgtctt tcactatact ctgtctggta
     9481 ccagcttaca gctttctgcc gggagtctac tcagtctttt acttgtactt gacattctat
     9541 ttcaccaatg atgtttcatt cttggctcac cttcaatggt ttgccatgtt ttctcctatt
     9601 gtgccttttt ggataacagc aatctatgta ttctgtattt ctctgaagca ctgccattgg
     9661 ttctttaaca actatcttag gaaaagagtc atgtttaatg gagttacatt tagtaccttc
     9721 gaggaggctg ctttgtgtac ctttttgctc aacaaggaaa tgtacctaaa attgcgtagc
     9781 gagacactgt tgccacttac acagtataac aggtatcttg ctctatataa caagtacaag
     9841 tatttcagtg gagccttaga tactaccagc tatcgtgaag cagcttgctg ccacttagca
     9901 aaggctctaa atgactttag caactcaggt gctgatgttc tctaccaacc accacagaca
     9961 tcaatcactt ctgctgttct gcagagtggt tttaggaaaa tggcattccc gtcaggcaaa
    10021 gttgaagggt gcatggtaca agtaacctgt ggaactacaa ctcttaatgg attgtggttg
    10081 gatgacacag tatactgtcc aagacatgtc atttgcacag cagaagacat gcttaatcct
    10141 aactatgaag atctgctcat tcgcaaatcc aaccatagct ttcttgttca ggctggcaat
    10201 gttcaacttc gtgttattgg ccattctatg caaaattgtc tgcttaggct taaagttgat
    10261 acttctaacc ctaagacacc caagtataaa tttgtccgta tccaacctgg tcaaacattt
    10321 tcagttctag catgctacaa tggttcacca tctggtgttt atcagtgtgc catgagacct
    10381 aatcatacca ttaaaggttc tttccttaat ggatcatgtg gtagtgttgg ttttaacatt
    10441 gattatgatt gcgtgtcttt ctgctatatg catcatatgg agcttccaac aggagtacac
    10501 gctggtactg acttagaagg taaattctat ggtccatttg ttgacagaca aactgcacag
    10561 gctgcaggta cagacacaac cataacatta aatgttttgg catggctgta tgctgctgtt
    10621 atcaatggtg ataggtggtt tcttaataga ttcaccacta ctttgaatga ctttaacctt
    10681 gtggcaatga agtacaacta tgaacctttg acacaagatc atgttgacat attgggacct
    10741 ctttctgctc aaacaggaat tgccgtctta gatatgtgtg ctgctttgaa agagctgctg
    10801 cagaatggta tgaatggtcg tactatcctt ggtagcacta ttttagaaga tgagtttaca
    10861 ccatttgatg ttgttagaca atgctctggt gttaccttcc aaggtaagtt caagaaaatt
    10921 gttaagggca ctcatcattg gatgctttta actttcttga catcactatt gattcttgtt
    10981 caaagtacac agtggtcact gtttttcttt gtttacgaga atgctttctt gccatttact
    11041 cttggtatta tggcaattgc tgcatgtgct atgctgcttg ttaagcataa gcacgcattc
    11101 ttgtgcttgt ttctgttacc ttctcttgca acagttgctt actttaatat ggtctacatg
    11161 cctgctagct gggtgatgcg tatcatgaca tggcttgaat tggctgacac tagcttgtct
    11221 ggttataggc ttaaggattg tgttatgtat gcttcagctt tagttttgct tattctcatg
    11281 acagctcgca ctgtttatga tgatgctgct agacgtgttt ggacactgat gaatgtcatt
    11341 acacttgttt acaaagtcta ctatggtaat gctttagatc aagctatttc catgtgggcc
    11401 ttagttattt ctgtaacctc taactattct ggtgtcgtta cgactatcat gtttttagct
    11461 agagctatag tgtttgtgtg tgttgagtat tacccattgt tatttattac tggcaacacc
    11521 ttacagtgta tcatgcttgt ttattgtttc ttaggctatt gttgctgctg ctactttggc
    11581 cttttctgtt tactcaaccg ttacttcagg cttactcttg gtgtttatga ctacttggtc
    11641 tctacacaag aatttaggta tatgaactcc caggggcttt tgcctcctaa gagtagtatt
    11701 gatgctttca agcttaacat taagttgttg ggtattggag gtaaaccatg tatcaaggtt
    11761 gctactgtac agtctaaaat gtctgacgta aagtgcacat ctgtggtact gctctcggtt
    11821 cttcaacaac ttagagtaga gtcatcttct aaattgtggg cacaatgtgt acaactccac
    11881 aatgatattc ttcttgcaaa agacacaact gaagctttcg agaagatggt ttctcttttg
    11941 tctgttttgc tatccatgca gggtgctgta gacattaata ggttgtgcga ggaaatgctc
    12001 gataaccgtg ctactcttca ggctattgct tcagaattta gttctttacc atcatatgcc
    12061 gcttatgcca ctgcccagga ggcctatgag caggctgtag ctaatggtga ttctgaagtc
    12121 gttctcaaaa agttaaagaa atctttgaat gtggctaaat ctgagtttga ccgtgatgct
    12181 gccatgcaac gcaagttgga aaagatggca gatcaggcta tgacccaaat gtacaaacag
    12241 gcaagatctg aggacaagag ggcaaaagta actagtgcta tgcaaacaat gctcttcact
    12301 atgcttagga agcttgataa tgatgcactt aacaacatta tcaacaatgc gcgtgatggt
    12361 tgtgttccac tcaacatcat accattgact acagcagcca aactcatggt tgttgtccct
    12421 gattatggta cctacaagaa cacttgtgat ggtaacacct ttacatatgc atctgcactc
    12481 tgggaaatcc agcaagttgt tgatgcggat agcaagattg ttcaacttag tgaaattaac
    12541 atggacaatt caccaaattt ggcttggcct cttattgtta cagctctaag agccaactca
    12601 gctgttaaac tacagaataa tgaactgagt ccagtagcac tacgacagat gtcctgtgcg
    12661 gctggtacca cacaaacagc ttgtactgat gacaatgcac ttgcctacta taacaattcg
    12721 aagggaggta ggtttgtgct ggcattacta tcagaccacc aagatctcaa atgggctaga
    12781 ttccctaaga gtgatggtac aggtacaatt tacacagaac tggaaccacc ttgtaggttt
    12841 gttacagaca caccaaaagg gcctaaagtg aaatacttgt acttcatcaa aggcttaaac
    12901 aacctaaata gaggtatggt gctgggcagt ttagctgcta cagtacgtct tcaggctgga
    12961 aatgctacag aagtacctgc caattcaact gtgctttcct tctgtgcttt tgcagtagac
    13021 cctgctaaag catataagga ttacctagca agtggaggac aaccaatcac caactgtgtg
    13081 aagatgttgt gtacacacac tggtacagga caggcaatta ctgtaacacc agaagctaac
    13141 atggaccaag agtcctttgg tggtgcttca tgttgtctgt attgtagatg ccacattgac
    13201 catccaaatc ctaaaggatt ctgtgacttg aaaggtaagt acgtccaaat acctaccact
    13261 tgtgctaatg acccagtggg ttttacactt agaaacacag tctgtaccgt ctgcggaatg
    13321 tggaaaggtt atggctgtag ttgtgaccaa ctccgcgaac ccttgatgca gtctgcggat
    13381 gcatcaacgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca ccgtgcggca
    13441 caggcactag tactgatgtc gtctacaggg cttttgatat ttacaacgaa aaagttgctg
    13501 gttttgcaaa gttcctaaaa actaattgct gtcgcttcca ggagaaggat gaggaaggca
    13561 atttattaga ctcttacttt gtagttaaga ggcatactat gtctaactac caacatgaag
    13621 agactattta taacttggtt aaagattgtc cagcggttgc tgtccatgac tttttcaagt
    13681 ttagagtaga tggtgacatg gtaccacata tatcacgtca gcgtctaact aaatacacaa
    13741 tggctgattt agtctatgct ctacgtcatt ttgatgaggg taattgtgat acattaaaag
    13801 aaatactcgt cacatacaat tgctgtgatg atgattattt caataagaag gattggtatg
    13861 acttcgtaga gaatcctgac atcttacgcg tatatgctaa cttaggtgag cgtgtacgcc
    13921 aatcattatt aaagactgta caattctgcg atgctatgcg tgatgcaggc attgtaggcg
    13981 tactgacatt agataatcag gatcttaatg ggaactggta cgatttcggt gatttcgtac
    14041 aagtagcacc aggctgcgga gttcctattg tggattcata ttactcattg ctgatgccca
    14101 tcctcacttt gactagggca ttggctgctg agtcccatat ggatgctgat ctcgcaaaac
    14161 cacttattaa gtgggatttg ctgaaatatg attttacgga agagagactt tgtctcttcg
    14221 accgttattt taaatattgg gaccagacat accatcccaa ttgtattaac tgtttggatg
    14281 ataggtgtat ccttcattgt gcaaacttta atgtgttatt ttctactgtg tttccaccta
    14341 caagttttgg accactagta agaaaaatat ttgtagatgg tgttcctttt gttgtttcaa
    14401 ctggatacca ttttcgtgag ttaggagtcg tacataatca ggatgtaaac ttacatagct
    14461 cgcgtctcag tttcaaggaa cttttagtgt atgctgctga tccagctatg catgcagctt
    14521 ctggcaattt attgctagat aaacgcacta catgcttttc agtagctgca ctaacaaaca
    14581 atgttgcttt tcaaactgtc aaacccggta attttaataa agacttttat gactttgctg
    14641 tgtctaaagg tttctttaag gaaggaagtt ctgttgaact aaaacacttc ttctttgctc
    14701 aggatggcaa cgctgctatc agtgattatg actattatcg ttataatctg ccaacaatgt
    14761 gtgatatcag acaactccta ttcgtagttg aagttgttga taaatacttt gattgttacg
    14821 atggtggctg tattaatgcc aaccaagtaa tcgttaacaa tctggataaa tcagctggtt
    14881 tcccatttaa taaatggggt aaggctagac tttattatga ctcaatgagt tatgaggatc
    14941 aagatgcact tttcgcgtat actaagcgta atgtcatccc tactataact caaatgaatc
    15001 ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc tctatctgta
    15061 gtactatgac aaatagacag tttcatcaga aattattgaa gtcaatagcc gccactagag
    15121 gagctactgt ggtaattgga acaagcaagt tttacggtgg ctggcataat atgttaaaaa
    15181 ctgtttacag tgatgtagaa actccacacc ttatgggttg ggattatcca aaatgtgaca
    15241 gagccatgcc taacatgctt aggataatgg cctctcttgt tcttgctcgc aaacataaca
    15301 cttgctgtaa cttatcacac cgtttctaca ggttagctaa cgagtgtgcg caagtattaa
    15361 gtgagatggt catgtgtggc ggctcactat atgttaaacc aggtggaaca tcatccggtg
    15421 atgctacaac tgcttatgct aatagtgtct ttaacatttg tcaagctgtt acagccaatg
    15481 taaatgcact tctttcaact gatggtaata agatagctga caagtatgtc cgcaatctac
    15541 aacacaggct ctatgagtgt ctctatagaa atagggatgt tgatcatgaa ttcgtggatg
    15601 agttttacgc ttacctgcgt aaacatttct ccatgatgat tctttctgat gatgccgttg
    15661 tgtgctataa cagtaactat gcggctcaag gtttagtagc tagcattaag aactttaagg
    15721 cagttcttta ttatcaaaat aatgtgttca tgtctgaggc aaaatgttgg actgagactg
    15781 accttactaa aggacctcac gaattttgct cacagcatac aatgctagtt aaacaaggag
    15841 atgattacgt gtacctgcct tacccagatc catcaagaat attaggcgca ggctgttttg
    15901 tcgatgatat tgtcaaaaca gatggtacac ttatgattga aaggttcgtg tcactggcta
    15961 ttgatgctta cccacttaca aaacatccta atcaggagta tgctgatgtc tttcacttgt
    16021 atttacaata cattagaaag ttacatgatg agcttactgg ccacatgttg gacatgtatt
    16081 ccgtaatgct aactaatgat aacacctcac ggtactggga acctgagttt tatgaggcta
    16141 tgtacacacc acatacagtc ttgcaggctg taggtgcttg tgtattgtgc aattcacaga
    16201 cttcacttcg ttgcggtgcc tgtattagga gaccattcct atgttgcaag tgctgctatg
    16261 accatgtcat ttcaacatca cacaaattag tgttgtctgt taatccctat gtttgcaatg
    16321 ccccaggttg tgatgtcact gatgtgacac aactgtatct aggaggtatg agctattatt
    16381 gcaagtcaca taagcctccc attagttttc cattatgtgc taatggtcag gtttttggtt
    16441 tatacaaaaa cacatgtgta ggcagtgaca atgtcactga cttcaatgcg atagcaacat
    16501 gtgattggac taatgctggc gattacatac ttgccaacac ttgtactgag agactcaagc
    16561 ttttcgcagc agaaacgctc aaagccactg aggaaacatt taagctgtca tatggtattg
    16621 ccactgtacg cgaagtactc tctgacagag aattgcatct ttcatgggag gttggaaaac
    16681 ctagaccacc attgaacaga aactatgtct ttactggtta ccgtgtaact aaaaatagta
    16741 aagtacagat tggagagtac acctttgaaa aaggtgacta tggtgatgct gttgtgtaca
    16801 gaggtactac gacatacaag ttgaatgttg gtgattactt tgtgttgaca tctcacactg
    16861 taatgccact tagtgcacct actctagtgc cacaagagca ctatgtgaga attactggct
    16921 tgtacccaac actcaacatc tcagatgagt tttctagcaa tgttgcaaat tatcaaaagg
    16981 tcggcatgca aaagtactct acactccaag gaccacctgg tactggtaag agtcattttg
    17041 ccatcggact tgctctctat tacccatctg ctcgcatagt gtatacggca tgctctcatg
    17101 cagctgttga tgccctatgt gaaaaggcat taaaatattt gcccatagat aaatgtagta
    17161 gaatcatacc tgcgcgtgcg cgcgtagagt gttttgataa attcaaagtg aattcaacac
    17221 tagaacagta tgttttctgc actgtaaatg cattgccaga aacaactgct gacattgtag
    17281 tctttgatga aatctctatg gctactaatt atgacttgag tgttgtcaat gctagacttc
    17341 gtgcaaaaca ctacgtctat attggcgatc ctgctcaatt accagccccc cgcacattgc
    17401 tgactaaagg cacactagaa ccagaatatt ttaattcagt gtgcagactt atgaaaacaa
    17461 taggtccaga catgttcctt ggaacttgtc gccgttgtcc tgctgaaatt gttgacactg
    17521 tgagtgcttt agtttatgac aataagctaa aagcacacaa ggataagtca gctcaatgct
    17581 tcaaaatgtt ctacaaaggt gttattacac atgatgtttc atctgcaatc aacagacctc
    17641 aaataggcgt tgtaagagaa tttcttacac gcaatcctgc ttggagaaaa gctgttttta
    17701 tctcacctta taattcacag aacgctgtag cttcaaaaat cttaggattg cctacgcaga
    17761 ctgttgattc atcacagggt tctgaatatg actatgtcat attcacacaa actactgaaa
    17821 cagcacactc ttgtaatgtc aaccgcttca atgtggctat cacaagggca aaaattggca
    17881 ttttgtgcat aatgtctgat agagatcttt atgacaaact gcaatttaca agtctagaaa
    17941 taccacgtcg caatgtggct acattacaag cagaaaatgt aactggactt tttaaggact
    18001 gtagtaagat cattactggt cttcatccta cacaggcacc tacacacctc agcgttgata
    18061 taaagttcaa gactgaagga ttatgtgttg acataccagg cataccaaag gacatgacct
    18121 accgtagact catctctatg atgggtttca aaatgaatta ccaagtcaat ggttacccta
    18181 atatgtttat cacccgcgaa gaagctattc gtcacgttcg tgcgtggatt ggctttgatg
    18241 tagagggctg tcatgcaact agagatgctg tgggtactaa cctacctctc cagctaggat
    18301 tttctacagg tgttaactta gtagctgtac cgactggtta tgttgacact gaaaataaca
    18361 cagaattcac cagagttaat gcaaaacctc caccaggtga ccagtttaaa catcttatac
    18421 cactcatgta taaaggcttg ccctggaatg tagtgcgtat taagatagta caaatgctca
    18481 gtgatacact gaaaggattg tcagacagag tcgtgttcgt cctttgggcg catggctttg
    18541 agcttacatc aatgaagtac tttgtcaaga ttggacctga aagaacgtgt tgtctgtgtg
    18601 acaaacgtgc aacttgcttt tctacttcat cagatactta tgcctgctgg aatcattctg
    18661 tgggttttga ctatgtctat aacccattta tgattgatgt tcagcagtgg ggctttacgg
    18721 gtaaccttca gagtaaccat gaccaacatt gccaggtaca tggaaatgca catgtggcta
    18781 gttgtgatgc tatcatgact agatgtttag cagtccatga gtgctttgtt aagcgcgttg
    18841 attggtctgt tgaataccct attataggag atgaactgag ggttaattct gcttgcagaa
    18901 aagtacaaca catggttgtg aagtctgcat tgcttgctga taagtttcca gttcttcatg
    18961 acattggaaa tccaaaggct atcaagtgtg tgcctcaggc tgaagtagaa tggaagttct
    19021 acgatgctca gccatgtagt gacaaagctt acaaaataga ggaactcttc tattcttatg
    19081 ctacacatca cgataaattc actgatggtg tttgtttgtt ttggaattgt aacgttgatc
    19141 gttacccagc caatgcaatt gtgtgtaggt ttgacacaag agtcttgtca aacttgaact
    19201 taccaggctg tgatggtggt agtttgtatg tgaataagca tgcattccac actccagctt
    19261 tcgataaaag tgcatttact aatttaaagc aattgccttt cttttactat tctgatagtc
    19321 cttgtgagtc tcatggcaaa caagtagtgt cggatattga ttatgttcca ctcaaatctg
    19381 ctacgtgtat tacacgatgc aatttaggtg gtgctgtttg cagacaccat gcaaatgagt
    19441 accgacagta cttggatgca tataatatga tgatttctgc tggatttagc ctatggattt
    19501 acaaacaatt tgatacttat aacctgtgga atacatttac caggttacag agtttagaaa
    19561 atgtggctta taatgttgtt aataaaggac actttgatgg acacgccggc gaagcacctg
    19621 tttccatcat taataatgct gtttacacaa aggtagatgg tattgatgtg gagatctttg
    19681 aaaataagac aacacttcct gttaatgttg catttgagct ttgggctaag cgtaacatta
    19741 aaccagtgcc agagattaag atactcaata atttgggtgt tgatatcgct gctaatactg
    19801 taatctggga ctacaaaaga gaagccccag cacatgtatc tacaataggt gtctgcacaa
    19861 tgactgacat tgccaagaaa cctactgaga gtgcttgttc ttcacttact gtcttgtttg
    19921 atggtagagt ggaaggacag gtagaccttt ttagaaacgc ccgtaatggt gttttaataa
    19981 cagaaggttc agtcaaaggt ctaacacctt caaagggacc agcacaagct agcgtcaatg
    20041 gagtcacatt aattggagaa tcagtaaaaa cacagtttaa ctactttaag aaagtagacg
    20101 gcattattca acagttgcct gaaacctact ttactcagag cagagactta gaggatttta
    20161 agcccagatc acaaatggaa actgactttc tcgagctcgc tatggatgaa ttcatacagc
    20221 gatataagct cgagggctat gccttcgaac acatcgttta tggagatttc agtcatggac
    20281 aacttggcgg tcttcattta atgataggct tagccaagcg ctcacaagat tcaccactta
    20341 aattagagga ttttatccct atggacagca cagtgaaaaa ttacttcata acagatgcgc
    20401 aaacaggttc atcaaaatgt gtgtgttctg tgattgatct tttacttgat gactttgtcg
    20461 agataataaa gtcacaagat ttgtcagtga tttcaaaagt ggtcaaggtt acaattgact
    20521 atgctgaaat ttcattcatg ctttggtgta aggatggaca tgttgaaacc ttctacccaa
    20581 aactacaagc aagtcaagcg tggcaaccag gtgttgcgat gcctaacttg tacaagatgc
    20641 aaagaatgct tcttgaaaag tgtgaccttc agaattatgg tgaaaatgct gttataccaa
    20701 aaggaataat gatgaatgtc gcaaagtata ctcaactgtg tcaatactta aatacactta
    20761 ctttagctgt accctacaac atgagagtta ttcactttgg tgctggctct gataaaggag
    20821 ttgcaccagg tacagctgtg ctcagacaat ggttgccaac tggcacacta cttgtcgatt
    20881 cagatcttaa tgacttcgtc tccgacgcag attctacttt aattggagac tgtgcaacag
    20941 tacatacggc taataaatgg gaccttatta ttagcgatat gtatgaccct aggaccaaac
    21001 atgtgacaaa agagaatgac tctaaagaag ggtttttcac ttatctgtgt ggatttataa
    21061 agcaaaaact agccctgggt ggttctatag ctgtaaagat aacagagcat tcttggaatg
    21121 ctgaccttta caagcttatg ggccatttct catggtggac agcttttgtt acaaatgtaa
    21181 atgcatcatc atcggaagca tttttaattg gggctaacta tcttggcaag ccgaaggaac
    21241 aaattgatgg ctataccatg catgctaact acattttctg gaggaacaca aatcctatcc
    21301 agttgtcttc ctattcactc tttgacatga gcaaatttcc tcttaaatta agaggaactg
    21361 ctgtaatgtc tcttaaggag aatcaaatca atgatatgat ttattctctt ctggaaaaag
    21421 gtaggcttat cattagagaa aacaacagag ttgtggtttc aagtgatatt cttgttaaca
    21481 actaaacgaa catgtttatt ttcttattat ttcttactct cactagtggt agtgaccttg
    21541 accggtgcac cacttttgat gatgttcaag ctcctaatta cactcaacat acttcatcta
    21601 tgaggggggt ttactatcct gatgaaattt ttagatcaga cactctttat ttaactcagg
    21661 atttatttct tccattttat tctaatgtta cagggtttca tactattaat catacgtttg
    21721 gcaaccctgt catacctttt aaggatggta tttattttgc tgccacagag aaatcaaatg
    21781 ttgtccgtgg ttgggttttt ggttctacca tgaacaacaa gtcacagtcg gtgattatta
    21841 ttaacaattc tactaatgtt gttatacgag catgtaactt tgaattgtgt gacaaccctt
    21901 tctttgctgt ttctaaaccc atgggtacac agacacatac tatgatattc gataatgcat
    21961 ttaattgcac tttcgagtac atatctgatg ccttttcgct tgatgtttca gaaaagtcag
    22021 gtaattttaa acacttacga gagtttgtgt ttaaaaataa agatgggttt ctctatgttt
    22081 ataagggcta tcaacctata gatgtagttc gtgatctacc ttctggtttt aacactttga
    22141 aacctatttt taagttgcct cttggtatta acattacaaa ttttagagcc attcttacag
    22201 ccttttcacc tgctcaagac atttggggca cgtcagctgc agcctatttt gttggctatt
    22261 taaagccaac tacatttatg ctcaagtatg atgaaaatgg tacaatcaca gatgctgttg
    22321 attgttctca aaatccactt gctgaactca aatgctctgt taagagcttt gagattgaca
    22381 aaggaattta ccagacctct aatttcaggg ttgttccctc aggagatgtt gtgagattcc
    22441 ctaatattac aaacttgtgt ccttttggag aggtttttaa tgctactaaa ttcccttctg
    22501 tctatgcatg ggagagaaaa aaaatttcta attgtgttgc tgattactct gtgctctaca
    22561 actcaacatt tttttcaacc tttaagtgct atggcgtttc tgccactaag ttgaatgatc
    22621 tttgcttctc caatgtctat gcagattctt ttgtagtcaa gggagatgat gtaagacaaa
    22681 tagcgccagg acaaactggt gttattgctg attataatta taaattgcca gatgatttca
    22741 tgggttgtgt ccttgcttgg aatactagga acattgatgc tacttcaact ggtaattata
    22801 attataaata taggtatctt agacatggca agcttaggcc ctttgagaga gacatatcta
    22861 atgtgccttt ctcccctgat ggcaaacctt gcaccccacc tgctcttaat tgttattggc
    22921 cattaaatga ttatggtttt tacaccacta ctggcattgg ctaccaacct tacagagttg
    22981 tagtactttc ttttgaactt ttaaatgcac cggccacggt ttgtggacca aaattatcca
    23041 ctgaccttat taagaaccag tgtgtcaatt ttaattttaa tggactcact ggtactggtg
    23101 tgttaactcc ttcttcaaag agatttcaac catttcaaca atttggccgt gatgtttctg
    23161 atttcactga ttccgttcga gatcctaaaa catctgaaat attagacatt tcaccttgcg
    23221 cttttggggg tgtaagtgta attacacctg gaacaaatgc ttcatctgaa gttgctgttc
    23281 tatatcaaga tgttaactgc actgatgttt ctacagcaat tcatgcagat caactcacac
    23341 cagcttggcg catatattct actggaaaca atgtattcca gactcaagca ggctgtctta
    23401 taggagctga gcatgtcgac acttcttatg agtgcgacat tcctattgga gctggcattt
    23461 gtgctagtta ccatacagtt tctttattac gtagtactag ccaaaaatct attgtggctt
    23521 atactatgtc tttaggtgct gatagttcaa ttgcttactc taataacacc attgctatac
    23581 ctactaactt ttcaattagc attactacag aagtaatgcc tgtttctatg gctaaaacct
    23641 ccgtagattg taatatgtac atctgcggag attctactga atgtgctaat ttgcttctcc
    23701 aatatggtag cttttgcaca caactaaatc gtgcactctc aggtattgct gctgaacagg
    23761 atcgcaacac acgtgaagtg ttcgctcaag tcaaacaaat gtacaaaacc ccaactttga
    23821 aatattttgg tggttttaat ttttcacaaa tattacctga ccctctaaag ccaactaaga
    23881 ggtcttttat tgaggacttg ctctttaata aggtgacact cgctgatgct ggcttcatga
    23941 agcaatatgg cgaatgccta ggtgatatta atgctagaga tctcatttgt gcgcagaagt
    24001 tcaatggact tacagtgttg ccacctctgc tcactgatga tatgattgct gcctacactg
    24061 ctgctctagt tagtggtact gccactgctg gatggacatt tggtgctggc gctgctcttc
    24121 aaataccttt tgctatgcaa atggcatata ggttcaatgg cattggagtt acccaaaatg
    24181 ttctctatga gaaccaaaaa caaatcgcca accaatttaa caaggcgatt agtcaaattc
    24241 aagaatcact tacaacaaca tcaactgcat tgggcaagct gcaagacgtt gttaaccaga
    24301 atgctcaagc attaaacaca cttgttaaac aacttagctc taattttggt gcaatttcaa
    24361 gtgtgctaaa tgatatcctt tcgcgacttg ataaagtcga ggcggaggta caaattgaca
    24421 ggttaattac aggcagactt caaagccttc aaacctatgt aacacaacaa ctaatcaggg
    24481 ctgctgaaat cagggcttct gctaatcttg ctgctactaa aatgtctgag tgtgttcttg
    24541 gacaatcaaa aagagttgac ttttgtggaa agggctacca ccttatgtcc ttcccacaag
    24601 cagccccgca tggtgttgtc ttcctacatg tcacgtatgt gccatcccag gagaggaact
    24661 tcaccacagc gccagcaatt tgtcatgaag gcaaagcata cttccctcgt gaaggtgttt
    24721 ttgtgtttaa tggcacttct tggtttatta cacagaggaa cttcttttct ccacaaataa
    24781 ttactacaga caatacattt gtctcaggaa attgtgatgt cgttattggc atcattaaca
    24841 acacagttta tgatcctctg caacctgagc ttgactcatt caaagaagag ctggacaagt
    24901 acttcaaaaa tcatacatca ccagatgttg atcttggcga catttcaggc attaacgctt
    24961 ctgtcgtcaa cattcaaaaa gaaattgacc gcctcaatga ggtcgctaaa aatttaaatg
    25021 aatcactcat tgaccttcaa gaattgggaa aatatgagca atatattaaa tggccttggt
    25081 atgtttggct cggcttcatt gctggactaa ttgccatcgt catggttaca atcttgcttt
    25141 gttgcatgac tagttgttgc agttgcctca agggtgcatg ctcttgtggt tcttgctgca
    25201 agtttgatga ggatgactct gagccagttc tcaagggtgt caaattacat tacacataaa
    25261 cgaacttatg gatttgttta tgagattttt tactcttaga tcaattactg cacagccagt
    25321 aaaaattgac aatgcttctc ctgcaagtac tgttcatgct acagcaacga taccgctaca
    25381 agcctcactc cctttcggat ggcttgttat tggcgttgca tttcttgctg tttttcagag
    25441 cgctaccaaa ataattgcgc tcaataaaag atggcagcta gccctttata agggcttcca
    25501 gttcatttgc aatttactgc tgctatttgt taccatctat tcacatcttt tgcttgtcgc
    25561 tgcaggtatg gaggcgcaat ttttgtacct ctatgccttg atatattttc tacaatgcat
    25621 caacgcatgt agaattatta tgagatgttg gctttgttgg aagtgcaaat ccaagaaccc
    25681 attactttat gatgccaact actttgtttg ctggcacaca cataactatg actactgtat
    25741 accatataac agtgtcacag atacaattgt cgttactgaa ggtgacggca tttcaacacc
    25801 aaaactcaaa gaagactacc aaattggtgg ttattctgag gataggcact caggtgttaa
    25861 agactatgtc gttgtacatg gctatttcac cgaagtttac taccagcttg agtctacaca
    25921 aattactaca gacactggta ttgaaaatgc tacattcttc atctttaaca agcttgttaa
    25981 agacccaccg aatgtgcaaa tacacacaat cgacggctct tcaggagttg ctaatccagc
    26041 aatggatcca atttatgatg agccgacgac gactactagc gtgcctttgt aagcacaaga
    26101 aagtgagtac gaacttatgt actcattcgt ttcggaagaa acaggtacgt taatagttaa
    26161 tagcgtactt ctttttcttg ctttcgtggt attcttgcta gtcacactag ccatccttac
    26221 tgcgcttcga ttgtgtgcgt actgctgcaa tattgttaac gtgagtttag taaaaccaac
    26281 ggtttacgtc tactcgcgtg ttaaaaatct gaactcttct gaaggagttc ctgatcttct
    26341 ggtctaaacg aactaactat tattattatt ctgtttggaa ctttaacatt gcttatcatg
    26401 gcagacaacg gtactattac cgttgaggag cttaaacaac tcctggaaca atggaaccta
    26461 gtaataggtt tcctattcct agcctggatt atgttactac aatttgccta ttctaatcgg
    26521 aacaggtttt tgtacataat aaagcttgtt ttcctctggc tcttgtggcc agtaacactt
    26581 gcttgttttg tgcttgctgc tgtctacaga attaattggg tgactggcgg gattgcgatt
    26641 gcaatggctt gtattgtagg cttgatgtgg cttagctact tcgttgcttc cttcaggctg
    26701 tttgctcgta cccgctcaat gtggtcattc aacccagaaa caaacattct tctcaatgtg
    26761 cctctccggg ggacaattgt gaccagaccg ctcatggaaa gtgaacttgt cattggtgct
    26821 gtgatcattc gtggtcactt gcgaatggcc ggacactccc tagggcgctg tgacattaag
    26881 gacctgccaa aagagatcac tgtggctaca tcacgaacgc tttcttatta caaattagga
    26941 gcgtcgcagc gtgtaggcac tgattcaggt tttgctgcat acaaccgcta ccgtattgga
    27001 aactataaat taaatacaga ccacgccggt agcaacgaca atattgcttt gctagtacag
    27061 taagtgacaa cagatgtttc atcttgttga cttccaggtt acaatagcag agatattgat
    27121 tatcattatg aggactttca ggattgctat ttggaatctt gacgttataa taagttcaat
    27181 agtgagacaa ttatttaagc ctctaactaa gaagaattat tcggagttag atgatgaaga
    27241 acctatggag ttagattatc cataaaacga acatgaaaat tattctcttc ctgacattga
    27301 ttgtatttac atcttgcgag ctatatcact atcaggagtg tgttagaggt acgactgtac
    27361 tactaaaaga accttgccca tcaggaacat acgagggcaa ttcaccattt caccctcttg
    27421 ctgacaataa atttgcacta acttgcacta gcacacactt tgcttttgct tgtgctgacg
    27481 gtactcgaca tacctatcag ctgcgtgcaa gatcagtttc accaaaactt ttcatcagac
    27541 aagaggaggt tcaacaagag ctctactcgc cactttttct cattgttgct gctctagtat
    27601 ttttaatact ttgcttcacc attaagagaa agacagaatg aatgagctca ctttaattga
    27661 cttctatttg tgctttttag cctttctgct attccttgtt ttaataatgc ttattatatt
    27721 ttggttttca ctcgaaatcc aggatctaga agaaccttgt accaaagtct aaacgaacat
    27781 gaaacttctc attgttttga cttgtatttc tctatgcagt tgcatatgca ctgtagtaca
    27841 gcgctgtgca tctaataaac ctcatgtgct tgaagatcct tgtaaggtac aacactaggg
    27901 gtaatactta tagcactgct tggctttgtg ctctaggaaa ggttttacct tttcatagat
    27961 ggcacactat ggttcaaaca tgcacaccta atgttactat caactgtcaa gatccagctg
    28021 gtggtgcgct tatagctagg tgttggtacc ttcatgaagg tcaccaaact gctgcattta
    28081 gagacgtact tgttgtttta aataaacgaa caaattaaaa tgtctgataa tggaccccaa
    28141 tcaaaccaac gtagtgcccc ccgcattaca tttggtggac ccacagattc aactgacaat
    28201 aaccagaatg gaggacgcaa tggggcaagg ccaaaacagc gccgacccca aggtttaccc
    28261 aataatactg cgtcttggtt cacagctctc actcagcatg gcaaggagga acttagattc
    28321 cctcgaggcc agggcgttcc aatcaacacc aatagtggtc cagatgacca aattggctac
    28381 taccgaagag ctacccgacg agttcgtggt ggtgacggca aaatgaaaga gctcagcccc
    28441 agatggtact tctattacct aggaactggc ccagaagctt cacttcccta cggcgctaac
    28501 aaagaaggca tcgtatgggt tgcaactgag ggagccttga atacacccaa agaccacatt
    28561 ggcacccgca atcctaataa caatgctgcc accgtgctac aacttcctca aggaacaaca
    28621 ttgccaaaag gcttctacgc agagggaagc agaggcggca gtcaagcctc ttctcgctcc
    28681 tcatcacgta gtcgcggtaa ttcaagaaat tcaactcctg gcagcagtag gggaaattct
    28741 cctgctcgaa tggctagcgg aggtggtgaa actgccctcg cgctattgct gctagacaga
    28801 ttgaaccagc ttgagagcaa agtttctggt aaaggccaac aacaacaagg ccaaactgtc
    28861 actaagaaat ctgctgctga ggcatctaaa aagcctcgcc aaaaacgtac tgccacaaaa
    28921 cagtacaacg tcactcaagc atttgggaga cgtggtccag aacaaaccca aggaaatttc
    28981 ggggaccaag acctaatcag acaaggaact gattacaaac attggccgca aattgcacaa
    29041 tttgctccaa gtgcctctgc attctttgga atgtcacgca ttggcatgga agtcacacct
    29101 tcgggaacat ggctgactta tcatggagcc attaaattgg atgacaaaga tccacaattc
    29161 aaagacaacg tcatactgct gaacaagcac attgacgcat acaaaacatt cccaccaaca
    29221 gagcctaaaa aggacaaaaa gaaaaagact gatgaagctc agcctttgcc gcagagacaa
    29281 aagaagcagc ccactgtgac tcttcttcct gcggctgaca tggatgattt ctccagacaa
    29341 cttcaaaatt ccatgagtgg agcttctgct gattcaactc aggcataaac actcatgatg
    29401 accacacaag gcagatgggc tatgtaaacg ttttcgcaat tccgtttacg atacatagtc
    29461 tactcttgtg cagaatgaat tctcgtaact aaacagcaca agtaggttta gttaacttta
    29521 atctcacata gcaatcttta atcaatgtgt aacattaggg aggacttgaa agagccacca
    29581 cattttcatc gaggccacgc ggagtacgat cgagggtaca gtgaataatg ctagggagag
    29641 ctgcctatat ggaagagccc taatgtgtaa aattaatttt agtagtgcta tccccatgtg
    29701 attttaatag cttcttagga gaatgacaaa aaaaaaaaaa aaaaaaaaaa a
//
DBGET integrated database retrieval system