LOCUS NC_004718 29751 bp RNA linear VRL 20-NOV-2020
DEFINITION SARS coronavirus Tor2, complete genome.
ACCESSION NC_004718
VERSION NC_004718.3
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE SARS coronavirus Tor2
ORGANISM SARS coronavirus Tor2
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
Betacoronavirus; Sarbecovirus.
REFERENCE 1 (bases 1 to 29751)
AUTHORS He,R., Dobie,F., Ballantine,M., Leeson,A., Li,Y., Bastien,N.,
Cutts,T., Andonov,A., Cao,J., Booth,T.F., Plummer,F.A., Tyler,S.,
Baker,L. and Li,X.
CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease
Control and National Microbiology Laboratory Canada
TITLE Analysis of multimerization of the SARS coronavirus nucleocapsid
protein
JOURNAL Biochem. Biophys. Res. Commun. 316 (2), 476-483 (2004)
PUBMED 15020242
REFERENCE 2 (bases 1 to 29751)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (27-MAY-2020) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 29751)
CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease
Control and National Microbiology Laboratory Canada
TITLE Direct Submission
JOURNAL Submitted (30-APR-2003) Genome Sciences Centre, British Columbia
Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
4E6, Canada
REMARK Sequence update by submitter
REFERENCE 4 (bases 1 to 29751)
CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease
Control and National Microbiology Laboratory Canada
TITLE Direct Submission
JOURNAL Submitted (23-APR-2003) Genome Sciences Centre, British Columbia
Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
4E6, Canada
REMARK Sequence update by submitter
REFERENCE 5 (bases 1 to 29751)
CONSRTM BCCA Genome Sciences Centre, British Columbia Centre for Disease
Control and National Microbiology Laboratory Canada
TITLE Direct Submission
JOURNAL Submitted (13-APR-2003) Genome Sciences Centre, British Columbia
Cancer Research Centre, 600 West 10th Avenue, Vancouver, BC V5Z
4E6, Canada
COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence is identical to AY274119.
On or before Mar 28, 2016 this sequence version replaced
NC_028858.1, NC_028866.1, NC_028884.1, NC_028893.1, NC_028873.1,
NC_028845.1, NC_009696.1, NC_009695.1, NC_013664.1, NC_009693.1,
NC_009694.1, NC_004718.2.
Annotation based on that found in PMID: 31987001, PMID: 31967327
and the annotation of P0C6X7.1.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..29751
/organism="SARS coronavirus Tor2"
/mol_type="genomic RNA"
/isolate="Tor2"
/host="Homo sapiens; patient #2 with severe acute
respiratory syndrome (SARS)"
/db_xref="taxon:227984"
/country="Canada: Toronto"
5'UTR 1..264
gene 265..21485
/gene="ORF1ab"
/locus_tag="sarsp1"
/db_xref="GeneID:1489680"
CDS join(265..13392,13392..21485)
/gene="ORF1ab"
/locus_tag="sarsp1"
/ribosomal_slippage
/note="ORF1ab polyprotein is cleaved to yield the
RNA-dependent RNA polymerase and other nonstructural
proteins; polyprotein pp1ab; replicase 1AB"
/codon_start=1
/product="ORF1ab polyprotein"
/protein_id="NP_828849.7"
/db_xref="GeneID:1489680"
/translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEARE
HLKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGI
TLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQN
WNTKHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQ
LDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFP
LNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTC
DFLKATCEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSN
IETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDL
LEILSRERVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGN
YKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAA
VTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKL
RPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVV
NKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVT
FLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQ
YCALSPGLLATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNE
KCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGE
ENFSSRMYCSFYPPDEEEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVR
VEEEEEEDWLDDTTEQSEIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSA
NPMVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNL
AKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQT
VRTQVYIAVNDKALYEQVVMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDV
KPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYM
VGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT
ALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATI
QRKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLE
EAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSG
QRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLLSLREVKTIKVFTTVD
NTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFE
YYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQLEVKFN
APALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLN
VVCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMM
SAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTD
VFYKETSYTTTIKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNA
SFDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKK
GAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNL
ACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMA
AYVENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSN
CAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDA
GINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYL
NSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVL
AYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAMVRMYIFF
ASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFC
KTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHL
YFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCDESASKSASVYY
SQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSE
LAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTY
NKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKK
NNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVLAALVCYIVMPV
HTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGGSYKNDKSC
PVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEYSDFA
TSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPNT
YLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVFCGVDAMN
LIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHVVAANAL
LFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWIT
AIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLL
PLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSI
TSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNP
NYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQ
TFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP
TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTT
LNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGS
TILEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFF
VYENAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRI
MTWLELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKV
YYGNALDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCI
MLVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDA
FKLNIKLLGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLH
NDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPS
YAAYATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQ
MYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAK
LMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLI
VTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALL
SDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVL
GSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTH
TGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAND
PVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFFKRVCGVSAARLTPCGTG
TSTDVVYRAFDIYNEKVAGFAKFLKTNCCRFQEKDEEGNLLDSYFVVKRHTMSNYQHE
ETIYNLVKDCPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDT
LKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYANLGERVRQSLLKTVQFCDAMRDA
GIVGVLTLDNQDLNGNWYDFGDFVQVAPGCGVPIVDSYYSLLMPILTLTRALAAESHM
DADLAKPLIKWDLLKYDFTEERLCLFDRYFKYWDQTYHPNCINCLDDRCILHCANFNV
LFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLV
YAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQTVKPGNFNKDFYDFAVSKGFFKE
GSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCIN
ANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYA
ISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTV
YSDVETPHLMGWDYPKCDRAMPNMLRIMASLVLARKHNTCCNLSHRFYRLANECAQVL
SEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVR
NLQHRLYECLYRNRDVDHEFVDEFYAYLRKHFSMMILSDDAVVCYNSNYAAQGLVASI
KNFKAVLYYQNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRI
LGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDEL
TGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGACIR
RPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPI
SFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAET
LKATEETFKLSYGIATVREVLSDRELHLSWEVGKPRPPLNRNYVFTGYRVTKNSKVQI
GEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLSAPTLVPQEHYVRITGLY
PTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSH
AAVDALCEKALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTAD
IVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCR
LMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLKAHKDKSAQCFKMFYKGVITHDVS
SAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDY
VIFTQTTETAHSCNVNRFNVAITRAKIGILCIMSDRDLYDKLQFTSLEIPRRNVATLQ
AENVTGLFKDCSKIITGLHPTQAPTHLSVDIKFKTEGLCVDIPGIPKDMTYRRLISMM
GFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVN
LVAVPTGYVDTENNTEFTRVNAKPPPGDQFKHLIPLMYKGLPWNVVRIKIVQMLSDTL
KGLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWNHSVG
FDYVYNPFMIDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAVHECFVKRV
DWSVEYPIIGDELRVNSACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQAEVEW
KFYDAQPCSDKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPANAIVCRFDTRVL
SNLNLPGCDGGSLYVNKHAFHTPAFDKSAFTNLKQLPFFYYSDSPCESHGKQVVSDID
YVPLKSATCITRCNLGGAVCRHHANEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNT
FTRLQSLENVAYNVVNKGHFDGHAGEAPVSIINNAVYTKVDGIDVEIFENKTTLPVNV
AFELWAKRNIKPVPEIKILNNLGVDIAANTVIWDYKREAPAHVSTIGVCTMTDIAKKP
TESACSSLTVLFDGRVEGQVDLFRNARNGVLITEGSVKGLTPSKGPAQASVNGVTLIG
ESVKTQFNYFKKVDGIIQQLPETYFTQSRDLEDFKPRSQMETDFLELAMDEFIQRYKL
EGYAFEHIVYGDFSHGQLGGLHLMIGLAKRSQDSPLKLEDFIPMDSTVKNYFITDAQT
GSSKCVCSVIDLLLDDFVEIIKSQDLSVISKVVKVTIDYAEISFMLWCKDGHVETFYP
KLQASQAWQPGVAMPNLYKMQRMLLEKCDLQNYGENAVIPKGIMMNVAKYTQLCQYLN
TLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLPTGTLLVDSDLNDFVSDADSTLIG
DCATVHTANKWDLIISDMYDPRTKHVTKENDSKEGFFTYLCGFIKQKLALGGSIAVKI
TEHSWNADLYKLMGHFSWWTAFVTNVNASSSEAFLIGANYLGKPKEQIDGYTMHANYI
FWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKENQINDMIYSLLEKGRLIIRENNR
VVVSSDILVNN"
mat_peptide 265..804
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp1"
/protein_id="NP_828860.2"
misc_feature 301..645
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_N; cd21796"
/db_xref="CDD:439285"
misc_feature 646..804
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_C; cd22662"
/db_xref="CDD:439355"
misc_feature order(706..720,724..747,781..789,793..798,802..804)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439355"
misc_feature order(742..744,754..759,763..768,775..780,787..789)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439355"
mat_peptide 805..2718
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp2"
/protein_id="NP_828861.2"
misc_feature 808..2718
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to SARS-CoV Nsp2, and related proteins from
betacoronaviruses in the B lineage; Region:
betaCoV_Nsp2_SARS-like; cd21516"
/db_xref="CDD:439199"
mat_peptide 2719..8484
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp3"
/note="papain-like proteinase"
/protein_id="NP_828862.2"
misc_feature 2905..3351
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Protein of unknown function (DUF3655); Region:
DUF3655; pfam12379"
/db_xref="CDD:432517"
misc_feature 3364..3729
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3376..3384,3394..3414,3637..3642,3646..3660,
3724..3726)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3889..4266
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="SUD-N macrodomain (or Mac2 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-N_Nsp3-like; cd21562"
/db_xref="CDD:394883"
misc_feature 4294..4671
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Single-stranded poly(A) binding domain; Region:
SUD-M; pfam11633"
/db_xref="CDD:431970"
misc_feature 4678..4878
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Severe Acute
Respiratory Syndrome coronavirus and related
betacoronaviruses in the B lineage; Region:
SUD_C_SARS-CoV_Nsp3; cd21525"
/db_xref="CDD:394841"
misc_feature 4891..5799
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(5209..5220,5368..5376,5380..5385,5392..5394,
5404..5406,5479..5481,5503..5508,5551..5553,5557..5559,
5626..5628,5674..5676,5692..5703,5785..5787)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(5368..5376,5623..5628,5674..5676,5683..5685,
5692..5694,5701..5703,5785..5787)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5932..6252
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nucleic acid binding domain of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_NAB; cd21822"
/db_xref="CDD:409348"
misc_feature 6325..6672
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_betaSM; cd21814"
/db_xref="CDD:409629"
misc_feature 6889..8481
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Severe acute respiratory
syndrome-related coronavirus and betacoronavirus in the B
lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
/db_xref="CDD:409665"
misc_feature 6889..6957
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409665"
misc_feature 7204..7272
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409665"
mat_peptide 8485..9984
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp4"
/protein_id="NP_904322.1"
misc_feature 8524..9666
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8524..8592
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9322..9387
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9430..9495
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9574..9642
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9700..9978
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
mat_peptide 9985..10902
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="3C-like protease"
/note="3CLp; nsp5"
/protein_id="NP_828863.1"
misc_feature 9994..10884
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9994..10017,10024..10026,10336..10338,10348..10368,
10393..10407,10480..10482,10498..10500,10840..10842,
10852..10854,10876..10881)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(10045..10047,10054..10062,10129..10131,10144..10146,
10402..10419,10471..10482,10486..10488,10498..10500,
10543..10545,10549..10557)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
mat_peptide 10903..11772
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp6"
/protein_id="NP_828864.1"
misc_feature 10903..11772
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
mat_peptide 11773..12021
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp7"
/protein_id="NP_828865.1"
misc_feature 11773..12021
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11776..11778,11785..11796,11803..11811,11815..11820,
11827..11829,11854..11856,11863..11865,11881..11883,
11917..11934,11938..11955,11974..11988)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
mat_peptide 12022..12615
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp8"
/protein_id="NP_828866.1"
misc_feature 12022..12612
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nsp8 replicase; Region: nsp8; pfam08717"
/db_xref="CDD:400866"
mat_peptide 12616..12954
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp9"
/protein_id="NP_828867.1"
misc_feature 12616..12954
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12616..12633
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12622..12630,12634..12642,12832..12837,12901..12906,
12910..12918,12922..12930,12934..12942,12946..12954)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
mat_peptide 12955..13371
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp10"
/protein_id="NP_828868.1"
misc_feature 12955..13347
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="coronavirus non-structural protein 10; Region:
CoV_Nsp10; cd21872"
/db_xref="CDD:409325"
misc_feature order(12955..12978,12988..12990,12994..13002,13006..13014,
13027..13032,13039..13044,13051..13053,13072..13089,
13126..13131,13159..13161,13165..13170,13180..13182,
13186..13203,13216..13224,13231..13242)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409325"
misc_feature order(12994..13002,13006..13014,13027..13029,13072..13074,
13078..13089,13126..13134,13186..13197,13204..13206,
13237..13242,13297..13299)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409325"
misc_feature order(13027..13029,13078..13089,13126..13134,13204..13206,
13237..13242,13297..13299)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409325"
misc_feature order(13072..13095,13123..13131,13159..13170,13183..13188,
13192..13194,13231..13242)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409325"
mat_peptide join(13372..13392,13392..13394)
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="RNA-dependent RNA polymerase"
/note="RdRp; nsp12"
/protein_id="YP_009924301.1"
misc_feature join(13387..13392,13392..16166)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Severe acute respiratory syndrome coronavirus
RNA-dependent RNA polymerase, also known as non-structural
protein 12, and similar proteins from betacoronaviruses in
the B lineage: responsible for replication and
transcription of the viral RNA genome; Region:
SARS-CoV-like_RdRp; cd21591"
/db_xref="CDD:394895"
misc_feature order(14175..14189,14337..14342,14346..14348,14352..14366,
14382..14393,14400..14402,14472..14474,14481..14486,
14490..14495,14502..14504,14508..14522,14526..14546,
14556..14558,14562..14564,14574..14579,14589..14591,
14883..14888,14895..14897,14910..14915,14919..14921,
14928..14939,15366..15368)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp8 interaction site [polypeptide binding]; other
site"
/db_xref="CDD:394895"
misc_feature order(14595..14609,14613..14615,14628..14630,14655..14657,
14661..14663,14688..14705,15018..15020,15024..15026,
15897..15899)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp7 interaction site [polypeptide binding]; other
site"
/db_xref="CDD:394895"
misc_feature order(14862..14864,14868..14876,15003..15005,15039..15041,
15045..15047,15063..15065,15075..15077,15087..15089,
15099..15101,15138..15140,15144..15146,15150..15152,
15414..15425,15432..15434,15642..15653,15807..15812,
15864..15866,15888..15890,15939..15944,15954..15956,
15960..15965)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative RNA binding site [nucleotide binding];
other site"
/db_xref="CDD:394895"
misc_feature 14868..14909
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif G; other site"
/db_xref="CDD:394895"
misc_feature 14982..15050
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif F; other site"
/db_xref="CDD:394895"
misc_feature order(15015..15017,15408..15416,15429..15431,15441..15443,
15645..15647)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="inhibitor binding site [chemical binding];
inhibition site"
/db_xref="CDD:394895"
misc_feature 15201..15251
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif A; other site"
/db_xref="CDD:394895"
misc_feature order(15222..15224,15645..15653)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="catalytic residues [active]"
/db_xref="CDD:394895"
misc_feature 15408..15497
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif B; other site"
/db_xref="CDD:394895"
misc_feature 15627..15671
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif C; other site"
/db_xref="CDD:394895"
misc_feature 15693..15758
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif D; other site"
/db_xref="CDD:394895"
misc_feature 15798..15833
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="conserved polymerase motif E; other site"
/db_xref="CDD:394895"
mat_peptide 16167..17969
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="helicase/NTPase"
/note="nsp13"
/protein_id="NP_828870.1"
misc_feature 16167..16451
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Cys/His rich zinc-binding domain (CH/ZBD) of
coronavirus SARS NSP13 helicase and related proteins;
Region: ZBD_cv_Nsp13-like; cd21401"
/db_xref="CDD:439168"
misc_feature order(16299..16301,16362..16364,16368..16370,16407..16409,
16434..16448)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439168"
misc_feature 16461..16604
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="stalk domain of coronavirus Nsp13 helicase and
related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
/db_xref="CDD:410205"
misc_feature order(16470..16472,16557..16559)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="key interaction residues; other site"
/db_xref="CDD:410205"
misc_feature 16614..16850
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="1B domain of coronavirus SARS NSP13 helicase and
related proteins; Region: 1B_cv_Nsp13-like; cd21409"
/db_xref="CDD:394817"
misc_feature order(16698..16703,16707..16709,16800..16802)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nucleic acid substrate binding site [nucleotide
binding]; other site"
/db_xref="CDD:394817"
misc_feature 16812..16820
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:394817"
misc_feature 16917..17936
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="helicase domain of betacoronavirus non-structural
protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
/db_xref="CDD:409655"
misc_feature order(17019..17036,17376..17378,17493..17495,17778..17780,
17784..17786,17865..17867)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ATP binding site [chemical binding]; other site"
/db_xref="CDD:409655"
misc_feature order(17028..17033,17286..17291,17376..17378,17865..17867)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative active site [active]"
/db_xref="CDD:409655"
mat_peptide 17970..19550
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="3' to 5' exonuclease"
/note="ExoN; nsp14"
/protein_id="NP_828871.1"
misc_feature 17982..19544
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nonstructural protein 14 of betacoronavirus;
Region: betaCoV_Nsp14; cd21659"
/db_xref="CDD:394958"
misc_feature order(17982..17984,17988..17999,18024..18053,18120..18122,
18132..18134,18147..18170,18270..18275,18339..18341,
18345..18347,18357..18362,18543..18545,18552..18557,
18564..18572,18618..18620)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="heterodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394958"
misc_feature order(18237..18239,18243..18245,18540..18542,18771..18773,
18786..18788)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ExoN active site [active]"
/db_xref="CDD:394958"
misc_feature order(18843..18845,18885..18887,18894..18899,18906..18908,
18966..18977,18981..18983,19023..19031,19065..19073,
19122..19136,19170..19172,19227..19229,19233..19238,
19245..19247,19251..19253,19485..19487)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N7-MTase active site [active]"
/db_xref="CDD:394958"
mat_peptide 19551..20588
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="endoribonuclease"
/note="nsp15"
/protein_id="NP_828872.1"
misc_feature 19551..19733
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N-terminal domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: NTD_alpha_betaCoV_Nsp15-like; cd21171"
/db_xref="CDD:439163"
misc_feature order(19551..19559,19611..19613,19617..19628,19650..19652,
19665..19667,19692..19694,19698..19709)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439163"
misc_feature order(19575..19592,19629..19640,19644..19646,19653..19655,
19659..19676,19683..19694,19731..19733)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439163"
misc_feature 19743..20138
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="middle domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: M_alpha_beta_cv_Nsp15-like; cd21167"
/db_xref="CDD:439161"
misc_feature order(19779..19784,19812..19814,19818..19820,19824..19826,
19830..19838,20034..20039,20043..20054)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439161"
misc_feature 19857..19862
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439161"
misc_feature 20130..20582
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nidoviral uridylate-specific endoribonuclease
(NendoU) domain of coronavirus Nonstructural Protein 15
(Nsp15) and related proteins; Region:
NendoU_cv_Nsp15-like; cd21161"
/db_xref="CDD:439158"
misc_feature order(20154..20156,20160..20162,20268..20276,20340..20342,
20352..20357,20361..20363,20379..20381,20385..20387,
20391..20393,20397..20399,20403..20408,20412..20414)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439158"
misc_feature order(20250..20252,20262..20264,20286..20291,20295..20297,
20415..20417,20424..20429,20568..20570,20574..20576)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative active site [active]"
/db_xref="CDD:439158"
mat_peptide 20589..21482
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="2'-O-MTase"
/note="2'-O-methyltransferase; nsp16"
/protein_id="NP_828873.2"
misc_feature 20592..21479
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Coronavirus NSP13; Region: NSP13; pfam06460"
/db_xref="CDD:399456"
CDS 265..13413
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ORF1a polyprotein is cleaved to yield nonstructural
proteins; polyprotein pp1a"
/codon_start=1
/product="ORF1a polyprotein"
/protein_id="YP_009944365.1"
/db_xref="GeneID:1489680"
/translation="MESLVLGVNEKTHVQLSLPVLQVRDVLVRGFGDSVEEALSEARE
HLKNGTCGLVELEKGVLPQLEQPYVFIKRSDALSTNHGHKVVELVAEMDGIQYGRSGI
TLGVLVPHVGETPIAYRNVLLRKNGNKGAGGHSYGIDLKSYDLGDELGTDPIEDYEQN
WNTKHGSGALRELTRELNGGAVTRYVDNNFCGPDGYPLDCIKDFLARAGKSMCTLSEQ
LDYIESKRGVYCCRDHEHEIAWFTERSDKSYEHQTPFEIKSAKKFDTFKGECPKFVFP
LNSKVKVIQPRVEKKKTEGFMGRIRSVYPVASPQECNNMHLSTLMKCNHCDEVSWQTC
DFLKATCEHCGTENLVIEGPTTCGYLPTNAVVKMPCPACQDPEIGPEHSVADYHNHSN
IETRLRKGGRTRCFGGCVFAYVGCYNKRAYWVPRASADIGSGHTGITGDNVETLNEDL
LEILSRERVNINIVGDFHLNEEVAIILASFSASTSAFIDTIKSLDYKSFKTIVESCGN
YKVTKGKPVKGAWNIGQQRSVLTPLCGFPSQAAGVIRSIFARTLDAANHSIPDLQRAA
VTILDGISEQSLRLVDAMVYTSDLLTNSVIIMAYVTGGLVQQTSQWLSNLLGTTVEKL
RPIFEWIEAKLSAGVEFLKDAWEILKFLITGVFDIVKGQIQVASDNIKDCVKCFIDVV
NKALEMCIDQVTIAGAKLRSLNLGEVFIAQSKGLYRQCIRGKEQLQLLMPLKAPKEVT
FLEGDSHDTVLTSEEVVLKNGELEALETPVDSFTNGAIVGTPVCVNGLMLLEIKDKEQ
YCALSPGLLATNNVFRLKGGAPIKGVTFGEDTVWEVQGYKNVRITFELDERVDKVLNE
KCSVYTVESGTEVTEFACVVAEAVVKTLQPVSDLLTNMGIDLDEWSVATFYLFDDAGE
ENFSSRMYCSFYPPDEEEEDDAECEEEEIDETCEHEYGTEDDYQGLPLEFGASAETVR
VEEEEEEDWLDDTTEQSEIEPEPEPTPEEPVNQFTGYLKLTDNVAIKCVDIVKEAQSA
NPMVIVNAANIHLKHGGGVAGALNKATNGAMQKESDDYIKLNGPLTVGGSCLLSGHNL
AKKCLHVVGPNLNAGEDIQLLKAAYENFNSQDILLAPLLSAGIFGAKPLQSLQVCVQT
VRTQVYIAVNDKALYEQVVMDYLDNLKPRVEAPKQEEPPNTEDSKTEEKSVVQKPVDV
KPKIKACIDEVTTTLEETKFLTNKLLLFADINGKLYHDSQNMLRGEDMSFLEKDAPYM
VGDVITSGDITCVVIPSKKAGGTTEMLSRALKKVPVDEYITTYPGQGCAGYTLEEAKT
ALKKCKSAFYVLPSEAPNAKEEILGTVSWNLREMLAHAEETRKLMPICMDVRAIMATI
QRKYKGIKIQEGIVDYGVRFFFYTSKEPVASIITKLNSLNEPLVTMPIGYVTHGFNLE
EAARCMRSLKAPAVVSVSSPDAVTTYNGYLTSSSKTSEEHFVETVSLAGSYRDWSYSG
QRTELGVEFLKRGDKIVYHTLESPVEFHLDGEVLSLDKLKSLLSLREVKTIKVFTTVD
NTNLHTQLVDMSMTYGQQFGPTYLDGADVTKIKPHVNHEGKTFFVLPSDDTLRSEAFE
YYHTLDESFLGRYMSALNHTKKWKFPQVGGLTSIKWADNNCYLSSVLLALQQLEVKFN
APALQEAYYRARAGDAANFCALILAYSNKTVGELGDVRETMTHLLQHANLESAKRVLN
VVCKHCGQKTTTLTGVEAVMYMGTLSYDNLKTGVSIPCVCGRDATQYLVQQESSFVMM
SAPPAEYKLQQGTFLCANEYTGNYQCGHYTHITAKETLYRIDGAHLTKMSEYKGPVTD
VFYKETSYTTTIKPVSYKLDGVTYTEIEPKLDGYYKKDNAYYTEQPIDLVPTQPLPNA
SFDNFKLTCSNTKFADDLNQMTGFTKPASRELSVTFFPDLNGDVVAIDYRHYSASFKK
GAKLLHKPIVWHINQATTKTTFKPNTWCLRCLWSTKPVDTSNSFEVLAVEDTQGMDNL
ACESQQPTSEEVVENPTIQKEVIECDVKTTEVVGNVILKPSDEGVKVTQELGHEDLMA
AYVENTSITIKKPNELSLALGLKTIATHGIAAINSVPWSKILAYVKPFLGQAAITTSN
CAKRLAQRVFNNYMPYVFTLLFQLCTFTKSTNSRIRASLPTTIAKNSVKSVAKLCLDA
GINYVKSPKFSKLFTIAMWLLLLSICLGSLICVTAAFGVLLSNFGAPSYCNGVRELYL
NSSNVTTMDFCEGSFPCSICLSGLDSLDSYPALETIQVTISSYKLDLTILGLAAEWVL
AYMLFTKFFYLLGLSAIMQVFFGYFASHFISNSWLMWFIISIVQMAPVSAMVRMYIFF
ASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYVYANGGRGFC
KTHNWNCLNCDTFCTGSTFISDEVARDLSLQFKRPINPTDQSSYIVDSVAVKNGALHL
YFDKAGQKTYERHPLSHFVNLDNLRANNTKGSLPINVIVFDGKSKCDESASKSASVYY
SQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVDTFSATFSVPMEKLKALVATAHSE
LAKGVALDGVLSTFVSAARQGVVDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFMLTY
NKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKK
NNIPFRLTCATTRQVVNVITTKISLKGGKIVSTCFKLMLKATLLCVLAALVCYIVMPV
HTLSIHDGYTNEIIGYKAIQDGVTRDIISTDDCFANKHAGFDAWFSQRGGSYKNDKSC
PVVAAIITREIGFIVPGLPGTVLRAINGDFLHFLPRVFSAVGNICYTPSKLIEYSDFA
TSACVLAAECTIFKDAMGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPNT
YLEGSVRVVTTFDAEYCRHGTCERSEVGICLSTSGRWVLNNEHYRALSGVFCGVDAMN
LIANIFTPLVQPVGALDVSASVVAGGIIAILVTCAAYYFMKFRRVFGEYNHVVAANAL
LFLMSFTILCLVPAYSFLPGVYSVFYLYLTFYFTNDVSFLAHLQWFAMFSPIVPFWIT
AIYVFCISLKHCHWFFNNYLRKRVMFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLL
PLTQYNRYLALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSI
TSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDTVYCPRHVICTAEDMLNP
NYEDLLIRKSNHSFLVQAGNVQLRVIGHSMQNCLLRLKVDTSNPKTPKYKFVRIQPGQ
TFSVLACYNGSPSGVYQCAMRPNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELP
TGVHAGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGDRWFLNRFTTT
LNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAVLDMCAALKELLQNGMNGRTILGS
TILEDEFTPFDVVRQCSGVTFQGKFKKIVKGTHHWMLLTFLTSLLILVQSTQWSLFFF
VYENAFLPFTLGIMAIAACAMLLVKHKHAFLCLFLLPSLATVAYFNMVYMPASWVMRI
MTWLELADTSLSGYRLKDCVMYASALVLLILMTARTVYDDAARRVWTLMNVITLVYKV
YYGNALDQAISMWALVISVTSNYSGVVTTIMFLARAIVFVCVEYYPLLFITGNTLQCI
MLVYCFLGYCCCCYFGLFCLLNRYFRLTLGVYDYLVSTQEFRYMNSQGLLPPKSSIDA
FKLNIKLLGIGGKPCIKVATVQSKMSDVKCTSVVLLSVLQQLRVESSSKLWAQCVQLH
NDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINRLCEEMLDNRATLQAIASEFSSLPS
YAAYATAQEAYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQ
MYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAK
LMVVVPDYGTYKNTCDGNTFTYASALWEIQQVVDADSKIVQLSEINMDNSPNLAWPLI
VTALRANSAVKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNNSKGGRFVLALL
SDHQDLKWARFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVL
GSLAATVRLQAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTH
TGTGQAITVTPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCAND
PVGFTLRNTVCTVCGMWKGYGCSCDQLREPLMQSADASTFLNGFAV"
mat_peptide 265..804
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp1"
/protein_id="YP_009944366.1"
misc_feature 301..645
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_N; cd21796"
/db_xref="CDD:439285"
misc_feature 646..804
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_C; cd22662"
/db_xref="CDD:439355"
misc_feature order(706..720,724..747,781..789,793..798,802..804)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439355"
misc_feature order(742..744,754..759,763..768,775..780,787..789)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439355"
mat_peptide 805..2718
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp2"
/protein_id="YP_009944367.1"
misc_feature 808..2718
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to SARS-CoV Nsp2, and related proteins from
betacoronaviruses in the B lineage; Region:
betaCoV_Nsp2_SARS-like; cd21516"
/db_xref="CDD:439199"
mat_peptide 2719..8484
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp3"
/note="papain-like proteinase"
/protein_id="YP_009944368.1"
misc_feature 2905..3351
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Protein of unknown function (DUF3655); Region:
DUF3655; pfam12379"
/db_xref="CDD:432517"
misc_feature 3358..3729
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3376..3384,3394..3414,3637..3642,3646..3660,
3724..3726)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3889..4266
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="SUD-N macrodomain (or Mac2 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-N_Nsp3-like; cd21562"
/db_xref="CDD:394883"
misc_feature 4294..4671
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Single-stranded poly(A) binding domain; Region:
SUD-M; pfam11633"
/db_xref="CDD:431970"
misc_feature 4678..4878
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Severe Acute
Respiratory Syndrome coronavirus and related
betacoronaviruses in the B lineage; Region:
SUD_C_SARS-CoV_Nsp3; cd21525"
/db_xref="CDD:394841"
misc_feature 4891..5799
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(5209..5220,5368..5376,5380..5385,5392..5394,
5404..5406,5479..5481,5503..5508,5551..5553,5557..5559,
5626..5628,5674..5676,5692..5703,5785..5787)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(5368..5376,5623..5628,5674..5676,5683..5685,
5692..5694,5701..5703,5785..5787)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5932..6252
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nucleic acid binding domain of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_NAB; cd21822"
/db_xref="CDD:409348"
misc_feature 6325..6672
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_betaSM; cd21814"
/db_xref="CDD:409629"
misc_feature 6889..8481
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Severe acute respiratory
syndrome-related coronavirus and betacoronavirus in the B
lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
/db_xref="CDD:409665"
misc_feature 6889..6957
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409665"
misc_feature 7204..7272
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409665"
mat_peptide 8485..9984
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp4"
/protein_id="YP_009944369.1"
misc_feature 8524..9666
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8524..8592
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9322..9387
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9430..9495
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9574..9642
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9700..9978
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
mat_peptide 9985..10902
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="3C-like protease"
/note="3CLp; nsp5"
/protein_id="YP_009944370.1"
misc_feature 9994..10884
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9994..10017,10024..10026,10336..10338,10348..10368,
10393..10407,10480..10482,10498..10500,10840..10842,
10852..10854,10876..10881)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(10045..10047,10054..10062,10129..10131,10144..10146,
10402..10419,10471..10482,10486..10488,10498..10500,
10543..10545,10549..10557)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
mat_peptide 10903..11772
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp6"
/protein_id="YP_009944371.1"
misc_feature 10903..11772
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
mat_peptide 11773..12021
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp7"
/protein_id="YP_009944372.1"
misc_feature 11773..12021
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11776..11778,11785..11796,11803..11811,11815..11820,
11827..11829,11854..11856,11863..11865,11881..11883,
11917..11934,11938..11955,11974..11988)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
mat_peptide 12022..12615
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp8"
/protein_id="YP_009944373.1"
misc_feature 12022..12612
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="nsp8 replicase; Region: nsp8; pfam08717"
/db_xref="CDD:400866"
mat_peptide 12616..12954
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp9"
/protein_id="YP_009944374.1"
misc_feature 12616..12954
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12616..12633
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12622..12630,12634..12642,12832..12837,12901..12906,
12910..12918,12922..12930,12934..12942,12946..12954)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
mat_peptide 12955..13371
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="nsp10"
/protein_id="YP_009944375.1"
misc_feature 12955..13347
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="coronavirus non-structural protein 10; Region:
CoV_Nsp10; cd21872"
/db_xref="CDD:409325"
misc_feature order(12955..12978,12988..12990,12994..13002,13006..13014,
13027..13032,13039..13044,13051..13053,13072..13089,
13126..13131,13159..13161,13165..13170,13180..13182,
13186..13203,13216..13224,13231..13242)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409325"
misc_feature order(12994..13002,13006..13014,13027..13029,13072..13074,
13078..13089,13126..13134,13186..13197,13204..13206,
13237..13242,13297..13299)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409325"
misc_feature order(13027..13029,13078..13089,13126..13134,13204..13206,
13237..13242,13297..13299)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409325"
misc_feature order(13072..13095,13123..13131,13159..13170,13183..13188,
13192..13194,13231..13242)
/gene="ORF1ab"
/locus_tag="sarsp1"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409325"
mat_peptide 13372..13410
/gene="ORF1ab"
/locus_tag="sarsp1"
/product="ndp11"
/protein_id="YP_009944376.1"
gene 21492..25259
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/db_xref="GeneID:1489668"
CDS 21492..25259
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/codon_start=1
/product="spike glycoprotein"
/protein_id="YP_009825051.1"
/db_xref="GeneID:1489668"
/translation="MFIFLLFLTLTSGSDLDRCTTFDDVQAPNYTQHTSSMRGVYYPD
EIFRSDTLYLTQDLFLPFYSNVTGFHTINHTFGNPVIPFKDGIYFAATEKSNVVRGWV
FGSTMNNKSQSVIIINNSTNVVIRACNFELCDNPFFAVSKPMGTQTHTMIFDNAFNCT
FEYISDAFSLDVSEKSGNFKHLREFVFKNKDGFLYVYKGYQPIDVVRDLPSGFNTLKP
IFKLPLGINITNFRAILTAFSPAQDIWGTSAAAYFVGYLKPTTFMLKYDENGTITDAV
DCSQNPLAELKCSVKSFEIDKGIYQTSNFRVVPSGDVVRFPNITNLCPFGEVFNATKF
PSVYAWERKKISNCVADYSVLYNSTFFSTFKCYGVSATKLNDLCFSNVYADSFVVKGD
DVRQIAPGQTGVIADYNYKLPDDFMGCVLAWNTRNIDATSTGNYNYKYRYLRHGKLRP
FERDISNVPFSPDGKPCTPPALNCYWPLNDYGFYTTTGIGYQPYRVVVLSFELLNAPA
TVCGPKLSTDLIKNQCVNFNFNGLTGTGVLTPSSKRFQPFQQFGRDVSDFTDSVRDPK
TSEILDISPCAFGGVSVITPGTNASSEVAVLYQDVNCTDVSTAIHADQLTPAWRIYST
GNNVFQTQAGCLIGAEHVDTSYECDIPIGAGICASYHTVSLLRSTSQKSIVAYTMSLG
ADSSIAYSNNTIAIPTNFSISITTEVMPVSMAKTSVDCNMYICGDSTECANLLLQYGS
FCTQLNRALSGIAAEQDRNTREVFAQVKQMYKTPTLKYFGGFNFSQILPDPLKPTKRS
FIEDLLFNKVTLADAGFMKQYGECLGDINARDLICAQKFNGLTVLPPLLTDDMIAAYT
AALVSGTATAGWTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAIS
QIQESLTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAE
VQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYH
LMSFPQAAPHGVVFLHVTYVPSQERNFTTAPAICHEGKAYFPREGVFVFNGTSWFITQ
RNFFSPQIITTDNTFVSGNCDVVIGIINNTVYDPLQPELDSFKEELDKYFKNHTSPDV
DLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELGKYEQYIKWPWYVWLGFIA
GLIAIVMVTILLCCMTSCCSCLKGACSCGSCCKFDEDDSEPVLKGVKLHYT"
misc_feature 21540..22364
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="N-terminal domain of the S1 subunit of the Spike
(S) protein from Severe acute respiratory syndrome
coronavirus and related betacoronaviruses in the B
lineage; Region: SARS-CoV-like_Spike_S1_NTD; cd21624"
/db_xref="CDD:394950"
misc_feature order(21615..21617,21624..21635,21819..21821,21825..21827,
21969..21971,22062..22070,22143..22145,22152..22154,
22158..22172,22293..22301)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394950"
misc_feature 22407..23072
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="receptor-binding domain of the S1 subunit of severe
acute respiratory syndrome-related coronavirus Spike (S)
protein; Region: SARS-CoV_Spike_S1_RBD; cd21481"
/db_xref="CDD:394828"
misc_feature order(22407..22409,22515..22517,22593..22601,22605..22610,
22617..22622,22638..22640,22821..22823,22836..22850,
22854..22859,22863..22865,22995..23000,23004..23006)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394828"
misc_feature order(22557..22562,22566..22568,22572..22574,22578..22586,
22590..22592,22596..22598,22602..22610,22617..22622,
22626..22628,22734..22742,22992..22994,22998..23000,
23004..23006)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="cryptic epitope [polypeptide binding]; other site"
/db_xref="CDD:394828"
misc_feature 22764..22973
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="receptor binding motif; other site"
/db_xref="CDD:394828"
misc_feature order(22767..22769,22797..22799,22809..22811,22815..22820,
22869..22871,22875..22880,22905..22910,22914..22916,
22926..22928,22935..22937,22941..22955,22962..22964)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="receptor binding site [polypeptide binding]; other
site"
/db_xref="CDD:394828"
misc_feature 23076..25061
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
region, and the S2 fusion subunit of the spike (S)
glycoprotein from SARS-CoV-2 (COVID-19) and related
betacoronaviruses in the B lineage; Region:
SARS-CoV-like_Spike_SD1-2_S1-S2_S2; cd22378"
/db_xref="CDD:411965"
misc_feature order(23256..23258,23295..23297,23418..23420,23562..23564,
23586..23588,23838..23840,24657..24659,24729..24731,
24837..24839,24909..24911,24954..24956,25017..25019)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="N-linked glycosylation sites [posttranslational
modification]; other site"
/db_xref="CDD:411965"
misc_feature order(23463..23477,23481..23501,23505..23564)
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="S1/S2 cleavage region; other site"
/db_xref="CDD:411965"
misc_feature 23799..23855
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="fusion peptide; other site"
/db_xref="CDD:411965"
misc_feature 23883..23936
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="internal fusion peptide; other site"
/db_xref="CDD:411965"
misc_feature 24189..24386
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="heptad repeat 1 [structural motif]; Region: heptad
repeat 1"
/db_xref="CDD:411965"
misc_feature 24921..25046
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="heptad repeat 2 [structural motif]; Region: heptad
repeat 2"
/db_xref="CDD:411965"
misc_feature 25137..25253
/gene="S"
/locus_tag="sars2"
/gene_synonym="E2"
/note="Coronavirus spike glycoprotein S2, intravirion;
Region: CoV_S2_C; pfam19214"
/db_xref="CDD:437051"
gene 25268..26092
/gene="ORF3a"
/locus_tag="sars3a"
/db_xref="GeneID:1489669"
CDS 25268..26092
/gene="ORF3a"
/locus_tag="sars3a"
/codon_start=1
/product="ORF3a protein"
/protein_id="YP_009825052.1"
/db_xref="GeneID:1489669"
/translation="MDLFMRFFTLRSITAQPVKIDNASPASTVHATATIPLQASLPFG
WLVIGVAFLAVFQSATKIIALNKRWQLALYKGFQFICNLLLLFVTIYSHLLLVAAGME
AQFLYLYALIYFLQCINACRIIMRCWLCWKCKSKNPLLYDANYFVCWHTHNYDYCIPY
NSVTDTIVVTEGDGISTPKLKEDYQIGGYSEDRHSGVKDYVVVHGYFTEVYYQLESTQ
ITTDTGIENATFFIFNKLVKDPPNVQIHTIDGSSGVANPAMDPIYDEPTTTTSVPL"
misc_feature 25268..26086
/gene="ORF3a"
/locus_tag="sars3a"
/note="Coronavirus accessory protein 3a; Region:
APA3_viroporin; pfam11289"
/db_xref="CDD:431787"
misc_feature 25382..25444
/gene="ORF3a"
/locus_tag="sars3a"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature order(25403..25408,25415..25420,25427..25432,25436..25444,
25448..25453,25460..25465,25520..25522,25532..25534,
25589..25594,25601..25603,25610..25615,25619..25624,
25631..25633,25697..25702,25745..25750,25757..25759,
25769..25771,25820..25822,25826..25834,25910..25915,
25919..25921,25931..25936,25940..25942,25955..25957,
25961..25963,25967..25969)
/gene="ORF3a"
/locus_tag="sars3a"
/note="dimer interface [polypeptide binding]; other site"
/db_xref="CDD:439223"
misc_feature 25502..25564
/gene="ORF3a"
/locus_tag="sars3a"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature 25580..25642
/gene="ORF3a"
/locus_tag="sars3a"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature order(25658..25660,25667..25669,25673..25675,25715..25726,
25730..25732)
/gene="ORF3a"
/locus_tag="sars3a"
/note="putative tetramer interface [polypeptide binding];
other site"
/db_xref="CDD:439223"
gene 25689..26153
/gene="ORF3b"
/locus_tag="sars3b"
/db_xref="GeneID:1489670"
CDS 25689..26153
/gene="ORF3b"
/locus_tag="sars3b"
/note="ORF4"
/codon_start=1
/product="ORF3b protein"
/protein_id="YP_009825053.1"
/db_xref="GeneID:1489670"
/translation="MMPTTLFAGTHITMTTVYHITVSQIQLSLLKVTAFQHQNSKKTT
KLVVILRIGTQVLKTMSLYMAISPKFTTSLSLHKLLQTLVLKMLHSSSLTSLLKTHRM
CKYTQSTALQELLIQQWIQFMMSRRRLLACLCKHKKVSTNLCTHSFRKKQVR"
misc_feature 25689..26147
/gene="ORF3b"
/locus_tag="sars3b"
/note="accessory protein ORF3b of severe acute respiratory
syndrome-associated coronavirus; Region: SARS-CoV_ORF3b;
cl40696"
/db_xref="CDD:424327"
gene 26117..26347
/gene="E"
/locus_tag="sars4"
/db_xref="GeneID:1489671"
CDS 26117..26347
/gene="E"
/locus_tag="sars4"
/codon_start=1
/product="small envelope protein"
/protein_id="YP_009825054.1"
/db_xref="GeneID:1489671"
/translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCC
NIVNVSLVKPTVYVYSRVKNLNSSEGVPDLLV"
misc_feature 26120..26344
/gene="E"
/locus_tag="sars4"
/note="Severe acute respiratory syndrome coronavirus 2
Envelope small membrane protein; Region: SARS-CoV-2_E;
cd21536"
/db_xref="CDD:394862"
misc_feature order(26138..26140,26159..26164,26168..26173,26177..26203,
26207..26209,26255..26263,26285..26287,26294..26299,
26303..26311)
/gene="E"
/locus_tag="sars4"
/note="homopentameric interface [polypeptide binding];
other site"
/db_xref="CDD:394862"
misc_feature 26333..26344
/gene="E"
/locus_tag="sars4"
/note="PDZ binding motif; other site"
/db_xref="CDD:394862"
gene 26398..27063
/gene="M"
/locus_tag="sars5"
/db_xref="GeneID:1489672"
CDS 26398..27063
/gene="M"
/locus_tag="sars5"
/codon_start=1
/product="membrane glycoprotein M"
/protein_id="YP_009825055.1"
/db_xref="GeneID:1489672"
/translation="MADNGTITVEELKQLLEQWNLVIGFLFLAWIMLLQFAYSNRNRF
LYIIKLVFLWLLWPVTLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYFVASFRLF
ARTRSMWSFNPETNILLNVPLRGTIVTRPLMESELVIGAVIIRGHLRMAGHSLGRCDI
KDLPKEITVATSRTLSYYKLGASQRVGTDSGFAAYNRYRIGNYKLNTDHAGSNDNIAL
LVQ"
misc_feature 26407..27060
/gene="M"
/locus_tag="sars5"
/note="Membrane (or Matrix) protein from Severe acute
respiratory syndrome (SARS) coronavirus, SARS-CoV-2, and
related betacoronaviruses in the B lineage; Region:
SARS-like-CoV_M; cd21569"
/db_xref="CDD:394855"
gene 26913..27265
/gene="ORF6"
/locus_tag="sars6"
/db_xref="GeneID:1489673"
CDS 27074..27265
/gene="ORF6"
/locus_tag="sars6"
/note="ORF7"
/codon_start=1
/product="ORF6 protein"
/protein_id="YP_009825056.1"
/db_xref="GeneID:1489673"
/translation="MFHLVDFQVTIAEILIIIMRTFRIAIWNLDVIISSIVRQLFKPL
TKKNYSELDDEEPMELDYP"
misc_feature 27074..27259
/gene="ORF6"
/locus_tag="sars6"
/note="Open reading frame 6 from SARS coronavirus; Region:
Sars6; pfam12133"
/db_xref="CDD:432352"
gene 27273..27641
/gene="ORF7a"
/locus_tag="sars7a"
/db_xref="GeneID:1489674"
CDS 27273..27641
/gene="ORF7a"
/locus_tag="sars7a"
/note="ORF8"
/codon_start=1
/product="ORF7a protein"
/protein_id="YP_009825057.1"
/db_xref="GeneID:1489674"
/translation="MKIILFLTLIVFTSCELYHYQECVRGTTVLLKEPCPSGTYEGNS
PFHPLADNKFALTCTSTHFAFACADGTRHTYQLRARSVSPKLFIRQEEVQQELYSPLF
LIVAALVFLILCFTIKRKTE"
misc_feature 27318..27638
/gene="ORF7a"
/locus_tag="sars7a"
/note="SARS coronavirus X4 like; Region: SARS_X4;
pfam08779"
/db_xref="CDD:400915"
gene 27638..27772
/gene="ORF7b"
/locus_tag="sars7b"
/db_xref="GeneID:1489675"
CDS 27638..27772
/gene="ORF7b"
/locus_tag="sars7b"
/note="ORF9"
/codon_start=1
/product="ORF7b protein"
/protein_id="YP_009825058.1"
/db_xref="GeneID:1489675"
/translation="MNELTLIDFYLCFLAFLLFLVLIMLIIFWFSLEIQDLEEPCTKV
"
misc_feature 27638..27766
/gene="ORF7b"
/locus_tag="sars7b"
/note="Protein of unknown function (DUF2873); Region:
DUF2873; pfam11395"
/db_xref="CDD:431866"
gene 27779..27898
/gene="ORF8a"
/locus_tag="sars8a"
/db_xref="GeneID:1489676"
CDS 27779..27898
/gene="ORF8a"
/locus_tag="sars8a"
/note="ORF10"
/codon_start=1
/product="ORF8a protein"
/protein_id="YP_009825059.1"
/db_xref="GeneID:1489676"
/translation="MKLLIVLTCISLCSCICTVVQRCASNKPHVLEDPCKVQH"
misc_feature 27782..>27883
/gene="ORF8a"
/locus_tag="sars8a"
/note="SARS-CoV-2 ORF8 immunoglobulin (Ig) domain protein
and related proteins; Region: ORF8-Ig_SARS-CoV-2-like;
cl40466"
/db_xref="CDD:454761"
misc_feature 27827..27847
/gene="ORF8a"
/locus_tag="sars8a"
/note="Ig strand A [structural motif]; Region: Ig strand
A"
/db_xref="CDD:439221"
misc_feature 27860..27877
/gene="ORF8a"
/locus_tag="sars8a"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:439221"
gene 27864..28118
/gene="ORF8b"
/locus_tag="sars8b"
/db_xref="GeneID:1489677"
CDS 27864..28118
/gene="ORF8b"
/locus_tag="sars8b"
/note="ORF11"
/codon_start=1
/product="ORF8b protein"
/protein_id="YP_009825060.1"
/db_xref="GeneID:1489677"
/translation="MCLKILVRYNTRGNTYSTAWLCALGKVLPFHRWHTMVQTCTPNV
TINCQDPAGGALIARCWYLHEGHQTAAFRDVLVVLNKRTN"
misc_feature <27882..28109
/gene="ORF8b"
/locus_tag="sars8b"
/note="SARS-CoV-2 ORF8 immunoglobulin (Ig) domain protein
and related proteins; Region: ORF8-Ig_SARS-CoV-2-like;
cl40466"
/db_xref="CDD:454761"
misc_feature 27915..27926
/gene="ORF8b"
/locus_tag="sars8b"
/note="Ig strand C' [structural motif]; Region: Ig strand
C'"
/db_xref="CDD:439221"
misc_feature 27936..27950
/gene="ORF8b"
/locus_tag="sars8b"
/note="Ig strand D [structural motif]; Region: Ig strand
D"
/db_xref="CDD:439221"
misc_feature 27993..28007
/gene="ORF8b"
/locus_tag="sars8b"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:439221"
misc_feature 28023..28049
/gene="ORF8b"
/locus_tag="sars8b"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:439221"
misc_feature 28071..28097
/gene="ORF8b"
/locus_tag="sars8b"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:439221"
gene 28120..29388
/gene="N"
/locus_tag="sars9a"
/db_xref="GeneID:1489678"
CDS 28120..29388
/gene="N"
/locus_tag="sars9a"
/codon_start=1
/product="nucleocapsid protein"
/protein_id="YP_009825061.1"
/db_xref="GeneID:1489678"
/translation="MSDNGPQSNQRSAPRITFGGPTDSTDNNQNGGRNGARPKQRRPQ
GLPNNTASWFTALTQHGKEELRFPRGQGVPINTNSGPDDQIGYYRRATRRVRGGDGKM
KELSPRWYFYYLGTGPEASLPYGANKEGIVWVATEGALNTPKDHIGTRNPNNNAATVL
QLPQGTTLPKGFYAEGSRGGSQASSRSSSRSRGNSRNSTPGSSRGNSPARMASGGGET
ALALLLLDRLNQLESKVSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKQYNVTQAFG
RRGPEQTQGNFGDQDLIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTY
HGAIKLDDKDPQFKDNVILLNKHIDAYKTFPPTEPKKDKKKKTDEAQPLPQRQKKQPT
VTLLPAADMDDFSRQLQNSMSGASADSTQA"
misc_feature 28240..29295
/gene="N"
/locus_tag="sars9a"
/note="Coronavirus nucleocapsid protein; Region:
Corona_nucleoca; pfam00937"
/db_xref="CDD:425955"
misc_feature order(28270..28287,28441..28443,28447..28449,28453..28455,
28567..28569,28588..28590)
/gene="N"
/locus_tag="sars9a"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439219"
gene 28130..28426
/locus_tag="sars9b"
/db_xref="GeneID:1489679"
CDS 28130..28426
/locus_tag="sars9b"
/note="ORF13"
/codon_start=1
/product="ORF9b protein"
/protein_id="YP_009825062.1"
/db_xref="GeneID:1489679"
/translation="MDPNQTNVVPPALHLVDPQIQLTITRMEDAMGQGQNSADPKVYP
IILRLGSQLSLSMARRNLDSLEARAFQSTPIVVQMTKLATTEELPDEFVVVTAK"
misc_feature 28130..28423
/locus_tag="sars9b"
/note="SARS lipid binding protein; Region:
SARS_lipid_bind; pfam09399"
/db_xref="CDD:430584"
misc_feature order(28184..28204,28256..28258,28280..28309,28313..28315,
28337..28339,28382..28387,28391..28417,28421..28423)
/locus_tag="sars9b"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409250"
misc_feature order(28193..28195,28262..28264,28268..28270,28286..28294,
28358..28360,28412..28414)
/locus_tag="sars9b"
/note="lipid binding cavity [chemical binding];
lipid-binding site"
/db_xref="CDD:409250"
CDS 28583..28795
/gene="N"
/locus_tag="sars9a"
/note="ORF14"
/codon_start=1
/product="ORF9a protein"
/protein_id="YP_009825063.1"
/db_xref="GeneID:1489678"
/translation="MLPPCYNFLKEQHCQKASTQREAEAAVKPLLAPHHVVAVIQEIQ
LLAAVGEILLLEWLAEVVKLPSRYCC"
misc_feature 28583..28792
/gene="N"
/locus_tag="sars9a"
/note="accessory protein ORF9c (also referred to as ORF14)
from Severe acute respiratory syndrome-associated
coronavirus and related coronaviruses; Region:
SARS-CoV_ORF9c; cl38891"
/db_xref="CDD:422948"
3'UTR 28796..29751
ORIGIN
1 atattaggtt tttacctacc caggaaaagc caaccaacct cgatctcttg tagatctgtt
61 ctctaaacga actttaaaat ctgtgtagct gtcgctcggc tgcatgccta gtgcacctac
121 gcagtataaa caataataaa ttttactgtc gttgacaaga aacgagtaac tcgtccctct
181 tctgcagact gcttacggtt tcgtccgtgt tgcagtcgat catcagcata cctaggtttc
241 gtccgggtgt gaccgaaagg taagatggag agccttgttc ttggtgtcaa cgagaaaaca
301 cacgtccaac tcagtttgcc tgtccttcag gttagagacg tgctagtgcg tggcttcggg
361 gactctgtgg aagaggccct atcggaggca cgtgaacacc tcaaaaatgg cacttgtggt
421 ctagtagagc tggaaaaagg cgtactgccc cagcttgaac agccctatgt gttcattaaa
481 cgttctgatg ccttaagcac caatcacggc cacaaggtcg ttgagctggt tgcagaaatg
541 gacggcattc agtacggtcg tagcggtata acactgggag tactcgtgcc acatgtgggc
601 gaaaccccaa ttgcataccg caatgttctt cttcgtaaga acggtaataa gggagccggt
661 ggtcatagct atggcatcga tctaaagtct tatgacttag gtgacgagct tggcactgat
721 cccattgaag attatgaaca aaactggaac actaagcatg gcagtggtgc actccgtgaa
781 ctcactcgtg agctcaatgg aggtgcagtc actcgctatg tcgacaacaa tttctgtggc
841 ccagatgggt accctcttga ttgcatcaaa gattttctcg cacgcgcggg caagtcaatg
901 tgcactcttt ccgaacaact tgattacatc gagtcgaaga gaggtgtcta ctgctgccgt
961 gaccatgagc atgaaattgc ctggttcact gagcgctctg ataagagcta cgagcaccag
1021 acacccttcg aaattaagag tgccaagaaa tttgacactt tcaaagggga atgcccaaag
1081 tttgtgtttc ctcttaactc aaaagtcaaa gtcattcaac cacgtgttga aaagaaaaag
1141 actgagggtt tcatggggcg tatacgctct gtgtaccctg ttgcatctcc acaggagtgt
1201 aacaatatgc acttgtctac cttgatgaaa tgtaatcatt gcgatgaagt ttcatggcag
1261 acgtgcgact ttctgaaagc cacttgtgaa cattgtggca ctgaaaattt agttattgaa
1321 ggacctacta catgtgggta cctacctact aatgctgtag tgaaaatgcc atgtcctgcc
1381 tgtcaagacc cagagattgg acctgagcat agtgttgcag attatcacaa ccactcaaac
1441 attgaaactc gactccgcaa gggaggtagg actagatgtt ttggaggctg tgtgtttgcc
1501 tatgttggct gctataataa gcgtgcctac tgggttcctc gtgctagtgc tgatattggc
1561 tcaggccata ctggcattac tggtgacaat gtggagacct tgaatgagga tctccttgag
1621 atactgagtc gtgaacgtgt taacattaac attgttggcg attttcattt gaatgaagag
1681 gttgccatca ttttggcatc tttctctgct tctacaagtg cctttattga cactataaag
1741 agtcttgatt acaagtcttt caaaaccatt gttgagtcct gcggtaacta taaagttacc
1801 aagggaaagc ccgtaaaagg tgcttggaac attggacaac agagatcagt tttaacacca
1861 ctgtgtggtt ttccctcaca ggctgctggt gttatcagat caatttttgc gcgcacactt
1921 gatgcagcaa accactcaat tcctgatttg caaagagcag ctgtcaccat acttgatggt
1981 atttctgaac agtcattacg tcttgtcgac gccatggttt atacttcaga cctgctcacc
2041 aacagtgtca ttattatggc atatgtaact ggtggtcttg tacaacagac ttctcagtgg
2101 ttgtctaatc ttttgggcac tactgttgaa aaactcaggc ctatctttga atggattgag
2161 gcgaaactta gtgcaggagt tgaatttctc aaggatgctt gggagattct caaatttctc
2221 attacaggtg tttttgacat cgtcaagggt caaatacagg ttgcttcaga taacatcaag
2281 gattgtgtaa aatgcttcat tgatgttgtt aacaaggcac tcgaaatgtg cattgatcaa
2341 gtcactatcg ctggcgcaaa gttgcgatca ctcaacttag gtgaagtctt catcgctcaa
2401 agcaagggac tttaccgtca gtgtatacgt ggcaaggagc agctgcaact actcatgcct
2461 cttaaggcac caaaagaagt aacctttctt gaaggtgatt cacatgacac agtacttacc
2521 tctgaggagg ttgttctcaa gaacggtgaa ctcgaagcac tcgagacgcc cgttgatagc
2581 ttcacaaatg gagctatcgt tggcacacca gtctgtgtaa atggcctcat gctcttagag
2641 attaaggaca aagaacaata ctgcgcattg tctcctggtt tactggctac aaacaatgtc
2701 tttcgcttaa aagggggtgc accaattaaa ggtgtaacct ttggagaaga tactgtttgg
2761 gaagttcaag gttacaagaa tgtgagaatc acatttgagc ttgatgaacg tgttgacaaa
2821 gtgcttaatg aaaagtgctc tgtctacact gttgaatccg gtaccgaagt tactgagttt
2881 gcatgtgttg tagcagaggc tgttgtgaag actttacaac cagtttctga tctccttacc
2941 aacatgggta ttgatcttga tgagtggagt gtagctacat tctacttatt tgatgatgct
3001 ggtgaagaaa acttttcatc acgtatgtat tgttcctttt accctccaga tgaggaagaa
3061 gaggacgatg cagagtgtga ggaagaagaa attgatgaaa cctgtgaaca tgagtacggt
3121 acagaggatg attatcaagg tctccctctg gaatttggtg cctcagctga aacagttcga
3181 gttgaggaag aagaagagga agactggctg gatgatacta ctgagcaatc agagattgag
3241 ccagaaccag aacctacacc tgaagaacca gttaatcagt ttactggtta tttaaaactt
3301 actgacaatg ttgccattaa atgtgttgac atcgttaagg aggcacaaag tgctaatcct
3361 atggtgattg taaatgctgc taacatacac ctgaaacatg gtggtggtgt agcaggtgca
3421 ctcaacaagg caaccaatgg tgccatgcaa aaggagagtg atgattacat taagctaaat
3481 ggccctctta cagtaggagg gtcttgtttg ctttctggac ataatcttgc taagaagtgt
3541 ctgcatgttg ttggacctaa cctaaatgca ggtgaggaca tccagcttct taaggcagca
3601 tatgaaaatt tcaattcaca ggacatctta cttgcaccat tgttgtcagc aggcatattt
3661 ggtgctaaac cacttcagtc tttacaagtg tgcgtgcaga cggttcgtac acaggtttat
3721 attgcagtca atgacaaagc tctttatgag caggttgtca tggattatct tgataacctg
3781 aagcctagag tggaagcacc taaacaagag gagccaccaa acacagaaga ttccaaaact
3841 gaggagaaat ctgtcgtaca gaagcctgtc gatgtgaagc caaaaattaa ggcctgcatt
3901 gatgaggtta ccacaacact ggaagaaact aagtttctta ccaataagtt actcttgttt
3961 gctgatatca atggtaagct ttaccatgat tctcagaaca tgcttagagg tgaagatatg
4021 tctttccttg agaaggatgc accttacatg gtaggtgatg ttatcactag tggtgatatc
4081 acttgtgttg taataccctc caaaaaggct ggtggcacta ctgagatgct ctcaagagct
4141 ttgaagaaag tgccagttga tgagtatata accacgtacc ctggacaagg atgtgctggt
4201 tatacacttg aggaagctaa gactgctctt aagaaatgca aatctgcatt ttatgtacta
4261 ccttcagaag cacctaatgc taaggaagag attctaggaa ctgtatcctg gaatttgaga
4321 gaaatgcttg ctcatgctga agagacaaga aaattaatgc ctatatgcat ggatgttaga
4381 gccataatgg caaccatcca acgtaagtat aaaggaatta aaattcaaga gggcatcgtt
4441 gactatggtg tccgattctt cttttatact agtaaagagc ctgtagcttc tattattacg
4501 aagctgaact ctctaaatga gccgcttgtc acaatgccaa ttggttatgt gacacatggt
4561 tttaatcttg aagaggctgc gcgctgtatg cgttctctta aagctcctgc cgtagtgtca
4621 gtatcatcac cagatgctgt tactacatat aatggatacc tcacttcgtc atcaaagaca
4681 tctgaggagc actttgtaga aacagtttct ttggctggct cttacagaga ttggtcctat
4741 tcaggacagc gtacagagtt aggtgttgaa tttcttaagc gtggtgacaa aattgtgtac
4801 cacactctgg agagccccgt cgagtttcat cttgacggtg aggttctttc acttgacaaa
4861 ctaaagagtc tcttatccct gcgggaggtt aagactataa aagtgttcac aactgtggac
4921 aacactaatc tccacacaca gcttgtggat atgtctatga catatggaca gcagtttggt
4981 ccaacatact tggatggtgc tgatgttaca aaaattaaac ctcatgtaaa tcatgagggt
5041 aagactttct ttgtactacc tagtgatgac acactacgta gtgaagcttt cgagtactac
5101 catactcttg atgagagttt tcttggtagg tacatgtctg ctttaaacca cacaaagaaa
5161 tggaaatttc ctcaagttgg tggtttaact tcaattaaat gggctgataa caattgttat
5221 ttgtctagtg ttttattagc acttcaacag cttgaagtca aattcaatgc accagcactt
5281 caagaggctt attatagagc ccgtgctggt gatgctgcta acttttgtgc actcatactc
5341 gcttacagta ataaaactgt tggcgagctt ggtgatgtca gagaaactat gacccatctt
5401 ctacagcatg ctaatttgga atctgcaaag cgagttctta atgtggtgtg taaacattgt
5461 ggtcagaaaa ctactacctt aacgggtgta gaagctgtga tgtatatggg tactctatct
5521 tatgataatc ttaagacagg tgtttccatt ccatgtgtgt gtggtcgtga tgctacacaa
5581 tatctagtac aacaagagtc ttcttttgtt atgatgtctg caccacctgc tgagtataaa
5641 ttacagcaag gtacattctt atgtgcgaat gagtacactg gtaactatca gtgtggtcat
5701 tacactcata taactgctaa ggagaccctc tatcgtattg acggagctca ccttacaaag
5761 atgtcagagt acaaaggacc agtgactgat gttttctaca aggaaacatc ttacactaca
5821 accatcaagc ctgtgtcgta taaactcgat ggagttactt acacagagat tgaaccaaaa
5881 ttggatgggt attataaaaa ggataatgct tactatacag agcagcctat agaccttgta
5941 ccaactcaac cattaccaaa tgcgagtttt gataatttca aactcacatg ttctaacaca
6001 aaatttgctg atgatttaaa tcaaatgaca ggcttcacaa agccagcttc acgagagcta
6061 tctgtcacat tcttcccaga cttgaatggc gatgtagtgg ctattgacta tagacactat
6121 tcagcgagtt tcaagaaagg tgctaaatta ctgcataagc caattgtttg gcacattaac
6181 caggctacaa ccaagacaac gttcaaacca aacacttggt gtttacgttg tctttggagt
6241 acaaagccag tagatacttc aaattcattt gaagttctgg cagtagaaga cacacaagga
6301 atggacaatc ttgcttgtga aagtcaacaa cccacctctg aagaagtagt ggaaaatcct
6361 accatacaga aggaagtcat agagtgtgac gtgaaaacta ccgaagttgt aggcaatgtc
6421 atacttaaac catcagatga aggtgttaaa gtaacacaag agttaggtca tgaggatctt
6481 atggctgctt atgtggaaaa cacaagcatt accattaaga aacctaatga gctttcacta
6541 gccttaggtt taaaaacaat tgccactcat ggtattgctg caattaatag tgttccttgg
6601 agtaaaattt tggcttatgt caaaccattc ttaggacaag cagcaattac aacatcaaat
6661 tgcgctaaga gattagcaca acgtgtgttt aacaattata tgccttatgt gtttacatta
6721 ttgttccaat tgtgtacttt tactaaaagt accaattcta gaattagagc ttcactacct
6781 acaactattg ctaaaaatag tgttaagagt gttgctaaat tatgtttgga tgccggcatt
6841 aattatgtga agtcacccaa attttctaaa ttgttcacaa tcgctatgtg gctattgttg
6901 ttaagtattt gcttaggttc tctaatctgt gtaactgctg cttttggtgt actcttatct
6961 aattttggtg ctccttctta ttgtaatggc gttagagaat tgtatcttaa ttcgtctaac
7021 gttactacta tggatttctg tgaaggttct tttccttgca gcatttgttt aagtggatta
7081 gactcccttg attcttatcc agctcttgaa accattcagg tgacgatttc atcgtacaag
7141 ctagacttga caattttagg tctggccgct gagtgggttt tggcatatat gttgttcaca
7201 aaattctttt atttattagg tctttcagct ataatgcagg tgttctttgg ctattttgct
7261 agtcatttca tcagcaattc ttggctcatg tggtttatca ttagtattgt acaaatggca
7321 cccgtttctg caatggttag gatgtacatc ttctttgctt ctttctacta catatggaag
7381 agctatgttc atatcatgga tggttgcacc tcttcgactt gcatgatgtg ctataagcgc
7441 aatcgtgcca cacgcgttga gtgtacaact attgttaatg gcatgaagag atctttctat
7501 gtctatgcaa atggaggccg tggcttctgc aagactcaca attggaattg tctcaattgt
7561 gacacatttt gcactggtag tacattcatt agtgatgaag ttgctcgtga tttgtcactc
7621 cagtttaaaa gaccaatcaa ccctactgac cagtcatcgt atattgttga tagtgttgct
7681 gtgaaaaatg gcgcgcttca cctctacttt gacaaggctg gtcaaaagac ctatgagaga
7741 catccgctct cccattttgt caatttagac aatttgagag ctaacaacac taaaggttca
7801 ctgcctatta atgtcatagt ttttgatggc aagtccaaat gcgacgagtc tgcttctaag
7861 tctgcttctg tgtactacag tcagctgatg tgccaaccta ttctgttgct tgaccaagct
7921 cttgtatcag acgttggaga tagtactgaa gtttccgtta agatgtttga tgcttatgtc
7981 gacacctttt cagcaacttt tagtgttcct atggaaaaac ttaaggcact tgttgctaca
8041 gctcacagcg agttagcaaa gggtgtagct ttagatggtg tcctttctac attcgtgtca
8101 gctgcccgac aaggtgttgt tgataccgat gttgacacaa aggatgttat tgaatgtctc
8161 aaactttcac atcactctga cttagaagtg acaggtgaca gttgtaacaa tttcatgctc
8221 acctataata aggttgaaaa catgacgccc agagatcttg gcgcatgtat tgactgtaat
8281 gcaaggcata tcaatgccca agtagcaaaa agtcacaatg tttcactcat ctggaatgta
8341 aaagactaca tgtctttatc tgaacagctg cgtaaacaaa ttcgtagtgc tgccaagaag
8401 aacaacatac cttttagact aacttgtgct acaactagac aggttgtcaa tgtcataact
8461 actaaaatct cactcaaggg tggtaagatt gttagtactt gttttaaact tatgcttaag
8521 gccacattat tgtgcgttct tgctgcattg gtttgttata tcgttatgcc agtacataca
8581 ttgtcaatcc atgatggtta cacaaatgaa atcattggtt acaaagccat tcaggatggt
8641 gtcactcgtg acatcatttc tactgatgat tgttttgcaa ataaacatgc tggttttgac
8701 gcatggttta gccagcgtgg tggttcatac aaaaatgaca aaagctgccc tgtagtagct
8761 gctatcatta caagagagat tggtttcata gtgcctggct taccgggtac tgtgctgaga
8821 gcaatcaatg gtgacttctt gcattttcta cctcgtgttt ttagtgctgt tggcaacatt
8881 tgctacacac cttccaaact cattgagtat agtgattttg ctacctctgc ttgcgttctt
8941 gctgctgagt gtacaatttt taaggatgct atgggcaaac ctgtgccata ttgttatgac
9001 actaatttgc tagagggttc tatttcttat agtgagcttc gtccagacac tcgttatgtg
9061 cttatggatg gttccatcat acagtttcct aacacttacc tggagggttc tgttagagta
9121 gtaacaactt ttgatgctga gtactgtaga catggtacat gcgaaaggtc agaagtaggt
9181 atttgcctat ctaccagtgg tagatgggtt cttaataatg agcattacag agctctatca
9241 ggagttttct gtggtgttga tgcgatgaat ctcatagcta acatctttac tcctcttgtg
9301 caacctgtgg gtgctttaga tgtgtctgct tcagtagtgg ctggtggtat tattgccata
9361 ttggtgactt gtgctgccta ctactttatg aaattcagac gtgtttttgg tgagtacaac
9421 catgttgttg ctgctaatgc acttttgttt ttgatgtctt tcactatact ctgtctggta
9481 ccagcttaca gctttctgcc gggagtctac tcagtctttt acttgtactt gacattctat
9541 ttcaccaatg atgtttcatt cttggctcac cttcaatggt ttgccatgtt ttctcctatt
9601 gtgccttttt ggataacagc aatctatgta ttctgtattt ctctgaagca ctgccattgg
9661 ttctttaaca actatcttag gaaaagagtc atgtttaatg gagttacatt tagtaccttc
9721 gaggaggctg ctttgtgtac ctttttgctc aacaaggaaa tgtacctaaa attgcgtagc
9781 gagacactgt tgccacttac acagtataac aggtatcttg ctctatataa caagtacaag
9841 tatttcagtg gagccttaga tactaccagc tatcgtgaag cagcttgctg ccacttagca
9901 aaggctctaa atgactttag caactcaggt gctgatgttc tctaccaacc accacagaca
9961 tcaatcactt ctgctgttct gcagagtggt tttaggaaaa tggcattccc gtcaggcaaa
10021 gttgaagggt gcatggtaca agtaacctgt ggaactacaa ctcttaatgg attgtggttg
10081 gatgacacag tatactgtcc aagacatgtc atttgcacag cagaagacat gcttaatcct
10141 aactatgaag atctgctcat tcgcaaatcc aaccatagct ttcttgttca ggctggcaat
10201 gttcaacttc gtgttattgg ccattctatg caaaattgtc tgcttaggct taaagttgat
10261 acttctaacc ctaagacacc caagtataaa tttgtccgta tccaacctgg tcaaacattt
10321 tcagttctag catgctacaa tggttcacca tctggtgttt atcagtgtgc catgagacct
10381 aatcatacca ttaaaggttc tttccttaat ggatcatgtg gtagtgttgg ttttaacatt
10441 gattatgatt gcgtgtcttt ctgctatatg catcatatgg agcttccaac aggagtacac
10501 gctggtactg acttagaagg taaattctat ggtccatttg ttgacagaca aactgcacag
10561 gctgcaggta cagacacaac cataacatta aatgttttgg catggctgta tgctgctgtt
10621 atcaatggtg ataggtggtt tcttaataga ttcaccacta ctttgaatga ctttaacctt
10681 gtggcaatga agtacaacta tgaacctttg acacaagatc atgttgacat attgggacct
10741 ctttctgctc aaacaggaat tgccgtctta gatatgtgtg ctgctttgaa agagctgctg
10801 cagaatggta tgaatggtcg tactatcctt ggtagcacta ttttagaaga tgagtttaca
10861 ccatttgatg ttgttagaca atgctctggt gttaccttcc aaggtaagtt caagaaaatt
10921 gttaagggca ctcatcattg gatgctttta actttcttga catcactatt gattcttgtt
10981 caaagtacac agtggtcact gtttttcttt gtttacgaga atgctttctt gccatttact
11041 cttggtatta tggcaattgc tgcatgtgct atgctgcttg ttaagcataa gcacgcattc
11101 ttgtgcttgt ttctgttacc ttctcttgca acagttgctt actttaatat ggtctacatg
11161 cctgctagct gggtgatgcg tatcatgaca tggcttgaat tggctgacac tagcttgtct
11221 ggttataggc ttaaggattg tgttatgtat gcttcagctt tagttttgct tattctcatg
11281 acagctcgca ctgtttatga tgatgctgct agacgtgttt ggacactgat gaatgtcatt
11341 acacttgttt acaaagtcta ctatggtaat gctttagatc aagctatttc catgtgggcc
11401 ttagttattt ctgtaacctc taactattct ggtgtcgtta cgactatcat gtttttagct
11461 agagctatag tgtttgtgtg tgttgagtat tacccattgt tatttattac tggcaacacc
11521 ttacagtgta tcatgcttgt ttattgtttc ttaggctatt gttgctgctg ctactttggc
11581 cttttctgtt tactcaaccg ttacttcagg cttactcttg gtgtttatga ctacttggtc
11641 tctacacaag aatttaggta tatgaactcc caggggcttt tgcctcctaa gagtagtatt
11701 gatgctttca agcttaacat taagttgttg ggtattggag gtaaaccatg tatcaaggtt
11761 gctactgtac agtctaaaat gtctgacgta aagtgcacat ctgtggtact gctctcggtt
11821 cttcaacaac ttagagtaga gtcatcttct aaattgtggg cacaatgtgt acaactccac
11881 aatgatattc ttcttgcaaa agacacaact gaagctttcg agaagatggt ttctcttttg
11941 tctgttttgc tatccatgca gggtgctgta gacattaata ggttgtgcga ggaaatgctc
12001 gataaccgtg ctactcttca ggctattgct tcagaattta gttctttacc atcatatgcc
12061 gcttatgcca ctgcccagga ggcctatgag caggctgtag ctaatggtga ttctgaagtc
12121 gttctcaaaa agttaaagaa atctttgaat gtggctaaat ctgagtttga ccgtgatgct
12181 gccatgcaac gcaagttgga aaagatggca gatcaggcta tgacccaaat gtacaaacag
12241 gcaagatctg aggacaagag ggcaaaagta actagtgcta tgcaaacaat gctcttcact
12301 atgcttagga agcttgataa tgatgcactt aacaacatta tcaacaatgc gcgtgatggt
12361 tgtgttccac tcaacatcat accattgact acagcagcca aactcatggt tgttgtccct
12421 gattatggta cctacaagaa cacttgtgat ggtaacacct ttacatatgc atctgcactc
12481 tgggaaatcc agcaagttgt tgatgcggat agcaagattg ttcaacttag tgaaattaac
12541 atggacaatt caccaaattt ggcttggcct cttattgtta cagctctaag agccaactca
12601 gctgttaaac tacagaataa tgaactgagt ccagtagcac tacgacagat gtcctgtgcg
12661 gctggtacca cacaaacagc ttgtactgat gacaatgcac ttgcctacta taacaattcg
12721 aagggaggta ggtttgtgct ggcattacta tcagaccacc aagatctcaa atgggctaga
12781 ttccctaaga gtgatggtac aggtacaatt tacacagaac tggaaccacc ttgtaggttt
12841 gttacagaca caccaaaagg gcctaaagtg aaatacttgt acttcatcaa aggcttaaac
12901 aacctaaata gaggtatggt gctgggcagt ttagctgcta cagtacgtct tcaggctgga
12961 aatgctacag aagtacctgc caattcaact gtgctttcct tctgtgcttt tgcagtagac
13021 cctgctaaag catataagga ttacctagca agtggaggac aaccaatcac caactgtgtg
13081 aagatgttgt gtacacacac tggtacagga caggcaatta ctgtaacacc agaagctaac
13141 atggaccaag agtcctttgg tggtgcttca tgttgtctgt attgtagatg ccacattgac
13201 catccaaatc ctaaaggatt ctgtgacttg aaaggtaagt acgtccaaat acctaccact
13261 tgtgctaatg acccagtggg ttttacactt agaaacacag tctgtaccgt ctgcggaatg
13321 tggaaaggtt atggctgtag ttgtgaccaa ctccgcgaac ccttgatgca gtctgcggat
13381 gcatcaacgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca ccgtgcggca
13441 caggcactag tactgatgtc gtctacaggg cttttgatat ttacaacgaa aaagttgctg
13501 gttttgcaaa gttcctaaaa actaattgct gtcgcttcca ggagaaggat gaggaaggca
13561 atttattaga ctcttacttt gtagttaaga ggcatactat gtctaactac caacatgaag
13621 agactattta taacttggtt aaagattgtc cagcggttgc tgtccatgac tttttcaagt
13681 ttagagtaga tggtgacatg gtaccacata tatcacgtca gcgtctaact aaatacacaa
13741 tggctgattt agtctatgct ctacgtcatt ttgatgaggg taattgtgat acattaaaag
13801 aaatactcgt cacatacaat tgctgtgatg atgattattt caataagaag gattggtatg
13861 acttcgtaga gaatcctgac atcttacgcg tatatgctaa cttaggtgag cgtgtacgcc
13921 aatcattatt aaagactgta caattctgcg atgctatgcg tgatgcaggc attgtaggcg
13981 tactgacatt agataatcag gatcttaatg ggaactggta cgatttcggt gatttcgtac
14041 aagtagcacc aggctgcgga gttcctattg tggattcata ttactcattg ctgatgccca
14101 tcctcacttt gactagggca ttggctgctg agtcccatat ggatgctgat ctcgcaaaac
14161 cacttattaa gtgggatttg ctgaaatatg attttacgga agagagactt tgtctcttcg
14221 accgttattt taaatattgg gaccagacat accatcccaa ttgtattaac tgtttggatg
14281 ataggtgtat ccttcattgt gcaaacttta atgtgttatt ttctactgtg tttccaccta
14341 caagttttgg accactagta agaaaaatat ttgtagatgg tgttcctttt gttgtttcaa
14401 ctggatacca ttttcgtgag ttaggagtcg tacataatca ggatgtaaac ttacatagct
14461 cgcgtctcag tttcaaggaa cttttagtgt atgctgctga tccagctatg catgcagctt
14521 ctggcaattt attgctagat aaacgcacta catgcttttc agtagctgca ctaacaaaca
14581 atgttgcttt tcaaactgtc aaacccggta attttaataa agacttttat gactttgctg
14641 tgtctaaagg tttctttaag gaaggaagtt ctgttgaact aaaacacttc ttctttgctc
14701 aggatggcaa cgctgctatc agtgattatg actattatcg ttataatctg ccaacaatgt
14761 gtgatatcag acaactccta ttcgtagttg aagttgttga taaatacttt gattgttacg
14821 atggtggctg tattaatgcc aaccaagtaa tcgttaacaa tctggataaa tcagctggtt
14881 tcccatttaa taaatggggt aaggctagac tttattatga ctcaatgagt tatgaggatc
14941 aagatgcact tttcgcgtat actaagcgta atgtcatccc tactataact caaatgaatc
15001 ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc tctatctgta
15061 gtactatgac aaatagacag tttcatcaga aattattgaa gtcaatagcc gccactagag
15121 gagctactgt ggtaattgga acaagcaagt tttacggtgg ctggcataat atgttaaaaa
15181 ctgtttacag tgatgtagaa actccacacc ttatgggttg ggattatcca aaatgtgaca
15241 gagccatgcc taacatgctt aggataatgg cctctcttgt tcttgctcgc aaacataaca
15301 cttgctgtaa cttatcacac cgtttctaca ggttagctaa cgagtgtgcg caagtattaa
15361 gtgagatggt catgtgtggc ggctcactat atgttaaacc aggtggaaca tcatccggtg
15421 atgctacaac tgcttatgct aatagtgtct ttaacatttg tcaagctgtt acagccaatg
15481 taaatgcact tctttcaact gatggtaata agatagctga caagtatgtc cgcaatctac
15541 aacacaggct ctatgagtgt ctctatagaa atagggatgt tgatcatgaa ttcgtggatg
15601 agttttacgc ttacctgcgt aaacatttct ccatgatgat tctttctgat gatgccgttg
15661 tgtgctataa cagtaactat gcggctcaag gtttagtagc tagcattaag aactttaagg
15721 cagttcttta ttatcaaaat aatgtgttca tgtctgaggc aaaatgttgg actgagactg
15781 accttactaa aggacctcac gaattttgct cacagcatac aatgctagtt aaacaaggag
15841 atgattacgt gtacctgcct tacccagatc catcaagaat attaggcgca ggctgttttg
15901 tcgatgatat tgtcaaaaca gatggtacac ttatgattga aaggttcgtg tcactggcta
15961 ttgatgctta cccacttaca aaacatccta atcaggagta tgctgatgtc tttcacttgt
16021 atttacaata cattagaaag ttacatgatg agcttactgg ccacatgttg gacatgtatt
16081 ccgtaatgct aactaatgat aacacctcac ggtactggga acctgagttt tatgaggcta
16141 tgtacacacc acatacagtc ttgcaggctg taggtgcttg tgtattgtgc aattcacaga
16201 cttcacttcg ttgcggtgcc tgtattagga gaccattcct atgttgcaag tgctgctatg
16261 accatgtcat ttcaacatca cacaaattag tgttgtctgt taatccctat gtttgcaatg
16321 ccccaggttg tgatgtcact gatgtgacac aactgtatct aggaggtatg agctattatt
16381 gcaagtcaca taagcctccc attagttttc cattatgtgc taatggtcag gtttttggtt
16441 tatacaaaaa cacatgtgta ggcagtgaca atgtcactga cttcaatgcg atagcaacat
16501 gtgattggac taatgctggc gattacatac ttgccaacac ttgtactgag agactcaagc
16561 ttttcgcagc agaaacgctc aaagccactg aggaaacatt taagctgtca tatggtattg
16621 ccactgtacg cgaagtactc tctgacagag aattgcatct ttcatgggag gttggaaaac
16681 ctagaccacc attgaacaga aactatgtct ttactggtta ccgtgtaact aaaaatagta
16741 aagtacagat tggagagtac acctttgaaa aaggtgacta tggtgatgct gttgtgtaca
16801 gaggtactac gacatacaag ttgaatgttg gtgattactt tgtgttgaca tctcacactg
16861 taatgccact tagtgcacct actctagtgc cacaagagca ctatgtgaga attactggct
16921 tgtacccaac actcaacatc tcagatgagt tttctagcaa tgttgcaaat tatcaaaagg
16981 tcggcatgca aaagtactct acactccaag gaccacctgg tactggtaag agtcattttg
17041 ccatcggact tgctctctat tacccatctg ctcgcatagt gtatacggca tgctctcatg
17101 cagctgttga tgccctatgt gaaaaggcat taaaatattt gcccatagat aaatgtagta
17161 gaatcatacc tgcgcgtgcg cgcgtagagt gttttgataa attcaaagtg aattcaacac
17221 tagaacagta tgttttctgc actgtaaatg cattgccaga aacaactgct gacattgtag
17281 tctttgatga aatctctatg gctactaatt atgacttgag tgttgtcaat gctagacttc
17341 gtgcaaaaca ctacgtctat attggcgatc ctgctcaatt accagccccc cgcacattgc
17401 tgactaaagg cacactagaa ccagaatatt ttaattcagt gtgcagactt atgaaaacaa
17461 taggtccaga catgttcctt ggaacttgtc gccgttgtcc tgctgaaatt gttgacactg
17521 tgagtgcttt agtttatgac aataagctaa aagcacacaa ggataagtca gctcaatgct
17581 tcaaaatgtt ctacaaaggt gttattacac atgatgtttc atctgcaatc aacagacctc
17641 aaataggcgt tgtaagagaa tttcttacac gcaatcctgc ttggagaaaa gctgttttta
17701 tctcacctta taattcacag aacgctgtag cttcaaaaat cttaggattg cctacgcaga
17761 ctgttgattc atcacagggt tctgaatatg actatgtcat attcacacaa actactgaaa
17821 cagcacactc ttgtaatgtc aaccgcttca atgtggctat cacaagggca aaaattggca
17881 ttttgtgcat aatgtctgat agagatcttt atgacaaact gcaatttaca agtctagaaa
17941 taccacgtcg caatgtggct acattacaag cagaaaatgt aactggactt tttaaggact
18001 gtagtaagat cattactggt cttcatccta cacaggcacc tacacacctc agcgttgata
18061 taaagttcaa gactgaagga ttatgtgttg acataccagg cataccaaag gacatgacct
18121 accgtagact catctctatg atgggtttca aaatgaatta ccaagtcaat ggttacccta
18181 atatgtttat cacccgcgaa gaagctattc gtcacgttcg tgcgtggatt ggctttgatg
18241 tagagggctg tcatgcaact agagatgctg tgggtactaa cctacctctc cagctaggat
18301 tttctacagg tgttaactta gtagctgtac cgactggtta tgttgacact gaaaataaca
18361 cagaattcac cagagttaat gcaaaacctc caccaggtga ccagtttaaa catcttatac
18421 cactcatgta taaaggcttg ccctggaatg tagtgcgtat taagatagta caaatgctca
18481 gtgatacact gaaaggattg tcagacagag tcgtgttcgt cctttgggcg catggctttg
18541 agcttacatc aatgaagtac tttgtcaaga ttggacctga aagaacgtgt tgtctgtgtg
18601 acaaacgtgc aacttgcttt tctacttcat cagatactta tgcctgctgg aatcattctg
18661 tgggttttga ctatgtctat aacccattta tgattgatgt tcagcagtgg ggctttacgg
18721 gtaaccttca gagtaaccat gaccaacatt gccaggtaca tggaaatgca catgtggcta
18781 gttgtgatgc tatcatgact agatgtttag cagtccatga gtgctttgtt aagcgcgttg
18841 attggtctgt tgaataccct attataggag atgaactgag ggttaattct gcttgcagaa
18901 aagtacaaca catggttgtg aagtctgcat tgcttgctga taagtttcca gttcttcatg
18961 acattggaaa tccaaaggct atcaagtgtg tgcctcaggc tgaagtagaa tggaagttct
19021 acgatgctca gccatgtagt gacaaagctt acaaaataga ggaactcttc tattcttatg
19081 ctacacatca cgataaattc actgatggtg tttgtttgtt ttggaattgt aacgttgatc
19141 gttacccagc caatgcaatt gtgtgtaggt ttgacacaag agtcttgtca aacttgaact
19201 taccaggctg tgatggtggt agtttgtatg tgaataagca tgcattccac actccagctt
19261 tcgataaaag tgcatttact aatttaaagc aattgccttt cttttactat tctgatagtc
19321 cttgtgagtc tcatggcaaa caagtagtgt cggatattga ttatgttcca ctcaaatctg
19381 ctacgtgtat tacacgatgc aatttaggtg gtgctgtttg cagacaccat gcaaatgagt
19441 accgacagta cttggatgca tataatatga tgatttctgc tggatttagc ctatggattt
19501 acaaacaatt tgatacttat aacctgtgga atacatttac caggttacag agtttagaaa
19561 atgtggctta taatgttgtt aataaaggac actttgatgg acacgccggc gaagcacctg
19621 tttccatcat taataatgct gtttacacaa aggtagatgg tattgatgtg gagatctttg
19681 aaaataagac aacacttcct gttaatgttg catttgagct ttgggctaag cgtaacatta
19741 aaccagtgcc agagattaag atactcaata atttgggtgt tgatatcgct gctaatactg
19801 taatctggga ctacaaaaga gaagccccag cacatgtatc tacaataggt gtctgcacaa
19861 tgactgacat tgccaagaaa cctactgaga gtgcttgttc ttcacttact gtcttgtttg
19921 atggtagagt ggaaggacag gtagaccttt ttagaaacgc ccgtaatggt gttttaataa
19981 cagaaggttc agtcaaaggt ctaacacctt caaagggacc agcacaagct agcgtcaatg
20041 gagtcacatt aattggagaa tcagtaaaaa cacagtttaa ctactttaag aaagtagacg
20101 gcattattca acagttgcct gaaacctact ttactcagag cagagactta gaggatttta
20161 agcccagatc acaaatggaa actgactttc tcgagctcgc tatggatgaa ttcatacagc
20221 gatataagct cgagggctat gccttcgaac acatcgttta tggagatttc agtcatggac
20281 aacttggcgg tcttcattta atgataggct tagccaagcg ctcacaagat tcaccactta
20341 aattagagga ttttatccct atggacagca cagtgaaaaa ttacttcata acagatgcgc
20401 aaacaggttc atcaaaatgt gtgtgttctg tgattgatct tttacttgat gactttgtcg
20461 agataataaa gtcacaagat ttgtcagtga tttcaaaagt ggtcaaggtt acaattgact
20521 atgctgaaat ttcattcatg ctttggtgta aggatggaca tgttgaaacc ttctacccaa
20581 aactacaagc aagtcaagcg tggcaaccag gtgttgcgat gcctaacttg tacaagatgc
20641 aaagaatgct tcttgaaaag tgtgaccttc agaattatgg tgaaaatgct gttataccaa
20701 aaggaataat gatgaatgtc gcaaagtata ctcaactgtg tcaatactta aatacactta
20761 ctttagctgt accctacaac atgagagtta ttcactttgg tgctggctct gataaaggag
20821 ttgcaccagg tacagctgtg ctcagacaat ggttgccaac tggcacacta cttgtcgatt
20881 cagatcttaa tgacttcgtc tccgacgcag attctacttt aattggagac tgtgcaacag
20941 tacatacggc taataaatgg gaccttatta ttagcgatat gtatgaccct aggaccaaac
21001 atgtgacaaa agagaatgac tctaaagaag ggtttttcac ttatctgtgt ggatttataa
21061 agcaaaaact agccctgggt ggttctatag ctgtaaagat aacagagcat tcttggaatg
21121 ctgaccttta caagcttatg ggccatttct catggtggac agcttttgtt acaaatgtaa
21181 atgcatcatc atcggaagca tttttaattg gggctaacta tcttggcaag ccgaaggaac
21241 aaattgatgg ctataccatg catgctaact acattttctg gaggaacaca aatcctatcc
21301 agttgtcttc ctattcactc tttgacatga gcaaatttcc tcttaaatta agaggaactg
21361 ctgtaatgtc tcttaaggag aatcaaatca atgatatgat ttattctctt ctggaaaaag
21421 gtaggcttat cattagagaa aacaacagag ttgtggtttc aagtgatatt cttgttaaca
21481 actaaacgaa catgtttatt ttcttattat ttcttactct cactagtggt agtgaccttg
21541 accggtgcac cacttttgat gatgttcaag ctcctaatta cactcaacat acttcatcta
21601 tgaggggggt ttactatcct gatgaaattt ttagatcaga cactctttat ttaactcagg
21661 atttatttct tccattttat tctaatgtta cagggtttca tactattaat catacgtttg
21721 gcaaccctgt catacctttt aaggatggta tttattttgc tgccacagag aaatcaaatg
21781 ttgtccgtgg ttgggttttt ggttctacca tgaacaacaa gtcacagtcg gtgattatta
21841 ttaacaattc tactaatgtt gttatacgag catgtaactt tgaattgtgt gacaaccctt
21901 tctttgctgt ttctaaaccc atgggtacac agacacatac tatgatattc gataatgcat
21961 ttaattgcac tttcgagtac atatctgatg ccttttcgct tgatgtttca gaaaagtcag
22021 gtaattttaa acacttacga gagtttgtgt ttaaaaataa agatgggttt ctctatgttt
22081 ataagggcta tcaacctata gatgtagttc gtgatctacc ttctggtttt aacactttga
22141 aacctatttt taagttgcct cttggtatta acattacaaa ttttagagcc attcttacag
22201 ccttttcacc tgctcaagac atttggggca cgtcagctgc agcctatttt gttggctatt
22261 taaagccaac tacatttatg ctcaagtatg atgaaaatgg tacaatcaca gatgctgttg
22321 attgttctca aaatccactt gctgaactca aatgctctgt taagagcttt gagattgaca
22381 aaggaattta ccagacctct aatttcaggg ttgttccctc aggagatgtt gtgagattcc
22441 ctaatattac aaacttgtgt ccttttggag aggtttttaa tgctactaaa ttcccttctg
22501 tctatgcatg ggagagaaaa aaaatttcta attgtgttgc tgattactct gtgctctaca
22561 actcaacatt tttttcaacc tttaagtgct atggcgtttc tgccactaag ttgaatgatc
22621 tttgcttctc caatgtctat gcagattctt ttgtagtcaa gggagatgat gtaagacaaa
22681 tagcgccagg acaaactggt gttattgctg attataatta taaattgcca gatgatttca
22741 tgggttgtgt ccttgcttgg aatactagga acattgatgc tacttcaact ggtaattata
22801 attataaata taggtatctt agacatggca agcttaggcc ctttgagaga gacatatcta
22861 atgtgccttt ctcccctgat ggcaaacctt gcaccccacc tgctcttaat tgttattggc
22921 cattaaatga ttatggtttt tacaccacta ctggcattgg ctaccaacct tacagagttg
22981 tagtactttc ttttgaactt ttaaatgcac cggccacggt ttgtggacca aaattatcca
23041 ctgaccttat taagaaccag tgtgtcaatt ttaattttaa tggactcact ggtactggtg
23101 tgttaactcc ttcttcaaag agatttcaac catttcaaca atttggccgt gatgtttctg
23161 atttcactga ttccgttcga gatcctaaaa catctgaaat attagacatt tcaccttgcg
23221 cttttggggg tgtaagtgta attacacctg gaacaaatgc ttcatctgaa gttgctgttc
23281 tatatcaaga tgttaactgc actgatgttt ctacagcaat tcatgcagat caactcacac
23341 cagcttggcg catatattct actggaaaca atgtattcca gactcaagca ggctgtctta
23401 taggagctga gcatgtcgac acttcttatg agtgcgacat tcctattgga gctggcattt
23461 gtgctagtta ccatacagtt tctttattac gtagtactag ccaaaaatct attgtggctt
23521 atactatgtc tttaggtgct gatagttcaa ttgcttactc taataacacc attgctatac
23581 ctactaactt ttcaattagc attactacag aagtaatgcc tgtttctatg gctaaaacct
23641 ccgtagattg taatatgtac atctgcggag attctactga atgtgctaat ttgcttctcc
23701 aatatggtag cttttgcaca caactaaatc gtgcactctc aggtattgct gctgaacagg
23761 atcgcaacac acgtgaagtg ttcgctcaag tcaaacaaat gtacaaaacc ccaactttga
23821 aatattttgg tggttttaat ttttcacaaa tattacctga ccctctaaag ccaactaaga
23881 ggtcttttat tgaggacttg ctctttaata aggtgacact cgctgatgct ggcttcatga
23941 agcaatatgg cgaatgccta ggtgatatta atgctagaga tctcatttgt gcgcagaagt
24001 tcaatggact tacagtgttg ccacctctgc tcactgatga tatgattgct gcctacactg
24061 ctgctctagt tagtggtact gccactgctg gatggacatt tggtgctggc gctgctcttc
24121 aaataccttt tgctatgcaa atggcatata ggttcaatgg cattggagtt acccaaaatg
24181 ttctctatga gaaccaaaaa caaatcgcca accaatttaa caaggcgatt agtcaaattc
24241 aagaatcact tacaacaaca tcaactgcat tgggcaagct gcaagacgtt gttaaccaga
24301 atgctcaagc attaaacaca cttgttaaac aacttagctc taattttggt gcaatttcaa
24361 gtgtgctaaa tgatatcctt tcgcgacttg ataaagtcga ggcggaggta caaattgaca
24421 ggttaattac aggcagactt caaagccttc aaacctatgt aacacaacaa ctaatcaggg
24481 ctgctgaaat cagggcttct gctaatcttg ctgctactaa aatgtctgag tgtgttcttg
24541 gacaatcaaa aagagttgac ttttgtggaa agggctacca ccttatgtcc ttcccacaag
24601 cagccccgca tggtgttgtc ttcctacatg tcacgtatgt gccatcccag gagaggaact
24661 tcaccacagc gccagcaatt tgtcatgaag gcaaagcata cttccctcgt gaaggtgttt
24721 ttgtgtttaa tggcacttct tggtttatta cacagaggaa cttcttttct ccacaaataa
24781 ttactacaga caatacattt gtctcaggaa attgtgatgt cgttattggc atcattaaca
24841 acacagttta tgatcctctg caacctgagc ttgactcatt caaagaagag ctggacaagt
24901 acttcaaaaa tcatacatca ccagatgttg atcttggcga catttcaggc attaacgctt
24961 ctgtcgtcaa cattcaaaaa gaaattgacc gcctcaatga ggtcgctaaa aatttaaatg
25021 aatcactcat tgaccttcaa gaattgggaa aatatgagca atatattaaa tggccttggt
25081 atgtttggct cggcttcatt gctggactaa ttgccatcgt catggttaca atcttgcttt
25141 gttgcatgac tagttgttgc agttgcctca agggtgcatg ctcttgtggt tcttgctgca
25201 agtttgatga ggatgactct gagccagttc tcaagggtgt caaattacat tacacataaa
25261 cgaacttatg gatttgttta tgagattttt tactcttaga tcaattactg cacagccagt
25321 aaaaattgac aatgcttctc ctgcaagtac tgttcatgct acagcaacga taccgctaca
25381 agcctcactc cctttcggat ggcttgttat tggcgttgca tttcttgctg tttttcagag
25441 cgctaccaaa ataattgcgc tcaataaaag atggcagcta gccctttata agggcttcca
25501 gttcatttgc aatttactgc tgctatttgt taccatctat tcacatcttt tgcttgtcgc
25561 tgcaggtatg gaggcgcaat ttttgtacct ctatgccttg atatattttc tacaatgcat
25621 caacgcatgt agaattatta tgagatgttg gctttgttgg aagtgcaaat ccaagaaccc
25681 attactttat gatgccaact actttgtttg ctggcacaca cataactatg actactgtat
25741 accatataac agtgtcacag atacaattgt cgttactgaa ggtgacggca tttcaacacc
25801 aaaactcaaa gaagactacc aaattggtgg ttattctgag gataggcact caggtgttaa
25861 agactatgtc gttgtacatg gctatttcac cgaagtttac taccagcttg agtctacaca
25921 aattactaca gacactggta ttgaaaatgc tacattcttc atctttaaca agcttgttaa
25981 agacccaccg aatgtgcaaa tacacacaat cgacggctct tcaggagttg ctaatccagc
26041 aatggatcca atttatgatg agccgacgac gactactagc gtgcctttgt aagcacaaga
26101 aagtgagtac gaacttatgt actcattcgt ttcggaagaa acaggtacgt taatagttaa
26161 tagcgtactt ctttttcttg ctttcgtggt attcttgcta gtcacactag ccatccttac
26221 tgcgcttcga ttgtgtgcgt actgctgcaa tattgttaac gtgagtttag taaaaccaac
26281 ggtttacgtc tactcgcgtg ttaaaaatct gaactcttct gaaggagttc ctgatcttct
26341 ggtctaaacg aactaactat tattattatt ctgtttggaa ctttaacatt gcttatcatg
26401 gcagacaacg gtactattac cgttgaggag cttaaacaac tcctggaaca atggaaccta
26461 gtaataggtt tcctattcct agcctggatt atgttactac aatttgccta ttctaatcgg
26521 aacaggtttt tgtacataat aaagcttgtt ttcctctggc tcttgtggcc agtaacactt
26581 gcttgttttg tgcttgctgc tgtctacaga attaattggg tgactggcgg gattgcgatt
26641 gcaatggctt gtattgtagg cttgatgtgg cttagctact tcgttgcttc cttcaggctg
26701 tttgctcgta cccgctcaat gtggtcattc aacccagaaa caaacattct tctcaatgtg
26761 cctctccggg ggacaattgt gaccagaccg ctcatggaaa gtgaacttgt cattggtgct
26821 gtgatcattc gtggtcactt gcgaatggcc ggacactccc tagggcgctg tgacattaag
26881 gacctgccaa aagagatcac tgtggctaca tcacgaacgc tttcttatta caaattagga
26941 gcgtcgcagc gtgtaggcac tgattcaggt tttgctgcat acaaccgcta ccgtattgga
27001 aactataaat taaatacaga ccacgccggt agcaacgaca atattgcttt gctagtacag
27061 taagtgacaa cagatgtttc atcttgttga cttccaggtt acaatagcag agatattgat
27121 tatcattatg aggactttca ggattgctat ttggaatctt gacgttataa taagttcaat
27181 agtgagacaa ttatttaagc ctctaactaa gaagaattat tcggagttag atgatgaaga
27241 acctatggag ttagattatc cataaaacga acatgaaaat tattctcttc ctgacattga
27301 ttgtatttac atcttgcgag ctatatcact atcaggagtg tgttagaggt acgactgtac
27361 tactaaaaga accttgccca tcaggaacat acgagggcaa ttcaccattt caccctcttg
27421 ctgacaataa atttgcacta acttgcacta gcacacactt tgcttttgct tgtgctgacg
27481 gtactcgaca tacctatcag ctgcgtgcaa gatcagtttc accaaaactt ttcatcagac
27541 aagaggaggt tcaacaagag ctctactcgc cactttttct cattgttgct gctctagtat
27601 ttttaatact ttgcttcacc attaagagaa agacagaatg aatgagctca ctttaattga
27661 cttctatttg tgctttttag cctttctgct attccttgtt ttaataatgc ttattatatt
27721 ttggttttca ctcgaaatcc aggatctaga agaaccttgt accaaagtct aaacgaacat
27781 gaaacttctc attgttttga cttgtatttc tctatgcagt tgcatatgca ctgtagtaca
27841 gcgctgtgca tctaataaac ctcatgtgct tgaagatcct tgtaaggtac aacactaggg
27901 gtaatactta tagcactgct tggctttgtg ctctaggaaa ggttttacct tttcatagat
27961 ggcacactat ggttcaaaca tgcacaccta atgttactat caactgtcaa gatccagctg
28021 gtggtgcgct tatagctagg tgttggtacc ttcatgaagg tcaccaaact gctgcattta
28081 gagacgtact tgttgtttta aataaacgaa caaattaaaa tgtctgataa tggaccccaa
28141 tcaaaccaac gtagtgcccc ccgcattaca tttggtggac ccacagattc aactgacaat
28201 aaccagaatg gaggacgcaa tggggcaagg ccaaaacagc gccgacccca aggtttaccc
28261 aataatactg cgtcttggtt cacagctctc actcagcatg gcaaggagga acttagattc
28321 cctcgaggcc agggcgttcc aatcaacacc aatagtggtc cagatgacca aattggctac
28381 taccgaagag ctacccgacg agttcgtggt ggtgacggca aaatgaaaga gctcagcccc
28441 agatggtact tctattacct aggaactggc ccagaagctt cacttcccta cggcgctaac
28501 aaagaaggca tcgtatgggt tgcaactgag ggagccttga atacacccaa agaccacatt
28561 ggcacccgca atcctaataa caatgctgcc accgtgctac aacttcctca aggaacaaca
28621 ttgccaaaag gcttctacgc agagggaagc agaggcggca gtcaagcctc ttctcgctcc
28681 tcatcacgta gtcgcggtaa ttcaagaaat tcaactcctg gcagcagtag gggaaattct
28741 cctgctcgaa tggctagcgg aggtggtgaa actgccctcg cgctattgct gctagacaga
28801 ttgaaccagc ttgagagcaa agtttctggt aaaggccaac aacaacaagg ccaaactgtc
28861 actaagaaat ctgctgctga ggcatctaaa aagcctcgcc aaaaacgtac tgccacaaaa
28921 cagtacaacg tcactcaagc atttgggaga cgtggtccag aacaaaccca aggaaatttc
28981 ggggaccaag acctaatcag acaaggaact gattacaaac attggccgca aattgcacaa
29041 tttgctccaa gtgcctctgc attctttgga atgtcacgca ttggcatgga agtcacacct
29101 tcgggaacat ggctgactta tcatggagcc attaaattgg atgacaaaga tccacaattc
29161 aaagacaacg tcatactgct gaacaagcac attgacgcat acaaaacatt cccaccaaca
29221 gagcctaaaa aggacaaaaa gaaaaagact gatgaagctc agcctttgcc gcagagacaa
29281 aagaagcagc ccactgtgac tcttcttcct gcggctgaca tggatgattt ctccagacaa
29341 cttcaaaatt ccatgagtgg agcttctgct gattcaactc aggcataaac actcatgatg
29401 accacacaag gcagatgggc tatgtaaacg ttttcgcaat tccgtttacg atacatagtc
29461 tactcttgtg cagaatgaat tctcgtaact aaacagcaca agtaggttta gttaacttta
29521 atctcacata gcaatcttta atcaatgtgt aacattaggg aggacttgaa agagccacca
29581 cattttcatc gaggccacgc ggagtacgat cgagggtaca gtgaataatg ctagggagag
29641 ctgcctatat ggaagagccc taatgtgtaa aattaatttt agtagtgcta tccccatgtg
29701 attttaatag cttcttagga gaatgacaaa aaaaaaaaaa aaaaaaaaaa a
//