GenomeNet

Database: RefSeq
Entry: NC_045512
LinkDB: NC_045512
Original site: NC_045512 
LOCUS       NC_045512              29903 bp ss-RNA     linear   VRL 18-JUL-2020
DEFINITION  Severe acute respiratory syndrome coronavirus 2 isolate Wuhan-Hu-1,
            complete genome.
ACCESSION   NC_045512
VERSION     NC_045512.2
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2)
  ORGANISM  Severe acute respiratory syndrome coronavirus 2
            Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
            Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
            Betacoronavirus; Sarbecovirus.
REFERENCE   1  (bases 1 to 29903)
  AUTHORS   Wu,F., Zhao,S., Yu,B., Chen,Y.M., Wang,W., Song,Z.G., Hu,Y.,
            Tao,Z.W., Tian,J.H., Pei,Y.Y., Yuan,M.L., Zhang,Y.L., Dai,F.H.,
            Liu,Y., Wang,Q.M., Zheng,J.J., Xu,L., Holmes,E.C. and Zhang,Y.Z.
  TITLE     A new coronavirus associated with human respiratory disease in
            China
  JOURNAL   Nature 579 (7798), 265-269 (2020)
   PUBMED   32015508
  REMARK    Erratum:[Nature. 2020 Apr;580(7803):E7. PMID: 32296181]
REFERENCE   2  (bases 13476 to 13503)
  AUTHORS   Baranov,P.V., Henderson,C.M., Anderson,C.B., Gesteland,R.F.,
            Atkins,J.F. and Howard,M.T.
  TITLE     Programmed ribosomal frameshifting in decoding the SARS-CoV genome
  JOURNAL   Virology 332 (2), 498-510 (2005)
   PUBMED   15680415
REFERENCE   3  (bases 29728 to 29768)
  AUTHORS   Robertson,M.P., Igel,H., Baertsch,R., Haussler,D., Ares,M. Jr. and
            Scott,W.G.
  TITLE     The structure of a rigorously conserved RNA element within the SARS
            virus genome
  JOURNAL   PLoS Biol. 3 (1), e5 (2005)
   PUBMED   15630477
REFERENCE   4  (bases 29609 to 29657)
  AUTHORS   Williams,G.D., Chang,R.Y. and Brian,D.A.
  TITLE     A phylogenetically conserved hairpin-type 3' untranslated region
            pseudoknot functions in coronavirus RNA replication
  JOURNAL   J. Virol. 73 (10), 8349-8355 (1999)
   PUBMED   10482585
REFERENCE   5  (bases 1 to 29903)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (17-JAN-2020) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   6  (bases 1 to 29903)
  AUTHORS   Wu,F., Zhao,S., Yu,B., Chen,Y.-M., Wang,W., Hu,Y., Song,Z.-G.,
            Tao,Z.-W., Tian,J.-H., Pei,Y.-Y., Yuan,M.L., Zhang,Y.-L.,
            Dai,F.-H., Liu,Y., Wang,Q.-M., Zheng,J.-J., Xu,L., Holmes,E.C. and
            Zhang,Y.-Z.
  TITLE     Direct Submission
  JOURNAL   Submitted (05-JAN-2020) Shanghai Public Health Clinical Center &
            School of Public Health, Fudan University, Shanghai, China
COMMENT     REVIEWED REFSEQ: This record has been curated by NCBI staff. The
            reference sequence is identical to MN908947.
            On Jan 17, 2020 this sequence version replaced NC_045512.1.
            Annotation was added using homology to SARSr-CoV NC_004718.3. ###
            Formerly called 'Wuhan seafood market pneumonia virus.' If you have
            questions or suggestions, please email us at info@ncbi.nlm.nih.gov
            and include the accession number NC_045512.### Protein structures
            can be found at
            https://www.ncbi.nlm.nih.gov/structure/?term=sars-cov-2.### Find
            all other Severe acute respiratory syndrome coronavirus 2
            (SARS-CoV-2) sequences at
            https://www.ncbi.nlm.nih.gov/genbank/sars-cov-2-seqs/
            
            ##Assembly-Data-START##
            Assembly Method       :: Megahit v. V1.1.3
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..29903
                     /organism="Severe acute respiratory syndrome coronavirus
                     2"
                     /mol_type="genomic RNA"
                     /isolate="Wuhan-Hu-1"
                     /host="Homo sapiens"
                     /db_xref="taxon:2697049"
                     /country="China"
                     /collection_date="Dec-2019"
     5'UTR           1..265
     gene            266..21555
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /db_xref="GeneID:43740578"
     CDS             join(266..13468,13468..21555)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /ribosomal_slippage
                     /note="pp1ab; translated by -1 ribosomal frameshift"
                     /codon_start=1
                     /product="ORF1ab polyprotein"
                     /protein_id="YP_009724389.1"
                     /db_xref="GeneID:43740578"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQ
                     HLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGE
                     TLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQEN
                     WNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQ
                     LDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFP
                     LNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTG
                     DFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESG
                     LKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNL
                     LEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGN
                     FKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAA
                     ITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKL
                     KPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLV
                     NKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEII
                     FLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEK
                     YCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEK
                     CSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEF
                     KLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPE
                     EEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSG
                     YLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESD
                     DYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLA
                     PLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIA
                     EIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGN
                     LHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKV
                     PTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREM
                     LAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLIN
                     TLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS
                     KTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVIT
                     FDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPH
                     NSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIK
                     WADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELG
                     DVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQ
                     IPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSK
                     ETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYY
                     KKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVT
                     FFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWST
                     KPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGD
                     IILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNS
                     VPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRI
                     KASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTA
                     ALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLET
                     IQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWL
                     MWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVE
                     CTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRP
                     INPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPI
                     NVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVN
                     TFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVEC
                     LKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALI
                     WNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWL
                     KQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFA
                     NKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLP
                     RVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVA
                     YESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSG
                     RWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCL
                     AYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTN
                     DVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFE
                     EAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHL
                     AKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNG
                     LWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVL
                     KLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVN
                     VLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAV
                     LDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHW
                     LLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFL
                     LPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTAR
                     TVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLAR
                     GIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYL
                     VSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVL
                     LSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKL
                     CEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAK
                     SEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALN
                     NIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDA
                     DSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTA
                     CTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTP
                     KGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAK
                     AYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDH
                     PNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSA
                     DAQSFLNRVCGVSAARLTPCGTGTSTDVVYRAFDIYNDKVAGFAKFLKTNCCRFQEKD
                     EDDNLIDSYFVVKRHTFSNYQHEETIYNLLKDCPAVAKHDFFKFRIDGDMVPHISRQR
                     LTKYTMADLVYALRHFDEGNCDTLKEILVTYNCCDDDYFNKKDWYDFVENPDILRVYA
                     NLGERVRQALLKTVQFCDAMRNAGIVGVLTLDNQDLNGNWYDFGDFIQTTPGSGVPVV
                     DSYYSLLMPILTLTRALTAESHVDTDLTKPYIKWDLLKYDFTEERLKLFDRYFKYWDQ
                     TYHPNCVNCLDDRCILHCANFNVLFSTVFPPTSFGPLVRKIFVDGVPFVVSTGYHFRE
                     LGVVHNQDVNLHSSRLSFKELLVYAADPAMHAASGNLLLDKRTTCFSVAALTNNVAFQ
                     TVKPGNFNKDFYDFAVSKGFFKEGSSVELKHFFFAQDGNAAISDYDYYRYNLPTMCDI
                     RQLLFVVEVVDKYFDCYDGGCINANQVIVNNLDKSAGFPFNKWGKARLYYDSMSYEDQ
                     DALFAYTKRNVIPTITQMNLKYAISAKNRARTVAGVSICSTMTNRQFHQKLLKSIAAT
                     RGATVVIGTSKFYGGWHNMLKTVYSDVENPHLMGWDYPKCDRAMPNMLRIMASLVLAR
                     KHTTCCSLSHRFYRLANECAQVLSEMVMCGGSLYVKPGGTSSGDATTAYANSVFNICQ
                     AVTANVNALLSTDGNKIADKYVRNLQHRLYECLYRNRDVDTDFVNEFYAYLRKHFSMM
                     ILSDDAVVCFNSTYASQGLVASIKNFKSVLYYQNNVFMSEAKCWTETDLTKGPHEFCS
                     QHTMLVKQGDDYVYLPYPDPSRILGAGCFVDDIVKTDGTLMIERFVSLAIDAYPLTKH
                     PNQEYADVFHLYLQYIRKLHDELTGHMLDMYSVMLTNDNTSRYWEPEFYEAMYTPHTV
                     LQAVGACVLCNSQTSLRCGACIRRPFLCCKCCYDHVISTSHKLVLSVNPYVCNAPGCD
                     VTDVTQLYLGGMSYYCKSHKPPISFPLCANGQVFGLYKNTCVGSDNVTDFNAIATCDW
                     TNAGDYILANTCTERLKLFAAETLKATEETFKLSYGIATVREVLSDRELHLSWEVGKP
                     RPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGDYGDAVVYRGTTTYKLNVGDYFVLTSH
                     TVMPLSAPTLVPQEHYVRITGLYPTLNISDEFSSNVANYQKVGMQKYSTLQGPPGTGK
                     SHFAIGLALYYPSARIVYTACSHAAVDALCEKALKYLPIDKCSRIIPARARVECFDKF
                     KVNSTLEQYVFCTVNALPETTADIVVFDEISMATNYDLSVVNARLRAKHYVYIGDPAQ
                     LPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDMFLGTCRRCPAEIVDTVSALVYDNKLK
                     AHKDKSAQCFKMFYKGVITHDVSSAINRPQIGVVREFLTRNPAWRKAVFISPYNSQNA
                     VASKILGLPTQTVDSSQGSEYDYVIFTQTTETAHSCNVNRFNVAITRAKVGILCIMSD
                     RDLYDKLQFTSLEIPRRNVATLQAENVTGLFKDCSKVITGLHPTQAPTHLSVDTKFKT
                     EGLCVDIPGIPKDMTYRRLISMMGFKMNYQVNGYPNMFITREEAIRHVRAWIGFDVEG
                     CHATREAVGTNLPLQLGFSTGVNLVAVPTGYVDTPNNTDFSRVSAKPPPGDQFKHLIP
                     LMYKGLPWNVVRIKIVQMLSDTLKNLSDRVVFVLWAHGFELTSMKYFVKIGPERTCCL
                     CDRRATCFSTASDTYACWHHSIGFDYVYNPFMIDVQQWGFTGNLQSNHDLYCQVHGNA
                     HVASCDAIMTRCLAVHECFVKRVDWTIEYPIIGDELKINAACRKVQHMVVKAALLADK
                     FPVLHDIGNPKAIKCVPQADVEWKFYDAQPCSDKAYKIEELFYSYATHSDKFTDGVCL
                     FWNCNVDRYPANSIVCRFDTRVLSNLNLPGCDGGSLYVNKHAFHTPAFDKSAFVNLKQ
                     LPFFYYSDSPCESHGKQVVSDIDYVPLKSATCITRCNLGGAVCRHHANEYRLYLDAYN
                     MMISAGFSLWVYKQFDTYNLWNTFTRLQSLENVAFNVVNKGHFDGQQGEVPVSIINNT
                     VYTKVDGVDVELFENKTTLPVNVAFELWAKRNIKPVPEVKILNNLGVDIAANTVIWDY
                     KRDAPAHISTIGVCSMTDIAKKPTETICAPLTVFFDGRVDGQVDLFRNARNGVLITEG
                     SVKGLQPSVGPKQASLNGVTLIGEAVKTQFNYYKKVDGVVQQLPETYFTQSRNLQEFK
                     PRSQMEIDFLELAMDEFIERYKLEGYAFEHIVYGDFSHSQLGGLHLLIGLAKRFKESP
                     FELEDFIPMDSTVKNYFITDAQTGSSKCVCSVIDLLLDDFVEIIKSQDLSVVSKVVKV
                     TIDYTEISFMLWCKDGHVETFYPKLQSSQAWQPGVAMPNLYKMQRMLLEKCDLQNYGD
                     SATLPKGIMMNVAKYTQLCQYLNTLTLAVPYNMRVIHFGAGSDKGVAPGTAVLRQWLP
                     TGTLLVDSDLNDFVSDADSTLIGDCATVHTANKWDLIISDMYDPKTKNVTKENDSKEG
                     FFTYICGFIQQKLALGGSVAIKITEHSWNADLYKLMGHFAWWTAFVTNVNASSSEAFL
                     IGCNYLGKPREQIDGYVMHANYIFWRNTNPIQLSSYSLFDMSKFPLKLRGTAVMSLKE
                     GQINDMILSLLSKGRLIIRENNRVVISSDVLVNN"
     mat_peptide     266..805
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="leader protein"
                     /note="nsp1; produced by both pp1a and pp1ab"
                     /protein_id="YP_009725297.1"
     misc_feature    302..646
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N-terminal domain of non-structural protein 1 from
                     Severe acute respiratory syndrome-related coronavirus and
                     betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp1_N; cd21796"
                     /db_xref="CDD:409335"
     mat_peptide     806..2719
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp2"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009725298.1"
     misc_feature    809..2719
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to SARS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the B lineage; Region:
                     cv_beta_Nsp2_SARS-like; cd21516"
                     /db_xref="CDD:394867"
     mat_peptide     2720..8554
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp3"
                     /note="former nsp1; conserved domains are: N-terminal
                     acidic (Ac), predicted phosphoesterase, papain-like
                     proteinase, Y-domain, transmembrane domain 1 (TM1),
                     adenosine diphosphate-ribose 1''-phosphatase (ADRP);
                     produced by both pp1a and pp1ab"
                     /protein_id="YP_009725299.1"
     misc_feature    2903..3415
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Protein of unknown function (DUF3655); Region:
                     DUF3655; pfam12379"
                     /db_xref="CDD:403549"
     misc_feature    3425..3796
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="X-domain of viral non-structural protein 3 and
                     related macrodomains; Region: Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:394882"
     misc_feature    order(3443..3445,3479..3481,3485..3487,3620..3622,
                     3704..3727)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394882"
     misc_feature    3962..4339
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="macrodomain superfamily; Region: Macro_SF; cl00019"
                     /db_xref="CDD:412115"
     misc_feature    4316..4744
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Single-stranded poly(A) binding domain; Region:
                     SUD-M; pfam11633"
                     /db_xref="CDD:314498"
     misc_feature    4751..4948
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Severe Acute
                     Respiratory Syndrome coronavirus and related
                     betacoronaviruses in the B lineage; Region:
                     SUD_C_SARS-CoV_Nsp3; cd21525"
                     /db_xref="CDD:394841"
     misc_feature    4961..5869
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(5279..5290,5438..5446,5450..5455,5462..5464,
                     5549..5551,5618..5623,5627..5629,5696..5698,5744..5746,
                     5765..5773,5855..5857)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(5438..5446,5693..5698,5744..5746,5753..5755,
                     5771..5773,5855..5857)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    6002..6322
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_NAB; cd21822"
                     /db_xref="CDD:409348"
     misc_feature    6395..6742
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus-specific marker of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_betaSM; cd21814"
                     /db_xref="CDD:409629"
     misc_feature    6959..8551
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Severe acute respiratory
                     syndrome-related coronavirus and betacoronavirus in the B
                     lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
                     /db_xref="CDD:409665"
     misc_feature    6959..7024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409665"
     misc_feature    7274..7342
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409665"
     mat_peptide     8555..10054
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp4"
                     /note="nsp4B_TM; contains transmembrane domain 2 (TM2);
                     produced by both pp1a and pp1ab"
                     /protein_id="YP_009725300.1"
     misc_feature    8594..9736
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8594..8662
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9392..9457
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9500..9565
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9644..9712
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9770..10048
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     mat_peptide     10055..10972
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="3C-like proteinase"
                     /note="nsp5A_3CLpro and nsp5B_3CLpro; main proteinase
                     (Mpro); mediates cleavages downstream of nsp4. 3D
                     structure of the SARSr-CoV homolog has been determined
                     (Yang et al., 2003); produced by both pp1a and pp1ab"
                     /protein_id="YP_009725301.1"
     misc_feature    10064..10954
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(10064..10087,10094..10096,10406..10408,10418..10438,
                     10463..10477,10550..10552,10568..10570,10910..10912,
                     10922..10924,10946..10951)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(10127..10135,10175..10177,10472..10489,10541..10552,
                     10568..10570,10613..10630)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     mat_peptide     10973..11842
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp6"
                     /note="nsp6_TM; putative transmembrane domain; produced by
                     both pp1a and pp1ab"
                     /protein_id="YP_009725302.1"
     misc_feature    10973..11842
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     mat_peptide     11843..12091
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp7"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009725303.1"
     misc_feature    11843..12091
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11846..11848,11855..11866,11873..11881,11885..11890,
                     11897..11899,11924..11926,11933..11935,11951..11953,
                     11987..12004,12008..12025,12044..12058)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     mat_peptide     12092..12685
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp8"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009725304.1"
     misc_feature    12092..12682
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nsp8 replicase; Region: nsp8; pfam08717"
                     /db_xref="CDD:400866"
     mat_peptide     12686..13024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp9"
                     /note="ssRNA-binding protein; produced by both pp1a and
                     pp1ab"
                     /protein_id="YP_009725305.1"
     misc_feature    12686..13024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12686..12703
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12692..12697,12704..12709,12902..12907,12971..12976,
                     12980..12988,12992..13000,13004..13009)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     mat_peptide     13025..13441
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp10"
                     /note="nsp10_CysHis; formerly known as growth-factor-like
                     protein (GFL); produced by both pp1a and pp1ab"
                     /protein_id="YP_009725306.1"
     misc_feature    13025..13417
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="alphacoronavirus and betacoronavirus non-structural
                     protein 14; Region: alpha_betaCoV_Nsp10; cd21901"
                     /db_xref="CDD:409326"
     misc_feature    order(13025..13048,13058..13060,13064..13072,13076..13084,
                     13097..13102,13109..13114,13121..13123,13142..13159,
                     13196..13201,13229..13231,13235..13240,13250..13252,
                     13256..13273,13286..13294,13301..13312)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     misc_feature    order(13064..13072,13076..13084,13097..13099,13142..13144,
                     13148..13159,13196..13204,13256..13267,13274..13276,
                     13307..13312,13367..13369)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(13097..13099,13148..13159,13196..13204,13274..13276,
                     13307..13312,13367..13369)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(13142..13165,13193..13201,13229..13240,13253..13258,
                     13262..13264,13301..13312)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     mat_peptide     join(13442..13468,13468..16236)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="RNA-dependent RNA polymerase"
                     /note="nsp12; NiRAN and RdRp; produced by pp1ab only"
                     /protein_id="YP_009725307.1"
     misc_feature    join(13454..13468,13468..16236)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Severe acute respiratory syndrome coronavirus
                     RNA-dependent RNA polymerase, also known as non-structural
                     protein 12, and similar proteins from betacoronaviruses in
                     the B lineage: responsible for replication and
                     transcription of the viral RNA genome; Region:
                     SARS-CoV-like_RdRp; cd21591"
                     /db_xref="CDD:394895"
     misc_feature    order(14245..14259,14407..14412,14416..14418,14422..14436,
                     14452..14463,14470..14472,14542..14544,14551..14556,
                     14560..14565,14572..14574,14578..14592,14596..14616,
                     14626..14628,14632..14634,14644..14649,14659..14661,
                     14953..14958,14965..14967,14980..14985,14989..14991,
                     14998..15009,15436..15438)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp8 interaction site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394895"
     misc_feature    order(14665..14679,14683..14685,14698..14700,14725..14727,
                     14731..14733,14758..14775,15088..15090,15094..15096,
                     15967..15969)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp7 interaction site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394895"
     misc_feature    order(14794..14796,15073..15075,15103..15105,15292..15297,
                     15301..15309,15484..15486,15499..15501,15511..15513,
                     15715..15723)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative inhibitor binding site [chemical binding];
                     other site"
                     /db_xref="CDD:394895"
     misc_feature    order(14932..14934,14938..14946,15073..15075,15109..15111,
                     15115..15117,15133..15135,15145..15147,15157..15159,
                     15169..15171,15208..15210,15214..15216,15220..15222,
                     15484..15495,15502..15504,15712..15723,15877..15882,
                     15934..15936,15958..15960,16009..16014,16024..16026,
                     16030..16035)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative RNA binding site [nucleotide binding];
                     other site"
                     /db_xref="CDD:394895"
     misc_feature    14938..14979
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif G; other site"
                     /db_xref="CDD:394895"
     misc_feature    15052..15120
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif F; other site"
                     /db_xref="CDD:394895"
     misc_feature    15271..15321
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif A; other site"
                     /db_xref="CDD:394895"
     misc_feature    order(15478..15528,15532..15567)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif B; other site"
                     /db_xref="CDD:394895"
     misc_feature    15697..15741
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif C; other site"
                     /db_xref="CDD:394895"
     misc_feature    order(15763..15771,15775..15828)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif D; other site"
                     /db_xref="CDD:394895"
     misc_feature    15868..15903
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="conserved polymerase motif E; other site"
                     /db_xref="CDD:394895"
     mat_peptide     16237..18039
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="helicase"
                     /note="nsp13_ZBD, nsp13_TB, and nsp_HEL1core; zinc-binding
                     domain (ZD), NTPase/helicase domain (HEL), RNA
                     5'-triphosphatase; produced by pp1ab only"
                     /protein_id="YP_009725308.1"
     misc_feature    16237..16521
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Cys/His rich zinc-binding domain (CH/ZBD) of
                     coronavirus SARS NSP13 helicase and related proteins;
                     Region: ZBD_cv_Nsp13-like; cd21401"
                     /db_xref="CDD:394808"
     misc_feature    16531..16674
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="stalk domain of coronavirus Nsp13 helicase and
                     related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
                     /db_xref="CDD:410205"
     misc_feature    order(16540..16542,16627..16629)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="key interaction residues; other site"
                     /db_xref="CDD:410205"
     misc_feature    16684..16920
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="1B domain of coronavirus SARS NSP13 helicase and
                     related proteins; Region: 1B_cv_Nsp13-like; cd21409"
                     /db_xref="CDD:394817"
     misc_feature    order(16771..16773,16777..16779,16837..16839)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nucleic acid substrate binding site [nucleotide
                     binding]; other site"
                     /db_xref="CDD:394817"
     misc_feature    16987..18006
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="helicase domain of betacoronavirus non-structural
                     protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
                     /db_xref="CDD:409655"
     misc_feature    order(17089..17106,17446..17448,17563..17565,17848..17850,
                     17854..17856,17935..17937)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="ATP binding site [chemical binding]; other site"
                     /db_xref="CDD:409655"
     misc_feature    order(17098..17103,17356..17361,17446..17448,17935..17937)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative active site [active]"
                     /db_xref="CDD:409655"
     mat_peptide     18040..19620
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="3'-to-5' exonuclease"
                     /note="nsp14A2_ExoN and nsp14B_NMT; produced by pp1ab
                     only"
                     /protein_id="YP_009725309.1"
     misc_feature    18052..19614
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nonstructural protein 14 of betacoronavirus;
                     Region: betaCoV_Nsp14; cd21659"
                     /db_xref="CDD:394958"
     misc_feature    order(18052..18054,18058..18069,18094..18123,18190..18192,
                     18202..18204,18217..18240,18340..18345,18409..18411,
                     18415..18417,18427..18432,18613..18615,18622..18627,
                     18634..18642,18688..18690)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="heterodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394958"
     misc_feature    order(18307..18309,18313..18315,18610..18612,18841..18843,
                     18856..18858)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="ExoN active site [active]"
                     /db_xref="CDD:394958"
     misc_feature    order(18913..18915,18955..18957,18964..18969,18976..18978,
                     19036..19047,19051..19053,19093..19101,19135..19143,
                     19192..19206,19240..19242,19297..19299,19303..19308,
                     19315..19317,19321..19323,19555..19557)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N7-MTase active site [active]"
                     /db_xref="CDD:394958"
     mat_peptide     19621..20658
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="endoRNAse"
                     /note="nsp15-A1 and nsp15B-NendoU; produced by pp1ab only"
                     /protein_id="YP_009725310.1"
     misc_feature    19621..19803
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N-terminal domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: NTD_alpha_beta_cv_Nsp15-like; cd21171"
                     /db_xref="CDD:394900"
     misc_feature    19813..20208
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="middle domain of alpha- and beta-coronavirus
                     Nonstructural protein 15 (Nsp15), and related proteins;
                     Region: M_alpha_beta_cv_Nsp15-like; cd21167"
                     /db_xref="CDD:394905"
     misc_feature    20200..20652
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nidoviral uridylate-specific endoribonuclease
                     (NendoU) domain of coronavirus Nonstructural Protein 15
                     (Nsp15) and related proteins; Region:
                     NendoU_cv_Nsp15-like; cd21161"
                     /db_xref="CDD:394912"
     misc_feature    order(20320..20322,20332..20334,20359..20361,20365..20367,
                     20485..20487,20497..20499,20638..20640)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative active site [active]"
                     /db_xref="CDD:394912"
     mat_peptide     20659..21552
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="2'-O-ribose methyltransferase"
                     /note="nsp16_OMT; 2'-o-MT; produced by pp1ab only"
                     /protein_id="YP_009725311.1"
     misc_feature    20662..21549
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Coronavirus NSP13; Region: NSP13; pfam06460"
                     /db_xref="CDD:399456"
     CDS             266..13483
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="pp1a"
                     /codon_start=1
                     /product="ORF1a polyprotein"
                     /protein_id="YP_009725295.1"
                     /db_xref="GeneID:43740578"
                     /translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDSVEEVLSEARQ
                     HLKDGTCGLVEVEKGVLPQLEQPYVFIKRSDARTAPHGHVMVELVAELEGIQYGRSGE
                     TLGVLVPHVGEIPVAYRKVLLRKNGNKGAGGHSYGADLKSFDLGDELGTDPYEDFQEN
                     WNTKHSSGVTRELMRELNGGAYTRYVDNNFCGPDGYPLECIKDLLARAGKASCTLSEQ
                     LDFIDTKRGVYCCREHEHEIAWYTERSEKSYELQTPFEIKLAKKFDTFNGECPNFVFP
                     LNSIIKTIQPRVEKKKLDGFMGRIRSVYPVASPNECNQMCLSTLMKCDHCGETSWQTG
                     DFVKATCEFCGTENLTKEGATTCGYLPQNAVVKIYCPACHNSEVGPEHSLAEYHNESG
                     LKTILRKGGRTIAFGGCVFSYVGCHNKCAYWVPRASANIGCNHTGVVGEGSEGLNDNL
                     LEILQKEKVNINIVGDFKLNEEIAIILASFSASTSAFVETVKGLDYKAFKQIVESCGN
                     FKVTKGKAKKGAWNIGEQKSILSPLYAFASEAARVVRSIFSRTLETAQNSVRVLQKAA
                     ITILDGISQYSLRLIDAMMFTSDLATNNLVVMAYITGGVVQLTSQWLTNIFGTVYEKL
                     KPVLDWLEEKFKEGVEFLRDGWEIVKFISTCACEIVGGQIVTCAKEIKESVQTFFKLV
                     NKFLALCADSIIIGGAKLKALNLGETFVTHSKGLYRKCVKSREETGLLMPLKAPKEII
                     FLEGETLPTEVLTEEVVLKTGDLQPLEQPTSEAVEAPLVGTPVCINGLMLLEIKDTEK
                     YCALAPNMMVTNNTFTLKGGAPTKVTFGDDTVIEVQGYKSVNITFELDERIDKVLNEK
                     CSAYTVELGTEVNEFACVVADAVIKTLQPVSELLTPLGIDLDEWSMATYYLFDESGEF
                     KLASHMYCSFYPPDEDEEEGDCEEEEFEPSTQYEYGTEDDYQGKPLEFGATSAALQPE
                     EEQEEDWLDDDSQQTVGQQDGSEDNQTTTIQTIVEVQPQLEMELTPVVQTIEVNSFSG
                     YLKLTDNVYIKNADIVEEAKKVKPTVVVNAANVYLKHGGGVAGALNKATNNAMQVESD
                     DYIATNGPLKVGGSCVLSGHNLAKHCLHVVGPNVNKGEDIQLLKSAYENFNQHEVLLA
                     PLLSAGIFGADPIHSLRVCVDTVRTNVYLAVFDKNLYDKLVSSFLEMKSEKQVEQKIA
                     EIPKEEVKPFITESKPSVEQRKQDDKKIKACVEEVTTTLEETKFLTENLLLYIDINGN
                     LHPDSATLVSDIDITFLKKDAPYIVGDVVQEGVLTAVVIPTKKAGGTTEMLAKALRKV
                     PTDNYITTYPGQGLNGYTVEEAKTVLKKCKSAFYILPSIISNEKQEILGTVSWNLREM
                     LAHAEETRKLMPVCVETKAIVSTIQRKYKGIKIQEGVVDYGARFYFYTSKTTVASLIN
                     TLNDLNETLVTMPLGYVTHGLNLEEAARYMRSLKVPATVSVSSPDAVTAYNGYLTSSS
                     KTPEEHFIETISLAGSYKDWSYSGQSTQLGIEFLKRGDKSVYYTSNPTTFHLDGEVIT
                     FDNLKTLLSLREVRTIKVFTTVDNINLHTQVVDMSMTYGQQFGPTYLDGADVTKIKPH
                     NSHEGKTFYVLPNDDTLRVEAFEYYHTTDPSFLGRYMSALNHTKKWKYPQVNGLTSIK
                     WADNNCYLATALLTLQQIELKFNPPALQDAYYRARAGEAANFCALILAYCNKTVGELG
                     DVRETMSYLFQHANLDSCKRVLNVVCKTCGQQQTTLKGVEAVMYMGTLSYEQFKKGVQ
                     IPCTCGKQATKYLVQQESPFVMMSAPPAQYELKHGTFTCASEYTGNYQCGHYKHITSK
                     ETLYCIDGALLTKSSEYKGPITDVFYKENSYTTTIKPVTYKLDGVVCTEIDPKLDNYY
                     KKDNSYFTEQPIDLVPNQPYPNASFDNFKFVCDNIKFADDLNQLTGYKKPASRELKVT
                     FFPDLNGDVVAIDYKHYTPSFKKGAKLLHKPIVWHVNNATNKATYKPNTWCIRCLWST
                     KPVETSNSFDVLKSEDAQGMDNLACEDLKPVSEEVVENPTIQKDVLECNVKTTEVVGD
                     IILKPANNSLKITEEVGHTDLMAAYVDNSSLTIKKPNELSRVLGLKTLATHGLAAVNS
                     VPWDTIANYAKPFLNKVVSTTTNIVTRCLNRVCTNYMPYFFTLLLQLCTFTRSTNSRI
                     KASMPTTIAKNTVKSVGKFCLEASFNYLKSPNFSKLINIIIWFLLLSVCLGSLIYSTA
                     ALGVLMSNLGMPSYCTGYREGYLNSTNVTIATYCTGSIPCSVCLSGLDSLDTYPSLET
                     IQITISSFKWDLTAFGLVAEWFLAYILFTRFFYVLGLAAIMQLFFSYFAVHFISNSWL
                     MWLIINLVQMAPISAMVRMYIFFASFYYVWKSYVHVVDGCNSSTCMMCYKRNRATRVE
                     CTTIVNGVRRSFYVYANGGKGFCKLHNWNCVNCDTFCAGSTFISDEVARDLSLQFKRP
                     INPTDQSSYIVDSVTVKNGSIHLYFDKAGQKTYERHSLSHFVNLDNLRANNTKGSLPI
                     NVIVFDGKSKCEESSAKSASVYYSQLMCQPILLLDQALVSDVGDSAEVAVKMFDAYVN
                     TFSSTFNVPMEKLKTLVATAEAELAKNVSLDNVLSTFISAARQGFVDSDVETKDVVEC
                     LKLSHQSDIEVTGDSCNNYMLTYNKVENMTPRDLGACIDCSARHINAQVAKSHNIALI
                     WNVKDFMSLSEQLRKQIRSAAKKNNLPFKLTCATTRQVVNVVTTKIALKGGKIVNNWL
                     KQLIKVTLVFLFVAAIFYLITPVHVMSKHTDFSSEIIGYKAIDGGVTRDIASTDTCFA
                     NKHADFDTWFSQRGGSYTNDKACPLIAAVITREVGFVVPGLPGTILRTTNGDFLHFLP
                     RVFSAVGNICYTPSKLIEYTDFATSACVLAAECTIFKDASGKPVPYCYDTNVLEGSVA
                     YESLRPDTRYVLMDGSIIQFPNTYLEGSVRVVTTFDSEYCRHGTCERSEAGVCVSTSG
                     RWVLNNDYYRSLPGVFCGVDAVNLLTNMFTPLIQPIGALDISASIVAGGIVAIVVTCL
                     AYYFMRFRRAFGEYSHVVAFNTLLFLMSFTVLCLTPVYSFLPGVYSVIYLYLTFYLTN
                     DVSFLAHIQWMVMFTPLVPFWITIAYIICISTKHFYWFFSNYLKRRVVFNGVSFSTFE
                     EAALCTFLLNKEMYLKLRSDVLLPLTQYNRYLALYNKYKYFSGAMDTTSYREAACCHL
                     AKALNDFSNSGSDVLYQPPQTSITSAVLQSGFRKMAFPSGKVEGCMVQVTCGTTTLNG
                     LWLDDVVYCPRHVICTSEDMLNPNYEDLLIRKSNHNFLVQAGNVQLRVIGHSMQNCVL
                     KLKVDTANPKTPKYKFVRIQPGQTFSVLACYNGSPSGVYQCAMRPNFTIKGSFLNGSC
                     GSVGFNIDYDCVSFCYMHHMELPTGVHAGTDLEGNFYGPFVDRQTAQAAGTDTTITVN
                     VLAWLYAAVINGDRWFLNRFTTTLNDFNLVAMKYNYEPLTQDHVDILGPLSAQTGIAV
                     LDMCASLKELLQNGMNGRTILGSALLEDEFTPFDVVRQCSGVTFQSAVKRTIKGTHHW
                     LLLTILTSLLVLVQSTQWSLFFFLYENAFLPFAMGIIAMSAFAMMFVKHKHAFLCLFL
                     LPSLATVAYFNMVYMPASWVMRIMTWLDMVDTSLSGFKLKDCVMYASAVVLLILMTAR
                     TVYDDGARRVWTLMNVLTLVYKVYYGNALDQAISMWALIISVTSNYSGVVTTVMFLAR
                     GIVFMCVEYCPIFFITGNTLQCIMLVYCFLGYFCTCYFGLFCLLNRYFRLTLGVYDYL
                     VSTQEFRYMNSQGLLPPKNSIDAFKLNIKLLGVGGKPCIKVATVQSKMSDVKCTSVVL
                     LSVLQQLRVESSSKLWAQCVQLHNDILLAKDTTEAFEKMVSLLSVLLSMQGAVDINKL
                     CEEMLDNRATLQAIASEFSSLPSYAAFATAQEAYEQAVANGDSEVVLKKLKKSLNVAK
                     SEFDRDAAMQRKLEKMADQAMTQMYKQARSEDKRAKVTSAMQTMLFTMLRKLDNDALN
                     NIINNARDGCVPLNIIPLTTAAKLMVVIPDYNTYKNTCDGTTFTYASALWEIQQVVDA
                     DSKIVQLSEISMDNSPNLAWPLIVTALRANSAVKLQNNELSPVALRQMSCAAGTTQTA
                     CTDDNALAYYNTTKGGRFVLALLSDLQDLKWARFPKSDGTGTIYTELEPPCRFVTDTP
                     KGPKVKYLYFIKGLNNLNRGMVLGSLAATVRLQAGNATEVPANSTVLSFCAFAVDAAK
                     AYKDYLASGGQPITNCVKMLCTHTGTGQAITVTPEANMDQESFGGASCCLYCRCHIDH
                     PNPKGFCDLKGKYVQIPTTCANDPVGFTLKNTVCTVCGMWKGYGCSCDQLREPMLQSA
                     DAQSFLNGFAV"
     mat_peptide     266..805
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="leader protein"
                     /note="nsp1; produced by both pp1a and pp1ab"
                     /protein_id="YP_009742608.1"
     misc_feature    302..646
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N-terminal domain of non-structural protein 1 from
                     Severe acute respiratory syndrome-related coronavirus and
                     betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp1_N; cd21796"
                     /db_xref="CDD:409335"
     mat_peptide     806..2719
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp2"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009742609.1"
     misc_feature    809..2719
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 2 (Nsp2)
                     similar to SARS-CoV Nsp2, and related proteins from
                     betacoronaviruses in the B lineage; Region:
                     cv_beta_Nsp2_SARS-like; cd21516"
                     /db_xref="CDD:394867"
     mat_peptide     2720..8554
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp3"
                     /note="former nsp1; conserved domains are: N-terminal
                     acidic (Ac), predicted phosphoesterase, papain-like
                     proteinase, Y-domain, transmembrane domain 1 (TM1),
                     adenosine diphosphate-ribose 1''-phosphatase (ADRP);
                     produced by both pp1a and pp1ab"
                     /protein_id="YP_009742610.1"
     misc_feature    2903..3415
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Protein of unknown function (DUF3655); Region:
                     DUF3655; pfam12379"
                     /db_xref="CDD:403549"
     misc_feature    3425..3796
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="X-domain of viral non-structural protein 3 and
                     related macrodomains; Region: Macro_X_Nsp3-like; cd21557"
                     /db_xref="CDD:394882"
     misc_feature    order(3443..3445,3479..3481,3485..3487,3620..3622,
                     3704..3727)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative ADP-ribose binding site [chemical
                     binding]; other site"
                     /db_xref="CDD:394882"
     misc_feature    3962..4339
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="macrodomain superfamily; Region: Macro_SF; cl00019"
                     /db_xref="CDD:412115"
     misc_feature    4316..4744
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Single-stranded poly(A) binding domain; Region:
                     SUD-M; pfam11633"
                     /db_xref="CDD:314498"
     misc_feature    4751..4948
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="C-terminal SARS-Unique Domain (SUD) of
                     non-structural protein 3 (Nsp3) from Severe Acute
                     Respiratory Syndrome coronavirus and related
                     betacoronaviruses in the B lineage; Region:
                     SUD_C_SARS-CoV_Nsp3; cd21525"
                     /db_xref="CDD:394841"
     misc_feature    4961..5869
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus papain-like protease; Region:
                     betaCoV_PLPro; cd21732"
                     /db_xref="CDD:409649"
     misc_feature    order(5279..5290,5438..5446,5450..5455,5462..5464,
                     5549..5551,5618..5623,5627..5629,5696..5698,5744..5746,
                     5765..5773,5855..5857)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="ubiquitin binding site [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409649"
     misc_feature    order(5438..5446,5693..5698,5744..5746,5753..5755,
                     5771..5773,5855..5857)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:409649"
     misc_feature    6002..6322
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nucleic acid binding domain of non-structural
                     protein 3 from Severe acute respiratory syndrome-related
                     coronavirus and betacoronavirus in the B lineage; Region:
                     SARS-CoV-like_Nsp3_NAB; cd21822"
                     /db_xref="CDD:409348"
     misc_feature    6395..6742
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus-specific marker of betacoronavirus
                     non-structural protein 3; Region: betaCoV_Nsp3_betaSM;
                     cl41743"
                     /db_xref="CDD:425374"
     misc_feature    6959..8551
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="C-terminus of non-structural protein 3, including
                     transmembrane and Y domains, from Severe acute respiratory
                     syndrome-related coronavirus and betacoronavirus in the B
                     lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
                     /db_xref="CDD:409665"
     misc_feature    6959..7024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="TM1 [structural motif]; Region: TM1"
                     /db_xref="CDD:409665"
     misc_feature    7274..7342
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="TM2 [structural motif]; Region: TM2"
                     /db_xref="CDD:409665"
     mat_peptide     8555..10054
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp4"
                     /note="nsp4B_TM; contains transmembrane domain 2 (TM2);
                     produced by both pp1a and pp1ab"
                     /protein_id="YP_009742611.1"
     misc_feature    8594..9736
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="coronavirus non-structural protein 4 (Nsp4)
                     transmembrane domain; Region: cv_Nsp4_TM; cd21473"
                     /db_xref="CDD:394836"
     misc_feature    8594..8662
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 1 [structural motif]; Region:
                     putative TM helix 1"
                     /db_xref="CDD:394836"
     misc_feature    9392..9457
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 2 [structural motif]; Region:
                     putative TM helix 2"
                     /db_xref="CDD:394836"
     misc_feature    9500..9565
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 3 [structural motif]; Region:
                     putative TM helix 3"
                     /db_xref="CDD:394836"
     misc_feature    9644..9712
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="putative TM helix 4 [structural motif]; Region:
                     putative TM helix 4"
                     /db_xref="CDD:394836"
     misc_feature    9770..10048
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Coronavirus nonstructural protein 4 C-terminus;
                     Region: Corona_NSP4_C; pfam16348"
                     /db_xref="CDD:406690"
     mat_peptide     10055..10972
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="3C-like proteinase"
                     /note="nsp5A_3CLpro and nsp5B_3CLpro; main proteinase
                     (Mpro); mediates cleavages downstream of nsp4. 3D
                     structure of the SARSr-CoV homolog has been determined
                     (Yang et al., 2003); produced by both pp1a and pp1ab"
                     /protein_id="YP_009742612.1"
     misc_feature    10064..10954
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 5, also
                     called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
                     cd21666"
                     /db_xref="CDD:394887"
     misc_feature    order(10064..10087,10094..10096,10406..10408,10418..10438,
                     10463..10477,10550..10552,10568..10570,10910..10912,
                     10922..10924,10946..10951)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:394887"
     misc_feature    order(10127..10135,10175..10177,10472..10489,10541..10552,
                     10568..10570,10613..10630)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="polypeptide substrate binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394887"
     mat_peptide     10973..11842
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp6"
                     /note="nsp6_TM; putative transmembrane domain; produced by
                     both pp1a and pp1ab"
                     /protein_id="YP_009742613.1"
     misc_feature    10973..11842
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 6; Region:
                     betaCoV-Nsp6; cd21560"
                     /db_xref="CDD:394846"
     mat_peptide     11843..12091
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp7"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009742614.1"
     misc_feature    11843..12091
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 7; Region:
                     betaCoV_Nsp7; cd21827"
                     /db_xref="CDD:409253"
     misc_feature    order(11846..11848,11855..11866,11873..11881,11885..11890,
                     11897..11899,11924..11926,11933..11935,11951..11953,
                     11987..12004,12008..12025,12044..12058)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409253"
     mat_peptide     12092..12685
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp8"
                     /note="produced by both pp1a and pp1ab"
                     /protein_id="YP_009742615.1"
     misc_feature    12092..12682
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="nsp8 replicase; Region: nsp8; pfam08717"
                     /db_xref="CDD:400866"
     mat_peptide     12686..13024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp9"
                     /note="ssRNA-binding protein; produced by both pp1a and
                     pp1ab"
                     /protein_id="YP_009742616.1"
     misc_feature    12686..13024
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="betacoronavirus non-structural protein 9; Region:
                     betaCoV_Nsp9; cd21898"
                     /db_xref="CDD:409331"
     misc_feature    12686..12703
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="N-finger; other site"
                     /db_xref="CDD:409331"
     misc_feature    order(12692..12697,12704..12709,12902..12907,12971..12976,
                     12980..12988,12992..13000,13004..13009)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homodimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409331"
     mat_peptide     13025..13441
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp10"
                     /note="nsp10_CysHis; formerly known as growth-factor-like
                     protein (GFL); produced by both pp1a and pp1ab"
                     /protein_id="YP_009742617.1"
     misc_feature    13025..13417
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="alphacoronavirus and betacoronavirus non-structural
                     protein 14; Region: alpha_betaCoV_Nsp10; cd21901"
                     /db_xref="CDD:409326"
     misc_feature    order(13025..13048,13058..13060,13064..13072,13076..13084,
                     13097..13102,13109..13114,13121..13123,13142..13159,
                     13196..13201,13229..13231,13235..13240,13250..13252,
                     13256..13273,13286..13294,13301..13312)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp14 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     misc_feature    order(13064..13072,13076..13084,13097..13099,13142..13144,
                     13148..13159,13196..13204,13256..13267,13274..13276,
                     13307..13312,13367..13369)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="oligomer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(13097..13099,13148..13159,13196..13204,13274..13276,
                     13307..13312,13367..13369)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="homotrimer interface [polypeptide binding]; other
                     site"
                     /db_xref="CDD:409326"
     misc_feature    order(13142..13165,13193..13201,13229..13240,13253..13258,
                     13262..13264,13301..13312)
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /note="Nsp16 interface [polypeptide binding]; other site"
                     /db_xref="CDD:409326"
     mat_peptide     13442..13480
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /product="nsp11"
                     /note="produced by pp1a only"
                     /protein_id="YP_009725312.1"
     stem_loop       13476..13503
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /inference="COORDINATES:
                     profile:Rfam-release-14.1:RF00507,Infernal:1.1.2"
                     /function="Coronavirus frameshifting stimulation element
                     stem-loop 1"
     stem_loop       13488..13542
                     /gene="ORF1ab"
                     /locus_tag="GU280_gp01"
                     /inference="COORDINATES:
                     profile:Rfam-release-14.1:RF00507,Infernal:1.1.2"
                     /function="Coronavirus frameshifting stimulation element
                     stem-loop 2"
     gene            21563..25384
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /db_xref="GeneID:43740568"
     CDS             21563..25384
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="structural protein; spike protein"
                     /codon_start=1
                     /product="surface glycoprotein"
                     /protein_id="YP_009724390.1"
                     /db_xref="GeneID:43740568"
                     /translation="MFVFLVLLPLVSSQCVNLTTRTQLPPAYTNSFTRGVYYPDKVFR
                     SSVLHSTQDLFLPFFSNVTWFHAIHVSGTNGTKRFDNPVLPFNDGVYFASTEKSNIIR
                     GWIFGTTLDSKTQSLLIVNNATNVVIKVCEFQFCNDPFLGVYYHKNNKSWMESEFRVY
                     SSANNCTFEYVSQPFLMDLEGKQGNFKNLREFVFKNIDGYFKIYSKHTPINLVRDLPQ
                     GFSALEPLVDLPIGINITRFQTLLALHRSYLTPGDSSSGWTAGAAAYYVGYLQPRTFL
                     LKYNENGTITDAVDCALDPLSETKCTLKSFTVEKGIYQTSNFRVQPTESIVRFPNITN
                     LCPFGEVFNATRFASVYAWNRKRISNCVADYSVLYNSASFSTFKCYGVSPTKLNDLCF
                     TNVYADSFVIRGDEVRQIAPGQTGKIADYNYKLPDDFTGCVIAWNSNNLDSKVGGNYN
                     YLYRLFRKSNLKPFERDISTEIYQAGSTPCNGVEGFNCYFPLQSYGFQPTNGVGYQPY
                     RVVVLSFELLHAPATVCGPKKSTNLVKNKCVNFNFNGLTGTGVLTESNKKFLPFQQFG
                     RDIADTTDAVRDPQTLEILDITPCSFGGVSVITPGTNTSNQVAVLYQDVNCTEVPVAI
                     HADQLTPTWRVYSTGSNVFQTRAGCLIGAEHVNNSYECDIPIGAGICASYQTQTNSPR
                     RARSVASQSIIAYTMSLGAENSVAYSNNSIAIPTNFTISVTTEILPVSMTKTSVDCTM
                     YICGDSTECSNLLLQYGSFCTQLNRALTGIAVEQDKNTQEVFAQVKQIYKTPPIKDFG
                     GFNFSQILPDPSKPSKRSFIEDLLFNKVTLADAGFIKQYGDCLGDIAARDLICAQKFN
                     GLTVLPPLLTDEMIAQYTSALLAGTITSGWTFGAGAALQIPFAMQMAYRFNGIGVTQN
                     VLYENQKLIANQFNSAIGKIQDSLSSTASALGKLQDVVNQNAQALNTLVKQLSSNFGA
                     ISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMS
                     ECVLGQSKRVDFCGKGYHLMSFPQSAPHGVVFLHVTYVPAQEKNFTTAPAICHDGKAH
                     FPREGVFVSNGTHWFVTQRNFYEPQIITTDNTFVSGNCDVVIGIVNNTVYDPLQPELD
                     SFKEELDKYFKNHTSPDVDLGDISGINASVVNIQKEIDRLNEVAKNLNESLIDLQELG
                     KYEQYIKWPWYIWLGFIAGLIAIVMVTIMLCCMTSCCSCLKGCCSCGSCCKFDEDDSE
                     PVLKGVKLHYT"
     misc_feature    21599..22474
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="N-terminal domain of the S1 subunit of the Spike
                     (S) protein from Severe acute respiratory syndrome
                     coronavirus and related betacoronaviruses in the B
                     lineage; Region: SARS-CoV-like_Spike_S1_NTD; cd21624"
                     /db_xref="CDD:394950"
     misc_feature    order(21674..21676,21683..21697,21701..21703,22061..22063,
                     22154..22159,22232..22234,22250..22252,22256..22258,
                     22262..22264,22406..22408)
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394950"
     misc_feature    22517..23185
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="receptor-binding domain of the S1 subunit of severe
                     acute respiratory syndrome coronavirus 2 Spike (S)
                     protein; Region: SARS-CoV-2_Spike_S1_RBD; cd21480"
                     /db_xref="CDD:394827"
     misc_feature    order(22625..22627,22682..22693,22703..22714,22718..22720,
                     22730..22732,22748..22750,22775..22780,22784..22786,
                     22802..22807,22811..22813,22832..22834,22841..22843,
                     23102..23104,23108..23113,23117..23119)
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="trimer interface [polypeptide binding]; other site"
                     /db_xref="CDD:394827"
     misc_feature    order(22778..22780,22787..22789,22811..22813,22907..22909,
                     22919..22921,22925..22930,23027..23029,23033..23035,
                     23039..23041)
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="putative receptor binding site [polypeptide
                     binding]; other site"
                     /db_xref="CDD:394827"
     misc_feature    order(22874..22876,22901..22930,23027..23047,23081..23086)
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="receptor binding motif; other site"
                     /db_xref="CDD:394827"
     misc_feature    23189..25186
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
                     region, and the S2 fusion subunit of the spike (S)
                     glycoprotein from SARS-CoV-2 (COVID-19) and related
                     betacoronaviruses in the B lineage; Region:
                     SARS-CoV-like_Spike_SD1-2_S1-S2_S2; cd22378"
                     /db_xref="CDD:411965"
     misc_feature    order(23576..23578,23642..23659,23687..23689)
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="S1/S2 cleavage region; other site"
                     /db_xref="CDD:411965"
     misc_feature    23954..23980
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="fusion peptide; other site"
                     /db_xref="CDD:411965"
     misc_feature    24008..24061
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="internal fusion peptide; other site"
                     /db_xref="CDD:411965"
     misc_feature    24314..24511
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="heptad repeat 1 [structural motif]; Region: heptad
                     repeat 1"
                     /db_xref="CDD:411965"
     misc_feature    25046..25171
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="heptad repeat 2 [structural motif]; Region: heptad
                     repeat 2"
                     /db_xref="CDD:411965"
     misc_feature    25259..25381
                     /gene="S"
                     /locus_tag="GU280_gp02"
                     /gene_synonym="spike glycoprotein"
                     /note="Coronavirus spike glycoprotein S2, intravirion;
                     Region: CoV_S2_C; pfam19214"
                     /db_xref="CDD:408983"
     gene            25393..26220
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /db_xref="GeneID:43740569"
     CDS             25393..26220
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /codon_start=1
                     /product="ORF3a protein"
                     /protein_id="YP_009724391.1"
                     /db_xref="GeneID:43740569"
                     /translation="MDLFMRIFTIGTVTLKQGEIKDATPSDFVRATATIPIQASLPFG
                     WLIVGVALLAVFQSASKIITLKKRWQLALSKGVHFVCNLLLLFVTVYSHLLLVAAGLE
                     APFLYLYALVYFLQSINFVRIIMRLWLCWKCRSKNPLLYDANYFLCWHTNCYDYCIPY
                     NSVTSSIVITSGDGTTSPISEHDYQIGGYTEKWESGVKDCVVLHSYFTSDYYQLYSTQ
                     LSTDTGVEHVTFFIYNKIVDEPEEHVQIHTIDGSSGVVNPVMEPIYDEPTTTTSVPL"
     misc_feature    25405..26214
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /note="accessory protein ORF3a of severe acute respiratory
                     syndrome-associated coronavirus and similar proteins from
                     related betacoronavirus; Region: SARS-CoV-like_ORF3a;
                     cd21648"
                     /db_xref="CDD:394922"
     misc_feature    25507..25569
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     misc_feature    25627..25689
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     misc_feature    25705..25767
                     /gene="ORF3a"
                     /locus_tag="GU280_gp03"
                     /note="putative TM segment [structural motif]; Region:
                     putative TM segment"
                     /db_xref="CDD:394922"
     gene            26245..26472
                     /gene="E"
                     /locus_tag="GU280_gp04"
                     /db_xref="GeneID:43740570"
     CDS             26245..26472
                     /gene="E"
                     /locus_tag="GU280_gp04"
                     /note="ORF4; structural protein; E protein"
                     /codon_start=1
                     /product="envelope protein"
                     /protein_id="YP_009724392.1"
                     /db_xref="GeneID:43740570"
                     /translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCC
                     NIVNVSLVKPSFYVYSRVKNLNSSRVPDLLV"
     misc_feature    26248..26469
                     /gene="E"
                     /locus_tag="GU280_gp04"
                     /note="Severe acute respiratory syndrome coronavirus 2
                     Envelope small membrane protein; Region: SARS-CoV-2_E;
                     cd21536"
                     /db_xref="CDD:394862"
     misc_feature    order(26266..26268,26287..26292,26296..26301,26305..26331,
                     26335..26337,26383..26391,26413..26415,26422..26427,
                     26431..26439)
                     /gene="E"
                     /locus_tag="GU280_gp04"
                     /note="homopentameric interface [polypeptide binding];
                     other site"
                     /db_xref="CDD:394862"
     gene            26523..27191
                     /gene="M"
                     /locus_tag="GU280_gp05"
                     /db_xref="GeneID:43740571"
     CDS             26523..27191
                     /gene="M"
                     /locus_tag="GU280_gp05"
                     /note="ORF5; structural protein"
                     /codon_start=1
                     /product="membrane glycoprotein"
                     /protein_id="YP_009724393.1"
                     /db_xref="GeneID:43740571"
                     /translation="MADSNGTITVEELKKLLEQWNLVIGFLFLTWICLLQFAYANRNR
                     FLYIIKLIFLWLLWPVTLACFVLAAVYRINWITGGIAIAMACLVGLMWLSYFIASFRL
                     FARTRSMWSFNPETNILLNVPLHGTILTRPLLESELVIGAVILRGHLRIAGHHLGRCD
                     IKDLPKEITVATSRTLSYYKLGASQRVAGDSGFAAYSRYRIGNYKLNTDHSSSSDNIA
                     LLVQ"
     misc_feature    26535..27188
                     /gene="M"
                     /locus_tag="GU280_gp05"
                     /note="Membrane (or Matrix) protein from Severe acute
                     respiratory syndrome (SARS) coronavirus, SARS-CoV-2, and
                     related betacoronaviruses in the B lineage; Region:
                     SARS-like-CoV_M; cd21569"
                     /db_xref="CDD:394855"
     gene            27202..27387
                     /gene="ORF6"
                     /locus_tag="GU280_gp06"
                     /db_xref="GeneID:43740572"
     CDS             27202..27387
                     /gene="ORF6"
                     /locus_tag="GU280_gp06"
                     /codon_start=1
                     /product="ORF6 protein"
                     /protein_id="YP_009724394.1"
                     /db_xref="GeneID:43740572"
                     /translation="MFHLVDFQVTIAEILLIIMRTFKVSIWNLDYIINLIIKNLSKSL
                     TENKYSQLDEEQPMEID"
     misc_feature    27202..27384
                     /gene="ORF6"
                     /locus_tag="GU280_gp06"
                     /note="Open reading frame 6 from SARS coronavirus; Region:
                     Sars6; pfam12133"
                     /db_xref="CDD:403379"
     gene            27394..27759
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /db_xref="GeneID:43740573"
     CDS             27394..27759
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /codon_start=1
                     /product="ORF7a protein"
                     /protein_id="YP_009724395.1"
                     /db_xref="GeneID:43740573"
                     /translation="MKIILFLALITLATCELYHYQECVRGTTVLLKEPCSSGTYEGNS
                     PFHPLADNKFALTCFSTQFAFACPDGVKHVYQLRARSVSPKLFIRQEEVQELYSPIFL
                     IVAAIVFITLCFTLKRKTE"
     misc_feature    27394..27756
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Severe Acute Respiratory Syndrome coronavirus 2
                     (SARS-CoV-2) structural accessory protein ORF7a and a bat
                     coronavirus (BatCoV RaTG13) from related betacoronaviruses
                     in the subgenera Sarbecovirus (B lineage); Region:
                     ORF7a_SARS-CoV-2-like; cd21684"
                     /db_xref="CDD:394935"
     misc_feature    27439..27465
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand A [structural motif]; Region: Ig strand
                     A"
                     /db_xref="CDD:394935"
     misc_feature    27469..27492
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:394935"
     misc_feature    27505..27519
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:394935"
     misc_feature    27532..27543
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand D [structural motif]; Region: Ig strand
                     D"
                     /db_xref="CDD:394935"
     misc_feature    27547..27567
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:394935"
     misc_feature    27571..27597
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:394935"
     misc_feature    27601..27633
                     /gene="ORF7a"
                     /locus_tag="GU280_gp07"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:394935"
     gene            27756..27887
                     /gene="ORF7b"
                     /locus_tag="GU280_gp08"
                     /db_xref="GeneID:43740574"
     CDS             27756..27887
                     /gene="ORF7b"
                     /locus_tag="GU280_gp08"
                     /codon_start=1
                     /product="ORF7b"
                     /protein_id="YP_009725318.1"
                     /db_xref="GeneID:43740574"
                     /translation="MIELSLIDFYLCFLAFLLFLVLIMLIIFWFSLELQDHNETCHA"
     misc_feature    27756..27884
                     /gene="ORF7b"
                     /locus_tag="GU280_gp08"
                     /note="Structural accessory protein ORF7b of Severe Acute
                     Respiratory Syndrome coronavirus 2 and similar proteins;
                     Region: ORF7b_SARS-CoV-2; cd21623"
                     /db_xref="CDD:394938"
     gene            27894..28259
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /db_xref="GeneID:43740577"
     CDS             27894..28259
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /codon_start=1
                     /product="ORF8 protein"
                     /protein_id="YP_009724396.1"
                     /db_xref="GeneID:43740577"
                     /translation="MKFLVFLGIITTVAAFHQECSLQSCTQHQPYVVDDPCPIHFYSK
                     WYIRVGARKSAPLIELCVDEAGSKSPIQYIDIGNYTVSCLPFTINCQEPKLGSLVVRC
                     SFYEDFLEYHDVRVVLDFI"
     misc_feature    27894..28256
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="SARS-CoV-2 ORF8 immunoglobulin (Ig) domain protein
                     and related proteins; Region: ORF8-Ig_SARS-CoV-2-like;
                     cd21641"
                     /db_xref="CDD:394945"
     misc_feature    27948..27968
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand A [structural motif]; Region: Ig strand
                     A"
                     /db_xref="CDD:394945"
     misc_feature    27981..28004
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand B [structural motif]; Region: Ig strand
                     B"
                     /db_xref="CDD:394945"
     misc_feature    28020..28034
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand C [structural motif]; Region: Ig strand
                     C"
                     /db_xref="CDD:394945"
     misc_feature    28044..28058
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand C' [structural motif]; Region: Ig strand
                     C'"
                     /db_xref="CDD:394945"
     misc_feature    28062..28076
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand C' [structural motif]; Region: Ig strand
                     C'"
                     /db_xref="CDD:394945"
     misc_feature    28095..28109
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand D [structural motif]; Region: Ig strand
                     D"
                     /db_xref="CDD:394945"
     misc_feature    28113..28124
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand E [structural motif]; Region: Ig strand
                     E"
                     /db_xref="CDD:394945"
     misc_feature    28149..28166
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand F [structural motif]; Region: Ig strand
                     F"
                     /db_xref="CDD:394945"
     misc_feature    28179..28199
                     /gene="ORF8"
                     /locus_tag="GU280_gp09"
                     /note="Ig strand G [structural motif]; Region: Ig strand
                     G"
                     /db_xref="CDD:394945"
     gene            28274..29533
                     /gene="N"
                     /locus_tag="GU280_gp10"
                     /db_xref="GeneID:43740575"
     CDS             28274..29533
                     /gene="N"
                     /locus_tag="GU280_gp10"
                     /note="ORF9; structural protein"
                     /codon_start=1
                     /product="nucleocapsid phosphoprotein"
                     /protein_id="YP_009724397.2"
                     /db_xref="GeneID:43740575"
                     /translation="MSDNGPQNQRNAPRITFGGPSDSTGSNQNGERSGARSKQRRPQG
                     LPNNTASWFTALTQHGKEDLKFPRGQGVPINTNSSPDDQIGYYRRATRRIRGGDGKMK
                     DLSPRWYFYYLGTGPEAGLPYGANKDGIIWVATEGALNTPKDHIGTRNPANNAAIVLQ
                     LPQGTTLPKGFYAEGSRGGSQASSRSSSRSRNSSRNSTPGSSRGTSPARMAGNGGDAA
                     LALLLLDRLNQLESKMSGKGQQQQGQTVTKKSAAEASKKPRQKRTATKAYNVTQAFGR
                     RGPEQTQGNFGDQELIRQGTDYKHWPQIAQFAPSASAFFGMSRIGMEVTPSGTWLTYT
                     GAIKLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADETQALPQRQKKQQTV
                     TLLPAADLDDFSKQLQQSMSSADSTQA"
     misc_feature    28391..29437
                     /gene="N"
                     /locus_tag="GU280_gp10"
                     /note="Coronavirus nucleocapsid protein; Region:
                     Corona_nucleoca; pfam00937"
                     /db_xref="CDD:395751"
     gene            29558..29674
                     /gene="ORF10"
                     /locus_tag="GU280_gp11"
                     /db_xref="GeneID:43740576"
     CDS             29558..29674
                     /gene="ORF10"
                     /locus_tag="GU280_gp11"
                     /codon_start=1
                     /product="ORF10 protein"
                     /protein_id="YP_009725255.1"
                     /db_xref="GeneID:43740576"
                     /translation="MGYINVFAFPFTIYSLLLCRMNSRNYIAQVDVVNFNLT"
     misc_feature    29558..29665
                     /gene="ORF10"
                     /locus_tag="GU280_gp11"
                     /note="Severe acute respiratory syndrome coronavirus 2
                     Orf10; Region: SARS-CoV-2_Orf10; cd21597"
                     /db_xref="CDD:394948"
     stem_loop       29609..29644
                     /gene="ORF10"
                     /locus_tag="GU280_gp11"
                     /inference="COORDINATES:
                     profile::Rfam-release-14.1:RF00165,Infernal:1.1.2"
                     /function="Coronavirus 3' UTR pseudoknot stem-loop 1"
     stem_loop       29629..29657
                     /gene="ORF10"
                     /locus_tag="GU280_gp11"
                     /inference="COORDINATES:
                     profile::Rfam-release-14.1:RF00165,Infernal:1.1.2"
                     /function="Coronavirus 3' UTR pseudoknot stem-loop 2"
     3'UTR           29675..29903
     stem_loop       29728..29768
                     /inference="COORDINATES:
                     profile:Rfam-release-14.1:RF00164,Infernal:1.1.2"
                     /note="basepair exception: alignment to the Rfam model
                     implies coordinates 29740:29758 form a noncanonical C:T
                     basepair, but the homologous positions form a highly
                     conserved C:G basepair in other viruses, including SARS
                     (NC_004718.3)"
                     /function="Coronavirus 3' stem-loop II-like motif (s2m)"
ORIGIN      
        1 attaaaggtt tataccttcc caggtaacaa accaaccaac tttcgatctc ttgtagatct
       61 gttctctaaa cgaactttaa aatctgtgtg gctgtcactc ggctgcatgc ttagtgcact
      121 cacgcagtat aattaataac taattactgt cgttgacagg acacgagtaa ctcgtctatc
      181 ttctgcaggc tgcttacggt ttcgtccgtg ttgcagccga tcatcagcac atctaggttt
      241 cgtccgggtg tgaccgaaag gtaagatgga gagccttgtc cctggtttca acgagaaaac
      301 acacgtccaa ctcagtttgc ctgttttaca ggttcgcgac gtgctcgtac gtggctttgg
      361 agactccgtg gaggaggtct tatcagaggc acgtcaacat cttaaagatg gcacttgtgg
      421 cttagtagaa gttgaaaaag gcgttttgcc tcaacttgaa cagccctatg tgttcatcaa
      481 acgttcggat gctcgaactg cacctcatgg tcatgttatg gttgagctgg tagcagaact
      541 cgaaggcatt cagtacggtc gtagtggtga gacacttggt gtccttgtcc ctcatgtggg
      601 cgaaatacca gtggcttacc gcaaggttct tcttcgtaag aacggtaata aaggagctgg
      661 tggccatagt tacggcgccg atctaaagtc atttgactta ggcgacgagc ttggcactga
      721 tccttatgaa gattttcaag aaaactggaa cactaaacat agcagtggtg ttacccgtga
      781 actcatgcgt gagcttaacg gaggggcata cactcgctat gtcgataaca acttctgtgg
      841 ccctgatggc taccctcttg agtgcattaa agaccttcta gcacgtgctg gtaaagcttc
      901 atgcactttg tccgaacaac tggactttat tgacactaag aggggtgtat actgctgccg
      961 tgaacatgag catgaaattg cttggtacac ggaacgttct gaaaagagct atgaattgca
     1021 gacacctttt gaaattaaat tggcaaagaa atttgacacc ttcaatgggg aatgtccaaa
     1081 ttttgtattt cccttaaatt ccataatcaa gactattcaa ccaagggttg aaaagaaaaa
     1141 gcttgatggc tttatgggta gaattcgatc tgtctatcca gttgcgtcac caaatgaatg
     1201 caaccaaatg tgcctttcaa ctctcatgaa gtgtgatcat tgtggtgaaa cttcatggca
     1261 gacgggcgat tttgttaaag ccacttgcga attttgtggc actgagaatt tgactaaaga
     1321 aggtgccact acttgtggtt acttacccca aaatgctgtt gttaaaattt attgtccagc
     1381 atgtcacaat tcagaagtag gacctgagca tagtcttgcc gaataccata atgaatctgg
     1441 cttgaaaacc attcttcgta agggtggtcg cactattgcc tttggaggct gtgtgttctc
     1501 ttatgttggt tgccataaca agtgtgccta ttgggttcca cgtgctagcg ctaacatagg
     1561 ttgtaaccat acaggtgttg ttggagaagg ttccgaaggt cttaatgaca accttcttga
     1621 aatactccaa aaagagaaag tcaacatcaa tattgttggt gactttaaac ttaatgaaga
     1681 gatcgccatt attttggcat ctttttctgc ttccacaagt gcttttgtgg aaactgtgaa
     1741 aggtttggat tataaagcat tcaaacaaat tgttgaatcc tgtggtaatt ttaaagttac
     1801 aaaaggaaaa gctaaaaaag gtgcctggaa tattggtgaa cagaaatcaa tactgagtcc
     1861 tctttatgca tttgcatcag aggctgctcg tgttgtacga tcaattttct cccgcactct
     1921 tgaaactgct caaaattctg tgcgtgtttt acagaaggcc gctataacaa tactagatgg
     1981 aatttcacag tattcactga gactcattga tgctatgatg ttcacatctg atttggctac
     2041 taacaatcta gttgtaatgg cctacattac aggtggtgtt gttcagttga cttcgcagtg
     2101 gctaactaac atctttggca ctgtttatga aaaactcaaa cccgtccttg attggcttga
     2161 agagaagttt aaggaaggtg tagagtttct tagagacggt tgggaaattg ttaaatttat
     2221 ctcaacctgt gcttgtgaaa ttgtcggtgg acaaattgtc acctgtgcaa aggaaattaa
     2281 ggagagtgtt cagacattct ttaagcttgt aaataaattt ttggctttgt gtgctgactc
     2341 tatcattatt ggtggagcta aacttaaagc cttgaattta ggtgaaacat ttgtcacgca
     2401 ctcaaaggga ttgtacagaa agtgtgttaa atccagagaa gaaactggcc tactcatgcc
     2461 tctaaaagcc ccaaaagaaa ttatcttctt agagggagaa acacttccca cagaagtgtt
     2521 aacagaggaa gttgtcttga aaactggtga tttacaacca ttagaacaac ctactagtga
     2581 agctgttgaa gctccattgg ttggtacacc agtttgtatt aacgggctta tgttgctcga
     2641 aatcaaagac acagaaaagt actgtgccct tgcacctaat atgatggtaa caaacaatac
     2701 cttcacactc aaaggcggtg caccaacaaa ggttactttt ggtgatgaca ctgtgataga
     2761 agtgcaaggt tacaagagtg tgaatatcac ttttgaactt gatgaaagga ttgataaagt
     2821 acttaatgag aagtgctctg cctatacagt tgaactcggt acagaagtaa atgagttcgc
     2881 ctgtgttgtg gcagatgctg tcataaaaac tttgcaacca gtatctgaat tacttacacc
     2941 actgggcatt gatttagatg agtggagtat ggctacatac tacttatttg atgagtctgg
     3001 tgagtttaaa ttggcttcac atatgtattg ttctttctac cctccagatg aggatgaaga
     3061 agaaggtgat tgtgaagaag aagagtttga gccatcaact caatatgagt atggtactga
     3121 agatgattac caaggtaaac ctttggaatt tggtgccact tctgctgctc ttcaacctga
     3181 agaagagcaa gaagaagatt ggttagatga tgatagtcaa caaactgttg gtcaacaaga
     3241 cggcagtgag gacaatcaga caactactat tcaaacaatt gttgaggttc aacctcaatt
     3301 agagatggaa cttacaccag ttgttcagac tattgaagtg aatagtttta gtggttattt
     3361 aaaacttact gacaatgtat acattaaaaa tgcagacatt gtggaagaag ctaaaaaggt
     3421 aaaaccaaca gtggttgtta atgcagccaa tgtttacctt aaacatggag gaggtgttgc
     3481 aggagcctta aataaggcta ctaacaatgc catgcaagtt gaatctgatg attacatagc
     3541 tactaatgga ccacttaaag tgggtggtag ttgtgtttta agcggacaca atcttgctaa
     3601 acactgtctt catgttgtcg gcccaaatgt taacaaaggt gaagacattc aacttcttaa
     3661 gagtgcttat gaaaatttta atcagcacga agttctactt gcaccattat tatcagctgg
     3721 tatttttggt gctgacccta tacattcttt aagagtttgt gtagatactg ttcgcacaaa
     3781 tgtctactta gctgtctttg ataaaaatct ctatgacaaa cttgtttcaa gctttttgga
     3841 aatgaagagt gaaaagcaag ttgaacaaaa gatcgctgag attcctaaag aggaagttaa
     3901 gccatttata actgaaagta aaccttcagt tgaacagaga aaacaagatg ataagaaaat
     3961 caaagcttgt gttgaagaag ttacaacaac tctggaagaa actaagttcc tcacagaaaa
     4021 cttgttactt tatattgaca ttaatggcaa tcttcatcca gattctgcca ctcttgttag
     4081 tgacattgac atcactttct taaagaaaga tgctccatat atagtgggtg atgttgttca
     4141 agagggtgtt ttaactgctg tggttatacc tactaaaaag gctggtggca ctactgaaat
     4201 gctagcgaaa gctttgagaa aagtgccaac agacaattat ataaccactt acccgggtca
     4261 gggtttaaat ggttacactg tagaggaggc aaagacagtg cttaaaaagt gtaaaagtgc
     4321 cttttacatt ctaccatcta ttatctctaa tgagaagcaa gaaattcttg gaactgtttc
     4381 ttggaatttg cgagaaatgc ttgcacatgc agaagaaaca cgcaaattaa tgcctgtctg
     4441 tgtggaaact aaagccatag tttcaactat acagcgtaaa tataagggta ttaaaataca
     4501 agagggtgtg gttgattatg gtgctagatt ttacttttac accagtaaaa caactgtagc
     4561 gtcacttatc aacacactta acgatctaaa tgaaactctt gttacaatgc cacttggcta
     4621 tgtaacacat ggcttaaatt tggaagaagc tgctcggtat atgagatctc tcaaagtgcc
     4681 agctacagtt tctgtttctt cacctgatgc tgttacagcg tataatggtt atcttacttc
     4741 ttcttctaaa acacctgaag aacattttat tgaaaccatc tcacttgctg gttcctataa
     4801 agattggtcc tattctggac aatctacaca actaggtata gaatttctta agagaggtga
     4861 taaaagtgta tattacacta gtaatcctac cacattccac ctagatggtg aagttatcac
     4921 ctttgacaat cttaagacac ttctttcttt gagagaagtg aggactatta aggtgtttac
     4981 aacagtagac aacattaacc tccacacgca agttgtggac atgtcaatga catatggaca
     5041 acagtttggt ccaacttatt tggatggagc tgatgttact aaaataaaac ctcataattc
     5101 acatgaaggt aaaacatttt atgttttacc taatgatgac actctacgtg ttgaggcttt
     5161 tgagtactac cacacaactg atcctagttt tctgggtagg tacatgtcag cattaaatca
     5221 cactaaaaag tggaaatacc cacaagttaa tggtttaact tctattaaat gggcagataa
     5281 caactgttat cttgccactg cattgttaac actccaacaa atagagttga agtttaatcc
     5341 acctgctcta caagatgctt attacagagc aagggctggt gaagctgcta acttttgtgc
     5401 acttatctta gcctactgta ataagacagt aggtgagtta ggtgatgtta gagaaacaat
     5461 gagttacttg tttcaacatg ccaatttaga ttcttgcaaa agagtcttga acgtggtgtg
     5521 taaaacttgt ggacaacagc agacaaccct taagggtgta gaagctgtta tgtacatggg
     5581 cacactttct tatgaacaat ttaagaaagg tgttcagata ccttgtacgt gtggtaaaca
     5641 agctacaaaa tatctagtac aacaggagtc accttttgtt atgatgtcag caccacctgc
     5701 tcagtatgaa cttaagcatg gtacatttac ttgtgctagt gagtacactg gtaattacca
     5761 gtgtggtcac tataaacata taacttctaa agaaactttg tattgcatag acggtgcttt
     5821 acttacaaag tcctcagaat acaaaggtcc tattacggat gttttctaca aagaaaacag
     5881 ttacacaaca accataaaac cagttactta taaattggat ggtgttgttt gtacagaaat
     5941 tgaccctaag ttggacaatt attataagaa agacaattct tatttcacag agcaaccaat
     6001 tgatcttgta ccaaaccaac catatccaaa cgcaagcttc gataatttta agtttgtatg
     6061 tgataatatc aaatttgctg atgatttaaa ccagttaact ggttataaga aacctgcttc
     6121 aagagagctt aaagttacat ttttccctga cttaaatggt gatgtggtgg ctattgatta
     6181 taaacactac acaccctctt ttaagaaagg agctaaattg ttacataaac ctattgtttg
     6241 gcatgttaac aatgcaacta ataaagccac gtataaacca aatacctggt gtatacgttg
     6301 tctttggagc acaaaaccag ttgaaacatc aaattcgttt gatgtactga agtcagagga
     6361 cgcgcaggga atggataatc ttgcctgcga agatctaaaa ccagtctctg aagaagtagt
     6421 ggaaaatcct accatacaga aagacgttct tgagtgtaat gtgaaaacta ccgaagttgt
     6481 aggagacatt atacttaaac cagcaaataa tagtttaaaa attacagaag aggttggcca
     6541 cacagatcta atggctgctt atgtagacaa ttctagtctt actattaaga aacctaatga
     6601 attatctaga gtattaggtt tgaaaaccct tgctactcat ggtttagctg ctgttaatag
     6661 tgtcccttgg gatactatag ctaattatgc taagcctttt cttaacaaag ttgttagtac
     6721 aactactaac atagttacac ggtgtttaaa ccgtgtttgt actaattata tgccttattt
     6781 ctttacttta ttgctacaat tgtgtacttt tactagaagt acaaattcta gaattaaagc
     6841 atctatgccg actactatag caaagaatac tgttaagagt gtcggtaaat tttgtctaga
     6901 ggcttcattt aattatttga agtcacctaa tttttctaaa ctgataaata ttataatttg
     6961 gtttttacta ttaagtgttt gcctaggttc tttaatctac tcaaccgctg ctttaggtgt
     7021 tttaatgtct aatttaggca tgccttctta ctgtactggt tacagagaag gctatttgaa
     7081 ctctactaat gtcactattg caacctactg tactggttct ataccttgta gtgtttgtct
     7141 tagtggttta gattctttag acacctatcc ttctttagaa actatacaaa ttaccatttc
     7201 atcttttaaa tgggatttaa ctgcttttgg cttagttgca gagtggtttt tggcatatat
     7261 tcttttcact aggtttttct atgtacttgg attggctgca atcatgcaat tgtttttcag
     7321 ctattttgca gtacatttta ttagtaattc ttggcttatg tggttaataa ttaatcttgt
     7381 acaaatggcc ccgatttcag ctatggttag aatgtacatc ttctttgcat cattttatta
     7441 tgtatggaaa agttatgtgc atgttgtaga cggttgtaat tcatcaactt gtatgatgtg
     7501 ttacaaacgt aatagagcaa caagagtcga atgtacaact attgttaatg gtgttagaag
     7561 gtccttttat gtctatgcta atggaggtaa aggcttttgc aaactacaca attggaattg
     7621 tgttaattgt gatacattct gtgctggtag tacatttatt agtgatgaag ttgcgagaga
     7681 cttgtcacta cagtttaaaa gaccaataaa tcctactgac cagtcttctt acatcgttga
     7741 tagtgttaca gtgaagaatg gttccatcca tctttacttt gataaagctg gtcaaaagac
     7801 ttatgaaaga cattctctct ctcattttgt taacttagac aacctgagag ctaataacac
     7861 taaaggttca ttgcctatta atgttatagt ttttgatggt aaatcaaaat gtgaagaatc
     7921 atctgcaaaa tcagcgtctg tttactacag tcagcttatg tgtcaaccta tactgttact
     7981 agatcaggca ttagtgtctg atgttggtga tagtgcggaa gttgcagtta aaatgtttga
     8041 tgcttacgtt aatacgtttt catcaacttt taacgtacca atggaaaaac tcaaaacact
     8101 agttgcaact gcagaagctg aacttgcaaa gaatgtgtcc ttagacaatg tcttatctac
     8161 ttttatttca gcagctcggc aagggtttgt tgattcagat gtagaaacta aagatgttgt
     8221 tgaatgtctt aaattgtcac atcaatctga catagaagtt actggcgata gttgtaataa
     8281 ctatatgctc acctataaca aagttgaaaa catgacaccc cgtgaccttg gtgcttgtat
     8341 tgactgtagt gcgcgtcata ttaatgcgca ggtagcaaaa agtcacaaca ttgctttgat
     8401 atggaacgtt aaagatttca tgtcattgtc tgaacaacta cgaaaacaaa tacgtagtgc
     8461 tgctaaaaag aataacttac cttttaagtt gacatgtgca actactagac aagttgttaa
     8521 tgttgtaaca acaaagatag cacttaaggg tggtaaaatt gttaataatt ggttgaagca
     8581 gttaattaaa gttacacttg tgttcctttt tgttgctgct attttctatt taataacacc
     8641 tgttcatgtc atgtctaaac atactgactt ttcaagtgaa atcataggat acaaggctat
     8701 tgatggtggt gtcactcgtg acatagcatc tacagatact tgttttgcta acaaacatgc
     8761 tgattttgac acatggttta gccagcgtgg tggtagttat actaatgaca aagcttgccc
     8821 attgattgct gcagtcataa caagagaagt gggttttgtc gtgcctggtt tgcctggcac
     8881 gatattacgc acaactaatg gtgacttttt gcatttctta cctagagttt ttagtgcagt
     8941 tggtaacatc tgttacacac catcaaaact tatagagtac actgactttg caacatcagc
     9001 ttgtgttttg gctgctgaat gtacaatttt taaagatgct tctggtaagc cagtaccata
     9061 ttgttatgat accaatgtac tagaaggttc tgttgcttat gaaagtttac gccctgacac
     9121 acgttatgtg ctcatggatg gctctattat tcaatttcct aacacctacc ttgaaggttc
     9181 tgttagagtg gtaacaactt ttgattctga gtactgtagg cacggcactt gtgaaagatc
     9241 agaagctggt gtttgtgtat ctactagtgg tagatgggta cttaacaatg attattacag
     9301 atctttacca ggagttttct gtggtgtaga tgctgtaaat ttacttacta atatgtttac
     9361 accactaatt caacctattg gtgctttgga catatcagca tctatagtag ctggtggtat
     9421 tgtagctatc gtagtaacat gccttgccta ctattttatg aggtttagaa gagcttttgg
     9481 tgaatacagt catgtagttg cctttaatac tttactattc cttatgtcat tcactgtact
     9541 ctgtttaaca ccagtttact cattcttacc tggtgtttat tctgttattt acttgtactt
     9601 gacattttat cttactaatg atgtttcttt tttagcacat attcagtgga tggttatgtt
     9661 cacaccttta gtacctttct ggataacaat tgcttatatc atttgtattt ccacaaagca
     9721 tttctattgg ttctttagta attacctaaa gagacgtgta gtctttaatg gtgtttcctt
     9781 tagtactttt gaagaagctg cgctgtgcac ctttttgtta aataaagaaa tgtatctaaa
     9841 gttgcgtagt gatgtgctat tacctcttac gcaatataat agatacttag ctctttataa
     9901 taagtacaag tattttagtg gagcaatgga tacaactagc tacagagaag ctgcttgttg
     9961 tcatctcgca aaggctctca atgacttcag taactcaggt tctgatgttc tttaccaacc
    10021 accacaaacc tctatcacct cagctgtttt gcagagtggt tttagaaaaa tggcattccc
    10081 atctggtaaa gttgagggtt gtatggtaca agtaacttgt ggtacaacta cacttaacgg
    10141 tctttggctt gatgacgtag tttactgtcc aagacatgtg atctgcacct ctgaagacat
    10201 gcttaaccct aattatgaag atttactcat tcgtaagtct aatcataatt tcttggtaca
    10261 ggctggtaat gttcaactca gggttattgg acattctatg caaaattgtg tacttaagct
    10321 taaggttgat acagccaatc ctaagacacc taagtataag tttgttcgca ttcaaccagg
    10381 acagactttt tcagtgttag cttgttacaa tggttcacca tctggtgttt accaatgtgc
    10441 tatgaggccc aatttcacta ttaagggttc attccttaat ggttcatgtg gtagtgttgg
    10501 ttttaacata gattatgact gtgtctcttt ttgttacatg caccatatgg aattaccaac
    10561 tggagttcat gctggcacag acttagaagg taacttttat ggaccttttg ttgacaggca
    10621 aacagcacaa gcagctggta cggacacaac tattacagtt aatgttttag cttggttgta
    10681 cgctgctgtt ataaatggag acaggtggtt tctcaatcga tttaccacaa ctcttaatga
    10741 ctttaacctt gtggctatga agtacaatta tgaacctcta acacaagacc atgttgacat
    10801 actaggacct ctttctgctc aaactggaat tgccgtttta gatatgtgtg cttcattaaa
    10861 agaattactg caaaatggta tgaatggacg taccatattg ggtagtgctt tattagaaga
    10921 tgaatttaca ccttttgatg ttgttagaca atgctcaggt gttactttcc aaagtgcagt
    10981 gaaaagaaca atcaagggta cacaccactg gttgttactc acaattttga cttcactttt
    11041 agttttagtc cagagtactc aatggtcttt gttctttttt ttgtatgaaa atgccttttt
    11101 accttttgct atgggtatta ttgctatgtc tgcttttgca atgatgtttg tcaaacataa
    11161 gcatgcattt ctctgtttgt ttttgttacc ttctcttgcc actgtagctt attttaatat
    11221 ggtctatatg cctgctagtt gggtgatgcg tattatgaca tggttggata tggttgatac
    11281 tagtttgtct ggttttaagc taaaagactg tgttatgtat gcatcagctg tagtgttact
    11341 aatccttatg acagcaagaa ctgtgtatga tgatggtgct aggagagtgt ggacacttat
    11401 gaatgtcttg acactcgttt ataaagttta ttatggtaat gctttagatc aagccatttc
    11461 catgtgggct cttataatct ctgttacttc taactactca ggtgtagtta caactgtcat
    11521 gtttttggcc agaggtattg tttttatgtg tgttgagtat tgccctattt tcttcataac
    11581 tggtaataca cttcagtgta taatgctagt ttattgtttc ttaggctatt tttgtacttg
    11641 ttactttggc ctcttttgtt tactcaaccg ctactttaga ctgactcttg gtgtttatga
    11701 ttacttagtt tctacacagg agtttagata tatgaattca cagggactac tcccacccaa
    11761 gaatagcata gatgccttca aactcaacat taaattgttg ggtgttggtg gcaaaccttg
    11821 tatcaaagta gccactgtac agtctaaaat gtcagatgta aagtgcacat cagtagtctt
    11881 actctcagtt ttgcaacaac tcagagtaga atcatcatct aaattgtggg ctcaatgtgt
    11941 ccagttacac aatgacattc tcttagctaa agatactact gaagcctttg aaaaaatggt
    12001 ttcactactt tctgttttgc tttccatgca gggtgctgta gacataaaca agctttgtga
    12061 agaaatgctg gacaacaggg caaccttaca agctatagcc tcagagttta gttcccttcc
    12121 atcatatgca gcttttgcta ctgctcaaga agcttatgag caggctgttg ctaatggtga
    12181 ttctgaagtt gttcttaaaa agttgaagaa gtctttgaat gtggctaaat ctgaatttga
    12241 ccgtgatgca gccatgcaac gtaagttgga aaagatggct gatcaagcta tgacccaaat
    12301 gtataaacag gctagatctg aggacaagag ggcaaaagtt actagtgcta tgcagacaat
    12361 gcttttcact atgcttagaa agttggataa tgatgcactc aacaacatta tcaacaatgc
    12421 aagagatggt tgtgttccct tgaacataat acctcttaca acagcagcca aactaatggt
    12481 tgtcatacca gactataaca catataaaaa tacgtgtgat ggtacaacat ttacttatgc
    12541 atcagcattg tgggaaatcc aacaggttgt agatgcagat agtaaaattg ttcaacttag
    12601 tgaaattagt atggacaatt cacctaattt agcatggcct cttattgtaa cagctttaag
    12661 ggccaattct gctgtcaaat tacagaataa tgagcttagt cctgttgcac tacgacagat
    12721 gtcttgtgct gccggtacta cacaaactgc ttgcactgat gacaatgcgt tagcttacta
    12781 caacacaaca aagggaggta ggtttgtact tgcactgtta tccgatttac aggatttgaa
    12841 atgggctaga ttccctaaga gtgatggaac tggtactatc tatacagaac tggaaccacc
    12901 ttgtaggttt gttacagaca cacctaaagg tcctaaagtg aagtatttat actttattaa
    12961 aggattaaac aacctaaata gaggtatggt acttggtagt ttagctgcca cagtacgtct
    13021 acaagctggt aatgcaacag aagtgcctgc caattcaact gtattatctt tctgtgcttt
    13081 tgctgtagat gctgctaaag cttacaaaga ttatctagct agtgggggac aaccaatcac
    13141 taattgtgtt aagatgttgt gtacacacac tggtactggt caggcaataa cagttacacc
    13201 ggaagccaat atggatcaag aatcctttgg tggtgcatcg tgttgtctgt actgccgttg
    13261 ccacatagat catccaaatc ctaaaggatt ttgtgactta aaaggtaagt atgtacaaat
    13321 acctacaact tgtgctaatg accctgtggg ttttacactt aaaaacacag tctgtaccgt
    13381 ctgcggtatg tggaaaggtt atggctgtag ttgtgatcaa ctccgcgaac ccatgcttca
    13441 gtcagctgat gcacaatcgt ttttaaacgg gtttgcggtg taagtgcagc ccgtcttaca
    13501 ccgtgcggca caggcactag tactgatgtc gtatacaggg cttttgacat ctacaatgat
    13561 aaagtagctg gttttgctaa attcctaaaa actaattgtt gtcgcttcca agaaaaggac
    13621 gaagatgaca atttaattga ttcttacttt gtagttaaga gacacacttt ctctaactac
    13681 caacatgaag aaacaattta taatttactt aaggattgtc cagctgttgc taaacatgac
    13741 ttctttaagt ttagaataga cggtgacatg gtaccacata tatcacgtca acgtcttact
    13801 aaatacacaa tggcagacct cgtctatgct ttaaggcatt ttgatgaagg taattgtgac
    13861 acattaaaag aaatacttgt cacatacaat tgttgtgatg atgattattt caataaaaag
    13921 gactggtatg attttgtaga aaacccagat atattacgcg tatacgccaa cttaggtgaa
    13981 cgtgtacgcc aagctttgtt aaaaacagta caattctgtg atgccatgcg aaatgctggt
    14041 attgttggtg tactgacatt agataatcaa gatctcaatg gtaactggta tgatttcggt
    14101 gatttcatac aaaccacgcc aggtagtgga gttcctgttg tagattctta ttattcattg
    14161 ttaatgccta tattaacctt gaccagggct ttaactgcag agtcacatgt tgacactgac
    14221 ttaacaaagc cttacattaa gtgggatttg ttaaaatatg acttcacgga agagaggtta
    14281 aaactctttg accgttattt taaatattgg gatcagacat accacccaaa ttgtgttaac
    14341 tgtttggatg acagatgcat tctgcattgt gcaaacttta atgttttatt ctctacagtg
    14401 ttcccaccta caagttttgg accactagtg agaaaaatat ttgttgatgg tgttccattt
    14461 gtagtttcaa ctggatacca cttcagagag ctaggtgttg tacataatca ggatgtaaac
    14521 ttacatagct ctagacttag ttttaaggaa ttacttgtgt atgctgctga ccctgctatg
    14581 cacgctgctt ctggtaatct attactagat aaacgcacta cgtgcttttc agtagctgca
    14641 cttactaaca atgttgcttt tcaaactgtc aaacccggta attttaacaa agacttctat
    14701 gactttgctg tgtctaaggg tttctttaag gaaggaagtt ctgttgaatt aaaacacttc
    14761 ttctttgctc aggatggtaa tgctgctatc agcgattatg actactatcg ttataatcta
    14821 ccaacaatgt gtgatatcag acaactacta tttgtagttg aagttgttga taagtacttt
    14881 gattgttacg atggtggctg tattaatgct aaccaagtca tcgtcaacaa cctagacaaa
    14941 tcagctggtt ttccatttaa taaatggggt aaggctagac tttattatga ttcaatgagt
    15001 tatgaggatc aagatgcact tttcgcatat acaaaacgta atgtcatccc tactataact
    15061 caaatgaatc ttaagtatgc cattagtgca aagaatagag ctcgcaccgt agctggtgtc
    15121 tctatctgta gtactatgac caatagacag tttcatcaaa aattattgaa atcaatagcc
    15181 gccactagag gagctactgt agtaattgga acaagcaaat tctatggtgg ttggcacaac
    15241 atgttaaaaa ctgtttatag tgatgtagaa aaccctcacc ttatgggttg ggattatcct
    15301 aaatgtgata gagccatgcc taacatgctt agaattatgg cctcacttgt tcttgctcgc
    15361 aaacatacaa cgtgttgtag cttgtcacac cgtttctata gattagctaa tgagtgtgct
    15421 caagtattga gtgaaatggt catgtgtggc ggttcactat atgttaaacc aggtggaacc
    15481 tcatcaggag atgccacaac tgcttatgct aatagtgttt ttaacatttg tcaagctgtc
    15541 acggccaatg ttaatgcact tttatctact gatggtaaca aaattgccga taagtatgtc
    15601 cgcaatttac aacacagact ttatgagtgt ctctatagaa atagagatgt tgacacagac
    15661 tttgtgaatg agttttacgc atatttgcgt aaacatttct caatgatgat actctctgac
    15721 gatgctgttg tgtgtttcaa tagcacttat gcatctcaag gtctagtggc tagcataaag
    15781 aactttaagt cagttcttta ttatcaaaac aatgttttta tgtctgaagc aaaatgttgg
    15841 actgagactg accttactaa aggacctcat gaattttgct ctcaacatac aatgctagtt
    15901 aaacagggtg atgattatgt gtaccttcct tacccagatc catcaagaat cctaggggcc
    15961 ggctgttttg tagatgatat cgtaaaaaca gatggtacac ttatgattga acggttcgtg
    16021 tctttagcta tagatgctta cccacttact aaacatccta atcaggagta tgctgatgtc
    16081 tttcatttgt acttacaata cataagaaag ctacatgatg agttaacagg acacatgtta
    16141 gacatgtatt ctgttatgct tactaatgat aacacttcaa ggtattggga acctgagttt
    16201 tatgaggcta tgtacacacc gcatacagtc ttacaggctg ttggggcttg tgttctttgc
    16261 aattcacaga cttcattaag atgtggtgct tgcatacgta gaccattctt atgttgtaaa
    16321 tgctgttacg accatgtcat atcaacatca cataaattag tcttgtctgt taatccgtat
    16381 gtttgcaatg ctccaggttg tgatgtcaca gatgtgactc aactttactt aggaggtatg
    16441 agctattatt gtaaatcaca taaaccaccc attagttttc cattgtgtgc taatggacaa
    16501 gtttttggtt tatataaaaa tacatgtgtt ggtagcgata atgttactga ctttaatgca
    16561 attgcaacat gtgactggac aaatgctggt gattacattt tagctaacac ctgtactgaa
    16621 agactcaagc tttttgcagc agaaacgctc aaagctactg aggagacatt taaactgtct
    16681 tatggtattg ctactgtacg tgaagtgctg tctgacagag aattacatct ttcatgggaa
    16741 gttggtaaac ctagaccacc acttaaccga aattatgtct ttactggtta tcgtgtaact
    16801 aaaaacagta aagtacaaat aggagagtac acctttgaaa aaggtgacta tggtgatgct
    16861 gttgtttacc gaggtacaac aacttacaaa ttaaatgttg gtgattattt tgtgctgaca
    16921 tcacatacag taatgccatt aagtgcacct acactagtgc cacaagagca ctatgttaga
    16981 attactggct tatacccaac actcaatatc tcagatgagt tttctagcaa tgttgcaaat
    17041 tatcaaaagg ttggtatgca aaagtattct acactccagg gaccacctgg tactggtaag
    17101 agtcattttg ctattggcct agctctctac tacccttctg ctcgcatagt gtatacagct
    17161 tgctctcatg ccgctgttga tgcactatgt gagaaggcat taaaatattt gcctatagat
    17221 aaatgtagta gaattatacc tgcacgtgct cgtgtagagt gttttgataa attcaaagtg
    17281 aattcaacat tagaacagta tgtcttttgt actgtaaatg cattgcctga gacgacagca
    17341 gatatagttg tctttgatga aatttcaatg gccacaaatt atgatttgag tgttgtcaat
    17401 gccagattac gtgctaagca ctatgtgtac attggcgacc ctgctcaatt acctgcacca
    17461 cgcacattgc taactaaggg cacactagaa ccagaatatt tcaattcagt gtgtagactt
    17521 atgaaaacta taggtccaga catgttcctc ggaacttgtc ggcgttgtcc tgctgaaatt
    17581 gttgacactg tgagtgcttt ggtttatgat aataagctta aagcacataa agacaaatca
    17641 gctcaatgct ttaaaatgtt ttataagggt gttatcacgc atgatgtttc atctgcaatt
    17701 aacaggccac aaataggcgt ggtaagagaa ttccttacac gtaaccctgc ttggagaaaa
    17761 gctgtcttta tttcacctta taattcacag aatgctgtag cctcaaagat tttgggacta
    17821 ccaactcaaa ctgttgattc atcacagggc tcagaatatg actatgtcat attcactcaa
    17881 accactgaaa cagctcactc ttgtaatgta aacagattta atgttgctat taccagagca
    17941 aaagtaggca tactttgcat aatgtctgat agagaccttt atgacaagtt gcaatttaca
    18001 agtcttgaaa ttccacgtag gaatgtggca actttacaag ctgaaaatgt aacaggactc
    18061 tttaaagatt gtagtaaggt aatcactggg ttacatccta cacaggcacc tacacacctc
    18121 agtgttgaca ctaaattcaa aactgaaggt ttatgtgttg acatacctgg catacctaag
    18181 gacatgacct atagaagact catctctatg atgggtttta aaatgaatta tcaagttaat
    18241 ggttacccta acatgtttat cacccgcgaa gaagctataa gacatgtacg tgcatggatt
    18301 ggcttcgatg tcgaggggtg tcatgctact agagaagctg ttggtaccaa tttaccttta
    18361 cagctaggtt tttctacagg tgttaaccta gttgctgtac ctacaggtta tgttgataca
    18421 cctaataata cagatttttc cagagttagt gctaaaccac cgcctggaga tcaatttaaa
    18481 cacctcatac cacttatgta caaaggactt ccttggaatg tagtgcgtat aaagattgta
    18541 caaatgttaa gtgacacact taaaaatctc tctgacagag tcgtatttgt cttatgggca
    18601 catggctttg agttgacatc tatgaagtat tttgtgaaaa taggacctga gcgcacctgt
    18661 tgtctatgtg atagacgtgc cacatgcttt tccactgctt cagacactta tgcctgttgg
    18721 catcattcta ttggatttga ttacgtctat aatccgttta tgattgatgt tcaacaatgg
    18781 ggttttacag gtaacctaca aagcaaccat gatctgtatt gtcaagtcca tggtaatgca
    18841 catgtagcta gttgtgatgc aatcatgact aggtgtctag ctgtccacga gtgctttgtt
    18901 aagcgtgttg actggactat tgaatatcct ataattggtg atgaactgaa gattaatgcg
    18961 gcttgtagaa aggttcaaca catggttgtt aaagctgcat tattagcaga caaattccca
    19021 gttcttcacg acattggtaa ccctaaagct attaagtgtg tacctcaagc tgatgtagaa
    19081 tggaagttct atgatgcaca gccttgtagt gacaaagctt ataaaataga agaattattc
    19141 tattcttatg ccacacattc tgacaaattc acagatggtg tatgcctatt ttggaattgc
    19201 aatgtcgata gatatcctgc taattccatt gtttgtagat ttgacactag agtgctatct
    19261 aaccttaact tgcctggttg tgatggtggc agtttgtatg taaataaaca tgcattccac
    19321 acaccagctt ttgataaaag tgcttttgtt aatttaaaac aattaccatt tttctattac
    19381 tctgacagtc catgtgagtc tcatggaaaa caagtagtgt cagatataga ttatgtacca
    19441 ctaaagtctg ctacgtgtat aacacgttgc aatttaggtg gtgctgtctg tagacatcat
    19501 gctaatgagt acagattgta tctcgatgct tataacatga tgatctcagc tggctttagc
    19561 ttgtgggttt acaaacaatt tgatacttat aacctctgga acacttttac aagacttcag
    19621 agtttagaaa atgtggcttt taatgttgta aataagggac actttgatgg acaacagggt
    19681 gaagtaccag tttctatcat taataacact gtttacacaa aagttgatgg tgttgatgta
    19741 gaattgtttg aaaataaaac aacattacct gttaatgtag catttgagct ttgggctaag
    19801 cgcaacatta aaccagtacc agaggtgaaa atactcaata atttgggtgt ggacattgct
    19861 gctaatactg tgatctggga ctacaaaaga gatgctccag cacatatatc tactattggt
    19921 gtttgttcta tgactgacat agccaagaaa ccaactgaaa cgatttgtgc accactcact
    19981 gtcttttttg atggtagagt tgatggtcaa gtagacttat ttagaaatgc ccgtaatggt
    20041 gttcttatta cagaaggtag tgttaaaggt ttacaaccat ctgtaggtcc caaacaagct
    20101 agtcttaatg gagtcacatt aattggagaa gccgtaaaaa cacagttcaa ttattataag
    20161 aaagttgatg gtgttgtcca acaattacct gaaacttact ttactcagag tagaaattta
    20221 caagaattta aacccaggag tcaaatggaa attgatttct tagaattagc tatggatgaa
    20281 ttcattgaac ggtataaatt agaaggctat gccttcgaac atatcgttta tggagatttt
    20341 agtcatagtc agttaggtgg tttacatcta ctgattggac tagctaaacg ttttaaggaa
    20401 tcaccttttg aattagaaga ttttattcct atggacagta cagttaaaaa ctatttcata
    20461 acagatgcgc aaacaggttc atctaagtgt gtgtgttctg ttattgattt attacttgat
    20521 gattttgttg aaataataaa atcccaagat ttatctgtag tttctaaggt tgtcaaagtg
    20581 actattgact atacagaaat ttcatttatg ctttggtgta aagatggcca tgtagaaaca
    20641 ttttacccaa aattacaatc tagtcaagcg tggcaaccgg gtgttgctat gcctaatctt
    20701 tacaaaatgc aaagaatgct attagaaaag tgtgaccttc aaaattatgg tgatagtgca
    20761 acattaccta aaggcataat gatgaatgtc gcaaaatata ctcaactgtg tcaatattta
    20821 aacacattaa cattagctgt accctataat atgagagtta tacattttgg tgctggttct
    20881 gataaaggag ttgcaccagg tacagctgtt ttaagacagt ggttgcctac gggtacgctg
    20941 cttgtcgatt cagatcttaa tgactttgtc tctgatgcag attcaacttt gattggtgat
    21001 tgtgcaactg tacatacagc taataaatgg gatctcatta ttagtgatat gtacgaccct
    21061 aagactaaaa atgttacaaa agaaaatgac tctaaagagg gttttttcac ttacatttgt
    21121 gggtttatac aacaaaagct agctcttgga ggttccgtgg ctataaagat aacagaacat
    21181 tcttggaatg ctgatcttta taagctcatg ggacacttcg catggtggac agcctttgtt
    21241 actaatgtga atgcgtcatc atctgaagca tttttaattg gatgtaatta tcttggcaaa
    21301 ccacgcgaac aaatagatgg ttatgtcatg catgcaaatt acatattttg gaggaataca
    21361 aatccaattc agttgtcttc ctattcttta tttgacatga gtaaatttcc ccttaaatta
    21421 aggggtactg ctgttatgtc tttaaaagaa ggtcaaatca atgatatgat tttatctctt
    21481 cttagtaaag gtagacttat aattagagaa aacaacagag ttgttatttc tagtgatgtt
    21541 cttgttaaca actaaacgaa caatgtttgt ttttcttgtt ttattgccac tagtctctag
    21601 tcagtgtgtt aatcttacaa ccagaactca attaccccct gcatacacta attctttcac
    21661 acgtggtgtt tattaccctg acaaagtttt cagatcctca gttttacatt caactcagga
    21721 cttgttctta cctttctttt ccaatgttac ttggttccat gctatacatg tctctgggac
    21781 caatggtact aagaggtttg ataaccctgt cctaccattt aatgatggtg tttattttgc
    21841 ttccactgag aagtctaaca taataagagg ctggattttt ggtactactt tagattcgaa
    21901 gacccagtcc ctacttattg ttaataacgc tactaatgtt gttattaaag tctgtgaatt
    21961 tcaattttgt aatgatccat ttttgggtgt ttattaccac aaaaacaaca aaagttggat
    22021 ggaaagtgag ttcagagttt attctagtgc gaataattgc acttttgaat atgtctctca
    22081 gccttttctt atggaccttg aaggaaaaca gggtaatttc aaaaatctta gggaatttgt
    22141 gtttaagaat attgatggtt attttaaaat atattctaag cacacgccta ttaatttagt
    22201 gcgtgatctc cctcagggtt tttcggcttt agaaccattg gtagatttgc caataggtat
    22261 taacatcact aggtttcaaa ctttacttgc tttacataga agttatttga ctcctggtga
    22321 ttcttcttca ggttggacag ctggtgctgc agcttattat gtgggttatc ttcaacctag
    22381 gacttttcta ttaaaatata atgaaaatgg aaccattaca gatgctgtag actgtgcact
    22441 tgaccctctc tcagaaacaa agtgtacgtt gaaatccttc actgtagaaa aaggaatcta
    22501 tcaaacttct aactttagag tccaaccaac agaatctatt gttagatttc ctaatattac
    22561 aaacttgtgc ccttttggtg aagtttttaa cgccaccaga tttgcatctg tttatgcttg
    22621 gaacaggaag agaatcagca actgtgttgc tgattattct gtcctatata attccgcatc
    22681 attttccact tttaagtgtt atggagtgtc tcctactaaa ttaaatgatc tctgctttac
    22741 taatgtctat gcagattcat ttgtaattag aggtgatgaa gtcagacaaa tcgctccagg
    22801 gcaaactgga aagattgctg attataatta taaattacca gatgatttta caggctgcgt
    22861 tatagcttgg aattctaaca atcttgattc taaggttggt ggtaattata attacctgta
    22921 tagattgttt aggaagtcta atctcaaacc ttttgagaga gatatttcaa ctgaaatcta
    22981 tcaggccggt agcacacctt gtaatggtgt tgaaggtttt aattgttact ttcctttaca
    23041 atcatatggt ttccaaccca ctaatggtgt tggttaccaa ccatacagag tagtagtact
    23101 ttcttttgaa cttctacatg caccagcaac tgtttgtgga cctaaaaagt ctactaattt
    23161 ggttaaaaac aaatgtgtca atttcaactt caatggttta acaggcacag gtgttcttac
    23221 tgagtctaac aaaaagtttc tgcctttcca acaatttggc agagacattg ctgacactac
    23281 tgatgctgtc cgtgatccac agacacttga gattcttgac attacaccat gttcttttgg
    23341 tggtgtcagt gttataacac caggaacaaa tacttctaac caggttgctg ttctttatca
    23401 ggatgttaac tgcacagaag tccctgttgc tattcatgca gatcaactta ctcctacttg
    23461 gcgtgtttat tctacaggtt ctaatgtttt tcaaacacgt gcaggctgtt taataggggc
    23521 tgaacatgtc aacaactcat atgagtgtga catacccatt ggtgcaggta tatgcgctag
    23581 ttatcagact cagactaatt ctcctcggcg ggcacgtagt gtagctagtc aatccatcat
    23641 tgcctacact atgtcacttg gtgcagaaaa ttcagttgct tactctaata actctattgc
    23701 catacccaca aattttacta ttagtgttac cacagaaatt ctaccagtgt ctatgaccaa
    23761 gacatcagta gattgtacaa tgtacatttg tggtgattca actgaatgca gcaatctttt
    23821 gttgcaatat ggcagttttt gtacacaatt aaaccgtgct ttaactggaa tagctgttga
    23881 acaagacaaa aacacccaag aagtttttgc acaagtcaaa caaatttaca aaacaccacc
    23941 aattaaagat tttggtggtt ttaatttttc acaaatatta ccagatccat caaaaccaag
    24001 caagaggtca tttattgaag atctactttt caacaaagtg acacttgcag atgctggctt
    24061 catcaaacaa tatggtgatt gccttggtga tattgctgct agagacctca tttgtgcaca
    24121 aaagtttaac ggccttactg ttttgccacc tttgctcaca gatgaaatga ttgctcaata
    24181 cacttctgca ctgttagcgg gtacaatcac ttctggttgg acctttggtg caggtgctgc
    24241 attacaaata ccatttgcta tgcaaatggc ttataggttt aatggtattg gagttacaca
    24301 gaatgttctc tatgagaacc aaaaattgat tgccaaccaa tttaatagtg ctattggcaa
    24361 aattcaagac tcactttctt ccacagcaag tgcacttgga aaacttcaag atgtggtcaa
    24421 ccaaaatgca caagctttaa acacgcttgt taaacaactt agctccaatt ttggtgcaat
    24481 ttcaagtgtt ttaaatgata tcctttcacg tcttgacaaa gttgaggctg aagtgcaaat
    24541 tgataggttg atcacaggca gacttcaaag tttgcagaca tatgtgactc aacaattaat
    24601 tagagctgca gaaatcagag cttctgctaa tcttgctgct actaaaatgt cagagtgtgt
    24661 acttggacaa tcaaaaagag ttgatttttg tggaaagggc tatcatctta tgtccttccc
    24721 tcagtcagca cctcatggtg tagtcttctt gcatgtgact tatgtccctg cacaagaaaa
    24781 gaacttcaca actgctcctg ccatttgtca tgatggaaaa gcacactttc ctcgtgaagg
    24841 tgtctttgtt tcaaatggca cacactggtt tgtaacacaa aggaattttt atgaaccaca
    24901 aatcattact acagacaaca catttgtgtc tggtaactgt gatgttgtaa taggaattgt
    24961 caacaacaca gtttatgatc ctttgcaacc tgaattagac tcattcaagg aggagttaga
    25021 taaatatttt aagaatcata catcaccaga tgttgattta ggtgacatct ctggcattaa
    25081 tgcttcagtt gtaaacattc aaaaagaaat tgaccgcctc aatgaggttg ccaagaattt
    25141 aaatgaatct ctcatcgatc tccaagaact tggaaagtat gagcagtata taaaatggcc
    25201 atggtacatt tggctaggtt ttatagctgg cttgattgcc atagtaatgg tgacaattat
    25261 gctttgctgt atgaccagtt gctgtagttg tctcaagggc tgttgttctt gtggatcctg
    25321 ctgcaaattt gatgaagacg actctgagcc agtgctcaaa ggagtcaaat tacattacac
    25381 ataaacgaac ttatggattt gtttatgaga atcttcacaa ttggaactgt aactttgaag
    25441 caaggtgaaa tcaaggatgc tactccttca gattttgttc gcgctactgc aacgataccg
    25501 atacaagcct cactcccttt cggatggctt attgttggcg ttgcacttct tgctgttttt
    25561 cagagcgctt ccaaaatcat aaccctcaaa aagagatggc aactagcact ctccaagggt
    25621 gttcactttg tttgcaactt gctgttgttg tttgtaacag tttactcaca ccttttgctc
    25681 gttgctgctg gccttgaagc cccttttctc tatctttatg ctttagtcta cttcttgcag
    25741 agtataaact ttgtaagaat aataatgagg ctttggcttt gctggaaatg ccgttccaaa
    25801 aacccattac tttatgatgc caactatttt ctttgctggc atactaattg ttacgactat
    25861 tgtatacctt acaatagtgt aacttcttca attgtcatta cttcaggtga tggcacaaca
    25921 agtcctattt ctgaacatga ctaccagatt ggtggttata ctgaaaaatg ggaatctgga
    25981 gtaaaagact gtgttgtatt acacagttac ttcacttcag actattacca gctgtactca
    26041 actcaattga gtacagacac tggtgttgaa catgttacct tcttcatcta caataaaatt
    26101 gttgatgagc ctgaagaaca tgtccaaatt cacacaatcg acggttcatc cggagttgtt
    26161 aatccagtaa tggaaccaat ttatgatgaa ccgacgacga ctactagcgt gcctttgtaa
    26221 gcacaagctg atgagtacga acttatgtac tcattcgttt cggaagagac aggtacgtta
    26281 atagttaata gcgtacttct ttttcttgct ttcgtggtat tcttgctagt tacactagcc
    26341 atccttactg cgcttcgatt gtgtgcgtac tgctgcaata ttgttaacgt gagtcttgta
    26401 aaaccttctt tttacgttta ctctcgtgtt aaaaatctga attcttctag agttcctgat
    26461 cttctggtct aaacgaacta aatattatat tagtttttct gtttggaact ttaattttag
    26521 ccatggcaga ttccaacggt actattaccg ttgaagagct taaaaagctc cttgaacaat
    26581 ggaacctagt aataggtttc ctattcctta catggatttg tcttctacaa tttgcctatg
    26641 ccaacaggaa taggtttttg tatataatta agttaatttt cctctggctg ttatggccag
    26701 taactttagc ttgttttgtg cttgctgctg tttacagaat aaattggatc accggtggaa
    26761 ttgctatcgc aatggcttgt cttgtaggct tgatgtggct cagctacttc attgcttctt
    26821 tcagactgtt tgcgcgtacg cgttccatgt ggtcattcaa tccagaaact aacattcttc
    26881 tcaacgtgcc actccatggc actattctga ccagaccgct tctagaaagt gaactcgtaa
    26941 tcggagctgt gatccttcgt ggacatcttc gtattgctgg acaccatcta ggacgctgtg
    27001 acatcaagga cctgcctaaa gaaatcactg ttgctacatc acgaacgctt tcttattaca
    27061 aattgggagc ttcgcagcgt gtagcaggtg actcaggttt tgctgcatac agtcgctaca
    27121 ggattggcaa ctataaatta aacacagacc attccagtag cagtgacaat attgctttgc
    27181 ttgtacagta agtgacaaca gatgtttcat ctcgttgact ttcaggttac tatagcagag
    27241 atattactaa ttattatgag gacttttaaa gtttccattt ggaatcttga ttacatcata
    27301 aacctcataa ttaaaaattt atctaagtca ctaactgaga ataaatattc tcaattagat
    27361 gaagagcaac caatggagat tgattaaacg aacatgaaaa ttattctttt cttggcactg
    27421 ataacactcg ctacttgtga gctttatcac taccaagagt gtgttagagg tacaacagta
    27481 cttttaaaag aaccttgctc ttctggaaca tacgagggca attcaccatt tcatcctcta
    27541 gctgataaca aatttgcact gacttgcttt agcactcaat ttgcttttgc ttgtcctgac
    27601 ggcgtaaaac acgtctatca gttacgtgcc agatcagttt cacctaaact gttcatcaga
    27661 caagaggaag ttcaagaact ttactctcca atttttctta ttgttgcggc aatagtgttt
    27721 ataacacttt gcttcacact caaaagaaag acagaatgat tgaactttca ttaattgact
    27781 tctatttgtg ctttttagcc tttctgctat tccttgtttt aattatgctt attatctttt
    27841 ggttctcact tgaactgcaa gatcataatg aaacttgtca cgcctaaacg aacatgaaat
    27901 ttcttgtttt cttaggaatc atcacaactg tagctgcatt tcaccaagaa tgtagtttac
    27961 agtcatgtac tcaacatcaa ccatatgtag ttgatgaccc gtgtcctatt cacttctatt
    28021 ctaaatggta tattagagta ggagctagaa aatcagcacc tttaattgaa ttgtgcgtgg
    28081 atgaggctgg ttctaaatca cccattcagt acatcgatat cggtaattat acagtttcct
    28141 gtttaccttt tacaattaat tgccaggaac ctaaattggg tagtcttgta gtgcgttgtt
    28201 cgttctatga agacttttta gagtatcatg acgttcgtgt tgttttagat ttcatctaaa
    28261 cgaacaaact aaaatgtctg ataatggacc ccaaaatcag cgaaatgcac cccgcattac
    28321 gtttggtgga ccctcagatt caactggcag taaccagaat ggagaacgca gtggggcgcg
    28381 atcaaaacaa cgtcggcccc aaggtttacc caataatact gcgtcttggt tcaccgctct
    28441 cactcaacat ggcaaggaag accttaaatt ccctcgagga caaggcgttc caattaacac
    28501 caatagcagt ccagatgacc aaattggcta ctaccgaaga gctaccagac gaattcgtgg
    28561 tggtgacggt aaaatgaaag atctcagtcc aagatggtat ttctactacc taggaactgg
    28621 gccagaagct ggacttccct atggtgctaa caaagacggc atcatatggg ttgcaactga
    28681 gggagccttg aatacaccaa aagatcacat tggcacccgc aatcctgcta acaatgctgc
    28741 aatcgtgcta caacttcctc aaggaacaac attgccaaaa ggcttctacg cagaagggag
    28801 cagaggcggc agtcaagcct cttctcgttc ctcatcacgt agtcgcaaca gttcaagaaa
    28861 ttcaactcca ggcagcagta ggggaacttc tcctgctaga atggctggca atggcggtga
    28921 tgctgctctt gctttgctgc tgcttgacag attgaaccag cttgagagca aaatgtctgg
    28981 taaaggccaa caacaacaag gccaaactgt cactaagaaa tctgctgctg aggcttctaa
    29041 gaagcctcgg caaaaacgta ctgccactaa agcatacaat gtaacacaag ctttcggcag
    29101 acgtggtcca gaacaaaccc aaggaaattt tggggaccag gaactaatca gacaaggaac
    29161 tgattacaaa cattggccgc aaattgcaca atttgccccc agcgcttcag cgttcttcgg
    29221 aatgtcgcgc attggcatgg aagtcacacc ttcgggaacg tggttgacct acacaggtgc
    29281 catcaaattg gatgacaaag atccaaattt caaagatcaa gtcattttgc tgaataagca
    29341 tattgacgca tacaaaacat tcccaccaac agagcctaaa aaggacaaaa agaagaaggc
    29401 tgatgaaact caagccttac cgcagagaca gaagaaacag caaactgtga ctcttcttcc
    29461 tgctgcagat ttggatgatt tctccaaaca attgcaacaa tccatgagca gtgctgactc
    29521 aactcaggcc taaactcatg cagaccacac aaggcagatg ggctatataa acgttttcgc
    29581 ttttccgttt acgatatata gtctactctt gtgcagaatg aattctcgta actacatagc
    29641 acaagtagat gtagttaact ttaatctcac atagcaatct ttaatcagtg tgtaacatta
    29701 gggaggactt gaaagagcca ccacattttc accgaggcca cgcggagtac gatcgagtgt
    29761 acagtgaaca atgctaggga gagctgccta tatggaagag ccctaatgtg taaaattaat
    29821 tttagtagtg ctatccccat gtgattttaa tagcttctta ggagaatgac aaaaaaaaaa
    29881 aaaaaaaaaa aaaaaaaaaa aaa
//
DBGET integrated database retrieval system