LOCUS NC_003900 11824 bp ss-RNA linear VRL 13-AUG-2018
DEFINITION Aura virus, complete genome.
ACCESSION NC_003900
VERSION NC_003900.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Aura virus
ORGANISM Aura virus
Viruses; Riboviria; Orthornavirae; Kitrinoviricota; Alsuviricetes;
Martellivirales; Togaviridae; Alphavirus.
REFERENCE 1 (bases 1 to 11824)
AUTHORS Rumenapf,T., Strauss,E.G. and Strauss,J.H.
TITLE Aura virus is a New World representative of Sindbis-like viruses
JOURNAL Virology 208 (2), 621-633 (1995)
PUBMED 7747434
REFERENCE 2 (bases 1 to 11824)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (05-MAY-2009) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 11824)
AUTHORS Ruemenapf,T.H.
TITLE Direct Submission
JOURNAL Submitted (08-FEB-1999) Vet. Virology, Justus-Liebig University,
Frankfurter Str. 107, Giessen D-35392, Germany
COMMENT VALIDATED REFSEQ: This record has undergone validation or
preliminary review. The reference sequence was derived from
AF126284.
The mature peptides were predicted and annotated at the NCBI by
analogy with the annotation of the genome of Semliki forest virus
(NC_003215) and other alphaviruses.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..11824
/organism="Aura virus"
/mol_type="genomic RNA"
/db_xref="taxon:44158"
gene 78..7577
/locus_tag="AURAVgp1"
/db_xref="GeneID:944525"
CDS 78..7577
/locus_tag="AURAVgp1"
/inference="non-experimental evidence, no additional
details recorded"
/note="The mature peptides were predicted and annotated at
the NCBI by analogy with the annotation of the genome of
Semliki forest virus (NC_003215) and other alphaviruses;
Inferred from its homology with non-structural polyprotein
precursor nsP1234 of Sindbis virus (PMID 2521676 and PMID
3339717). Contains leaky opal termination codon"
/codon_start=1
/transl_except=(pos:5724..5726,aa:Trp)
/product="Polyprotein 1"
/protein_id="NP_632023.2"
/db_xref="GeneID:944525"
/translation="MEKPTVHVDVDPQSPFVLQLQKSFPQFEIVAQQVTPNDHANARA
FSHLASKLIEHEIPTSVTILDIGSAPARRMYSEHKYHCVCPMRSPEDPDRLMNYASRL
ADKAGEITNKRLHDKLADLKSVLESPDAETGTICFHNDVICRTTAEVSVMQNVYINAP
STIYHQALKGVRKLYWIGFDTTQFMFSSMAGSYPSYNTNWADERVLEARNIGLCSTKL
REGTMGKLSTFRKKALKPGTNVYFSVGSTLYPENRADLQSWHLPSVFHLKGKQSFTCR
CDTAVNCEGYVVKKITISPGITGRVNRYTVTNNSEGFLLCKITDTVKGERVSFPVCTY
IPPSICDQMTGILATDIQPEDAQKLLVGLNQRIVVNGKTNRNTNTMQNYLLPAVATGL
SKWAKERKADCSDEKPLNVRERKLAFGCLWAFKTKKIHSFYRPPGTQTIVKVAAEFSA
FPMSSVWTTSLPMSLRQKVKLLLVKKTNKPVVTITDTAVKNAQEAYNEAVETAEAEEK
AKALPPLKPTAPPVAEDVKCEVTDLVDDAGAALVETPRGKIKIIPQEGDVRIGSYTVI
SPAAVLRNQQLEPIHELAEQVKIITHGGRTGRYSVEPYDAKVLLPTGCPMSWQHFAAL
SESATLVYNEREFLNRKLHHIATKGAAKNTEEEQYKVCKAKDTDHEYVYDVDARKCVK
REHAQGLVLVGELTNPPYHELAYEGLRTRPAAPYHIETLGVIGTPGSGKSAIIKSTVT
LKDLVTSGKKENCKEIENDVQKMRGMTIATRTVDSVLLNGWKKAVDVLYVDEAFACHA
GTLMALIAIVKPRRKVVLCGDPKQWPFFNLMQLKVNFNNPERDLCTSTHYKYISRRCT
QPVTAIVSTLHYDGKMRTTNPCKRAIEIDVNGSTKPKKGDIVLTCFRGWVKQGQIDYP
GPGGHDRAASQGLTRRGVYAVRQKVNENPLYAEKSEHVNVLLTRTEDRIVWKTLQGDP
WIKYLTNVPKGNFTATLEEWQAEHEDIMKAINSTSTVSDPFASKVNTCWAKAIIPILR
TAGIELTFEQWEDLFPQFRNDQPYSVMYALDVICTKMFGMDLSSGIFSRPEIPLTFHP
ADVGRVRAHWDNSPGGQKFGYNKAVIPTCKKYPVYLRAGKGDQILPIYGRVSVPSARN
NLVPLNRNLPHSLTASLQKKEAAPLHKFLNQLPGHSMLLVSKETCYCVSKRITWVAPL
GVRGADHNHDLHFGFPPLSRYDLVVVNMGQPYRFHHYQQCEEHAGLMRTLARSALNCL
KPGGTLALKAYGFADSNSEDVVLSLARKFVRASAVRPSCTQFNTEMFFVFRQLDNDRE
RQFTQHHLNLAVSNIFDNYKDGSGAAPSYRVKRMNIADCTEEAVVNAANARGKPGDGV
CRAIFKKWPKSFENATTEVETAVMKPCHNKVVIHAVGPDFRKYTLEEATKLLQNAYHD
VAKIVNEKGISSVAIPLLSTGIYAAGADRLDLSLRCLFTALDRTDADVTIYCLDKKWE
QRIADAIRMREQVTELKDPDIEIDEGLTRVHPDSCLKDHIGYSTQYGKLYSYFEGTKF
HQTAKDIAEIRALFPDVQAANEQICLYTLGEPMESIREKCPVEDSPASAPPKTIPCLC
MYAMTAERICRVRSNSVTNITVCSSFPLPKYRIKNVQKIQCTKVVLFNPDVPPYIPAR
VYINKDEPPVTPHTDSPPDTCSSRLSLTPTLSNAESDIVSLTFSEIDSELSSLNEPAR
HVMISSFKLRYTAIQALPQKLSWMREDRTPRQPPPVPPPRPKRAAKLSRLANQLNELR
RHATISSVQAEVHYNSGFTPEAELNERGSILRKPPPVPPLRPKQTTNLSRLANQLSMP
ITFGDFAEGELDRLLTPSPTPTFGDFSQEEMDRFFGNRQYWLTGVGGYIFSSDTGPGH
LQQKSVIQNSTTEILIERSRLEKIHAPVLDLQKEEMLKCRYQMSPTVANKSRYQSRKV
ENMKAVTTGRLLDGLKMYVTPDVEAECYKYTYPKPMYSASVPDRFVSPEVAVAVCNNF
FHENYPTVASYQITDEYDAYLDMVEGSVSCLDTATFCPAKLRSFPKTHSYLEPTLRSA
VPSAFQNTLQNVLSAATKRNCNVTQMRELPVLDSAVFNVECFKKYACNTDYWEEFKEK
PIRITTECVTSYVARLKGPEAAALFAKTHQLVPLQEVPMDRFVMDMKRDVKVTPGTKH
TEERPKVQVIQAAEPLATAYLCGIHRELVRRLTAVLLPNIHTLFDMSAEDFDAIIAAN
FSYVHPVLETDIGSFDKSQDDSLALTALMILEDLGVDDRLMDLIECAFGEITSVHLPT
ATTFKFGAMMKSGMFLTLFVNTVLNVVIASRVLEQRLRDSKCAAFIGDDNIIHGVVSD
KIMADRCATWMNMEVKIIDAVIGIKAPYFCGGFILEDQVTHTACRVSDPLKRLFKLGK
PLPVDDEQDHDRRRALEDETRAWFRVGIQGELLKAVESRYEVQEVQPVLLALATFSRS
DKAFKALRGSPRHLYGGPK"
mat_peptide 78..5744
/locus_tag="AURAVgp1"
/product="Nonstructural protein P123"
/note="together with nsP4, constitutes the minus-strand
polymerase"
/protein_id="NP_819014.1"
mat_peptide 78..1694
/locus_tag="AURAVgp1"
/product="Nonstructural protein nsP1"
/note="capping enzyme and polymerase complex component;
involved in initiation of negative strand RNA synthesis;
in SFV nsP1, residues 245-264 are responsible for binding
with anionic membrane phospholipids; probably mediates
association of viral replicative complex with membranes;
guanine-7N-methyltransferase; guanylyltransferase"
/protein_id="NP_819010.1"
misc_feature 120..1196
/locus_tag="AURAVgp1"
/note="Viral methyltransferase; Region: Vmethyltransf;
pfam01660"
/db_xref="CDD:396298"
mat_peptide 1695..4112
/locus_tag="AURAVgp1"
/product="NTPase"
/note="viral RNA capping enzyme and polymerase complex
component; the NTPase/RNA helicase/5'-triphosphatase
domain is located in the N-terminal half of the protein
and the proteinase at the C-terminus; the proteinase
domain mediates nonstructural polyprotein processing;
Nonstructural protein nsP2; RNA 5'-triphosphatase;
papain-like proteinase"
/protein_id="NP_819011.1"
misc_feature 2250..2951
/locus_tag="AURAVgp1"
/note="Viral (Superfamily 1) RNA helicase; Region:
Viral_helicase1; pfam01443"
/db_xref="CDD:366646"
mat_peptide 4113..5744
/locus_tag="AURAVgp1"
/product="Nonstructural protein nsP3"
/note="phosphoprotein, polymerase complex component;
predicted phosphoesterase (similar to the Appr-1'-p
processing enzyme) formerly known as 'X-domain'; together
with the membrane binding nsP1 domain, SFV nsP3 mediates
targeting the non-structural polyprotein to the endosome."
/protein_id="NP_819012.1"
misc_feature 4158..4541
/locus_tag="AURAVgp1"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(4176..4184,4194..4214,4431..4436,4440..4454,
4536..4538)
/locus_tag="AURAVgp1"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
mat_peptide 5745..7574
/locus_tag="AURAVgp1"
/product="Nonstructural protein nsP4"
/note="RNA-dependent RNA polymerase catalytic subunit"
/protein_id="NP_819013.1"
misc_feature 6192..7565
/locus_tag="AURAVgp1"
/note="RNA-dependent RNA polymerase (RdRp) in the family
Togaviridae of positive-sense single-stranded RNA
[(+)ssRNA] viruses; Region: Togaviridae_RdRp; cd23250"
/db_xref="CDD:438100"
misc_feature 6837..6881
/locus_tag="AURAVgp1"
/note="conserved polymerase motif A; other site"
/db_xref="CDD:438100"
misc_feature order(6852..6854,7134..7142)
/locus_tag="AURAVgp1"
/note="catalytic residues [active]"
/db_xref="CDD:438100"
misc_feature 7023..7094
/locus_tag="AURAVgp1"
/note="conserved polymerase motif B; other site"
/db_xref="CDD:438100"
misc_feature 7116..7160
/locus_tag="AURAVgp1"
/note="conserved polymerase motif C; other site"
/db_xref="CDD:438100"
gene 7628..11362
/locus_tag="AURAVgp2"
/db_xref="GeneID:944526"
CDS 7628..11362
/locus_tag="AURAVgp2"
/note="The mature peptides were predicted and annotated at
the NCBI by analogy with the annotation of the genome of
Semliki forest virus (NC_003215) and other alphaviruses;
Structural polyprotein"
/codon_start=1
/product="Polyprotein 2"
/protein_id="NP_632024.1"
/db_xref="GeneID:944526"
/translation="MNSVFYNPFGRGAYAQPPIAWRPRRRAAPAPRPSGLTTQIQQLT
RAVRALVLDNATRRQRPAPRTRPRKPKTQKPKPKKQNQKPPQQQKKGKNQPQQPKKPK
PGKRQRTALKFEADRTFVGKNEDGKIMGYAVAMEGKVIKPLHVKGTIDHPALAKLKFT
KSSSYDMEFAKLPTEMKSDAFGYTTEHPEVFYNWHHGAVQFSGGRFTIPTGVGGPGDS
GRPILDNSGKVVAIVLGGANEVPGTALSVVTWNKKGAAIKTTHEDTVEWSRAITAMCI
LQNVTFPCDRPPTCYNRNPDLTLTMLETNVNHPSYDVLLDAALRCPTRRHVRSTPTDD
FTLTAPYLGLCHRCKTMEPCYSPIKIEKVWDDADDGVLRIQVSAQLGYNRAGTAASAR
LRFMGGGVPPEIQEGAIADFKVFTSKPCLHLSHKGYFVIVKCPPGDSITTSLKVHGSD
QTCTIPMRVGYKFVGREKYTLPPMHGTQIPCLTYERTREKSAGYVTMHRPGQQSITML
MEESGGEVYVQPTSGRNVTYECKCGDFKTGTVTARTKIDGCTERKQCIAISADHVKWV
FNSPDLIRHTDHTAQGKLHIPFPLQQAQCTVPLAHLPGVKHAYRSMSLTLHAEHPTLL
TTRHLGENPQPTAEWIVGSVTRNFSITIQGFEYTWGNQKPVRVYAQESAPGNPHGWPH
EIVRHYYHLYPFYTVTVLSGMGLAICAGLVISILCCCKARRDCLTPYQLAPNATVPFL
VTLCCCFQRTSADEFTDTMGYLWQHSQTMFWIQLVIPLAAVITLVRCCSCCLPFLLVA
SPPNKADAYEHTITVPNAPLNSYKALVERPGYAPLNLEVMVMNTQIIPSVKREYITCR
YHTVVPSPQIKCCGTVECPKGEKADYTCKVFTGVYPFLWGGAQCFCDSENSQLSDKYV
ELSTDCATDHAEAVRVHTASVKSQLRITYGNSTAQVDVFVNGVTPARSKDMKLIAGPL
STTFSPFDNKVIIYHGKVYNYDFPEFGAGTPGAFGDVQASSTTGSDLLANTAIHLQRP
EARNIHVPYTQAPSGFEFWKNNSGQPLSDTAPFGCKVNVNPLRADKCAVGSLPISVDI
PDAAFTRVSEPLPSLLKCTVTSCTYSTDYGGVLVLTYESDRAGQCAVHSHSSTAVLRD
PSVYVEQKGETTLKFSTRSLQADFEVSMCGTRTTCHAQCQPPTEHVMNRPQKSTPDFS
SAISKTSWNWITALMGGISSIAAIAAIVLVIALVFTAQHR"
mat_peptide 7628..8428
/locus_tag="AURAVgp2"
/product="C protein"
/note="the capsid protein is a serine proteinase that
releases itself from the precursor"
/protein_id="NP_819015.1"
misc_feature 7955..8428
/locus_tag="AURAVgp2"
/note="Alphavirus core protein; Region: Peptidase_S3;
pfam00944"
/db_xref="CDD:366379"
mat_peptide 8429..8611
/locus_tag="AURAVgp2"
/product="E3 protein"
/note="In some alphaviruses, E3 was not detected"
/protein_id="NP_819016.1"
mat_peptide 8612..9883
/locus_tag="AURAVgp2"
/product="E2 protein"
/protein_id="NP_819017.1"
misc_feature 8645..9865
/locus_tag="AURAVgp2"
/note="Alphavirus E2 glycoprotein; Region:
Alpha_E2_glycop; pfam00943"
/db_xref="CDD:425959"
misc_feature 9872..11290
/locus_tag="AURAVgp2"
/note="Alphavirus E1 glycoprotein; Region:
Alpha_E1_glycop; pfam01589"
/db_xref="CDD:279870"
mat_peptide 9884..10045
/locus_tag="AURAVgp2"
/product="6K protein"
/protein_id="NP_819018.1"
mat_peptide 10046..11359
/locus_tag="AURAVgp2"
/product="E1 protein"
/protein_id="NP_819019.1"
gene 7628..10032
/locus_tag="AURAVgp3"
/db_xref="GeneID:13165419"
CDS join(7628..10012,10012..10032)
/locus_tag="AURAVgp3"
/note="Truncated verstion of structural polyprotein that
will be produced when frameshifting occurs at nt 10012;
Truncated version of structural polyprotein and transframe
fusion protein have been added by NCBI Staff with the kind
help of Dr. David Karlin (Department of Zoology,
University of Oxford, UK) and Dr. Andrew Firth (Department
of Pathology, University of Cambridge, UK)"
/codon_start=1
/product="truncated polyprotein"
/protein_id="YP_006491231.1"
/db_xref="GeneID:13165419"
/translation="MNSVFYNPFGRGAYAQPPIAWRPRRRAAPAPRPSGLTTQIQQLT
RAVRALVLDNATRRQRPAPRTRPRKPKTQKPKPKKQNQKPPQQQKKGKNQPQQPKKPK
PGKRQRTALKFEADRTFVGKNEDGKIMGYAVAMEGKVIKPLHVKGTIDHPALAKLKFT
KSSSYDMEFAKLPTEMKSDAFGYTTEHPEVFYNWHHGAVQFSGGRFTIPTGVGGPGDS
GRPILDNSGKVVAIVLGGANEVPGTALSVVTWNKKGAAIKTTHEDTVEWSRAITAMCI
LQNVTFPCDRPPTCYNRNPDLTLTMLETNVNHPSYDVLLDAALRCPTRRHVRSTPTDD
FTLTAPYLGLCHRCKTMEPCYSPIKIEKVWDDADDGVLRIQVSAQLGYNRAGTAASAR
LRFMGGGVPPEIQEGAIADFKVFTSKPCLHLSHKGYFVIVKCPPGDSITTSLKVHGSD
QTCTIPMRVGYKFVGREKYTLPPMHGTQIPCLTYERTREKSAGYVTMHRPGQQSITML
MEESGGEVYVQPTSGRNVTYECKCGDFKTGTVTARTKIDGCTERKQCIAISADHVKWV
FNSPDLIRHTDHTAQGKLHIPFPLQQAQCTVPLAHLPGVKHAYRSMSLTLHAEHPTLL
TTRHLGENPQPTAEWIVGSVTRNFSITIQGFEYTWGNQKPVRVYAQESAPGNPHGWPH
EIVRHYYHLYPFYTVTVLSGMGLAICAGLVISILCCCKARRDCLTPYQLAPNATVPFL
VTLCCCFQRTSADEFTDTMGYLWQHSQTMFWIQLVIPLAAVITLVRCCSCCLPFLIGC
QSS"
misc_feature 7955..8428
/locus_tag="AURAVgp3"
/note="Alphavirus core protein; Region: Peptidase_S3;
pfam00944"
/db_xref="CDD:366379"
misc_feature 8645..9865
/locus_tag="AURAVgp3"
/note="Alphavirus E2 glycoprotein; Region:
Alpha_E2_glycop; pfam00943"
/db_xref="CDD:425959"
misc_feature join(9872..10012,10012..>10014)
/locus_tag="AURAVgp3"
/note="Alphavirus E1 glycoprotein; Region:
Alpha_E1_glycop; pfam01589"
/db_xref="CDD:279870"
mat_peptide join(9884..10012,10012..10029)
/locus_tag="AURAVgp3"
/product="transframe fusion polyprotein"
/note="Transframe fusion (TF) protein expressed via
programmed ribosomal frameshifting. TF protein presumably
plays a stabilizing role in the virion structure."
/protein_id="YP_006491232.1"
ORIGIN
1 atagcggacg gactagtact tgtactacag aattaactgc cgtgtgccgc ccgctaaact
61 agccccaatc atcgaaaatg gagaaaccga cagtgcacgt tgacgtagac ccccaaagtc
121 cgtttgtgct acaactgcag aagagtttcc cacaattcga gattgtggct cagcaggtca
181 ctccgaatga ccatgctaat gccagagctt tttcgcatct ggctagtaaa ctgatcgaac
241 atgagatccc cacctcagtt acgatcttgg acataggaag cgcaccagct cgtagaatgt
301 attccgagca taagtatcac tgtgtgtgcc ccatgcgtag tcctgaagac ccggaccgtc
361 ttatgaatta cgcatcccga ctcgcagaca aagcagggga aattaccaac aagaggctgc
421 atgataaact tgcagacctc aagtcggtcc tcgagtcgcc ggatgctgaa actggtacca
481 tttgtttcca caatgacgta atatgccgta cgacagcgga ggtatcagtt atgcaaaatg
541 tgtatatcaa tgcaccttcg accatttacc atcaggccct aaagggagtc agaaaactgt
601 attggatcgg gttcgataca acgcagttta tgttctcctc gatggcaggg tcgtatccgt
661 cctacaatac taattgggcc gatgaaaggg tgctggaagc gcgtaatata ggcctatgta
721 gcacgaagct gagagagggt acgatgggca aactgtctac cttccggaaa aaggccttga
781 aacctggaac taacgtgtac ttctctgtcg gttcgacact ctaccctgag aatagagcgg
841 acctgcagag ttggcaccta ccatctgtgt tccacttgaa aggtaaacaa tcctttacgt
901 gccgctgtga tacggcggtt aactgcgaag gatacgtagt caagaagatc accatcagcc
961 ccgggatcac ggggcgtgtc aatcggtaca ctgtgactaa caacagcgag ggattcttgc
1021 tgtgtaagat cacagatacg gtcaaagggg agcgtgtatc gttccctgtc tgtacgtata
1081 ttccaccttc aatctgtgac caaatgacag gtatattggc cactgatatc caacccgaag
1141 acgcgcaaaa gttgctggta ggactgaacc aacgcatagt cgtgaacgga aaaactaata
1201 gaaacaccaa cacgatgcag aactatctcc tgcccgcggt ggctacaggt ctgagtaaat
1261 gggccaaaga aagaaaggca gactgcagtg acgagaaacc attgaatgtg agagaacgca
1321 aactagcttt cggttgccta tgggctttca agaccaagaa gatccattct ttttaccgcc
1381 cgccaggcac gcagactata gtaaaagtcg cagcggaatt cagtgcgttc cctatgtcct
1441 cggtgtggac tacgtcactg ccaatgtcac tgagacagaa agttaaactg cttcttgtaa
1501 agaaaaccaa taaaccggta gtcactatta ctgacactgc ggtaaaaaac gcacaagagg
1561 catataacga agccgtcgag acagcagaag cggaggagaa agcgaaggcc ttacctccgc
1621 tgaagccgac ggcaccccct gtagcggagg acgtcaaatg cgaggtcacc gacctggtag
1681 acgatgcggg agcggccctg gtcgagacgc cccggggaaa gataaaaatt atcccacagg
1741 aaggggacgt gcgtattggt tcctacacag tcatttctcc agcggcagtc cttagaaatc
1801 aacaactgga gccaatccac gagttagcag agcaggtgaa aattatcacg cacggtggcc
1861 gaacaggcag gtattccgtc gaaccttacg atgctaaggt tctcctgcca acaggatgcc
1921 ccatgtcctg gcaacatttc gcggccttga gcgaaagcgc tacgttagtc tacaatgaga
1981 gagagttcct gaaccggaaa ctccatcaca tcgctacgaa gggtgcggca aaaaacactg
2041 aggaagaaca atacaaagta tgcaaagcta aagacacgga tcatgagtac gtatacgacg
2101 tagatgccag aaaatgcgta aaaagagagc atgcacaagg gctagtacta gttggggaac
2161 taactaatcc gccttaccac gagctggcat acgaaggatt acgtacacga cccgctgccc
2221 cttaccatat cgaaacactg ggggtcattg gaacaccggg gtcaggtaag tcggccatca
2281 taaaatctac ggtaacacta aaagacctcg taactagcgg taagaaagaa aattgcaaag
2341 aaatagagaa tgacgtccag aaaatgcggg gaatgactat agctacgaga acggtagact
2401 cggtacttct taatggatgg aagaaagcag tagacgtcct atatgtggat gaagcgtttg
2461 catgtcatgc aggcacctta atggcattga ttgccattgt caaaccgaga cgtaaagtag
2521 tactgtgcgg cgacccgaag cagtggccct tctttaattt aatgcaactg aaggtaaact
2581 tcaacaaccc cgagcgagac ctgtgtactt ccacccatta taaatatatc tctcgcaggt
2641 gcacccaacc tgttacagcc atagtgtcta cattacacta tgacggaaag atgaggacta
2701 cgaatccctg caaaagggct atcgaaatag acgtaaacgg atcgactaag cccaagaaag
2761 gagacatagt gttgacgtgt ttccgtgggt gggttaagca ggggcaaatc gattaccccg
2821 gacccggagg tcatgaccgt gcagcttctc aagggctaac cagaaggggc gtttatgcgg
2881 tcagacagaa agtaaatgaa aacccactat atgcagagaa gtcagaacac gttaacgtgt
2941 tacttactag gacggaagat cgcatagtgt ggaagacact gcaaggggat ccttggatta
3001 agtacctcac taacgttcca aaagggaact ttacagccac tttagaagaa tggcaggcgg
3061 aacacgagga cattatgaag gccattaatt ctacatccac agtatctgac cctttcgcca
3121 gcaaagtgaa tacatgctgg gctaaagcta ttatacccat cctaagaacg gcagggatag
3181 aacttacatt cgagcagtgg gaagatctat tcccgcaatt tcgtaatgac caaccttact
3241 ccgtgatgta tgccctagat gtgatatgta ccaagatgtt cggcatggat ctgagcagtg
3301 ggatcttctc tcgtcctgag atacctctaa cgttccatcc cgcggacgtc ggccgagtga
3361 gagctcactg ggataactcc ccaggagggc agaagtttgg gtataacaag gcggtaatcc
3421 caacttgcaa gaaataccca gtgtacttaa gagcaggaaa aggggaccaa atactcccca
3481 tatatggcag agtttcagtc ccatcggcac ggaacaattt agttccctta aacagaaatc
3541 taccacactc gctaactgca agcctgcaga aaaaagaagc agctcccttg cacaagttcc
3601 ttaaccaact accaggacac agtatgctgc tggtctctaa ggaaacatgc tattgcgtgt
3661 ccaagcgaat cacatgggtc gctccgctgg gagtcagagg agctgaccac aaccatgacc
3721 tgcatttcgg gttcccacca ctgtccagat acgaccttgt ggtggttaat atgggacaac
3781 cgtacaggtt ccatcactac cagcagtgcg aggagcatgc cggcctcatg aggacgttgg
3841 cccggtcagc actcaactgc ctaaaaccag gaggaacatt agccctgaaa gcatatggtt
3901 tcgccgactc caatagtgag gacgttgttc tgtctttagc gaggaaattc gtgcgggcat
3961 ccgcagtgag accatcgtgt acacagttta acacagagat gttctttgta tttaggcagc
4021 tggacaacga tcgtgagcgc caattcactc agcatcactt gaatttagca gtatccaata
4081 tattcgacaa ttataaagac ggatccggag cagctccttc ttatcgcgtt aagagaatga
4141 atatcgcaga ctgcacagaa gaagcagtgg tgaacgcagc taacgcgcgg ggaaaacctg
4201 gggacggagt atgcagagct atcttcaaaa agtggccgaa gtcatttgag aacgctacca
4261 ctgaagtgga aaccgcggtc atgaaaccat gccacaacaa ggttgttata catgcagtgg
4321 gtcctgattt tagaaagtac acgttggagg aagcgacgaa gctactgcag aacgcatacc
4381 atgatgtggc aaagatagtg aacgagaaag gcatctcctc ggtagctata ccgctgctct
4441 caacaggtat ctatgctgcc ggagctgatc gcctggatct ctcgctgaga tgtcttttca
4501 ccgcgctgga tcgtacggat gcggatgtca caatatattg cctagataag aagtgggagc
4561 aacgcatagc agatgctatt aggatgcgag aacaagtaac tgaattaaaa gatccggaca
4621 tagagataga tgaaggatta acccgggtac acccagatag ctgcctcaag gatcacatag
4681 gctacagtac ccagtatggg aaattgtact catactttga aggtactaaa ttccaccaaa
4741 ccgcaaaaga catagccgag attcgtgcgc tgtttcctga tgtacaagcc gctaacgaac
4801 aaatctgcct gtacacttta ggcgaaccga tggagtccat acgcgaaaag tgcccagtcg
4861 aagactcccc ggcatcagca cctcctaaga caataccttg cctatgtatg tatgctatga
4921 cagccgaacg tatttgccgc gtacgcagta actccgtaac gaacataacg gtgtgctcat
4981 cctttccgtt acccaagtac cgaataaaga acgtacaaaa gatacagtgc acgaaagtcg
5041 tgctatttaa cccggatgta ccgccctaca ttcccgcgag agtatacatt aacaaggacg
5101 agcctcctgt tactccccat accgacagcc cgccggacac ttgctcttcg aggctatcgc
5161 tgacgcccac gctctctaac gcagaatcgg atatcgtatc tctaacgttt tcggaaatcg
5221 atagcgagct gtcgtcacta aacgagcctg cgaggcacgt aatgatatcc tcgttcaagc
5281 tgaggtacac tgcgatccag gctttacctc agaagctgag ttggatgaga gaggatcgca
5341 caccgagaca gccgcctcct gtaccgccac cacgaccaaa acgagccgcg aaattatccc
5401 gactagcaaa ccagctaaac gagctacgga ggcatgcgac gatatcctcc gttcaagctg
5461 aggtacacta caattcaggt tttaccccag aagccgaatt gaatgagaga ggatcgatac
5521 tgaggaagcc gcctcccgta ccgccactac gaccaaaaca aactacgaac ttatctcgac
5581 tcgcaaacca actatctatg ccgataacat tcggagattt tgccgaagga gagctagaca
5641 ggctgctaac gccatcaccg acccctacgt ttggggactt ctctcaggaa gaaatggata
5701 gattcttcgg aaacagacaa tattgactaa ccggggtagg tgggtacata ttctcttctg
5761 atacaggccc ggggcatctg caacaaaaat cagttattca aaacagcacc accgagatac
5821 taatagaacg aagtagacta gaaaaaattc atgcccctgt actggaccta cagaaagaag
5881 agatgctgaa gtgtaggtac cagatgtcac ccaccgtagc caacaagagc aggtaccagt
5941 cacggaaagt agagaacatg aaagcagtga cgacagggag gttgttagat ggtctgaaaa
6001 tgtacgtcac cccggacgtt gaagcagaat gctacaagta cacgtatcca aaaccgatgt
6061 attctgccag cgtgcctgac cgcttcgtat cacctgaagt ggcagtagcg gtctgcaaca
6121 acttcttcca tgagaattat cctacggtag cctcttacca gataactgac gagtacgatg
6181 cctacttaga catggtcgag gggtctgtat cgtgtttaga tactgctaca ttctgcccag
6241 cgaaactgag gagcttccca aagacacact cttacctgga acctacgttg cggagtgcgg
6301 ttccatcggc gtttcagaac accctgcaaa acgtattatc agcagctact aaaaggaact
6361 gcaatgtaac acaaatgaga gaactgcctg tgctagattc cgccgtgttc aatgtggagt
6421 gctttaaaaa gtatgcatgt aatacagact actgggaaga atttaaagag aaaccaataa
6481 gaataacaac agagtgtgtc acctcatacg tggcccgatt aaaaggaccg gaagctgcag
6541 cactattcgc aaagacacat caactggtcc cgctacagga ggtccctatg gataggtttg
6601 taatggacat gaaaagagat gtaaaggtga cacccggcac taaacatacc gaagagcgtc
6661 ccaaggtgca agttattcaa gcagcagaac cactggccac ggcttacttg tgcggtatac
6721 accgggagct agtacggagg cttacagcag tactgttacc taacatccat acactattcg
6781 acatgtctgc cgaggatttc gacgcgatca tagcagcaaa tttttcttac gtgcacccgg
6841 tgctagaaac agatataggc tctttcgaca agagccagga tgattcatta gccctgactg
6901 cactgatgat tctggaagac ttaggagtgg atgaccgcct gatggacctc attgaatgtg
6961 ccttcgggga gatcacaagc gtccacctac cgacagcaac cacgtttaaa tttggtgcta
7021 tgatgaagtc cggaatgttt ttaacactgt ttgttaacac tgtgttgaat gtagttattg
7081 ctagccgagt tctagaacaa cggttgaggg actccaaatg cgcagctttc ataggggacg
7141 acaacattat acatggtgtc gtttcggaca agataatggc cgacagatgc gccacttgga
7201 tgaatatgga agtgaaaatc atagacgcgg tcattggtat caaggcccct tacttttgcg
7261 gcggatttat tctagaagac caagtgacgc acacagcgtg ccgggtctct gatccgctta
7321 agagactgtt caaactaggc aagcctctac cagtagatga tgagcaggat cacgacaggc
7381 gtagagcgct ggaagatgag acaagggcgt ggttcagagt ggggatccaa ggagaacttt
7441 tgaaggccgt cgagtcacgc tatgaggtac aagaggttca gccagtgctg ttagccctcg
7501 ctacgttctc gcgtagcgat aaagcattta aagcactacg cgggagccca agacacctct
7561 acggtggtcc taaatagacg gtgtagcaca gtagatcgtc taaattattt cacatcttta
7621 cgttactatg aactctgtct tttacaatcc gtttggccga ggtgcctacg ctcaacctcc
7681 aatagcatgg aggccaagac gtagggctgc acctgcgcct cgaccatccg ggttgactac
7741 ccagatccaa cagctcacta gggctgttag agctttggtg ctggacaatg ctacacgtcg
7801 ccagcgcccg gctcctcgca cgcgcccgag gaagccgaag actcaaaaac ctaagccgaa
7861 gaagcaaaac cagaaaccac cacaacagca gaagaaaggg aaaaatcagc cccaacaacc
7921 gaagaaaccg aagcccggta aacgacagcg taccgccctg aaatttgaag ccgaccgcac
7981 atttgtcggg aagaatgaag acggcaagat tatgggatac gccgttgcca tggaagggaa
8041 agtgataaaa ccactacatg taaaaggaac cattgaccac ccggccctag cgaaacttaa
8101 attcactaaa tcttcttctt acgacatgga gtttgctaaa ctaccgaccg aaatgaaaag
8161 cgacgcattc gggtatacaa cggaacaccc cgaagtattt tacaactggc atcacggagc
8221 tgtccaattt tccggcggaa ggttcaccat ccctacagga gtcggaggcc ccggagatag
8281 cggaaggcct atactggata actccggaaa agtggtagcc atagtcctag gaggagctaa
8341 tgaagtgcca ggaacggcac tttctgttgt cacctggaat aagaagggag ccgctattaa
8401 aaccacccac gaagatactg tagagtggtc gcgggctatt accgctatgt gcatcctgca
8461 gaacgtcaca ttcccatgtg accgaccgcc aacttgctat aatcgtaatc ctgacttgac
8521 cctaaccatg ttggaaacaa atgtcaatca cccttcgtac gacgttctgc tggacgctgc
8581 tctgaggtgc cccacgagac ggcacgtcag atcaacgccc accgatgact tcactctcac
8641 agcaccgtac ctcggcttgt gtcacagatg taagacgatg gaaccatgct acagccctat
8701 aaaaatcgaa aaagtgtggg atgatgccga tgacggagtt ctccgtatac aagtaagtgc
8761 ccagttaggg tacaacaggg cgggcactgc agctagcgcc cgactccggt tcatgggcgg
8821 aggagtgcct ccggaaatcc aggagggagc aattgcagat tttaaggtct tcacgtccaa
8881 accatgttta cacctatcac ataaaggata ctttgtcatt gtcaagtgcc ctcctggtga
8941 tagtattaca acatcattga aagtgcatgg ctcggatcaa acctgcacaa ttccaatgcg
9001 agtaggttac aagttcgtag gcagggaaaa atatactctg ccaccaatgc atgggacaca
9061 aataccttgc cttacctacg aaaggacacg agagaaaagt gcaggatacg tgaccatgca
9121 tcgtcccgga caacaatcca taaccatgct gatggaagag agcggagggg aggtgtacgt
9181 acaaccgacc agtgggcgaa acgtcaccta cgagtgtaaa tgcggagact ttaaaactgg
9241 gactgtcact gcgcgcacta aaatagacgg ctgtacagaa aggaaacaat gcattgcgat
9301 ttctgccgac cacgtcaaat gggtgtttaa ctcccctgac ttgatcaggc ataccgacca
9361 cacagcccaa gggaagttgc atataccatt cccgctacag caggctcaat gtacagtacc
9421 actggcgcac cttccaggcg ttaagcatgc ttatcgcagt atgtctctga cactgcacgc
9481 tgagcatcct acattgctta ctacccgcca tcttggagaa aatcctcagc ccactgcaga
9541 atggattgtc gggagtgtaa ctcgaaactt ctccataacc atacaagggt tcgagtatac
9601 ttggggaaat cagaaaccgg tccgagtgta cgcgcaggaa tcggcacctg gcaatcctca
9661 tggctggcca catgaaatcg tacgccatta ctaccacctc tatcccttct acaccgttac
9721 agtgctgagc ggcatgggac tggccatatg cgctggctta gtgatcagta ttttatgctg
9781 ctgcaaagca agaagggatt gcctaacacc ttaccaactg gccccgaacg ctaccgtacc
9841 atttctggta acattgtgtt gctgtttcca acggacttca gcggatgaat ttaccgatac
9901 catggggtac ctatggcaac acagtcaaac aatgttctgg atacaattgg tcataccttt
9961 agcagcagtg ataactttgg ttagatgttg ctcctgctgt ctaccttttt tattggttgc
10021 cagtcctcct aacaaagcgg acgcctacga acatacgatc actgtcccaa atgcgccgtt
10081 gaactcgtat aaagcactag tggaacggcc tgggtatgcc cccttgaatc ttgaagtcat
10141 ggtcatgaac acccagatca taccatcggt taaacgtgaa tacattacct gcaggtacca
10201 caccgttgtt ccttcaccgc agattaaatg ttgcggaact gtcgaatgcc cgaaaggtga
10261 aaaagcagac tatacctgca aggtgttcac tggtgtgtac ccatttctgt ggggaggagc
10321 acagtgtttt tgcgactccg aaaacagtca gcttagcgac aagtacgtcg aactgtcaac
10381 agattgcgcc acagaccatg ccgaggcggt cagagtacac acggcttcgg tgaaatcaca
10441 gctccgaata acctacggga actccacagc acaagtagac gtatttgtca acggtgtgac
10501 tccagccagg agcaaagaca tgaaattgat agccggccca ttatctacta cattttcccc
10561 gtttgataat aaggtcatta tatatcatgg gaaagtctat aactatgact tcccggaatt
10621 tggggccgga acacctggag ctttcggaga tgtccaagcg tcatccacca ccggatcaga
10681 tctattagca aacacagcaa ttcatttgca gaggccggaa gccagaaaca tacacgtccc
10741 gtacacccaa gctccaagcg ggttcgaatt ctggaagaat aacagcggtc agcctttatc
10801 tgacactgcc cctttcggat gcaaagtcaa tgtcaacccg ctacgtgcag acaagtgtgc
10861 cgtgggatca ctcccgatat ccgtggatat accggacgct gcatttacac gcgtatccga
10921 gcccctgcca tcactgctta agtgcaccgt tactagttgc acatactcta cagactatgg
10981 cggagtgctc gtgttgacat acgagtcgga tcgcgcgggg caatgcgctg tacactcgca
11041 ttcatcaaca gcggtactgc gagacccatc ggtatacgtc gagcaaaaag gggagactac
11101 acttaaattt agtacgcgtt ccttgcaggc agacttcgag gtatcgatgt gcggaacgag
11161 aaccacttgc catgcccaat gtcaaccacc aacggaacac gtaatgaaca gaccccagaa
11221 gtcgactcca gacttctcct cagcgatatc caaaacatca tggaactgga ttacagcgct
11281 tatgggggga atttccagta tagctgctat agccgcaatt gtgctggtca tagcattagt
11341 atttacagca caacacagat gaatacaaac ttattacatt acgtatgtat tcgatgttat
11401 atgtataaca atgaacgtaa actcgatgta cttccgagga tgtgggtgca taatgccata
11461 cagcggtcca cttatcatga tatagtttta tagttaccac cggtgaaact cgatgtattt
11521 ccgaggaagt gtggtgcata atgccacaca ccggttgaat ttaatagttt tagtcaagac
11581 tgagaaaact cgatgtactt ccgaggatgt ggtgcataat gtcacacatc agtcttatca
11641 tattagttca aaggcaatat tacccctgaa tagtaacaaa actagaaaat cgtcaaccac
11701 cacggatacc gggatcggtg tcctactacg gtagttgaaa ataactttat agaattttaa
11761 aattttcttt attaaaatct tttgtttttc ttttattatt tcaaaatttt gtttttaata
11821 tttc
//