GenomeNet

Database: RefSeq
Entry: NC_005283
LinkDB: NC_005283
Original site: NC_005283 
LOCUS       NC_005283              15702 bp    RNA     linear   VRL 13-AUG-2018
DEFINITION  Dolphin morbillivirus, complete genome.
ACCESSION   NC_005283
VERSION     NC_005283.1
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq; complete genome; F gene; fusion protein; H gene;
            haemagglutinin protein; L gene; large protein; M gene; matrix
            protein; N gene; nucleocapsid protein; P/V/C gene; phosphoprotein.
SOURCE      Dolphin morbillivirus
  ORGANISM  Dolphin morbillivirus
            Viruses; Riboviria; Orthornavirae; Negarnaviricota;
            Haploviricotina; Monjiviricetes; Mononegavirales; Paramyxoviridae;
            Orthoparamyxovirinae; Morbillivirus; Cetacean morbillivirus.
REFERENCE   1
  AUTHORS   Rima,B.K., Collin,A.M. and Earle,J.A.
  TITLE     Completion of the sequence of a cetacean morbillivirus and
            comparative analysis of the complete genome sequences of four
            morbilliviruses
  JOURNAL   Virus Genes 30 (1), 113-119 (2005)
   PUBMED   15744569
REFERENCE   2  (bases 1 to 15702)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (05-DEC-2003) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   3  (bases 1 to 15702)
  AUTHORS   Rima,B.K.
  TITLE     Direct Submission
  JOURNAL   Submitted (17-OCT-2003) Rima B.K., School of Biology and
            Biochemistry, The Queen's University of Belfast, 97 Lisburn Road,
            Belfast, N. Ireland, BT9 7BL, United Kingdom
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence is identical to AJ608288.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..15702
                     /organism="Dolphin morbillivirus"
                     /mol_type="genomic RNA"
                     /db_xref="taxon:37131"
     misc_feature    1..52
                     /note="leader RNA sequence"
     misc_feature    53..55
                     /note="CTT intergenic sequence"
     gene            108..1679
                     /gene="N"
                     /locus_tag="DolMVgp1"
                     /db_xref="GeneID:2658690"
     CDS             108..1679
                     /gene="N"
                     /locus_tag="DolMVgp1"
                     /codon_start=1
                     /product="nucleocapsid protein"
                     /protein_id="NP_945024.1"
                     /db_xref="GOA:Q66412"
                     /db_xref="InterPro:IPR002021"
                     /db_xref="UniProtKB/TrEMBL:Q66412"
                     /db_xref="GeneID:2658690"
                     /translation="MATLLRSLALFKRNKDRTPLIAGSGGAIRGIKHVIVVPVPGDSS
                     IVTRSRLLDRLVRLAGDPYISGPKLTGVMISILSLFVESPSQLIQRITDDPDVSIRLV
                     EVIQSEKSLSGLTFASRGANMEDEADDYFSIQAGEEGDTRGTHWFENKEIVEIEVQDP
                     EEFNILLASILAQIWILLAKAVTAPDTAADSETRRWIKYTQQRRVVGEFRLDKGWLDA
                     VRNRIAEDLSLRRFMVALILDIKRTPGNKPRIAEMICDIDTYIVEAGLASFILTIKFG
                     IETMYPALGLHEFSGELTTVESLMNLYQQMGETAPYMVILENSIQNKFSAGSYPLLWS
                     YAMGVGVELENSMGGLNFGRSYFDPAYFRLGQEMVRRSAGKVSSSLAAELGITAEDAK
                     LVSEIAAQANDDRANRAIGPKQNQISFLHPDRGDASTPGNILRANEGDGSTRMKRGGN
                     IATPKGTSIDQTSTTLSKDTLDIDEQSDNTDDPISIQKSAEALAKMRAMAKLLENQGP
                     RDVTAHVYNDKDLLG"
     misc_feature    108..1673
                     /gene="N"
                     /locus_tag="DolMVgp1"
                     /note="Paramyxovirus nucleocapsid protein; Region:
                     Paramyxo_ncap; pfam00973"
                     /db_xref="CDD:144532"
     misc_feature    1739..1741
                     /note="CTT intergenic sequence"
     gene            1801..3321
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /db_xref="GeneID:2658694"
     CDS             1801..3321
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /codon_start=1
                     /product="phosphoprotein"
                     /protein_id="NP_945025.1"
                     /db_xref="GOA:Q709E8"
                     /db_xref="InterPro:IPR004897"
                     /db_xref="InterPro:IPR016075"
                     /db_xref="UniProtKB/TrEMBL:Q709E8"
                     /db_xref="GeneID:2658694"
                     /translation="MAEEQAYHVNKGLECLKSLRENPPDAVEIKEAQIIRSKAACEES
                     SESHHQDNSEKDTLDFDESCSSAIRPETYRMLLGDDTGFRAPGYIPNEGEPEPGDIGK
                     EEPAVRCYHVYDHGGQAVEGVKDADLLVVPTGSDDDAEFRDGDESSLESDGESGTVDT
                     RGNSSSNRGSAPRIKVERSSDVETISSEELQGLIRSQSQKHNGFGVDRFLKVPPIPTS
                     VPLDPAPKSIKKGTGERSALSGTETEFSLTGGATRLAQESRWASSESSAPAENVRQSV
                     TNAERTQKPPQGSGTTASQKSQNNGHSDDEYEDELFMEVQEIKTAITKINEDNQQIIS
                     KLDSIMLLKGEIESIKKQINKQNITISTIEGHLSSIMIAIPGFGKDPNDPTADVELNP
                     DLRPIISRDAGRALAEVLKRPAVERNPKVTPKVHPGSKGQILRDLQLKPVDRKMSSAV
                     GFVPTDDLPSRSVLRSMIKSSNLESEHKRSMIGLLNDVKSGKDLGEFYQMVKKIIK"
     misc_feature    1810..2736
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /note="Paramyxovirus structural protein V/P N-terminus;
                     Region: Paramyxo_PNT; pfam13825"
                     /db_xref="CDD:290537"
     misc_feature    2770..3303
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /note="Paramyxovirus P/V phosphoprotein C-terminal;
                     Region: Paramyx_P_V_C; pfam03210"
                     /db_xref="CDD:281237"
     CDS             1823..2356
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /codon_start=1
                     /product="C protein"
                     /protein_id="NP_945026.1"
                     /db_xref="InterPro:IPR003875"
                     /db_xref="UniProtKB/TrEMBL:Q709E7"
                     /db_xref="GeneID:2658694"
                     /translation="MSIRDLSVSNLSEKIRPMLSKLRKPKLSEARPPAKNQARVITRT
                     TPKKTLLISTNHALQQLDQKRTACYLVMIQDLEHQVTSLMKESPSQETSERRNLQYDV
                     TMFMITAVKRLKESRMLTCSWFQQAVMMMQNSETEMRALSRAMVNLALLIPEEILPLT
                     GDLLPGLRSRDRLTLRL"
     misc_feature    1916..2323
                     /gene="P/V/C"
                     /locus_tag="DolMVgp2"
                     /note="Non-structural protein C; Region: Paramyxo_NS_C;
                     pfam02725"
                     /db_xref="CDD:280822"
     misc_feature    3397..3399
                     /note="CTT intergenic sequence"
     gene            3432..4439
                     /gene="M"
                     /locus_tag="DolMVgp3"
                     /db_xref="GeneID:2658691"
     CDS             3432..4439
                     /gene="M"
                     /locus_tag="DolMVgp3"
                     /codon_start=1
                     /product="matrix protein"
                     /protein_id="NP_945027.1"
                     /db_xref="GOA:Q709E6"
                     /db_xref="InterPro:IPR000982"
                     /db_xref="UniProtKB/TrEMBL:Q709E6"
                     /db_xref="GeneID:2658691"
                     /translation="MTEVYDFDRSAWDVKGSIAPIEPTTYPDGRLIPQVRVIDPGLGD
                     RKDECFMYIFLLGILEDNDIMSPPIGRTFGSLPLGVGRSTAKPEELLKEATELDIVVR
                     RTAGLNEKLVFYNNTPLMLLTPWKKVLTAGSVFSANQVCNAVNLIPLDTPQRFRVVYM
                     SITRLSDNGCYRVPRKMLEFRSANALAFNILVTIRIENAGIVSRPYMSMMRDPQATFM
                     IHIGNFRRKKNEAYSADYCKMKIEKMGLVFALGGIGGTSLHIRCTGKMSKTLHAQLGF
                     KKILCYPLMDVNEDLNRYLWRAECKIVRIQAVLQPSVPQEFRVYDDVIINDDQGLFKI
                     L"
     misc_feature    3441..4424
                     /gene="M"
                     /locus_tag="DolMVgp3"
                     /note="Viral matrix protein; Region: Matrix; pfam00661"
                     /db_xref="CDD:279054"
     misc_feature    4853..4855
                     /note="CTT intergenic sequence"
     gene            5277..6935
                     /gene="F"
                     /locus_tag="DolMVgp4"
                     /db_xref="GeneID:2658692"
     CDS             5277..6935
                     /gene="F"
                     /locus_tag="DolMVgp4"
                     /codon_start=1
                     /product="fusion protein"
                     /protein_id="NP_945028.1"
                     /db_xref="GOA:Q66409"
                     /db_xref="HSSP:1SVF"
                     /db_xref="InterPro:IPR000776"
                     /db_xref="UniProtKB/TrEMBL:Q66409"
                     /db_xref="GeneID:2658692"
                     /translation="MAASNGGVMYQSFLTIIILVIMTEGQIHWGNLSKIGIVGTGSAS
                     YKVMTRPNHQYLVIKLMPNVTMIDNCTRTEVTEYRKLLKTVLEPVKNALTVITKNIKP
                     IQSLTTSRRSKRFAGVVLAGVALGVATAAQITAGVALHQSIMNSQSIDNLRTSLEKSN
                     QAIEEIRQASQETVLAVQGVQDFINNELIPSMHQLSCEMLGQKLGLKLLRYYTEILSI
                     FGPSLRDPVSAEISIQALSYALGGDINKILEKLGYSGADLLAILESRGIKAKVTHVDL
                     EGYFIVLSIAYPTLSEVKGVIVHKLEAVSYNLGSQEWYTTLPKYVATNGYLISNFDES
                     SCAFMSEVTICSQNALYPMSPLLQQCLRGSTASCARSLVSGTIGNRFILSKGNLIANC
                     ASVLCKCYSTGTIISQDPDKLLTFVAADKCPLVEVDGITIQVGSREYPDSVYVSRIDL
                     GPAISLEKLDVGTNLGSALTKLDNAKDLLDSSNQILENVRRSSFGGAMYIGILVCAGA
                     LVILCVLVYCCRRHCRKRVQTPPKATPGLKPDLTGTTKSYVRSL"
     misc_feature    5358..6755
                     /gene="F"
                     /locus_tag="DolMVgp4"
                     /note="Fusion glycoprotein F0; Region: Fusion_gly;
                     pfam00523"
                     /db_xref="CDD:278924"
     misc_feature    6771..>6902
                     /gene="F"
                     /locus_tag="DolMVgp4"
                     /note="Transmembrane 52; Region: TMEM52; pfam14979"
                     /db_xref="CDD:291640"
     misc_feature    7068..7070
                     /note="CTT intergenic sequence"
     gene            7091..8905
                     /gene="H"
                     /locus_tag="DolMVgp5"
                     /db_xref="GeneID:2658693"
     CDS             7091..8905
                     /gene="H"
                     /locus_tag="DolMVgp5"
                     /codon_start=1
                     /product="haemagglutinin protein"
                     /protein_id="NP_945029.1"
                     /db_xref="GOA:Q66411"
                     /db_xref="InterPro:IPR000665"
                     /db_xref="InterPro:IPR011040"
                     /db_xref="UniProtKB/TrEMBL:Q66411"
                     /db_xref="GeneID:2658693"
                     /translation="MSSPRDKVDAFYKDIPRPRNNRVLLDNERVIIERPLILVGVLAV
                     MFLSLVGLLAIAGVRLQKATTNSIEVNRKLSTNLETTVSIEHHVKDVLTPLFKIIGDE
                     VGLRMPQKLTEIMQFISNKIKFLNPDREYDFNDLHWCVNPPDQVKIDYAQYCNHIAAE
                     ELIVTKFKELMNHSLDMSKGRIFPPKNCSGSVITRGQTIKPGLTLVNIYTTRNFEVSF
                     MVTVISGGMYGKTYFLKPPEPDDPFEFQAFRIFEVGLVRDVGSREPVLQMTNFMVIDE
                     DEGLNFCLLSVGELRLAAVCVRGRPVVTKDIGGYKDEPFKVVTLGIIGGGLSNQKTEI
                     YPTIDSSIEKLYITSHRGIIRNSKARWSVPAIRSDDKDKMEKCTQALCKSRPPPSCNS
                     SDWEPLTSNRIPAYAYIALEIKEDSGLELDITSNYGPLIIHGAGMDIYEGPSSNQDWL
                     AIPPLSQSVLGVINKVDFTAGFDIKPHTLTTAVDYESGKCYVPVELSGAKDQDLKLES
                     NLVVLPTKDFGYVTATYDTSRSEHAIVYYVYDTARSSSYFFPFRIKARGEPIYLRIEC
                     FPWSRQLWCHHYCMINSTVSNEIVVVDNLVSINMSCSR"
     misc_feature    7649..8881
                     /gene="H"
                     /locus_tag="DolMVgp5"
                     /note="sialidases/neuraminidases; Region: Sialidase;
                     cl21531"
                     /db_xref="CDD:419713"
     misc_feature    9017..9019
                     /note="CTT intergenic sequence"
     gene            9042..15593
                     /gene="L"
                     /locus_tag="DolMVgp6"
                     /db_xref="GeneID:2658695"
     CDS             9042..15593
                     /gene="L"
                     /locus_tag="DolMVgp6"
                     /codon_start=1
                     /product="large protein"
                     /protein_id="NP_945030.1"
                     /db_xref="GOA:Q709E3"
                     /db_xref="InterPro:IPR001016"
                     /db_xref="InterPro:IPR014023"
                     /db_xref="InterPro:IPR016269"
                     /db_xref="UniProtKB/TrEMBL:Q709E3"
                     /db_xref="GeneID:2658695"
                     /translation="MESISINQILYPEVHLDSPIVTNKLVAILEYSRVTHGYILEDQT
                     LTKNIRYRVENGYSNQMIINNLEIGNVVNLRLMSYPYHRHKIYPDCNYDLFHISDHQI
                     SSRLLTLFKKGNTIYTKISGKIIECMKGVNSRLGISSDLSKEVTTGITDLGAYMQSSQ
                     WYGPFLYWFTIKTEMRSIIKSATHTSHRHRIVPSFVHGERCEVLISRDLVTIINKRSQ
                     DIYYLTFEMVLMYCDVVEGRLMTETAMTVDPRYTELLCRVKYLWNLIDGFFPTLGNST
                     YQIVALLEPLSLAYLQLKDITLELRGAFLSHCFNEIHDILESSGVLTEETYSDVVNAL
                     DYIFITDDIHLTGEIFSFFRSFGHPRLEAVTAANNVRKYMNQPKVINYETMMKGHAIF
                     CGIIINGYRDRHGGSWPPISLPTHASSIVRNALASGEGLTYSQCIDNWRSFAGVKFGC
                     FMPLSLDSDLTMYLKDKALAALKKEWDSAYPKEYLRYNPPKPTGSRRLVNVFLDDSTF
                     DPYNMILYVINGSYLEDPDFNLSYSLKEKEIKEVGRLFAKMTYQMRACQVIAENLISN
                     GIGKYFKDNGMAKDEHDLTKALHTLAVSGIPKNKKDYHRGEGGRQTNPWWFGDKSKIN
                     KRHGQTSTAHSNYAGAGCGIKNGHDQEAYETVSAFITTDLKKYCLNWRYETISIFAQR
                     LNEIYGLPSFFQWLHKRLEKSVLYVSDPHCPPDLDTHMDLDAVPNSQIFIKYPMGGIE
                     GYCQKLWTISTIPYLYLAAHESGVRIASLVQGDNQTIAVTKRVPSTWPYDLKKREATK
                     ITIEYFLILRQRLHDIGHHLKANETIISSHFFVYSKGIYYDGMLISQSLKSVARCVFW
                     SETIVDETRAACSNISTTLAKSIERGFDRYLAYSLNVLKIIQQILISLGFTINTSMTQ
                     DIAIPLLQNQDLLIKMALLPAPIGGLNYLNMSRLFVRNIGDPVTSSLADLKRMIIAGI
                     MPEESIHQVMTQQPGDSSFLDWASDPYSANLPCVQSITRLLKNITARHVLINSPNPML
                     RGLFHADSHEVDESLATFLMDRHIIRPRAAHEILDNSIAVARESLAGMLDTTKGLIRA
                     SMKRGGLTPRIITRLSNYDYDQSKMGISLLTVKKRNNLIDRESCSVQLARALRSHMWA
                     KLARGRSIYGLEVPVVLESMKGYIIKRHESCSLCETGSLNYGWFFGPANCQLDNISKE
                     TSSLRVPYIGSTTEERTDMKLAFAKSPSRSLKSAVRIATVYSWAYGDDDQSWHEAWTL
                     ARQRANITLEELRMITPISTSTNLAHRLRDRNTQVKYSGTSLIRVARYTTISNDNLSF
                     IIADKKVDTNFIYQQGMLLGLGILETYFRLQTNTGSSNTVLHLHVEAECCVIPMTDHP
                     RVPSHRTAPSARKMCTNPLIYDNSPIIEKDAVRLYSQSHRKHLVEFVTWSTGQLYHVL
                     AKSTAMSMIELVTKFEKDHLNEIAALIGDDDINSFITEFLLVEPRLFTVYLGQCAAIN
                     WAFEIHYHRPSGKYQMGELLFSFLCRMSKGVFKILTNALSHPKVYRRFWDCGIIEPIH
                     GPSLDTQNLHLTVCNMIYHCYMIYLDLLLNDELDDFTFLLCESDEDVVSDRFENIQAR
                     YLCILADLYCNAKNCPSIRELAPIKKCAVLTQFIKSEALISPGGLDWNDEPIVVDHFS
                     CSLTYLRRGAVKQIRLRVDPGFVSEVLIDASDHNLGPIKAKEIKLDSINFYPPKEDVA
                     RLLSTIGTAQHDLPIIGTRVINYEVHAYRRIGLNSSACYKAVEVSSVIKSMIEPGEDG
                     LFLGEGSGSMLVTYREILKLKRCYYNRGVSVESRSGQREISPYPSEVSLVEHQLGLDR
                     SVKVLFNGKPEVTWVGNVDCYKYIISNIPSSSLGLIHSDIETLPNKDLVEKLEELTAI
                     LSMTFILGKIGSLLIIKIMPTSGDLVQGFIGYTTPFFRESIIVYPRYSNFISTECYLV
                     FVGLKYNRLINPEGIKQQLLKLSIRTSPGFVAHLLSMKQANYLQSLIGLPVQKGFFNR
                     VLSGLTPIEKVLINCGLTVNGPKVCKNLVHHDIASGSEGLVNSTVILYKELARFKENT
                     RSQQGMFHAYPVLADSRQRELVSRIARKYWGYIILYSTEQGALNQLVRNLKAGYLLFD
                     VHHNFLVKNLSKSERVLIRTLIPRREWLFKLETSEIKEWFKLIGYGALIRE"
     misc_feature    9078..12395
                     /gene="L"
                     /locus_tag="DolMVgp6"
                     /note="Mononegavirales RNA dependent RNA polymerase;
                     Region: Mononeg_RNA_pol; pfam00946"
                     /db_xref="CDD:279314"
     misc_feature    12744..15572
                     /gene="L"
                     /locus_tag="DolMVgp6"
                     /note="mRNA capping enzyme, paramyxovirus family; Region:
                     paramyx_RNAcap; TIGR04198"
                     /db_xref="CDD:275046"
     misc_feature    15663..15702
                     /note="trailer RNA sequence"
ORIGIN      
        1 accagacaaa gctggctagg ggtagaataa cagataatga taaattatca tacttaggat
       61 taatgatcct atcaattggc acaggatttg gataaaggtt cacagtcatg gcgacacttc
      121 ttcggagtct agctctgttc aagaggaaca aagatagaac acccctaatt gcaggttcag
      181 gaggagccat aagagggatt aagcatgtta tcgtagtccc agtacccggt gattcatcga
      241 ttgtcactag gtcaagatta ttggacaggt tagtaagact tgctggtgac ccttatataa
      301 gtggacctaa gctgacaggc gtcatgatca gtatactatc attgtttgtt gaatcaccca
      361 gtcaactaat acagcgaatc actgacgacc cagatgtcag catcagatta gttgaagtaa
      421 tccaaagtga gaagtcccta tcagggctca cttttgcatc cagaggcgct aatatggagg
      481 atgaggcgga tgactatttc tctattcaag caggggagga aggggacacc agaggaaccc
      541 attggtttga gaacaaagag atagttgaaa ttgaggttca agacccagaa gaattcaaca
      601 tactattggc atctattctt gcacaaattt ggatcctatt agccaaggca gtcactgctc
      661 cagatactgc agctgactcc gagacgaggc ggtggattaa atatactcag caacgccgtg
      721 tagtgggtga gtttcggctt gacaagggat ggttggatgc tgtgagaaat cggattgcgg
      781 aggacctatc gttgaggaga ttcatggtgg cattaatttt agatatcaag agaacaccag
      841 ggaacaaacc caggattgcc gagatgatat gcgacataga cacctatatc gtcgaggcag
      901 gtcttgctag cttcatccta actatcaaat tcgggatcga aacaatgtac ccggccttag
      961 ggttgcatga attctcgggt gaattaacta cagttgagtc tctaatgaac ctctaccaac
     1021 agatgggcga gactgcaccg tacatggtaa ttcttgaaaa ctcaattcag aacaagttca
     1081 gtgcaggttc atacccgcta ttatggagct atgcaatggg agttggagtt gaacttgaaa
     1141 attcaatggg tggacttaat tttggtcgtt cttacttcga ccctgcatat ttcagactgg
     1201 gtcaagagat ggtcaggaga tcagcaggca aggtgagctc atcgctagcc gcagaactag
     1261 ggatcacagc cgaggacgcc aaacttgtct ccgagattgc tgcgcaggct aatgacgaca
     1321 gagctaatag agcaataggt cccaaacaaa accagatatc gtttcttcat cctgacagag
     1381 gcgatgccag tactccaggg aacatccttc gcgcaaacga gggtgacggg tccacccgga
     1441 tgaaaagagg ggggaacatt gctacaccaa aagggacaag catagatcag acatcaacga
     1501 ctctcagcaa agacactcta gacattgatg aacagtccga caacactgac gacccaatta
     1561 gtatccaaaa atcagctgag gcattagcca agatgagagc catggccaag ctattggaaa
     1621 accaaggccc gcgtgatgtc actgcgcacg tttataatga taaagatcta cttggctgaa
     1681 caaaaggatc cactctgagc agcgtcagac tttgtatttt catcaatatt acaaaaaact
     1741 taggaccaaa gtccaaggaa ttgacctcca catcatttca tccaccgaca ccatccccaa
     1801 atggcagagg agcaggccta tcatgtcaat aagggacttg agtgtctcaa atctctcaga
     1861 gaaaatccgc ccgatgctgt cgaaattaag gaagcccaaa ttatccgaag caaggccgcc
     1921 tgcgaagaat caagcgagag tcatcaccag gacaactccg aaaaagacac tcttgatttc
     1981 gacgaatcat gctcttcagc aattagacca gaaacgtacc gcatgttact tggtgatgat
     2041 acaggattta gagcaccagg ttacatccct aatgaaggag agcccgagcc aggagacatc
     2101 ggaaaggagg aacctgcagt acgatgttac catgtttatg atcacggcgg tcaagcggtt
     2161 gaaggagtca aggatgctga cctgctcgtg gttccaacag gcagtgatga tgatgcagaa
     2221 ttcagagacg gagatgagag ctctctcgag agcgatggtg aatctggcac tgttgatacc
     2281 agaggaaatt cttcctctaa caggggatct gctcccagga ttaaggtcga gagatcgtct
     2341 gacgttgaga ctataagcag tgaagagcta caaggactga ttagatctca gagtcaaaaa
     2401 cataatggat ttggagtaga cagattccta aaggtcccac caattccaac ctcagtgccg
     2461 ctggaccccg ctcccaaatc cattaaaaag ggcacaggag agagatcagc cttatctggg
     2521 acggagaccg agttttcatt gacaggtggt gcaacccgac ttgctcaaga atcaagatgg
     2581 gcatcgtcag agtcaagtgc acctgcggag aatgtccgcc agtctgtgac gaatgcagag
     2641 aggacccaga aacccccaca aggatctggt accacagcct cccagaaatc ccagaacaat
     2701 ggccattctg atgatgagta tgaagatgaa ctctttatgg aggtacagga gattaaaaca
     2761 gcaattacaa agattaatga agataatcag cagataatat caaaattgga ctctataatg
     2821 ctactaaagg gtgaaattga atctatcaag aaacagatca ataagcaaaa tatcactata
     2881 tcaactatcg aaggccacct gtccagcata atgatagcaa ttccaggatt cggtaaagat
     2941 cccaatgatc ctactgcaga cgtagaactc aatcccgatc tgagacccat aattagccgt
     3001 gatgcaggaa gagctctagc tgaggtcctc aagaggccag cagtcgagag aaatccaaag
     3061 gtcaccccaa aggtccatcc aggatccaag gggcagattc tgagggatct gcaacttaag
     3121 ccggtagaca ggaaaatgag ctctgctgtg ggatttgtcc caactgatga tctgccgtct
     3181 cggagtgtgc ttcgctccat gattaagtcc agcaatcttg aatcagaaca caaacgaagc
     3241 atgatagggc tcttgaatga tgtcaaaagt ggcaaggatc ttggagaatt ttatcagatg
     3301 gtgaaaaaaa tcatcaagta acataccaac ctacatctta ttgcttgctt gcctagctta
     3361 gtagtaatgt tagtcaaaat gatcaattat aaaaaactta ggattcaaga ctaagttgac
     3421 taccgtcgaa aatgaccgag gtttacgact ttgatagatc agcctgggat gtcaaggggt
     3481 ctattgctcc catagagccc actacttatc ccgatggaag actgatccct caagttagag
     3541 ttatagatcc aggcttaggt gacagaaagg atgagtgttt catgtatatc ttccttctgg
     3601 gcatattaga ggataatgat ataatgagcc ccccaatcgg gaggactttc ggctcactcc
     3661 cgctcggagt aggacgatcg actgctaaac ctgaagaact gcttaaagag gccaccgaat
     3721 tggatattgt ggtaaggagg acagcggggt taaatgagaa actagtcttc tacaacaata
     3781 caccgcttat gttattgact ccgtggaaga aagtacttac tgctggaagt gtctttagtg
     3841 caaaccaagt ctgcaatgct gtcaatctta tcccacttga cactccgcaa aggttcagag
     3901 ttgtctatat gagtataacc agactctcag acaatggatg ctatagggtt cccagaaaga
     3961 tgctggagtt cagatctgct aatgctctgg cattcaatat attagtgacc atcagaattg
     4021 aaaacgccgg aattgtaagc cgcccataca tgagcatgat gagagaccca caggcgactt
     4081 tcatgatcca tattgggaac ttccgcagaa agaagaatga ggcttactct gcagattact
     4141 gcaagatgaa gatagaaaaa atgggcctcg tctttgctct tggcggaatc ggagggacca
     4201 gcctacatat cagatgtact gggaagatga gtaagactct acatgctcag ttagggttta
     4261 agaaaatact ctgttacccc ttaatggatg tcaacgaaga ccttaatagg tacttgtggc
     4321 gtgcagaatg caagattgtg agaatccaag ctgttcttca gccatcagtc ccgcaggaat
     4381 tcagagttta tgatgatgtt attatcaatg atgatcaagg gttattcaag attctttaaa
     4441 aagctaaaga ataagacttc tcatttcagg ctcatagcta gttcatccag cagaatcaac
     4501 agagcttata tatattggat aaatcagttc aaagctcaat gtagttggag tttgttgtag
     4561 ggttagtatt gagagataat ggaagtcctg ataaaattca ccattcagct aatcaatggt
     4621 agataattat ctgcatgcgc acatagaagc ttagagtctt gcttctcatc ctgattcttg
     4681 ataattccag cagaacaaag cttcaaacaa cactgtactc agtttcgaat tacctgatca
     4741 accacagatc ctctcctcga caatcagggg ttaaacagtc ctacatccag tgaccccaaa
     4801 agccaagacg tgacatcact cgcaaccaac tgatgcaaat tattaaataa aacttaggag
     4861 taaagtaatt aagactccaa gcaacacccg acaacccact acacccaaaa acagacaacg
     4921 acaaccggca gaacaaacag atctcgcata ggcaaagaga ccacaccaca attcctcctt
     4981 tgcaatcgac cacaccagcc ccaggctcca tctccaggac caccagtcat ccattggctg
     5041 caaaaaacag gctcgatcag catagggcaa gccgatattc ttgctatccc caatccagat
     5101 ccggttgatg cgctcctgac caatcaccgc gagcgaccaa gaatccagag ctctcgatct
     5161 tgaagacaac ttttcatctg gtcgtcaaca ttgagtcact gtcgctagga gaaaatccgt
     5221 agtcaaatat tgtcgaaaag gtagcactta tccttaggac cccagtactc tcattcatgg
     5281 ccgctagtaa cggcggtgtt atgtatcagt catttctcac aatcataatc cttgtcatta
     5341 tgactgaagg gcaaatccat tgggggaatc tctccaaaat tgggatagtt gggacaggaa
     5401 gtgccagcta taaagtgatg accaggccta accatcagta tctagttatc aaattgatgc
     5461 cgaatgtaac tatgattgac aattgcacaa gaactgaagt gacagagtac aggaaactac
     5521 ttaagacagt attagaaccg gtcaagaacg ctcttacagt gataacaaag aatattaagc
     5581 ctatacagtc tttaaccacc agtagaagga gcaagaggtt tgcaggagtg gtcttggcag
     5641 gtgtggcatt gggagttgct actgccgccc agataactgc aggagttgcc cttcatcagt
     5701 ccatcatgaa ctcccagtca attgataatc taagaacaag tttagaaaag tccaaccaag
     5761 ccatagaaga gattagacaa gcatcccaag aaactgtctt agctgttcaa ggggtccaag
     5821 acttcatcaa caatgaactg attccatcta tgcaccagct gtcttgcgag atgctaggac
     5881 agaaattggg tctaaaacta ttaaggtatt acacagagat attgtccatt ttcggaccta
     5941 gcctcagaga tcctgtctct gccgaaatat ctattcaagc actgagttat gctcttggag
     6001 gagatattaa caaaatctta gaaaagttag gctatagtgg tgcagattta cttgcaatac
     6061 tagagagtcg agggataaaa gcaaaagtta cccatgttga tttagaaggt tactttatag
     6121 tactgagtat tgcatacccg actctatcag aggtcaaggg ggtgatagta cacaaattgg
     6181 aggcagtttc ctacaactta ggatctcaag agtggtacac aacattaccc aaatatgtgg
     6241 cgaccaatgg ctatctaatc tctaattttg atgaatcttc gtgtgcattt atgtcagaag
     6301 tgactatttg tagtcaaaat gcactttacc ccatgagtcc gttgctccaa caatgtctta
     6361 gaggatctac cgcatcatgc gcgcgaagct tagtatctgg gacaattggg aatcgattca
     6421 tattatccaa agggaactta atcgctaact gcgcatcggt actctgcaaa tgttactcta
     6481 ctggcaccat aattagccag gatccagata aactcttaac attcgtcgcc gcagacaaat
     6541 gccctttagt tgaggtagac ggaatcacaa tccaagtggg atccagggag tatcctgatt
     6601 cagtttatgt cagtagaatt gatctcggcc cagctatatc actggagaag ttagatgtag
     6661 gcacaaatct gggcagtgcc ctgacaaaac tggacaatgc taaagatcta ctagactcat
     6721 cgaatcaaat tctagagaat gtcaggagaa gttcctttgg aggagcaatg tacataggga
     6781 tccttgtatg tgcaggagct ttagtgattc tgtgcgtatt ggtatactgt tgtaggaggc
     6841 actgtcgcaa acgtgtccaa acacccccta aagcaactcc cggattaaag cctgatctaa
     6901 caggtactac caaatcatat gtaagatcct tgtgatttta atagttgcat gatggacatt
     6961 ccgtaacacc aaaacctagg caaatataca aatactggat tattggtcat acctgagcca
     7021 ttatatatac aatatagaga gtctaataaa taaataatta aagaaaactt agggtgcaag
     7081 ttgaccaacc atgtcttctc cgcgtgacaa ggtcgacgca ttctacaagg acataccaag
     7141 acctagaaac aatagggttt tgctagacaa tgagagggtt atcatagaaa gaccgttgat
     7201 actcgtgggt gtgctagccg ttatgtttct tagtctggta ggactacttg ccattgcagg
     7261 agttagactg caaaaagcaa cgactaacag cattgaggtg aacagaaaac ttagcacaaa
     7321 tttggaaaca accgtgtcta ttgaacatca tgttaaggat gtcttaactc ctttgtttaa
     7381 gatcatcgga gacgaagtgg ggttacggat gccccagaag ttaacagaga taatgcagtt
     7441 catatcaaac aagatcaaat tcctcaaccc agatagagaa tacgacttca atgatctaca
     7501 ctggtgtgtt aacccccctg atcaagtcaa aattgactat gctcagtact gcaaccacat
     7561 cgcagccgag gagcttatag ttactaagtt caaagagtta atgaaccact ctctagatat
     7621 gagtaaagga aggatattcc ctcctaagaa ttgctcgggc tcagtcatca ccagaggtca
     7681 aactataaaa ccagggctca ccttggttaa catatataca acaaggaact tcgaagtctc
     7741 ctttatggtc acagtaatat ctggggggat gtatgggaaa acatattttt tgaaaccacc
     7801 cgaacctgat gatccatttg agttccaagc atttaggatc tttgaggtag gattagtcag
     7861 agacgtaggg agccgtgaac ccgtgctgca gatgacaaat ttcatggtga ttgatgagga
     7921 cgaaggtctg aatttttgtt tgttgtcagt cggagaactt agacttgcgg ctgtctgcgt
     7981 cagagggaga ccagttgtta caaaggacat aggaggttac aaagatgagc cctttaaagt
     8041 tgttacgttg ggcatcatag ggggtggttt gagtaatcag aaaactgaga tctacccgac
     8101 gattgattct tctatcgaga agttatacat aacttctcat agaggtataa tccgaaactc
     8161 aaaagctagg tggtcagtac ctgcaattag aagtgatgac aaagacaaaa tggagaaatg
     8221 cactcaagca ttgtgcaaga gtagaccacc cccctcatgc aatagttctg attgggagcc
     8281 attgacgagc aaccggatcc cagcttatgc atatattgca ctagagatta aagaggactc
     8341 aggtctagag cttgatatta cgtcaaacta cggacccttg ataatccatg gagcagggat
     8401 ggacatttac gaaggcccca gtagcaatca agactggctg gccattcctc ctttgtcaca
     8461 atcagtactc ggtgtcatta acaaggtcga ttttacagca ggatttgata tcaaaccaca
     8521 tacccttaca actgcagtcg attacgagag tggaaaatgc tatgtcccgg ttgagttgtc
     8581 aggagccaag gatcaggacc ttaaattaga atcaaacctt gttgtattgc ctacaaagga
     8641 ctttggttat gttacagcaa catatgacac ctcaagaagt gagcatgcaa tagtttatta
     8701 tgtgtatgac acagctaggt cttcatcata tttcttccca ttcagaatca aggcaagagg
     8761 agagccaatt tatctgagga tagagtgttt cccctggtcc aggcaactct ggtgtcatca
     8821 ctactgcatg attaatagta cagtatccaa tgagattgta gtggttgata acctagtaag
     8881 tatcaatatg agctgcagcc gttagatgaa cgactgttga gcctccacca actgtcccag
     8941 tatgccgcac catcagcact cccactctcc aatccatcct ctaatcagtg gtcgctctag
     9001 accccattaa gaaaaactta gggaccagga ttattgcaga gatggagtcc atctcaatca
     9061 accagatcct ataccctgaa gtgcacctgg atagtcctat agtcactaat aagttagtag
     9121 ctatactgga gtactctcgg gtaacccatg ggtacattct ggaggatcag acattgacaa
     9181 agaatattcg ctatagggtc gagaatggtt actccaatca aatgattata aataacctcg
     9241 aaattgggaa tgtggtaaat ctaagactca tgagttaccc ctaccacagg cacaagatct
     9301 accccgactg taattatgac ctattccata tttctgatca tcaaatttct tccaggctgt
     9361 taaccctatt caaaaaggga aatacaatat atactaagat cagcggaaaa attattgaat
     9421 gtatgaaggg agtgaattca aggctaggga taagcagtga tttaagtaaa gaagttacga
     9481 cagggatcac agatttggga gcctacatgc agagttctca atggtatggg ccttttctat
     9541 attggttcac aattaaaact gagatgcggt ccatcatcaa atcagctaca cacacaagtc
     9601 atagacatag gatagtacct tcatttgtgc atggtgagag atgtgaagtt ttgatatcta
     9661 gagacttggt aaccataatt aataaaaggt cacaagatat ctactatctc acatttgaaa
     9721 tggtacttat gtactgcgat gtagtggaag gaaggttaat gacggaaact gcaatgactg
     9781 tggaccctag gtacactgag ctcctctgta gagtaaagta cctatggaat ttgatagacg
     9841 gcttcttccc caccttaggc aactccacat atcaaatcgt ggcattatta gagccattgt
     9901 ctctggccta tctccagttg aaagatatca cacttgaatt gaggggcgca ttccttagtc
     9961 actgttttaa tgaaatccat gacattttgg agagtagtgg agttttaact gaggagacat
    10021 attctgatgt tgtcaatgca ttggattaca tcttcattac tgacgacatc cacctgacag
    10081 gggagatatt ttcattcttc cgaagtttcg gtcacccgcg tcttgaagct gtaacagcag
    10141 ccaacaatgt gagaaagtat atgaaccaac caaaagttat caactacgag actatgatga
    10201 agggtcatgc tatattttgt ggtattatca ttaatgggta ccgtgatagg cacggaggta
    10261 gttggcctcc gatttccctt ccgactcatg catcttcaat agttaggaat gccctggctt
    10321 caggtgaggg attgacctac agtcaatgta ttgataattg gaggtcattt gctggggtta
    10381 agtttgggtg tttcatgccc ttgagcctag atagtgacct gactatgtat ctcaaggata
    10441 aagcactagc agcactcaag aaagaatggg attcggctta cccaaaggaa taccttcgtt
    10501 ataatcctcc caagccgaca ggatcgagaa gactggtcaa tgtttttctt gatgactcta
    10561 catttgatcc ttacaatatg atcctctacg tgataaatgg ctcctatttg gaagaccctg
    10621 atttcaactt atcttacagc ctaaaagaaa aagagattaa agaagttggg agactatttg
    10681 ccaaaatgac ctaccaaatg agggcttgcc aagtcattgc agagaattta atatcaaatg
    10741 gaatagggaa gtatttcaaa gacaacggta tggccaaaga tgaacatgat cttacaaagg
    10801 cactccacac actagcggtt tcaggtatac ctaagaacaa aaaagattac catagaggtg
    10861 aaggtgggag acagaccaat ccttggtggt ttggtgacaa gtcaaagatt aataagcgac
    10921 atggacaaac ctcaacggct cattctaatt atgctggtgc cgggtgcgga atcaagaatg
    10981 gacatgatca ggaggcttac gagactgtca gtgcttttat cacgactgat cttaagaaat
    11041 actgtttaaa ttggaggtat gagacaatca gcatatttgc ccagagacta aatgagattt
    11101 atgggttgcc ttcattcttc cagtggctac acaagaggtt ggagaaatca gtcctatatg
    11161 taagcgatcc tcactgtcca ccagatcttg atactcatat ggatttagat gccgtcccaa
    11221 actctcaaat atttataaag tacccaatgg gagggatcga aggatactgt cagaagctct
    11281 ggacaataag tacaataccc tacctgtacc tagcggctca tgagagtgga gttaggattg
    11341 cctcattggt tcaaggagac aaccagacca tagccgtcac aaaaagagtc ccaagcacct
    11401 ggccctatga tctcaagaag agggaggcta caaaaataac tattgagtac ttcttaatcc
    11461 taaggcagcg attgcacgat atagggcatc atttgaaagc aaatgaaaca attatatcct
    11521 cacatttctt tgtttactca aaaggtatat attatgatgg aatgctgatc tctcaatcac
    11581 ttaaaagtgt ggctcggtgt gttttctggt cggaaacaat tgtggatgaa actagagcag
    11641 cttgcagtaa tatctcgaca actcttgcaa aaagcataga aaggggattt gaccggtatt
    11701 tggcttactc attgaatgta ctcaagatta ttcaacagat ccttatctcc ctagggttta
    11761 caataaacac ttctatgact caagacatag caatcccatt actgcagaat caagatttgt
    11821 tgattaagat ggcactacta cctgctccta taggtggcct caattatctt aatatgagca
    11881 gattattcgt aaggaatata ggtgatccag ttacgtcctc tttagctgac ctcaaaagaa
    11941 tgataattgc agggattatg cctgaggagt cgatccacca agttatgaca caacaaccag
    12001 gggattcctc tttcctagat tgggcgagtg atccctactc agctaatcta ccatgtgtcc
    12061 agagtatcac tcggctgctt aagaatataa ctgcgaggca tgtactaata aatagtccca
    12121 atccgatgtt aagggggtta ttccatgcgg atagccatga ggtagatgag agtttagcaa
    12181 cttttctcat ggatagacac atcatcagac ctagagcagc tcatgagatt ctcgacaaca
    12241 gcatcgcagt ggctcgcgaa tcacttgcag ggatgctcga tactactaaa gggctaataa
    12301 gggctagtat gaaaagaggt ggactgaccc cacgaatcat cacgagatta tcaaattatg
    12361 attatgatca gtctaaaatg ggaatatcac ttttaactgt gaagaaacgg aacaatctta
    12421 tcgataggga gtcttgctca gtccagctgg cccgagccct cagaagtcac atgtgggcga
    12481 aactcgcaag aggaaggtcg atatacggac ttgaggttcc tgttgtacta gaatcaatga
    12541 aagggtacat tattaaacgt catgagtctt gttcattatg tgaaactggc tcactgaatt
    12601 atggttggtt ctttggtccc gccaattgcc aattagataa tatttccaag gagacatcat
    12661 ctttacgagt tccctatatt gggtcaacaa cagaggaaag gacagacatg aaattagcat
    12721 ttgccaagtc ccctagtcgc tccctcaaat ctgccgtgag aattgctaca gtgtactctt
    12781 gggcttatgg tgatgacgac cagtcatggc atgaggcatg gactctagcg aggcagagag
    12841 ctaacattac cttagaggag ttaaggatga ttaccccaat atcgacatcc accaatttgg
    12901 ctcatcgatt gagggatcgt aacactcagg taaagtactc aggtacatcg ttaatccgag
    12961 tagcgaggta tacaacaata tcgaatgata acttatcatt cattatagca gataagaagg
    13021 tggatacaaa tttcatttat cagcaaggaa tgttgttggg attgggaatc ctagaaacat
    13081 atttcaggtt acagacaaat acggggtcct ctaatacagt gttacatcta catgttgaag
    13141 cagaatgctg tgtaatccct atgactgacc atcccagagt cccaagtcac cgtactgcac
    13201 ctagcgccag gaaaatgtgc accaatccat tgatctacga taattctcca atcattgaga
    13261 aggatgcggt acgtctatat tcccagagcc atagaaagca cctcgtggag tttgttactt
    13321 ggtcaacggg gcagctgtat cacgtactag cgaaatcaac tgctatgtca atgatcgaat
    13381 tggttaccaa atttgagaaa gatcatctga acgaaatagc tgctttaata ggagacgatg
    13441 atatcaatag tttcatcact gaattccttc tagtagagcc tagactattt acagtatact
    13501 taggccaatg tgcagctatt aattgggctt tcgagataca ttatcaccgt ccttcaggca
    13561 agtaccaaat gggagagttg ctcttttctt ttttgtgtag gatgagtaaa ggtgttttta
    13621 aaatccttac taatgctttg agtcatccta aagtctacag gagattttgg gactgtggaa
    13681 taattgagcc aatacacggc ccctctctag atactcaaaa tttacatctc acggtctgta
    13741 acatgatcta ccactgttat atgatatatc tagatttgct cttgaatgat gagcttgatg
    13801 atttcacttt cctattgtgc gagagtgatg aagatgtagt cagtgaccgg tttgagaata
    13861 tccaagcaag gtacctttgt attcttgcag atctgtactg taatgcaaaa aactgcccgt
    13921 caataaggga gcttgcacct ataaagaagt gtgcagtact tacacaattc attaaatcgg
    13981 aagcattaat ttctcctggg gggttggatt ggaatgatga accgatagta gtagaccact
    14041 tttcttgctc cttaacatat ctcagaagag gggcggttaa acagatcagg ctgagggttg
    14101 atcctgggtt cgtatcggaa gttctaattg acgcttctga ccataacctg ggaccaatta
    14161 aagctaagga gattaaatta gatagcataa atttctatcc ccccaaggaa gatgttgcac
    14221 gattactcag tactataggc acagcccagc atgatctccc cataatcggt actagagtaa
    14281 tcaactatga agtacacgct tacaggagaa tagggttgaa ctcatctgct tgctacaagg
    14341 cagtagaagt atcctcggta atcaagagca tgattgaacc aggagaggat gggttattct
    14401 tgggtgaagg ttcaggttct atgctagtca cgtataggga aatactcaag cttaaacgtt
    14461 gttattacaa cagaggagtg tcagtcgagt ctagatctgg gcaaagagag atatctccct
    14521 atccatcaga agttagtctt gttgaacacc aactagggct ggatcgtagt gttaaggtat
    14581 tattcaacgg gaaaccagag gtgacatggg taggtaatgt agattgttat aagtatatca
    14641 ttagtaatat accatcttca agtctgggac taatccactc agatatcgag accttaccaa
    14701 ataaggacct agttgaaaag ctggaagaat taactgccat attgtcgatg acatttattt
    14761 taggtaaaat aggatctctc ctaataatca agataatgcc gacaagcggg gacttagttc
    14821 aaggattcat cggatacacc actccttttt ttagagagag cattattgta taccctagat
    14881 acagtaactt tatctccact gagtgttatc tagtatttgt tgggcttaag tacaatagat
    14941 taatcaaccc tgaaggaata aaacaacaac tactaaaatt gagcataaga acatcaccgg
    15001 gatttgtagc acatttgtta tctatgaagc aagcaaacta cctacaatcc cttattggac
    15061 tccctgttca gaaaggattc tttaatagag tgttgagtgg gttgaccccg attgaaaaag
    15121 ttcttattaa ttgcgggtta actgtgaacg ggcccaaggt atgtaagaac ttagtacatc
    15181 atgatatcgc gtctggatca gagggtcttg tcaactcaac tgttatttta tacaaagaat
    15241 tggcaaggtt taaggaaaac acaaggagtc aacaaggcat gtttcacgct tatccggttc
    15301 ttgcagatag taggcaaaga gaattagtat ccagaatcgc taggaagtat tgggggtata
    15361 ttatactgta ttcaacagaa cagggggcgc tcaatcaatt ggtaagaaat cttaaagcgg
    15421 gctacttatt atttgatgtt catcacaatt ttctcgtcaa gaatctttcc aaatccgaaa
    15481 gggtcttaat acggactctt atcccacgga gagagtggtt atttaagctt gagacaagtg
    15541 agataaagga gtggtttaag ttgatagggt acggtgccct catcagagag taataactaa
    15601 tgaagtggac ccctgctcct gtccttgtca gagtgatatc agattataat tattaagaaa
    15661 aacaagattc gatttaagta cctataccca gctttgtctg gt
//
DBGET integrated database retrieval system