LOCUS NC_031138 6441 bp RNA linear VRL 13-AUG-2018
DEFINITION Huangpi Tick Virus 2 strain H114-17 RNA-dependent RNA polymerase
(L) gene, complete cds.
ACCESSION NC_031138
VERSION NC_031138.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Huangpi Tick Virus 2
ORGANISM Huangpi Tick Virus 2
Viruses; Riboviria; Orthornavirae; Negarnaviricota;
Polyploviricotina; Ellioviricetes; Bunyavirales; Phenuiviridae;
Phlebovirus.
REFERENCE 1 (bases 1 to 6441)
AUTHORS Li,C.X., Shi,M., Tian,J.H., Lin,X.D., Kang,Y.J., Chen,L.J.,
Qin,X.C., Xu,J., Holmes,E.C. and Zhang,Y.Z.
TITLE Unprecedented genomic diversity of RNA viruses in arthropods
reveals the ancestry of negative-sense RNA viruses
JOURNAL Elife 4 (2015) In press
PUBMED 25633976
REMARK Publication Status: Available-Online prior to print
REFERENCE 2 (bases 1 to 6441)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (23-SEP-2016) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 6441)
AUTHORS Li,C.-X., Shi,M., Tian,J.-H., Lin,X.-D., Kang,Y.-J., Qin,X.-C.,
Chen,L.-J., Xu,J. and Holmes,E.C.
TITLE Direct Submission
JOURNAL Submitted (28-SEP-2014) Department of Zoonosis, State Key
Laboratory for Infectious Disease Prevention and Control, National
Institute of Communicable Disease Control and Prevention, Chinese
Center for Disease Control and Prevention, Changping Liuzi 5,
Beijing, Beijing 102206, China
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to KM817668.
##Assembly-Data-START##
Sequencing Technology :: Sanger dideoxy sequencing
##Assembly-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..6441
/organism="Huangpi Tick Virus 2"
/mol_type="genomic RNA"
/strain="H114-17"
/host="Haemaphysalis sp."
/db_xref="taxon:1608048"
/segment="L"
/country="China"
/collection_date="2013"
gene 17..6382
/gene="L"
/locus_tag="BI075_sLgp1"
/db_xref="GeneID:29061202"
CDS 17..6382
/gene="L"
/locus_tag="BI075_sLgp1"
/codon_start=1
/product="RNA-dependent RNA polymerase"
/protein_id="YP_009293590.1"
/db_xref="GeneID:29061202"
/translation="MFHRVDDLSEICRRSEKVDSLTVPDFRTYWSTVTLPPPLHHVYK
DGSDIIIDFDLSTLDTSSQTGSSIRTTYKVKADDAGTLIHDFTFAHWSETTDEPLQNH
FPSVRDDANRWTPDFISTRLDGIKDVVEFTTFRSVDERAARQRFMDKITKYEYPLELR
SRATPGTLLFAICVFRGGVVTNLDLTDAEVDEVCFRFSVVQAVFATLQNQMLVQEVQD
PEETRLERQVQQTFLRIQPEWDTTEKNFYPFTRDLYHSFQNGVMDEDYLRGALKHCYA
EAKKDVEDRNFIHVTTDSSERIVLNGEEAAKSITEFVEAVDAKALRSDHDHKSTIPFP
GIIPKVQGNTLSLSGLKDVSFSGITADSTGKAWAEAISRIHSDDVERADEHEELEREI
ALNGMNPDETEDYKKSRQQYHRVDLSNLDSSDRVELAKQGVEAKEFRDHPAIQQKRQE
SKRTFSLHADTSDIDCFLTEEGNLFDETLSQEAPPAVESCVESSARFQSLHGIDHKTN
PWSLNVLSFLRLPIGVWLLMATCIGVELSISLKQHCGRRKFILKKLRFFDVYLLVKPT
NSGSHVFYSVGFHKSAILGMLHSSNVFKAVKEDEGWCWTEFHSFKMSKLTNPVKALSS
LCSGYWYWREFFEIPFWTGSTDDYSNNIRQANEMFKLTIMLLLEDKARTEEIVTLSRY
IMMEGFKSVPELPKPHKMIEKLPTILRTKLQVWLTRKMLESISRVSQRPFQICNDQGT
LYWRGMFNPFTGESIMSTQKLISLFYLGYLKNKEESPEQNASIKMYSKILEYELKHPG
RYDYLGMMDPPADDCRYHEYSPSLIRLLCSTSIQFLRHQLGEGWRETLHKSIIHEIAH
LDLEKLATLKASSKFDEKWYSYNPSETYHRSKVIERVADYVNDKTSHVFQILEGCLLK
VESRGCMHICLFKKPQHGGLREIYVLGFEERVVQLVLETIARCICKHFPSETLTNPKH
KLTIPESHGRFAMKICGNQHQTVGTSDDARTWNQGHHVSKFALLLMSFTKSELHPFVF
RACSLFMKKRIMLDQNLLRILETNSNLLTDDETLRNLHSVYHGNEHVSWMTFKGGFIQ
TETGMMQGILHFTSSLFHTILQTWMKRVVAGSLKAILGINSTLNPHVDVLQSSDDSGM
LVSFPTDDPTLTMRCRQKTATLFEYKRKVGKLIGIYPSVKSTSNTLFVLEFNSEFFFH
TNHNRPVFRWIAAAGTISEQESLAARQEEMSNNLTSVLEGGGSFSLVSFCQYSQMLLH
YVLLGMTVSPVFLEFMCSAREYMDPALGFFLMDPPFSPGLCGFKYNLWVACRKGRLGL
KYRYFLNIMDGLATPEEKKASWKCLDTTSSGTFVQAVLIRWGDRKKWERLVKKVVTED
DWVDRIDENPFLLYRRPMTGQEVRLKLAVKMHSPGVASSLSKGNAVVRIIASSVYILT
RAVLSDNLMMLEENRIAKKTSLLQRVLGFNGLLGSSGPHLTEDQFLLLFPHHQDYLGI
SDRLGQLRSIAGRFSAKKTHITQTRIEIIEKERFMKVRPEDLVSDMWFGTSRSRVNTK
QFEKEWILLKTTVEWLRDTAEETLTLSPFSHHPALQNFFSRLETKGRSVRITGSPIKQ
RSGVSNMMTAVRDNFFPGFILSDVYDSAGLERSESAGLMRHCIFLILAGPYTEYRKTK
MVEEVCCSLPTISFKLNQYKSRINSLALIQHFLKNPNDEKIFDHICNTNSGVIGGFTQ
PQKSRPLGQGRLYYGPGVWRGLVYGMNIQIEVNSPPDSDYTYLQAVTIDHDSSKDFLP
GFLKTWCEEMNVTNLYSPRYSRSKKLLFFIYNFSVKPLRNPSGCPVYTESFKLFTNSS
LRIDLLGFKVRSSVLNFRYYENERERGKSNGRGMNLVSFFSRDSDAGLDEASSLSSLM
DQKIFSFSNNEPSTSWMTMRSLSSVSLNILLSKSSESHMLRAGIDKEVLKRCIKEALI
SSLKRMGVFLSDLKEAVDKMTDYAYTTAMEDCFNFAFEITEITSDDSDLFLNEPPQTA
QWDPTDFELDMSDLGPFGSMAIEEATNTRFYHHRIMEDVARKMVATLGHRGVRDLIIN
STYPRVHKELVQEWCSYLEINFESLAAKEEEAFGIALGPVIGLEQIG"
misc_feature 200..448
/gene="L"
/locus_tag="BI075_sLgp1"
/note="L protein N-terminus; Region: L_protein_N;
pfam15518"
/db_xref="CDD:373916"
misc_feature 587..1318
/gene="L"
/locus_tag="BI075_sLgp1"
/note="Protein of unknown function (DUF3770); Region:
DUF3770; pfam12603"
/db_xref="CDD:289377"
misc_feature 1862..3940
/gene="L"
/locus_tag="BI075_sLgp1"
/note="Bunyavirus RNA dependent RNA polymerase; Region:
Bunya_RdRp; pfam04196"
/db_xref="CDD:282102"
ORIGIN
1 acacagaggg gccaagatgt ttcatagagt agatgattta agcgagatct gcagacgatc
61 agagaaagtt gattctctga ccgtccctga cttcaggact tactggtcaa ctgtcacttt
121 acctcccccc cttcatcatg tgtacaagga tggatcagac atcatcatag atttcgatct
181 gtccacactt gacacttctt cccagactgg atccagcatc agaacaacct acaaggtcaa
241 ggcagacgac gctggcactc tcattcatga cttcaccttt gcgcattgga gtgagacgac
301 tgatgagcct ctacaaaatc acttcccaag cgtgagagat gacgccaaca gatggacgcc
361 agatttcatt agcactagac tggatggcat caaggatgtt gtggagttca caactttcag
421 aagcgtggat gagagagctg cccgtcaaag gttcatggac aagatcacta aatatgagta
481 cccccttgag ctccggtccc gtgcaacccc tggcaccttg ttgtttgcta tttgcgtgtt
541 ccgaggtggg gtggtgacga acctggattt aactgatgca gaagtagatg aggtctgctt
601 ccgattcagt gttgttcagg ctgtgtttgc aaccctccag aaccaaatgc tagtgcagga
661 ggttcaggat cctgaagaaa ccagacttga aaggcaggtg caacaaacat ttctcagaat
721 ccagcctgaa tgggacacta ctgagaagaa cttctacccc ttcacaaggg atctttacca
781 ttcatttcaa aatggtgtca tggatgaaga ctatctcaga ggagctctaa agcattgtta
841 tgcagaagca aagaaggatg tggaagaccg aaacttcatt catgtgacca cagattcatc
901 agaaagaatt gtgctaaatg gagaggaagc agcaaaatcc atcacggaat ttgtagaggc
961 agtggatgcc aaggccctcc gtagtgatca tgatcacaag agtacaatac catttcctgg
1021 aatcatcccc aaggtacaag gaaacactct ttcactctct ggattaaagg acgtatcctt
1081 ctcagggata acagcagact ctactggaaa ggcatgggct gaagcaatct cacgcatcca
1141 ttctgatgat gtggaacgtg ctgatgagca tgaagaatta gagagagaaa ttgctctgaa
1201 tggaatgaat ccagacgaga ctgaagacta caagaagtct aggcagcagt atcacagagt
1261 tgatctttcc aatctagatt ctagtgacag ggtggaactg gctaagcaag gtgttgaggc
1321 aaaggagttc cgagaccacc ctgctattca gcagaaaaga caggagtcaa aaaggacatt
1381 ctctctccat gcagacacca gtgatataga ctgtttcttg actgaagaag gtaacctttt
1441 tgacgagact ctgtcgcaag aagcacctcc tgcagtagaa agttgtgttg aatcatctgc
1501 tagattccag tctctccatg gaattgacca caagacgaac ccctggagtc tgaatgtgct
1561 atccttccta aggctcccaa taggggtgtg gctattaatg gcaacatgca ttggggttga
1621 actgtccatt agcctgaagc aacattgcgg caggagaaag ttcattctaa agaaactgag
1681 gttctttgat gtctaccttt tggtaaaacc aactaactct ggcagtcatg ttttctacag
1741 cgttggtttt cacaaatcag caatactagg gatgttgcat tctagcaatg tgttcaaggc
1801 agttaaagag gatgaaggct ggtgttggac agaattccat tcttttaaga tgtccaaatt
1861 gacaaatcct gttaaggcac tgtccagttt atgctcaggt tactggtact ggcgtgaatt
1921 cttcgagatt ccattttgga ctggatccac agatgattac tcaaacaaca ttaggcaagc
1981 taatgagatg ttcaaattga caatcatgct tcttcttgaa gacaaagctc gcactgagga
2041 gattgtcacc ttgagcaggt atatcatgat ggaagggttt aaatcagtcc cagaattgcc
2101 gaagccacac aagatgattg agaaacttcc aacaattctg cggacaaagc tgcaggtctg
2161 gttgacaagg aagatgcttg aaagcatctc aagagtttca caaagaccat tccaaatctg
2221 caacgaccag ggaactctgt actggcgagg gatgttcaac ccattcactg gggaatcaat
2281 catgtcaacg caaaagctca tctctctctt ctatctgggg tatctaaaga acaaagagga
2341 gagtccagag caaaatgcat ccataaagat gtattctaaa attctggaat atgaacttaa
2401 gcatcctgga agatatgatt atctcggaat gatggacccc ccagcagatg attgcagata
2461 tcatgaatac tctccttctc tgattcgcct gctgtgctcc acttcaatcc agtttttaag
2521 acaccagctg ggagagggat ggagagaaac actccataag tccataattc atgagatagc
2581 tcatctagat ctagagaaac ttgccactct gaaagcttcc agcaagtttg atgaaaagtg
2641 gtattcttac aacccctctg aaacttatca tcgaagcaaa gtcattgaga gagttgctga
2701 ttatgtaaat gacaagacat cccatgtttt ccaaattctg gagggatgcc ttttgaaggt
2761 tgagtctcgt ggctgcatgc acatctgcct gtttaagaaa ccacaacatg gagggttgag
2821 agagatttat gttcttggat ttgaggaacg tgtggtccaa ctggtcctag agaccattgc
2881 cagatgcatc tgcaagcatt tcccttctga aacactgact aaccccaagc acaagttaac
2941 aattccagaa tctcatggca gatttgcgat gaagatctgc gggaatcaac accagacagt
3001 ggggacctcc gatgatgcca gaacatggaa tcagggacat catgtttcaa aattcgctct
3061 cttgttgatg agtttcacaa aatctgagct gcatccattc gtttttaggg cttgcagcct
3121 tttcatgaag aagagaatca tgttggatca aaacttactc cggatccttg agacaaactc
3181 taaccttctg actgatgatg aaaccttgag aaacctgcac tctgtttacc atggcaacga
3241 acatgtgtca tggatgacat tcaaaggcgg atttattcag acagagacag gcatgatgca
3301 aggcatcctt cacttcacgt ctagtttgtt ccacacaatc ttgcaaacat ggatgaagcg
3361 agtagttgca gggtcgctta aagccattct aggaattaac agcacgctca atcctcatgt
3421 tgacgtgctc cagagttctg acgacagtgg aatgttggtt tccttcccta ctgatgatcc
3481 taccctcacc atgagatgca gacagaaaac tgctactctt tttgaataca aaaggaaggt
3541 ggggaagttg ataggaatct acccctctgt caaaagcact agcaacaccc tctttgttct
3601 cgagttcaat tctgaattct tctttcacac aaaccacaac cggcctgttt tccgatggat
3661 tgcagcagca ggtactatct cagagcagga aagtttggca gctaggcaag aagaaatgag
3721 taacaattta acatcagtgt tagagggagg agggagcttc tcattagtgt ccttctgcca
3781 gtacagccag atgctcctcc attacgtttt actaggtatg actgtctctc cagttttcct
3841 tgagttcatg tgctctgcaa gggagtacat ggatccagct ttaggtttct tcctcatgga
3901 cccacccttt tctcctggcc tgtgtggatt taagtacaat ctgtgggttg cctgcaggaa
3961 gggcagattg ggtctgaaat acagatactt tctgaacatc atggatggac ttgccacccc
4021 tgaggagaaa aaggccagct ggaagtgcct ggataccaca tcatcaggaa cctttgtgca
4081 agctgttcta atccgatggg gcgacaggaa gaagtgggaa agactggtga aaaaggttgt
4141 gactgaggat gactgggttg acagaattga tgagaatcct ttcctgctgt acaggagacc
4201 aatgacagga caagaagtgc ggctcaaact ggctgtaaag atgcactctc ctggagtggc
4261 aagttcactc tcaaaaggca atgccgttgt gaggataatt gcctcatcag tctacattct
4321 gaccagggca gtcttgagcg acaacctgat gatgctagag gaaaatagga ttgccaagaa
4381 aacaagcctc ttgcagaggg ttttaggatt caatggcctt cttggctcat caggtcctca
4441 tctcactgaa gatcagttct tgttgctgtt ccctcatcat caagattact tgggcatcag
4501 tgacagactg ggccaactgc ggagcatagc cggacgtttt tctgccaaga aaactcatat
4561 cacccagact agaattgaaa tcatagaaaa agaacggttc atgaaggtga ggccagagga
4621 tttagttagt gacatgtggt ttggtactag taggagccgg gtaaacacaa aacaatttga
4681 gaaagaatgg atcttgctca agaccactgt ggagtggttg cgagacacag ctgaagaaac
4741 tctgactctc agtccctttt ctcaccatcc tgcattacaa aatttcttct caaggcttga
4801 aacgaaaggt cgctctgttc gcatcactgg cagccccata aagcaaagat ctggagtttc
4861 aaacatgatg actgctgttc gagacaactt cttcccagga ttcatcctct ctgatgttta
4921 tgactcagca gggttagaga gatcagaatc tgctggcttg atgaggcact gtatcttcct
4981 gattcttgct gggccatata ctgaatacag gaagaccaag atggttgagg aagtgtgttg
5041 ttctttacca accatctctt tcaaactgaa ccagtataaa tccaggatta actctttggc
5101 tctcattcag cactttctta agaacccaaa tgatgaaaaa atctttgacc acatctgcaa
5161 caccaacagc ggagtgatag ggggctttac ccaacctcag aaaagcagac cactaggtca
5221 gggaagatta tactatggcc ctggagtctg gagaggtcta gtttatggaa tgaacatcca
5281 aattgaagtc aatagtcctc ctgactcaga ttacacttat ctgcaagctg tcacgattga
5341 tcatgacagc agtaaagatt tcctgcctgg cttccttaaa acctggtgtg aagagatgaa
5401 tgtcacaaat ttgtactccc ccaggtactc cagaagcaag aaactattat tcttcatcta
5461 caacttctca gtaaaaccat tgaggaaccc ctctggatgc cctgtgtaca cagagagctt
5521 taaactcttc accaattcaa gtcttaggat tgacctatta ggatttaaag tgaggagcag
5581 tgttctaaac tttaggtatt atgaaaatga acgagaaaga gggaagagca atggaagagg
5641 aatgaacttg gtgagtttct ttagcaggga ttctgatgct ggacttgatg aagcgtcttc
5701 attaagctct ctcatggatc agaagatctt ctctttctct aacaatgagc ctagcacatc
5761 atggatgaca atgagatcac taagttctgt ttccctgaac atactcctct caaagagtag
5821 tgaatcccac atgctcagag ctgggattga taaggaggtc ttgaagcgat gcataaagga
5881 ggccctaatc agctctctca agagaatggg ggttttcttg tcagatttaa aggaagcagt
5941 tgacaaaatg acagactatg cctacaccac tgcaatggag gattgtttca actttgcttt
6001 cgaaattact gaaatcacgt ctgatgactc tgatctcttc ctgaatgagc ctcctcagac
6061 tgcccagtgg gacccaacag actttgaact tgacatgtct gatctagggc cctttggttc
6121 aatggccatt gaagaggcaa caaacacaag attttaccac caccggataa tggaagatgt
6181 agccagaaag atggttgcaa ccctgggcca cagaggagtg agggacttga taatcaacag
6241 tacataccct agggtccaca aagagctagt tcaggagtgg tgctcatatc ttgaaataaa
6301 ttttgagtcc ctggctgcaa aggaggaaga ggcttttggc atagctctgg gaccagttat
6361 tggtcttgaa cagattggtt aatttggctc tcatttcttt gtggaaattg agtttgattt
6421 tggtgttcag atccttcgtg t
//