LOCUS NC_031139 3568 bp RNA linear VRL 13-AUG-2018
DEFINITION Huangpi Tick Virus 2 strain H114-17 glycoprotein precursor (G)
gene, complete cds.
ACCESSION NC_031139
VERSION NC_031139.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Huangpi Tick Virus 2
ORGANISM Huangpi Tick Virus 2
Viruses; Riboviria; Orthornavirae; Negarnaviricota;
Polyploviricotina; Ellioviricetes; Bunyavirales; Phenuiviridae;
Phlebovirus.
REFERENCE 1 (bases 1 to 3568)
AUTHORS Li,C.X., Shi,M., Tian,J.H., Lin,X.D., Kang,Y.J., Chen,L.J.,
Qin,X.C., Xu,J., Holmes,E.C. and Zhang,Y.Z.
TITLE Unprecedented genomic diversity of RNA viruses in arthropods
reveals the ancestry of negative-sense RNA viruses
JOURNAL Elife 4 (2015) In press
PUBMED 25633976
REMARK Publication Status: Available-Online prior to print
REFERENCE 2 (bases 1 to 3568)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (23-SEP-2016) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 3568)
AUTHORS Li,C.-X., Shi,M., Tian,J.-H., Lin,X.-D., Kang,Y.-J., Qin,X.-C.,
Chen,L.-J., Xu,J. and Holmes,E.C.
TITLE Direct Submission
JOURNAL Submitted (28-SEP-2014) Department of Zoonosis, State Key
Laboratory for Infectious Disease Prevention and Control, National
Institute of Communicable Disease Control and Prevention, Chinese
Center for Disease Control and Prevention, Changping Liuzi 5,
Beijing, Beijing 102206, China
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to KM817707.
##Assembly-Data-START##
Sequencing Technology :: Sanger dideoxy sequencing
##Assembly-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..3568
/organism="Huangpi Tick Virus 2"
/mol_type="genomic RNA"
/strain="H114-17"
/host="Haemaphysalis sp."
/db_xref="taxon:1608048"
/segment="M"
/country="China"
/collection_date="2013"
gene 382..3420
/gene="G"
/locus_tag="BI075_sMgp1"
/db_xref="GeneID:29061204"
CDS 382..3420
/gene="G"
/locus_tag="BI075_sMgp1"
/codon_start=1
/product="glycoprotein precursor"
/protein_id="YP_009293591.1"
/db_xref="GeneID:29061204"
/translation="MFFQRITLFLALSGMAMGFLEDLIQHVRTASQPEKKQFLQGLPR
SHELSKGLVEVPTGLEVSKIGFLPMENFTDKLSDXVSREMDCSGGRKTFLALDPKNKR
ISNLTCQASHILSKDCQTCISGTPPVLNPPFSIILYDDMICQFETEATARMKQPTTSF
CSVAGNSLRECKGIVENTVEKISWILLKEKVVFLEGHSLSWREGPWLSLFDCKNTTDG
KSQCDLSECKSGRCTGDAPFCSQFACEQSNPVCRCSRNLIPGILHVTIGDNTVIPQCF
GHSKWVVQRSRRHVQASSPRSCIDCSVECGVDHVIVVVRHFNPGYYQMCLGPICYTGM
AKGKEFRVDIHPMSRISTEEIDMKIWSDTKADRFDLRTSCHHMSACDLINCFFCSANW
VNIHCFGKEKWVLISLVFSLVCIIVGMALKAVQRIVMFCIWVLKPIIWICTVCCKYTS
KKTFQKMLRMRDVMREIDEESGIDLPLLVPIAETPAPSSATPRENRLARKAKVMMLVS
ILSLFSRSEACSDSIKLISPAKDCQQIAPNKYSCTFTTTALIPVAPIGQVSCVLLTSQ
TGESLGIMKIKTLEARLSCLKSDLYWIPKATHQCLGARRCRLVGECINDACMKMSEND
YSSEWGARDQIMSRLGWSSCNPQCGGIGCGCFNVNPSCFYLRKTFINSESLIYKAFEC
PSWTHSIKVEIFFNRSSQEVLLLPDAPQKTPWGRVQLSSVSAAPNLGFSDCFFESQAG
EMFHAPCNRRGEVSTGKLGEIQCPTASDAMQISTNCFSDQSLIHHLINKDVVHCSSQL
LDPKEVLKKNKLPSTIGNVIFYPSSGSVYASSSTKISATILLRLNEVQMESVTDRNKC
SVRFVNLTGCYNCEAGAILKIETVTDFGTADAVISCPEIGLLTYIISTSVLSSVDTII
HLNRSKISTTCSATCPNSNENFKIDGELTYLKDVDFRHHNETTTPIIQKGKGGIDWFG
WLHFSWFQWIWTILFIGFIVISTVVGFLILKFFLVKIKRT"
misc_feature 637..1797
/gene="G"
/locus_tag="BI075_sMgp1"
/note="Phlebovirus glycoprotein G1; Region:
Phlebovirus_G1; pfam07243"
/db_xref="CDD:369282"
misc_feature 1936..2844
/gene="G"
/locus_tag="BI075_sMgp1"
/note="Phlebovirus glycoprotein G2; Region:
Phlebovirus_G2; pfam07245"
/db_xref="CDD:429364"
misc_feature 2947..3387
/gene="G"
/locus_tag="BI075_sMgp1"
/note="Phlebovirus glycoprotein G2 C-terminal domain;
Region: Phlebo_G2_C; pfam19019"
/db_xref="CDD:408789"
ORIGIN
1 acacaaagac ggcaagatgt tttttcaaag aatcaccctc tttctggcat tgtctggaat
61 ggcaatgggg ttcctggagg atctgatcca gcacatccgg actgcttctc agcctgagaa
121 aaagcagttc ttacaaggac ttcccaggtc tcacgaactc acacaaagac ggcaagatgt
181 tttttcaaag aatcaccctc tctctggcat tgtctggaat ggcaatgggg ttcctggagg
241 atctgatcca gcacgtccgg actgcttctc agcctgagaa aaagcagttc ttacaaggac
301 ttcccaggtc tcacgaactc tcaaaaggat tagttgaagt gcacaaagac ggcaagatgt
361 gtgtgacaca gagaaggcaa gatgtttttt caaagaatca ccctctttct ggcattgtct
421 ggaatggcaa tggggttcct ggaggatctg atccagcacg tccggactgc ttctcagcct
481 gagaaaaagc agttcttaca aggacttccc aggtctcacg aactctcaaa aggattagtt
541 gaagtgccta caggattgga ggttagcaaa ataggcttcc ttcccatgga gaattttact
601 gataagctat cagatrgagt cagtcgagag atggactgtt ctggtggtag gaaaacattt
661 ctggcactgg atcccaaaaa caagcggata agcaacctaa catgccaggc aagtcacatc
721 ctaagtaaag actgtcaaac ctgtattagt gggaccccac ccgtcttaaa cccgcctttc
781 agcatcatac tatacgatga catgatctgt cagtttgaga cggaagcaac agcacgaatg
841 aaacaaccaa caacatcatt ctgcagcgtg gcaggaaact cattgagaga gtgcaaaggc
901 attgtagaga acacagttga gaagatcagt tggattttgc tcaaagaaaa agtggttttc
961 ctggaaggac actcattgag ctggagagaa ggcccttggc tgtctttatt tgattgcaaa
1021 aatacaacgg atgggaaatc acaatgtgac ctttcagaat gtaagtcagg aagatgtaca
1081 ggagatgctc ccttttgttc ccaatttgct tgtgaacaat ctaacccagt ctgtcggtgc
1141 agtagaaacc tcataccagg aattttgcat gtcacgatag gcgacaacac agtcatacca
1201 caatgctttg gacactccaa atgggttgtt caacgatcaa gaaggcatgt ccaggcatca
1261 tcccccagat cgtgcattga ttgttctgtt gagtgtggtg tcgatcatgt gatcgtagtt
1321 gtacgacatt ttaatccagg atattaccag atgtgtttag ggccaatttg ttacactggg
1381 atggcaaaag ggaaagaatt cagagtagat attcatccta tgtctagaat tagcacggaa
1441 gagatcgaca tgaagatatg gtctgacacc aaagctgaca gatttgatct acgaacttca
1501 tgtcatcaca tgtctgcttg tgatcttatt aattgctttt tctgctcagc aaactgggtg
1561 aacattcact gttttggtaa ggagaaatgg gtgttgatat cactagtctt ttccctagtt
1621 tgcattatcg ttggcatggc tttaaaagct gtccaaagga ttgtgatgtt ctgcatttgg
1681 gtcttgaagc ctatcatctg gatttgcacg gtctgctgca aatacaccag taaaaaaaca
1741 tttcaaaaga tgctcagaat gagagatgtg atgagagaaa tagatgaaga atcaggcatt
1801 gatctaccac tactagtccc aattgctgag actcctgcac cgtcatcagc aacgcccaga
1861 gaaaacagac ttgctagaaa agcaaaagtg atgatgctgg tctctatact ctccctcttt
1921 tcaagaagtg aggcgtgttc tgattccatc aagctgatct cccctgcaaa ggattgccag
1981 caaattgcgc ccaataaata ctcatgcacc ttcactacca cagcactcat acctgtggct
2041 cccattggac aagtatcctg tgtattgttg accagccaga caggggagtc actgggaatc
2101 atgaaaatca agaccttgga ggcacgtctg agttgtctaa agagtgatct ttactggatt
2161 cctaaagcta cacatcagtg tttgggtgct aggagatgtc gattggtagg tgagtgcata
2221 aatgatgcat gcatgaaaat gagtgaaaat gactacagct ctgaatgggg tgctagggac
2281 cagatcatga gccggttagg atggagctct tgcaatcccc aatgtggtgg cataggctgt
2341 gggtgcttta atgtgaatcc aagctgtttt tacttgcgaa agaccttcat aaattctgaa
2401 tcccttattt ataaggcatt tgagtgcccc tcctggaccc attcaattaa agtagagata
2461 ttcttcaaca gatcaagtca agaggtccta ttgcttcctg atgctccaca aaagactcca
2521 tgggggcggg tgcagctatc ctcagttagt gcagccccca atcttggctt ttcagattgc
2581 ttttttgagt cacaagcagg ggagatgttc catgcaccat gcaaccgaag aggagaagtg
2641 tcgacaggga aactaggaga gatacaatgt cccacagcat ctgacgcaat gcagatctct
2701 actaattgtt tctcagatca atctctcatc catcatttga taaacaagga tgtggtccac
2761 tgttctagtc agctcctaga tccaaaggaa gtgctgaaga aaaacaaatt gccatccact
2821 atcggcaatg tcatattcta cccctcaagt ggatctgttt atgcatcatc atccacaaag
2881 atctcagcca ccatcctgct cagactcaat gaggtccaga tggagagtgt gactgatagg
2941 aataaatgtt ctgtaagatt tgtgaattta acaggctgtt ataattgtga agcaggagcc
3001 atccttaaaa tagagacagt cactgatttt gggactgcag atgcagttat ttcttgtcca
3061 gagattggtc tgctcacata catcatcagc acttctgttc tatcatcagt ggacaccata
3121 attcacctga atagatcaaa aatctctact acatgttccg caacgtgtcc caacagcaat
3181 gaaaatttca agatagatgg cgaattgact tacctgaaag atgttgattt tagacatcat
3241 aatgagacaa ccacccccat tattcaaaag ggtaaaggag gaattgattg gtttggctgg
3301 ctgcactttt catggtttca gtggatttgg acaatattat ttataggctt tatagtaata
3361 tcaactgttg tggggttctt gattttaaag ttcttccttg tcaaaatcaa gaggacgtaa
3421 attcaaactg gataccattt ttaatctggg cactctttta agagtgtcca aggcagtgcc
3481 tgaatccttt atggtattct ttcagcagat cacttttgca gatcagttct aactgtgtta
3541 ttcctttaaa tcttgcccgt ctctgtgt
//