GenomeNet

Database: RefSeq
Entry: NC_039221
LinkDB: NC_039221
Original site: NC_039221 
LOCUS       NC_039221               4024 bp    RNA     linear   VRL 04-JUN-2019
DEFINITION  Thiafora virus isolate AnD 11411 envelope glycoprotein gene,
            complete cds.
ACCESSION   NC_039221
VERSION     NC_039221.1
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      Orthonairovirus thiaforaense
  ORGANISM  Orthonairovirus thiaforaense
            Viruses; Riboviria; Orthornavirae; Negarnaviricota;
            Polyploviricotina; Ellioviricetes; Bunyavirales; Nairoviridae;
            Orthonairovirus.
REFERENCE   1  (bases 1 to 4024)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (04-JUN-2019) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence is identical to KR537451.
            GenBank Accession Numbers KR537450-KR537452 represent sequences
            from the 3 segments of Thiafora virus.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..4024
                     /organism="Orthonairovirus thiaforaense"
                     /mol_type="genomic RNA"
                     /isolate="AnD 11411"
                     /host="Crocidura sp."
                     /db_xref="taxon:3052537"
                     /segment="M"
                     /country="Senegal"
                     /collection_date="17-Feb-1971"
     gene            42..3887
                     /locus_tag="D1Y41_sMgp1"
                     /db_xref="GeneID:37629384"
     CDS             42..3887
                     /locus_tag="D1Y41_sMgp1"
                     /codon_start=1
                     /product="envelope glycoprotein"
                     /protein_id="YP_009513192.1"
                     /db_xref="GeneID:37629384"
                     /translation="METKWVVAVALVMQLYQLLSLTVSSTSTTTPTVESTTSPNTSSA
                     TNATTGASGPTAQGISSSTSNQSVKSVDQIVSYAANIWDSTLEALFTKDTSCNKNLSW
                     CRLEVTDTHGLTPYVKHLYNLSNSKYNTFCQSKKGNFGFIKKEKFTFDVTTGPERMLL
                     KDIQCANVIYDGVTKDGYLLHVLFGGRRVFFYSCRYTIITKNCRIRASHDAPVPLPGY
                     GNWTTAMYSVFLEHRSAAETCKIYFPCLNKGKPLGNGGFKIKGFFVTGLTKPEGTGRK
                     LLTAADPEPDEDCGSASHLKQITNHHLVTDFKDGPGDVISICNGSTFFHGRMPDGLGC
                     HSIRSIKVSHHCGHHSTKCTVEPDLKACSHGKCILIRMSNRGMVKLTRGTTVETIRCG
                     TECMLPPLDGEGDILIDCPGGTQHFLQKNIIDLDCPNYPYFKEFMLYVCRASHRPKTT
                     IAFFAWLSVGYILLSTVCSISLWSLKIMCKGIELCKSKLPHNSGECKVCRQHYSSELG
                     QQLHEANCKNGLCPYCSNRLPENSLYKHAEVCPRRKVTEDTIREHDDFNSTPWFFVFI
                     FGISEYKGTTIKRATWLMILLALLLVSLSPVYGNELFFDGIGDGQLEKGLWDEEIGLV
                     ENCHQECFITETECLCPSFETGRKLLFFHLLNKQIRSGRKMKLLSSISLDTPWGVVKI
                     EKSFKPVLSMSNLQLSWTSEEEVGGKIVVSGKSTAVLKLKEKTGMVWELSSSKATEKK
                     TLIVSVMDYTQEYKTQLQYLTGDRLVSEWPRATCTGPCPDRCACHTSTCTWKAWPNSR
                     KWTCNPTWCWGVGTGCTCCGMDIEKPYQNYLAAKWSTEYIKTDVVVCVETSDEERHCD
                     IIQAGSRFHLGPITVLISDPQNVVKKLPSEVATIHKVQSSEVDVMHVDRILTANGLCK
                     LQSCTHGSPGDIQIFKPDYLVKHSISKRINAIEDHWWANDTWMSWQGTDLDYYCTTGS
                     WPTCTYSGVVRQNTDSFKNLETTEFDLLEEYFFHSSRVEVHGRTLNFPVKSRPKEGGG
                     ELTVLVEVNGLELHSKLIVPVGLMFKITSCKGCYSCSSGFICDVVLGVDSPPELTVHA
                     ECTNPNIVLTEGSLIARQGQTSTAKIKGFSVLKTTRLCIVLQESKVLEKLIEDCTELK
                     LDDPKDVIIERGSTLLSHQNDSCSYGLGCWLANVGTFGSGLSMIFQNYFGSVILGFLF
                     FVLPVVLLLVFFCLGDKIFFCRKFKWCFKSNKEDKEKFKQMVTELKQTNLIKKMKEEA
                     KTSWRGLANKALGKSTKEE"
     misc_feature    393..1787
                     /locus_tag="D1Y41_sMgp1"
                     /note="Nairovirus M polyprotein-like; Region:
                     Nairovirus_M; pfam07948"
                     /db_xref="CDD:285223"
     misc_feature    2112..3644
                     /locus_tag="D1Y41_sMgp1"
                     /note="Hantavirus glycoprotein G2; Region: Hanta_G2;
                     pfam01561"
                     /db_xref="CDD:426324"
ORIGIN      
        1 tctcaaagaa agtctagcgg caaactcgtc acagacttgc aatggagaca aaatgggtgg
       61 ttgctgttgc tctcgttatg cagctgtacc aacttctgag cctaactgtg tcgtcaacga
      121 gcactacgac ccctacagta gaatcgacca ccagcccaaa tacaagcagc gccacaaatg
      181 caaccacagg tgcctcgggc ccgactgccc agggcatttc gtcctcgact tccaaccagt
      241 ctgtgaagag tgtggatcag attgtgagtt atgcagcaaa catctgggat tcaacattgg
      301 aagctctgtt taccaaggac accagttgca ataagaactt gagttggtgc agactggagg
      361 tcacggatac gcatggtcta actccgtatg taaaacatct atacaacctc tctaatagca
      421 aatacaacac gttctgccaa agcaagaaag ggaactttgg attcataaaa aaagaaaaat
      481 tcacatttga tgtgaccact ggaccagaaa gaatgctgtt aaaagacatt caatgtgcaa
      541 acgtcatata tgatggtgtt actaaagacg gttacctgct gcatgttctt ttcggtggaa
      601 gacgtgtctt cttttatagt tgtaggtata ccatcataac aaaaaactgt aggattaggg
      661 caagccatga tgcaccagtt cctcttccag gctatggcaa ctggaccacg gcaatgtact
      721 ctgttttcct agagcataga agtgctgcag aaacatgcaa aatttatttc ccatgcttga
      781 acaaaggaaa gcctttgggt aatggaggtt ttaaaatcaa gggatttttc gtcactggtt
      841 taaccaagcc ggaagggaca gggaggaaac tactaactgc agctgaccct gaaccagatg
      901 aagactgcgg ctcagcttca cacctgaaac aaattacgaa ccaccacctt gtaacggact
      961 tcaaggacgg acctggtgat gtaataagta tttgcaatgg ttcaacgttc ttccatggaa
     1021 gaatgccaga tggattaggc tgtcacagta ttagaagtat aaaggtcagt catcactgtg
     1081 gacaccacag tacaaagtgc acagtcgagc ctgacttaaa agcatgcagc cacggtaaat
     1141 gtatattaat tagaatgagc aacaggggca tggtgaaact gaccagaggt accactgtgg
     1201 aaaccattag gtgtggtacc gaatgcatgc ttcctccgct cgatggtgaa ggtgacatat
     1261 tgattgactg tcctggagga acacaacact tcctccaaaa gaacattata gatttagatt
     1321 gcccaaatta tccctacttc aaggagttta tgctatatgt gtgcagagct tcccacagac
     1381 caaaaacaac catagccttc tttgcatggc tgtcggttgg ctacatcttg ctctccacag
     1441 tttgtagcat tagtttgtgg tcattgaaaa tcatgtgtaa aggtatagaa ctgtgtaaat
     1501 ctaaattgcc tcacaactct ggggaatgta aagtatgcag gcagcattat tcaagtgagc
     1561 ttggtcaaca gctccatgaa gcaaactgca aaaatgggct gtgtccttac tgctcaaaca
     1621 gacttcctga gaatagtttg tacaaacatg cggaggtgtg tccaaggaga aaagtaactg
     1681 aagacacaat cagagagcat gatgacttta attcaacccc ctggttcttt gtatttattt
     1741 ttgggattag tgaatacaaa ggcacaacaa ttaaaagagc aacttggttg atgattcttt
     1801 tggctctgct cttagtttca ctttccccag tgtatggcaa cgaactcttc tttgatggta
     1861 taggagatgg acaattagaa aaaggattat gggacgaaga gattggtcta gttgaaaact
     1921 gccaccaaga gtgttttatt acagaaactg agtgtctgtg cccaagtttt gaaacaggta
     1981 ggaaactact ttttttccat ttactaaata aacaaattag atctggaaga aaaatgaagc
     2041 tactaagcag tatctcatta gatactccct ggggtgtggt gaagatagag aagagcttca
     2101 aacctgtcct ctccatgtca aacttacagc tatcatggac aagtgaagaa gaggtgggag
     2161 gcaagattgt agtgtcagga aagtcaacag cagtgttgaa gctaaaagag aaaactggca
     2221 tggtctggga attatcctcc agtaaagcaa ctgagaaaaa aacacttatt gtttctgtga
     2281 tggattacac acaagaatac aaaacacagc tgcaatattt aacaggggac aggctggtgt
     2341 ctgagtggcc aagagctacc tgcacgggcc cctgtccgga taggtgtgct tgtcacactt
     2401 caacatgcac ttggaaagcc tggcctaaca gtaggaaatg gacttgcaat cctacttggt
     2461 gctggggagt tggtacaggt tgcacatgct gtggaatgga cattgaaaaa ccttatcaaa
     2521 actacttagc agctaagtgg agtacggagt atataaaaac agatgttgtt gtttgtgttg
     2581 aaacatctga tgaggaaagg cactgtgaca taatacaggc agggtcaaga ttccatctag
     2641 gaccaataac agttctaatt tcagacccac agaatgttgt aaaaaaactg ccttctgagg
     2701 ttgctaccat tcacaaagtt cagtcaagtg aagttgacgt aatgcatgta gacagaatcc
     2761 tcacggcaaa cggactttgc aaacttcaga gctgcacaca tggttctcca ggagacatac
     2821 agatcttcaa accggattac ttagtgaagc acagtatatc taaaagaatc aatgcaatag
     2881 aagatcattg gtgggcaaac gacacttgga tgtcatggca agggacagac ttggactact
     2941 actgcaccac aggaagctgg cccacatgca cctattctgg cgttgttaga cagaatactg
     3001 actctttcaa aaatttggag acaacagaat ttgacttact agaggagtat ttcttccatt
     3061 cttcaagggt tgaagttcat ggcagaactt taaacttccc tgtcaaatcg aggcccaaag
     3121 aagggggggg tgaactgaca gtgttggttg aagtaaatgg attagagcta cattctaaat
     3181 tgatagtgcc agtgggcctg atgtttaaaa taacatcgtg caaggggtgt tactcctgtt
     3241 cttctggatt catctgtgat gtagtgttgg gagtggatag tcctcctgag ttgactgtac
     3301 atgcagaatg cacaaaccca aacatagttt taacagaagg aagtctaatt gctagacaag
     3361 gacagacctc tactgcaaaa ataaaggggt tttcagttct taagactaca agactatgca
     3421 ttgtcctaca agagtctaag gtgttggaga aactaattga agactgtaca gagctcaaac
     3481 tagatgatcc caaagacgtt attattgaaa ggggaagtac cctgctttcc catcaaaatg
     3541 atagctgcag ctatggacta ggatgctggc tggctaacgt tggcactttt ggctccggac
     3601 tgagtatgat cttccagaat tattttggct cagttatctt aggctttctt ttctttgttt
     3661 tacctgtggt cttgttgtta gtctttttct gcttgggtga caagatattc ttctgtcgga
     3721 aatttaaatg gtgttttaag agcaacaagg aagacaaaga aaagtttaaa cagatggtta
     3781 cagagttaaa gcagaccaat cttataaaga aaatgaagga agaagccaag acaagttgga
     3841 gaggcttggc aaacaaagca ctaggaaagt caaccaagga agagtagcca caaatcatac
     3901 aaatccttta gtactttcaa tcattaatca cacttgtccg cacccaacag ccagccccca
     3961 ttctccggca gtagacacaa cacaaataca acagatgtga acgagtatgc cgcttctata
     4021 tctt
//
DBGET integrated database retrieval system