LOCUS NC_039221 4024 bp RNA linear VRL 04-JUN-2019
DEFINITION Thiafora virus isolate AnD 11411 envelope glycoprotein gene,
complete cds.
ACCESSION NC_039221
VERSION NC_039221.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Orthonairovirus thiaforaense
ORGANISM Orthonairovirus thiaforaense
Viruses; Riboviria; Orthornavirae; Negarnaviricota;
Polyploviricotina; Ellioviricetes; Bunyavirales; Nairoviridae;
Orthonairovirus.
REFERENCE 1 (bases 1 to 4024)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (04-JUN-2019) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to KR537451.
GenBank Accession Numbers KR537450-KR537452 represent sequences
from the 3 segments of Thiafora virus.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..4024
/organism="Orthonairovirus thiaforaense"
/mol_type="genomic RNA"
/isolate="AnD 11411"
/host="Crocidura sp."
/db_xref="taxon:3052537"
/segment="M"
/country="Senegal"
/collection_date="17-Feb-1971"
gene 42..3887
/locus_tag="D1Y41_sMgp1"
/db_xref="GeneID:37629384"
CDS 42..3887
/locus_tag="D1Y41_sMgp1"
/codon_start=1
/product="envelope glycoprotein"
/protein_id="YP_009513192.1"
/db_xref="GeneID:37629384"
/translation="METKWVVAVALVMQLYQLLSLTVSSTSTTTPTVESTTSPNTSSA
TNATTGASGPTAQGISSSTSNQSVKSVDQIVSYAANIWDSTLEALFTKDTSCNKNLSW
CRLEVTDTHGLTPYVKHLYNLSNSKYNTFCQSKKGNFGFIKKEKFTFDVTTGPERMLL
KDIQCANVIYDGVTKDGYLLHVLFGGRRVFFYSCRYTIITKNCRIRASHDAPVPLPGY
GNWTTAMYSVFLEHRSAAETCKIYFPCLNKGKPLGNGGFKIKGFFVTGLTKPEGTGRK
LLTAADPEPDEDCGSASHLKQITNHHLVTDFKDGPGDVISICNGSTFFHGRMPDGLGC
HSIRSIKVSHHCGHHSTKCTVEPDLKACSHGKCILIRMSNRGMVKLTRGTTVETIRCG
TECMLPPLDGEGDILIDCPGGTQHFLQKNIIDLDCPNYPYFKEFMLYVCRASHRPKTT
IAFFAWLSVGYILLSTVCSISLWSLKIMCKGIELCKSKLPHNSGECKVCRQHYSSELG
QQLHEANCKNGLCPYCSNRLPENSLYKHAEVCPRRKVTEDTIREHDDFNSTPWFFVFI
FGISEYKGTTIKRATWLMILLALLLVSLSPVYGNELFFDGIGDGQLEKGLWDEEIGLV
ENCHQECFITETECLCPSFETGRKLLFFHLLNKQIRSGRKMKLLSSISLDTPWGVVKI
EKSFKPVLSMSNLQLSWTSEEEVGGKIVVSGKSTAVLKLKEKTGMVWELSSSKATEKK
TLIVSVMDYTQEYKTQLQYLTGDRLVSEWPRATCTGPCPDRCACHTSTCTWKAWPNSR
KWTCNPTWCWGVGTGCTCCGMDIEKPYQNYLAAKWSTEYIKTDVVVCVETSDEERHCD
IIQAGSRFHLGPITVLISDPQNVVKKLPSEVATIHKVQSSEVDVMHVDRILTANGLCK
LQSCTHGSPGDIQIFKPDYLVKHSISKRINAIEDHWWANDTWMSWQGTDLDYYCTTGS
WPTCTYSGVVRQNTDSFKNLETTEFDLLEEYFFHSSRVEVHGRTLNFPVKSRPKEGGG
ELTVLVEVNGLELHSKLIVPVGLMFKITSCKGCYSCSSGFICDVVLGVDSPPELTVHA
ECTNPNIVLTEGSLIARQGQTSTAKIKGFSVLKTTRLCIVLQESKVLEKLIEDCTELK
LDDPKDVIIERGSTLLSHQNDSCSYGLGCWLANVGTFGSGLSMIFQNYFGSVILGFLF
FVLPVVLLLVFFCLGDKIFFCRKFKWCFKSNKEDKEKFKQMVTELKQTNLIKKMKEEA
KTSWRGLANKALGKSTKEE"
misc_feature 393..1787
/locus_tag="D1Y41_sMgp1"
/note="Nairovirus M polyprotein-like; Region:
Nairovirus_M; pfam07948"
/db_xref="CDD:285223"
misc_feature 2112..3644
/locus_tag="D1Y41_sMgp1"
/note="Hantavirus glycoprotein G2; Region: Hanta_G2;
pfam01561"
/db_xref="CDD:426324"
ORIGIN
1 tctcaaagaa agtctagcgg caaactcgtc acagacttgc aatggagaca aaatgggtgg
61 ttgctgttgc tctcgttatg cagctgtacc aacttctgag cctaactgtg tcgtcaacga
121 gcactacgac ccctacagta gaatcgacca ccagcccaaa tacaagcagc gccacaaatg
181 caaccacagg tgcctcgggc ccgactgccc agggcatttc gtcctcgact tccaaccagt
241 ctgtgaagag tgtggatcag attgtgagtt atgcagcaaa catctgggat tcaacattgg
301 aagctctgtt taccaaggac accagttgca ataagaactt gagttggtgc agactggagg
361 tcacggatac gcatggtcta actccgtatg taaaacatct atacaacctc tctaatagca
421 aatacaacac gttctgccaa agcaagaaag ggaactttgg attcataaaa aaagaaaaat
481 tcacatttga tgtgaccact ggaccagaaa gaatgctgtt aaaagacatt caatgtgcaa
541 acgtcatata tgatggtgtt actaaagacg gttacctgct gcatgttctt ttcggtggaa
601 gacgtgtctt cttttatagt tgtaggtata ccatcataac aaaaaactgt aggattaggg
661 caagccatga tgcaccagtt cctcttccag gctatggcaa ctggaccacg gcaatgtact
721 ctgttttcct agagcataga agtgctgcag aaacatgcaa aatttatttc ccatgcttga
781 acaaaggaaa gcctttgggt aatggaggtt ttaaaatcaa gggatttttc gtcactggtt
841 taaccaagcc ggaagggaca gggaggaaac tactaactgc agctgaccct gaaccagatg
901 aagactgcgg ctcagcttca cacctgaaac aaattacgaa ccaccacctt gtaacggact
961 tcaaggacgg acctggtgat gtaataagta tttgcaatgg ttcaacgttc ttccatggaa
1021 gaatgccaga tggattaggc tgtcacagta ttagaagtat aaaggtcagt catcactgtg
1081 gacaccacag tacaaagtgc acagtcgagc ctgacttaaa agcatgcagc cacggtaaat
1141 gtatattaat tagaatgagc aacaggggca tggtgaaact gaccagaggt accactgtgg
1201 aaaccattag gtgtggtacc gaatgcatgc ttcctccgct cgatggtgaa ggtgacatat
1261 tgattgactg tcctggagga acacaacact tcctccaaaa gaacattata gatttagatt
1321 gcccaaatta tccctacttc aaggagttta tgctatatgt gtgcagagct tcccacagac
1381 caaaaacaac catagccttc tttgcatggc tgtcggttgg ctacatcttg ctctccacag
1441 tttgtagcat tagtttgtgg tcattgaaaa tcatgtgtaa aggtatagaa ctgtgtaaat
1501 ctaaattgcc tcacaactct ggggaatgta aagtatgcag gcagcattat tcaagtgagc
1561 ttggtcaaca gctccatgaa gcaaactgca aaaatgggct gtgtccttac tgctcaaaca
1621 gacttcctga gaatagtttg tacaaacatg cggaggtgtg tccaaggaga aaagtaactg
1681 aagacacaat cagagagcat gatgacttta attcaacccc ctggttcttt gtatttattt
1741 ttgggattag tgaatacaaa ggcacaacaa ttaaaagagc aacttggttg atgattcttt
1801 tggctctgct cttagtttca ctttccccag tgtatggcaa cgaactcttc tttgatggta
1861 taggagatgg acaattagaa aaaggattat gggacgaaga gattggtcta gttgaaaact
1921 gccaccaaga gtgttttatt acagaaactg agtgtctgtg cccaagtttt gaaacaggta
1981 ggaaactact ttttttccat ttactaaata aacaaattag atctggaaga aaaatgaagc
2041 tactaagcag tatctcatta gatactccct ggggtgtggt gaagatagag aagagcttca
2101 aacctgtcct ctccatgtca aacttacagc tatcatggac aagtgaagaa gaggtgggag
2161 gcaagattgt agtgtcagga aagtcaacag cagtgttgaa gctaaaagag aaaactggca
2221 tggtctggga attatcctcc agtaaagcaa ctgagaaaaa aacacttatt gtttctgtga
2281 tggattacac acaagaatac aaaacacagc tgcaatattt aacaggggac aggctggtgt
2341 ctgagtggcc aagagctacc tgcacgggcc cctgtccgga taggtgtgct tgtcacactt
2401 caacatgcac ttggaaagcc tggcctaaca gtaggaaatg gacttgcaat cctacttggt
2461 gctggggagt tggtacaggt tgcacatgct gtggaatgga cattgaaaaa ccttatcaaa
2521 actacttagc agctaagtgg agtacggagt atataaaaac agatgttgtt gtttgtgttg
2581 aaacatctga tgaggaaagg cactgtgaca taatacaggc agggtcaaga ttccatctag
2641 gaccaataac agttctaatt tcagacccac agaatgttgt aaaaaaactg ccttctgagg
2701 ttgctaccat tcacaaagtt cagtcaagtg aagttgacgt aatgcatgta gacagaatcc
2761 tcacggcaaa cggactttgc aaacttcaga gctgcacaca tggttctcca ggagacatac
2821 agatcttcaa accggattac ttagtgaagc acagtatatc taaaagaatc aatgcaatag
2881 aagatcattg gtgggcaaac gacacttgga tgtcatggca agggacagac ttggactact
2941 actgcaccac aggaagctgg cccacatgca cctattctgg cgttgttaga cagaatactg
3001 actctttcaa aaatttggag acaacagaat ttgacttact agaggagtat ttcttccatt
3061 cttcaagggt tgaagttcat ggcagaactt taaacttccc tgtcaaatcg aggcccaaag
3121 aagggggggg tgaactgaca gtgttggttg aagtaaatgg attagagcta cattctaaat
3181 tgatagtgcc agtgggcctg atgtttaaaa taacatcgtg caaggggtgt tactcctgtt
3241 cttctggatt catctgtgat gtagtgttgg gagtggatag tcctcctgag ttgactgtac
3301 atgcagaatg cacaaaccca aacatagttt taacagaagg aagtctaatt gctagacaag
3361 gacagacctc tactgcaaaa ataaaggggt tttcagttct taagactaca agactatgca
3421 ttgtcctaca agagtctaag gtgttggaga aactaattga agactgtaca gagctcaaac
3481 tagatgatcc caaagacgtt attattgaaa ggggaagtac cctgctttcc catcaaaatg
3541 atagctgcag ctatggacta ggatgctggc tggctaacgt tggcactttt ggctccggac
3601 tgagtatgat cttccagaat tattttggct cagttatctt aggctttctt ttctttgttt
3661 tacctgtggt cttgttgtta gtctttttct gcttgggtga caagatattc ttctgtcgga
3721 aatttaaatg gtgttttaag agcaacaagg aagacaaaga aaagtttaaa cagatggtta
3781 cagagttaaa gcagaccaat cttataaaga aaatgaagga agaagccaag acaagttgga
3841 gaggcttggc aaacaaagca ctaggaaagt caaccaagga agagtagcca caaatcatac
3901 aaatccttta gtactttcaa tcattaatca cacttgtccg cacccaacag ccagccccca
3961 ttctccggca gtagacacaa cacaaataca acagatgtga acgagtatgc cgcttctata
4021 tctt
//