LOCUS OR149007 5590 bp DNA circular PHG 27-AUG-2023
DEFINITION Microvirus D_HF6_100, complete genome.
ACCESSION OR149007
VERSION OR149007.1
DBLINK BioProject: PRJNA956591
BioSample: SAMN35328321
Sequence Read Archive: SRR24738785
KEYWORDS .
SOURCE Microvirus D_HF6_100
ORGANISM Microvirus D_HF6_100
Viruses; Monodnaviria; Sangervirae; Phixviricota;
Malgrandaviricetes; Petitvirales; Microviridae.
REFERENCE 1 (bases 1 to 5590)
AUTHORS Paietta,E.N., Kraberger,S., Custer,J.M., Vargas,K.L., Epsy,C.,
Ehmke,E., Yoder,A.D. and Varsani,A.
TITLE Characterization of diverse anelloviruses, cressdnaviruses, and
phages in the human oral virome in North Carolina
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 5590)
AUTHORS Paietta,E.N., Kraberger,S., Custer,J.M., Vargas,K.L., Epsy,C.,
Ehmke,E., Yoder,A.D. and Varsani,A.
TITLE Direct Submission
JOURNAL Submitted (16-JUN-2023) The Biodesign Center of Fundamental and
Applied Microbiomics, Arizona State University, 1001 S. McAllister
Ave, Tempe, AZ 85287-5001, USA
COMMENT ##Assembly-Data-START##
Assembly Method :: MegaHit v. 1.2.9
Sequencing Technology :: Illumina
##Assembly-Data-END##
FEATURES Location/Qualifiers
source 1..5590
/organism="Microvirus D_HF6_100"
/mol_type="genomic DNA"
/isolate="D_HF6_100"
/isolation_source="saliva (passive drool collection)"
/host="Homo sapiens"
/db_xref="taxon:3071187"
/country="USA: Durham, NC"
/collection_date="30-Aug-2021"
CDS 1..1779
/codon_start=1
/transl_table=11
/product="major capsid protein"
/protein_id="WMC01534.1"
/translation="MDALKNNVNFSKFDLSHTHKTSMDMGQLVPIACIPTLPGDKINV
DVDAFIRGMPTLVPIMDKVDIKINHFYVPYRVLWSRFEEFISHSDRHKLRNDEKPTMP
VFDTNQLFAVFQRGVNGSGVSADAFKVGTKDYKDSLKSLTKKYKLGRLLNYLSMDNMI
TSNNSGNPTKKDKDLVSLMPVLAYNRIFLDYYAPQRWLNYFQSNNKPHWFMELSKLLE
EIKNSNNYLLDASKVGSESKYDIFKLFGLGSVVDASNGVSHLFQDKIFNLFSLKNSYW
NQDYFTSALPEPTLFGDIKLPLFNEDIPDNQKHLIASGGARVEFASSGNYANESDIKH
NNSVLSTIRDLRKAVSLQHYFETLSQAGGRYLETMEVMFGQRLPDDMLNYSEYLGGSV
IPLFVNEVEQTAPYESKAGDKTYLGDLSGKPVGAGSGENIFFEADEYGIYMAIAHIVP
KRSYYNLGLRYWRELEPLDLPNPAFEGLGDQAVYRYEIGSALAQNAWDVFGYVPRYAH
YKTVLDKFSGEMEHSLKQWHLGDYSYGQAKDAQLNINPESFMCAPRNDIFHVPDEPDK
FICTYNLKIDAVRPLSYEAPVGVSRI"
CDS 1999..2385
/codon_start=1
/transl_table=11
/product="hypothetical protein"
/protein_id="WMC01532.1"
/translation="MDNNNKKNNVMRRETNSYKDYDFDLEEYQKENQKLDKDIKQENY
VKPVDTSVSAMVSRGIVADNSQLVYGEDIPPMSKMSLMELHKMKQLYSDKVQGLEADI
KYQEENYKRLQELKAQELKDVEPKTE"
CDS 2388..3344
/codon_start=1
/transl_table=11
/product="DNA pilot protein"
/protein_id="WMC01531.1"
/translation="MDWISTAIAGVSQGVNALLQSGQNRKNREFQERENEKARQFAVQ
QWNAQNEYNLPTNQMTRLRDAGINPHLAYSNGTPMNSSNAPAAPSGVGSLPPGEAPKF
NLGELYQTLLTKSQIKNMDADTAKKEAEKREVEARTENTTTDTEIKKVELNHKDREIM
AKINVDEQQVEESKSRIESSQVANRKMEQEIENLKSQKNLTDQQVENLKKTIYLIMAQ
IDNTNADTKLKEAQRETELVRKGNVEADTRLKNAQVIGQNINNMFSPMLLSATVTKVW
TEIKKIGVDIGHIQHQELSSIVGALLKYVRDGGNSMYMSSLP"
CDS 3346..4752
/codon_start=1
/transl_table=11
/product="replication initiator protein"
/protein_id="WMC01535.1"
/translation="MCLDPKVIKSKTRGAENYIGSLYYSDDRGNPYTVVPCGKCIECR
NLYIEQWQIRWKEQIKDSVENSCYMLTLTYNDENLPTEVIDEETGEVISEVTTLRYSD
VTKFLKRLRKRQDKYIKENGLDHVAIKYHYCGEYGTKFTRRPHYHMLITNVIIPIDGI
GNFKNNTFNDIWKNGHVHIGTDVTEKSMRYILKYTLKNVYNQDEKETIQETKTIARSY
CGATCFDDVPEFHKFSEREIIDYWSDKLERDRNIYFDLPFNRSDSESLNSFALEFLEQ
KKKEVETIQEEFKVRNICRIYRTGKNKGRVVEKAICSKGIGKGYLTEKNIGYHKSNLD
LGYMDYEDGKGWKERPLPRYYRDYIFNPILKIDEKKEYYRSIGIEPTKEVLKRQIRKY
RECQEDYEDTLIYKKRVMMYRRNISEYLEILEKIDKVGEENYYFEINSYKNVKSQQYM
SNLAKYLAGVSYREPEFM"
CDS 5287..5415
/codon_start=1
/transl_table=11
/product="hypothetical protein"
/protein_id="WMC01533.1"
/translation="MKIIWIKIVDVIVKNVLPVVVNILVDLLETKVENAKKKVELV"
ORIGIN
1 atggatgctt taaaaaataa tgtaaacttt agtaagtttg atttatccca tactcacaag
61 acaagtatgg atatgggtca attagttcct atcgcgtgta ttcctacttt gcctggagac
121 aagataaatg tagatgtaga cgcttttatt agaggtatgc cgacattagt tccgataatg
181 gataaagtgg atataaaaat taatcatttc tatgtcccgt atagagttct atggtctaga
241 tttgaggagt ttatttctca ttctgatagg cataaattga gaaatgatga aaaacctacg
301 atgcctgtat ttgatactaa tcagttgttt gctgtatttc aaagaggtgt aaatggttct
361 ggtgtttcag cggatgcctt taaagtaggt actaaggatt ataaagattc tttaaaatca
421 ttaactaaga agtataaact aggtagatta ttaaattatc tatctatgga taatatgatt
481 acttctaata attcaggaaa tcctacaaag aaagacaaag atttggtttc tcttatgcct
541 gttctggcat acaatagaat attcttagac tattatgcac cgcaaagatg gttaaattat
601 ttccaaagta acaataagcc tcactggttc atggagcttt ctaagttgtt agaggagata
661 aaaaatagta ataattatct gttagatgct tcaaaagtag gtagtgaaag taaatatgat
721 atatttaaat tatttggatt aggttctgta gtagatgcta gtaatggtgt gagtcattta
781 tttcaggaca agatatttaa tttatttagt ttaaaaaatt catactggaa tcaggactat
841 tttacttctg ctcttccgga gcctactctg tttggtgata ttaaattacc attgtttaac
901 gaggatattc cggacaatca gaaacactta attgcttccg gtggtgcccg tgtagagttc
961 gcaagttctg gaaattatgc caacgagtca gatattaagc ataataatag tgtactgtct
1021 actattagag acctaagaaa ggctgtaagt cttcagcatt attttgagac gctttctcaa
1081 gctggtggta gatacctgga aacaatggaa gttatgttcg gtcaaagatt accggatgat
1141 atgctaaatt attctgaata tctcggaggt tctgttattc cattatttgt aaatgaagta
1201 gagcaaaccg ctccatatga gagtaaagct ggtgataaaa cctatttagg tgatttatca
1261 ggaaaacctg tcggagctgg tagcggtgaa aatatattct ttgaagctga tgaatacggt
1321 atttacatgg caattgctca catagtacct aaaagaagtt actataattt aggtttaaga
1381 tactggagag agttggaacc attggattta ccaaatccag cgtttgaagg tttaggagac
1441 caggctgttt atagatacga aataggtagc gcgttggcac aaaatgcctg ggatgttttc
1501 ggatatgttc cgaggtacgc tcattataaa acagtattag ataaattttc tggagaaatg
1561 gagcactcgt taaaacagtg gcatttagga gattactcct atggacaagc taaagatgca
1621 caattaaata taaatcctga gtcgtttatg tgtgccccta gaaatgatat attccatgtt
1681 ccggatgaac cggacaagtt tatttgtaca tataatttga aaatagatgc tgtacgccct
1741 ctatcttatg aggcaccagt aggagtaagt agaatttaga atgattatat aagtttaatt
1801 tttaattagt taattatgag tagaggaaag cgaagatata acctatcccg tggaggtttt
1861 aggttatctt aagaaaattg atgtgttcta tgtttaggga gttaagagcg aataatagta
1921 gtttttgctc ccttttttac aaaaaaggcg cacgcccccc ttgattaaag tagtgcgcat
1981 tgacacctaa tattgtcaat ggataataac aataaaaaaa ataatgtaat gagaagagaa
2041 acaaacagtt acaaagatta cgattttgat ttagaagagt atcaaaaaga aaatcagaag
2101 ttagataagg atattaagca agaaaactat gtaaaacctg tagatacttc cgtttctgcg
2161 atggtttccc gtggtatagt tgctgataat agtcagttag tgtatggcga ggatatccct
2221 cctatgtcta agatgtcatt gatggagttg cataaaatga aacaactgta ttctgataaa
2281 gttcaaggat tagaggctga tataaaatac caggaagaaa attataaaag acttcaagag
2341 ttaaaagctc aagagttaaa agatgttgaa ccaaaaactg aatagttatg gactggattt
2401 ctacggcaat agctggagtt tcgcaaggag ttaacgcctt gttgcaatct ggacaaaaca
2461 ggaaaaatag agaattccaa gagcgagaaa atgaaaaagc tagacagttc gctgttcagc
2521 aatggaatgc gcaaaatgaa tataatttac ctactaatca aatgacaagg ttgagagatg
2581 ctggaataaa tccacatctg gcatattcaa acggtacgcc tatgaatagt tctaatgctc
2641 cggctgctcc ttctggagta ggttctctcc cacctggtga agcccctaag tttaatttag
2701 gtgagttata ccaaacctta ttaactaagt ctcagataaa gaatatggat gcagacactg
2761 ccaagaaaga agctgaaaaa agagaggtag aagctcggac tgaaaatact actactgata
2821 cggaaattaa gaaagtagag ctaaaccata aggatagaga gattatggct aaaataaatg
2881 tagatgagca acaggtagaa gaaagtaagt caagaataga gagctctcag gtagcaaatc
2941 gtaagatgga acaggagata gaaaatttga aatctcaaaa aaatcttaca gaccagcagg
3001 ttgaaaatct taaaaagaca atttatctaa tcatggctca aatagataat acaaatgctg
3061 atactaaact aaaagaagct caaagagaga ccgaattagt gcgtaaaggt aatgtagaag
3121 ctgatactag gctaaagaat gcacaggtaa taggtcagaa tataaataac atgttttctc
3181 ctatgctgtt aagtgctact gttactaaag tctggactga gataaagaaa ataggtgtag
3241 atattggaca tatacaacat caggaattga gttctattgt tggagcttta ttgaaatatg
3301 ttagagatgg tggaaattct atgtatatga gttctttacc ttaaaatgtg tcttgatcct
3361 aaagtcataa agtctaaaac tcgcggagct gagaattaca tagggagcct ctattatagt
3421 gatgatagag gtaatcccta tactgttgtg ccctgtggaa agtgtataga atgtaggaac
3481 ctgtatattg aacagtggca aatccgttgg aaagaacaga taaaagatag tgtagagaat
3541 tcctgctata tgcttacttt gacttataac gatgagaatc tacctacgga agtaatagat
3601 gaggagactg gagaagttat tagtgaagtt acaaccttaa gatatagtga tgttactaag
3661 ttcttaaaga gattaagaaa aagacaggat aagtatatta aagaaaatgg attagatcat
3721 gttgctataa aatatcatta ctgcggtgaa tatggtacga agttcaccag acgccctcat
3781 tatcatatgt tgattacaaa tgtgatcatt ccgatagatg gtataggtaa ttttaaaaat
3841 aataccttta atgatatctg gaaaaatgga catgttcata taggaacgga tgtaacggaa
3901 aaatctatga ggtatatttt aaaatatacc cttaaaaatg tttataatca ggatgaaaag
3961 gaaacaatac aagagaccaa gacgattgcg cgaagttatt gcggagctac ttgcttcgat
4021 gatgtgccag aattccacaa gttcagtgaa agagaaataa tagattattg gagtgataag
4081 cttgaaagag atagaaatat atactttgat ttacccttta ataggtctga tagtgaaagc
4141 cttaatagtt ttgctctgga atttctggag cagaaaaaga aagaagtaga aactatccag
4201 gaagaattta aagtaagaaa tatttgtagg atatatagaa caggtaaaaa taaaggtcgt
4261 gtagttgaaa aagctatttg tagtaaaggt ataggtaaag gatatttaac tgaaaaaaat
4321 ataggttatc ataagtctaa tttagattta ggctatatgg attatgaaga tggtaaagga
4381 tggaaagagc gtccattacc gagatattat agagattata tttttaatcc aatcttaaaa
4441 atagatgaga aaaaagaata ttataggtct attggtatag agcctacaaa agaggttctg
4501 aaaagacaga taagaaaata tagagaatgc caggaagatt atgaagatac gctaatatat
4561 aaaaagcgtg taatgatgta caggcgtaat atttctgaat atctggaaat cttggaaaag
4621 atagataaag taggtgaaga aaattattat tttgaaatta attcttataa aaatgtaaaa
4681 tctcaacagt atatgtctaa tcttgctaaa tacttagcag gtgtaagcta tagagagcct
4741 gaatttatgt aaaacatgat aaaaatcatg taaatatatt tagaataata ttttatattt
4801 gtaatgttaa atttgattta atggactgga tattttggct tgttgctatg ttggttttcg
4861 tgttcttctt tataggagat cagaagaaaa acaagtaaat agtttttttt tttttttttt
4921 ttcggggttc gctcccgaaa gagcgagaaa aagaataagc gtaatgctgg taatcgtatg
4981 cctatggcta cgaaaatgag cctgtattat taatgtggtt gtttttgccc ttaatttata
5041 aaaattgtaa aaacggcaaa agcagaagtg aaacgacttt tgtcatttta aaatttttga
5101 tataaatttg cagaaagaaa cagatttaaa ataatacgca aattcgtgtt actcattggc
5161 gtaattgttt tttggactta tcattatgat atttatcagt tagcccctcc cctatcgcgg
5221 tttaactggc ttgaataaag gaaaaaaagg tgtaatttat attatgttaa attaaaatgt
5281 tttattatga aaattatttg gattaaaatc gttgatgtaa tcgttaaaaa tgtgttacca
5341 gttgtggtaa atattttggt tgatttatta gaaacaaagg tagaaaatgc gaaaaagaaa
5401 gttgaactgg tttaaaaagt cgtgcaggtt cgctctgctg ttcctgccct acttattcta
5461 tttgtttaca aagttcgtaa ctacgacaaa tgacggtata caaatgttat cttctaaatt
5521 gttagataag ttaaaattat ctgattatga gaagttgaag tatgatgaaa aatatgaaat
5581 actgtaatta
//