GenomeNet

Database: GenBank
Entry: OR149007
LinkDB: OR149007
LOCUS       OR149007                5590 bp    DNA     circular PHG 27-AUG-2023
DEFINITION  Microvirus D_HF6_100, complete genome.
ACCESSION   OR149007
VERSION     OR149007.1
DBLINK      BioProject: PRJNA956591
            BioSample: SAMN35328321
            Sequence Read Archive: SRR24738785
KEYWORDS    .
SOURCE      Microvirus D_HF6_100
  ORGANISM  Microvirus D_HF6_100
            Viruses; Monodnaviria; Sangervirae; Phixviricota;
            Malgrandaviricetes; Petitvirales; Microviridae.
REFERENCE   1  (bases 1 to 5590)
  AUTHORS   Paietta,E.N., Kraberger,S., Custer,J.M., Vargas,K.L., Epsy,C.,
            Ehmke,E., Yoder,A.D. and Varsani,A.
  TITLE     Characterization of diverse anelloviruses, cressdnaviruses, and
            phages in the human oral virome in North Carolina
  JOURNAL   Unpublished
REFERENCE   2  (bases 1 to 5590)
  AUTHORS   Paietta,E.N., Kraberger,S., Custer,J.M., Vargas,K.L., Epsy,C.,
            Ehmke,E., Yoder,A.D. and Varsani,A.
  TITLE     Direct Submission
  JOURNAL   Submitted (16-JUN-2023) The Biodesign Center of Fundamental and
            Applied Microbiomics, Arizona State University, 1001 S. McAllister
            Ave, Tempe, AZ 85287-5001, USA
COMMENT     ##Assembly-Data-START##
            Assembly Method       :: MegaHit v. 1.2.9
            Sequencing Technology :: Illumina
            ##Assembly-Data-END##
FEATURES             Location/Qualifiers
     source          1..5590
                     /organism="Microvirus D_HF6_100"
                     /mol_type="genomic DNA"
                     /isolate="D_HF6_100"
                     /isolation_source="saliva (passive drool collection)"
                     /host="Homo sapiens"
                     /db_xref="taxon:3071187"
                     /country="USA: Durham, NC"
                     /collection_date="30-Aug-2021"
     CDS             1..1779
                     /codon_start=1
                     /transl_table=11
                     /product="major capsid protein"
                     /protein_id="WMC01534.1"
                     /translation="MDALKNNVNFSKFDLSHTHKTSMDMGQLVPIACIPTLPGDKINV
                     DVDAFIRGMPTLVPIMDKVDIKINHFYVPYRVLWSRFEEFISHSDRHKLRNDEKPTMP
                     VFDTNQLFAVFQRGVNGSGVSADAFKVGTKDYKDSLKSLTKKYKLGRLLNYLSMDNMI
                     TSNNSGNPTKKDKDLVSLMPVLAYNRIFLDYYAPQRWLNYFQSNNKPHWFMELSKLLE
                     EIKNSNNYLLDASKVGSESKYDIFKLFGLGSVVDASNGVSHLFQDKIFNLFSLKNSYW
                     NQDYFTSALPEPTLFGDIKLPLFNEDIPDNQKHLIASGGARVEFASSGNYANESDIKH
                     NNSVLSTIRDLRKAVSLQHYFETLSQAGGRYLETMEVMFGQRLPDDMLNYSEYLGGSV
                     IPLFVNEVEQTAPYESKAGDKTYLGDLSGKPVGAGSGENIFFEADEYGIYMAIAHIVP
                     KRSYYNLGLRYWRELEPLDLPNPAFEGLGDQAVYRYEIGSALAQNAWDVFGYVPRYAH
                     YKTVLDKFSGEMEHSLKQWHLGDYSYGQAKDAQLNINPESFMCAPRNDIFHVPDEPDK
                     FICTYNLKIDAVRPLSYEAPVGVSRI"
     CDS             1999..2385
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="WMC01532.1"
                     /translation="MDNNNKKNNVMRRETNSYKDYDFDLEEYQKENQKLDKDIKQENY
                     VKPVDTSVSAMVSRGIVADNSQLVYGEDIPPMSKMSLMELHKMKQLYSDKVQGLEADI
                     KYQEENYKRLQELKAQELKDVEPKTE"
     CDS             2388..3344
                     /codon_start=1
                     /transl_table=11
                     /product="DNA pilot protein"
                     /protein_id="WMC01531.1"
                     /translation="MDWISTAIAGVSQGVNALLQSGQNRKNREFQERENEKARQFAVQ
                     QWNAQNEYNLPTNQMTRLRDAGINPHLAYSNGTPMNSSNAPAAPSGVGSLPPGEAPKF
                     NLGELYQTLLTKSQIKNMDADTAKKEAEKREVEARTENTTTDTEIKKVELNHKDREIM
                     AKINVDEQQVEESKSRIESSQVANRKMEQEIENLKSQKNLTDQQVENLKKTIYLIMAQ
                     IDNTNADTKLKEAQRETELVRKGNVEADTRLKNAQVIGQNINNMFSPMLLSATVTKVW
                     TEIKKIGVDIGHIQHQELSSIVGALLKYVRDGGNSMYMSSLP"
     CDS             3346..4752
                     /codon_start=1
                     /transl_table=11
                     /product="replication initiator protein"
                     /protein_id="WMC01535.1"
                     /translation="MCLDPKVIKSKTRGAENYIGSLYYSDDRGNPYTVVPCGKCIECR
                     NLYIEQWQIRWKEQIKDSVENSCYMLTLTYNDENLPTEVIDEETGEVISEVTTLRYSD
                     VTKFLKRLRKRQDKYIKENGLDHVAIKYHYCGEYGTKFTRRPHYHMLITNVIIPIDGI
                     GNFKNNTFNDIWKNGHVHIGTDVTEKSMRYILKYTLKNVYNQDEKETIQETKTIARSY
                     CGATCFDDVPEFHKFSEREIIDYWSDKLERDRNIYFDLPFNRSDSESLNSFALEFLEQ
                     KKKEVETIQEEFKVRNICRIYRTGKNKGRVVEKAICSKGIGKGYLTEKNIGYHKSNLD
                     LGYMDYEDGKGWKERPLPRYYRDYIFNPILKIDEKKEYYRSIGIEPTKEVLKRQIRKY
                     RECQEDYEDTLIYKKRVMMYRRNISEYLEILEKIDKVGEENYYFEINSYKNVKSQQYM
                     SNLAKYLAGVSYREPEFM"
     CDS             5287..5415
                     /codon_start=1
                     /transl_table=11
                     /product="hypothetical protein"
                     /protein_id="WMC01533.1"
                     /translation="MKIIWIKIVDVIVKNVLPVVVNILVDLLETKVENAKKKVELV"
ORIGIN      
        1 atggatgctt taaaaaataa tgtaaacttt agtaagtttg atttatccca tactcacaag
       61 acaagtatgg atatgggtca attagttcct atcgcgtgta ttcctacttt gcctggagac
      121 aagataaatg tagatgtaga cgcttttatt agaggtatgc cgacattagt tccgataatg
      181 gataaagtgg atataaaaat taatcatttc tatgtcccgt atagagttct atggtctaga
      241 tttgaggagt ttatttctca ttctgatagg cataaattga gaaatgatga aaaacctacg
      301 atgcctgtat ttgatactaa tcagttgttt gctgtatttc aaagaggtgt aaatggttct
      361 ggtgtttcag cggatgcctt taaagtaggt actaaggatt ataaagattc tttaaaatca
      421 ttaactaaga agtataaact aggtagatta ttaaattatc tatctatgga taatatgatt
      481 acttctaata attcaggaaa tcctacaaag aaagacaaag atttggtttc tcttatgcct
      541 gttctggcat acaatagaat attcttagac tattatgcac cgcaaagatg gttaaattat
      601 ttccaaagta acaataagcc tcactggttc atggagcttt ctaagttgtt agaggagata
      661 aaaaatagta ataattatct gttagatgct tcaaaagtag gtagtgaaag taaatatgat
      721 atatttaaat tatttggatt aggttctgta gtagatgcta gtaatggtgt gagtcattta
      781 tttcaggaca agatatttaa tttatttagt ttaaaaaatt catactggaa tcaggactat
      841 tttacttctg ctcttccgga gcctactctg tttggtgata ttaaattacc attgtttaac
      901 gaggatattc cggacaatca gaaacactta attgcttccg gtggtgcccg tgtagagttc
      961 gcaagttctg gaaattatgc caacgagtca gatattaagc ataataatag tgtactgtct
     1021 actattagag acctaagaaa ggctgtaagt cttcagcatt attttgagac gctttctcaa
     1081 gctggtggta gatacctgga aacaatggaa gttatgttcg gtcaaagatt accggatgat
     1141 atgctaaatt attctgaata tctcggaggt tctgttattc cattatttgt aaatgaagta
     1201 gagcaaaccg ctccatatga gagtaaagct ggtgataaaa cctatttagg tgatttatca
     1261 ggaaaacctg tcggagctgg tagcggtgaa aatatattct ttgaagctga tgaatacggt
     1321 atttacatgg caattgctca catagtacct aaaagaagtt actataattt aggtttaaga
     1381 tactggagag agttggaacc attggattta ccaaatccag cgtttgaagg tttaggagac
     1441 caggctgttt atagatacga aataggtagc gcgttggcac aaaatgcctg ggatgttttc
     1501 ggatatgttc cgaggtacgc tcattataaa acagtattag ataaattttc tggagaaatg
     1561 gagcactcgt taaaacagtg gcatttagga gattactcct atggacaagc taaagatgca
     1621 caattaaata taaatcctga gtcgtttatg tgtgccccta gaaatgatat attccatgtt
     1681 ccggatgaac cggacaagtt tatttgtaca tataatttga aaatagatgc tgtacgccct
     1741 ctatcttatg aggcaccagt aggagtaagt agaatttaga atgattatat aagtttaatt
     1801 tttaattagt taattatgag tagaggaaag cgaagatata acctatcccg tggaggtttt
     1861 aggttatctt aagaaaattg atgtgttcta tgtttaggga gttaagagcg aataatagta
     1921 gtttttgctc ccttttttac aaaaaaggcg cacgcccccc ttgattaaag tagtgcgcat
     1981 tgacacctaa tattgtcaat ggataataac aataaaaaaa ataatgtaat gagaagagaa
     2041 acaaacagtt acaaagatta cgattttgat ttagaagagt atcaaaaaga aaatcagaag
     2101 ttagataagg atattaagca agaaaactat gtaaaacctg tagatacttc cgtttctgcg
     2161 atggtttccc gtggtatagt tgctgataat agtcagttag tgtatggcga ggatatccct
     2221 cctatgtcta agatgtcatt gatggagttg cataaaatga aacaactgta ttctgataaa
     2281 gttcaaggat tagaggctga tataaaatac caggaagaaa attataaaag acttcaagag
     2341 ttaaaagctc aagagttaaa agatgttgaa ccaaaaactg aatagttatg gactggattt
     2401 ctacggcaat agctggagtt tcgcaaggag ttaacgcctt gttgcaatct ggacaaaaca
     2461 ggaaaaatag agaattccaa gagcgagaaa atgaaaaagc tagacagttc gctgttcagc
     2521 aatggaatgc gcaaaatgaa tataatttac ctactaatca aatgacaagg ttgagagatg
     2581 ctggaataaa tccacatctg gcatattcaa acggtacgcc tatgaatagt tctaatgctc
     2641 cggctgctcc ttctggagta ggttctctcc cacctggtga agcccctaag tttaatttag
     2701 gtgagttata ccaaacctta ttaactaagt ctcagataaa gaatatggat gcagacactg
     2761 ccaagaaaga agctgaaaaa agagaggtag aagctcggac tgaaaatact actactgata
     2821 cggaaattaa gaaagtagag ctaaaccata aggatagaga gattatggct aaaataaatg
     2881 tagatgagca acaggtagaa gaaagtaagt caagaataga gagctctcag gtagcaaatc
     2941 gtaagatgga acaggagata gaaaatttga aatctcaaaa aaatcttaca gaccagcagg
     3001 ttgaaaatct taaaaagaca atttatctaa tcatggctca aatagataat acaaatgctg
     3061 atactaaact aaaagaagct caaagagaga ccgaattagt gcgtaaaggt aatgtagaag
     3121 ctgatactag gctaaagaat gcacaggtaa taggtcagaa tataaataac atgttttctc
     3181 ctatgctgtt aagtgctact gttactaaag tctggactga gataaagaaa ataggtgtag
     3241 atattggaca tatacaacat caggaattga gttctattgt tggagcttta ttgaaatatg
     3301 ttagagatgg tggaaattct atgtatatga gttctttacc ttaaaatgtg tcttgatcct
     3361 aaagtcataa agtctaaaac tcgcggagct gagaattaca tagggagcct ctattatagt
     3421 gatgatagag gtaatcccta tactgttgtg ccctgtggaa agtgtataga atgtaggaac
     3481 ctgtatattg aacagtggca aatccgttgg aaagaacaga taaaagatag tgtagagaat
     3541 tcctgctata tgcttacttt gacttataac gatgagaatc tacctacgga agtaatagat
     3601 gaggagactg gagaagttat tagtgaagtt acaaccttaa gatatagtga tgttactaag
     3661 ttcttaaaga gattaagaaa aagacaggat aagtatatta aagaaaatgg attagatcat
     3721 gttgctataa aatatcatta ctgcggtgaa tatggtacga agttcaccag acgccctcat
     3781 tatcatatgt tgattacaaa tgtgatcatt ccgatagatg gtataggtaa ttttaaaaat
     3841 aataccttta atgatatctg gaaaaatgga catgttcata taggaacgga tgtaacggaa
     3901 aaatctatga ggtatatttt aaaatatacc cttaaaaatg tttataatca ggatgaaaag
     3961 gaaacaatac aagagaccaa gacgattgcg cgaagttatt gcggagctac ttgcttcgat
     4021 gatgtgccag aattccacaa gttcagtgaa agagaaataa tagattattg gagtgataag
     4081 cttgaaagag atagaaatat atactttgat ttacccttta ataggtctga tagtgaaagc
     4141 cttaatagtt ttgctctgga atttctggag cagaaaaaga aagaagtaga aactatccag
     4201 gaagaattta aagtaagaaa tatttgtagg atatatagaa caggtaaaaa taaaggtcgt
     4261 gtagttgaaa aagctatttg tagtaaaggt ataggtaaag gatatttaac tgaaaaaaat
     4321 ataggttatc ataagtctaa tttagattta ggctatatgg attatgaaga tggtaaagga
     4381 tggaaagagc gtccattacc gagatattat agagattata tttttaatcc aatcttaaaa
     4441 atagatgaga aaaaagaata ttataggtct attggtatag agcctacaaa agaggttctg
     4501 aaaagacaga taagaaaata tagagaatgc caggaagatt atgaagatac gctaatatat
     4561 aaaaagcgtg taatgatgta caggcgtaat atttctgaat atctggaaat cttggaaaag
     4621 atagataaag taggtgaaga aaattattat tttgaaatta attcttataa aaatgtaaaa
     4681 tctcaacagt atatgtctaa tcttgctaaa tacttagcag gtgtaagcta tagagagcct
     4741 gaatttatgt aaaacatgat aaaaatcatg taaatatatt tagaataata ttttatattt
     4801 gtaatgttaa atttgattta atggactgga tattttggct tgttgctatg ttggttttcg
     4861 tgttcttctt tataggagat cagaagaaaa acaagtaaat agtttttttt tttttttttt
     4921 ttcggggttc gctcccgaaa gagcgagaaa aagaataagc gtaatgctgg taatcgtatg
     4981 cctatggcta cgaaaatgag cctgtattat taatgtggtt gtttttgccc ttaatttata
     5041 aaaattgtaa aaacggcaaa agcagaagtg aaacgacttt tgtcatttta aaatttttga
     5101 tataaatttg cagaaagaaa cagatttaaa ataatacgca aattcgtgtt actcattggc
     5161 gtaattgttt tttggactta tcattatgat atttatcagt tagcccctcc cctatcgcgg
     5221 tttaactggc ttgaataaag gaaaaaaagg tgtaatttat attatgttaa attaaaatgt
     5281 tttattatga aaattatttg gattaaaatc gttgatgtaa tcgttaaaaa tgtgttacca
     5341 gttgtggtaa atattttggt tgatttatta gaaacaaagg tagaaaatgc gaaaaagaaa
     5401 gttgaactgg tttaaaaagt cgtgcaggtt cgctctgctg ttcctgccct acttattcta
     5461 tttgtttaca aagttcgtaa ctacgacaaa tgacggtata caaatgttat cttctaaatt
     5521 gttagataag ttaaaattat ctgattatga gaagttgaag tatgatgaaa aatatgaaat
     5581 actgtaatta
//
DBGET integrated database retrieval system