GenomeNet

Database: RefSeq
Entry: NC_039089
LinkDB: NC_039089
Original site: NC_039089 
LOCUS       NC_039089               8017 bp    DNA     circular VRL 24-AUG-2018
DEFINITION  Human papillomavirus type 71 DNA, complete genome.
ACCESSION   NC_039089
VERSION     NC_039089.1
DBLINK      BioProject: PRJNA485481
KEYWORDS    RefSeq.
SOURCE      human papillomavirus 71
  ORGANISM  human papillomavirus 71
            Viruses; Monodnaviria; Shotokuvirae; Cossaviricota;
            Papovaviricetes; Zurhausenvirales; Papillomaviridae;
            Firstpapillomavirinae; Alphapapillomavirus; Alphapapillomavirus 14.
REFERENCE   1
  AUTHORS   Matsukura,T. and Sugase,M.
  TITLE     Relationships between 80 human papillomavirus genotypes and
            different grades of cervical intraepithelial neoplasia: association
            and causality
  JOURNAL   Virology 283 (1), 139-147 (2001)
   PUBMED   11312670
REFERENCE   2  (bases 1 to 8017)
  CONSRTM   NCBI Genome Project
  TITLE     Direct Submission
  JOURNAL   Submitted (24-AUG-2018) National Center for Biotechnology
            Information, NIH, Bethesda, MD 20894, USA
REFERENCE   3  (bases 1 to 8017)
  AUTHORS   Matsukura,T.T. and Sugase,M.
  TITLE     Direct Submission
  JOURNAL   Submitted (22-MAR-2000) Toshihiko T. Matsukura, National Institute
            of Infectious Diseases, Department of Virology II; 1-23-1 Toyama,
            Shinjuku, Tokyo 160, Japan (E-mail:toshi@nih.go.jp,
            Tel:81-03-5285-1111)
COMMENT     PROVISIONAL REFSEQ: This record has not yet been subject to final
            NCBI review. The reference sequence is identical to AB040456.
            COMPLETENESS: full length.
FEATURES             Location/Qualifiers
     source          1..8017
                     /organism="human papillomavirus 71"
                     /mol_type="genomic DNA"
                     /db_xref="taxon:120686"
     misc_feature    1..8017
                     /note="whole genome of human papillomavirus type 71"
     gene            834..2762
                     /gene="E1"
                     /locus_tag="D1R35_gp1"
                     /db_xref="GeneID:918588"
     CDS             834..2762
                     /gene="E1"
                     /locus_tag="D1R35_gp1"
                     /codon_start=1
                     /product="hypothetical protein"
                     /protein_id="NP_073548.1"
                     /db_xref="GeneID:918588"
                     /translation="MADCEGTGTDDDGGEAGTGAGGWFFVEAIVDRCTGPQPSSDEDE
                     DDNDTGEDMVDFINDRHAGDGQEVAAEVYRQQEALDDEAIVQPLKRKFLASPLSAGVC
                     VDKELSPRLDAISIGRESQKAKRRLFELQDSGYGNTQVDTEAAGNQVPRDGTPGGLHT
                     EQEEERQGGDGAAGIDSTPAHTGTNTVLALLNSSNRRATLLGKFKDLYGLSYMELVRQ
                     FKSNKTTCLDWVVCAFGVYCTVAEGVKTLIQQHCEYAHIQQQTCSWGVVILMLLRYKC
                     AKNRDTVAKGLSMLLNIPETNMLIEPPKIRSTPAALYWFRASMGNASDIFGETPEWIV
                     RQTVVGHSMEECQFQLSVMVQWAYDHDITDESILAYEYARLADVDSNAAAFLASNCQA
                     KYVKDACTMCRHYKRAEQAQMSMSQWISFRSAKITEEGDWRTIVKYLRHQDIEFITFI
                     IALKNFLKGIPKKSCLVFYGPADTGKSYFCMSLLRFLGGVVISYANSSSHFWLQPLAD
                     AKLGLIDDVTPNCWSYIDVYLRNALDGNQICIDRKHRPLLQLKCPPLLITTNTNPLEE
                     ERWKFLRSRLQLFTFKNAFPLNSKGDPMYPLNDANWKCFFQRLWARLDLHEQDEQEDN
                     GDTGQPFRCVPGDVTRTV"
     misc_feature    834..2759
                     /gene="E1"
                     /locus_tag="D1R35_gp1"
                     /note="E1; Provisional; Region: PHA02774"
                     /db_xref="CDD:222927"
ORIGIN      
        1 ttgttctact tcttactcat tattataaat tataatgttt gtataagaaa tataggtgta
       61 accgaaaacg gtgtaaccga aatgggtgca tatataaagc aatgcttggt tcagcagatt
      121 tagctatgtc cagcggggac gcatacccca ccaacctctt cagactgtgt aaccagtacg
      181 acgtggacct gcaggacctg aacctaacct gcatattctg cagaacaatt aacagacgtg
      241 gaagtcgtgg ccttgcatat aaggagctga aagttgtgtg gagaagtggt tttccgtttg
      301 ctgcatgtgc ctgctgtttg gaaatagctg gaaaactaag gcaacttaga tattggcaat
      361 tttcaggctt tgcaaacaca gtggaattag acaccggaac gccagttaca gagcaactaa
      421 tacggtgcta cgtgtgtcac aagccattgt gtagtgtgga aaaagaaaga ataattacag
      481 aaggcaggcg atttcataaa atagcaggcc attggcgcgg tgcttgccta cagtgttgga
      541 aaccatgcga ggccaacaat gtaccttaaa gacattgtcc tgcagctaca gccagaggtt
      601 gttgacctgt attgtcacga gcaatttgcc agctcagacg aggaagataa tagggtggac
      661 ggtgagcaac ccacagaacc agcacagcag gcatataggg tggtttcata ctgtggtagg
      721 tgctgtcgtg cagttaggct tgtggtggaa agcgacgaag cagacataag agcgcttcaa
      781 cagctgttac tgggcacact gacaatagtg tgtcccatct gcgtgtagct gccatggccg
      841 actgtgaagg tacaggtact gatgatgatg ggggcgaagc gggaacaggc gcgggagggt
      901 ggttttttgt agaagctatt gtggacagat gtaccggccc acagccatcc agtgatgagg
      961 atgaggatga caatgatact ggggaagata tggtggattt tataaatgac aggcatgcag
     1021 gggacggaca ggaagtggca gcagaggtgt acagacagca agaagcatta gatgacgagg
     1081 caattgtgca gcccctaaaa cgaaagtttc ttgcaagtcc tttgtctgca ggggtgtgtg
     1141 tagacaaaga gttaagcccg cggctagatg ctatatccat aggcagggaa tcccaaaaag
     1201 caaaacgaag gttatttgaa ctacaggaca gtgggtatgg caatacgcaa gtggatactg
     1261 aagcggcagg aaaccaggta ccaagggacg ggacgccagg ggggctgcac acagaacagg
     1321 aggaggagcg tcaggggggg gatggggcgg caggcataga cagtacacca gcacacacag
     1381 gaacaaatac agtgttggct ttgttaaact ccagtaatcg aagggcaaca ctgctaggta
     1441 agtttaaaga cttatatggg ttatcatata tggaattggt acggcaattt aaaagtaata
     1501 aaacaacatg tttggactgg gtagtatgtg cgtttggtgt gtattgcacg gttgcggagg
     1561 gtgtaaaaac cttgatacag caacactgtg aatacgcaca catacaacaa caaacatgtt
     1621 cctggggagt ggttatatta atgctgttgc gatataagtg tgctaaaaac agagataccg
     1681 tggcaaaggg attaagcatg ttattaaata taccagaaac aaacatgcta atagaaccac
     1741 caaaaataag aagtacgcct gctgcattat attggtttag ggcgagcatg ggaaacgcaa
     1801 gtgacatatt cggggaaacg ccagaatgga tagttagaca aactgtggta ggacacagca
     1861 tggaggaatg ccagtttcag ttatcagtaa tggtgcaatg ggcatatgac catgatataa
     1921 cagatgaaag tatattggca tatgaatatg cccgtctggc tgatgtagat agtaatgcag
     1981 cagccttttt agcaagcaat tgtcaagcca aatatgtgaa agatgcatgc acaatgtgta
     2041 gacattataa acgagcagag caggcacaaa tgtctatgtc acaatggata tcatttagaa
     2101 gtgctaaaat aacagaagaa ggggattggc gaacaatagt aaaatattta agacaccagg
     2161 atatagaatt tattacattt attatagcat taaaaaattt tttaaagggt ataccaaaga
     2221 aaagctgttt agtattttat gggcctgcag acactggcaa atcctatttt tgcatgagcc
     2281 tattacggtt tttgggtggg gttgtcattt cctatgccaa ttccagcagc catttttggt
     2341 tgcagccgtt agctgatgca aaattaggac taatagatga tgtaacccct aattgctgga
     2401 gctatataga tgtatatcta agaaatgcat tagacggtaa tcaaatatgt atagatagaa
     2461 aacacaggcc attactgcag ttaaagtgtc ccccattgtt aataacaaca aataccaatc
     2521 cgttagagga ggaacgatgg aagtttttac gtagtagact gcagctgttt acatttaaaa
     2581 atgcatttcc tttaaattca aaaggggacc ctatgtatcc actaaatgat gcaaactgga
     2641 aatgcttttt tcaaaggttg tgggctcggt tagacttaca cgagcaggac gagcaggagg
     2701 acaatggaga cactggccag ccgtttagat gcgtgccagg agacgttact agaactgtat
     2761 gaaaaagata gtgaccagct acaggaccaa attaaacact ggcaacacgt gcgatgggaa
     2821 aatgtgttgt tatttaaggc aagggaagca ggaattactc atcttgccca ccaggtggtg
     2881 cctgtgcttg gtattgctaa agccaaagct tgtaaagcaa ttgaaattca gttggcatta
     2941 aagacattac ttaacagtcc ctatagcaac gaacgatgga cattgcgtga cacgagccag
     3001 gaaatgtggg acgcagtgcc taagcaatgc tggaaaaaaa aaggctacac tgtagaagtg
     3061 cgatacgatt gcaaagagga aaagacaatg tgttacacat gttggaggga aatatatgtg
     3121 caaaacagta caaatgagac atgggaaaaa gtgtgtggcc tggtggacca tgcgggcata
     3181 tactatttac acgatgggat acgtgtagac tgtgtattat tctccaagga agcagtaata
     3241 tatggggaca caggcatctg ggaagtacat gtgggttcaa gggtgattta tgatgcattc
     3301 gactcctctg tgtctagcac ccaggacacc gagcaagacc aagtacccac tattaaacct
     3361 actgaccacg gacccgactc gcacccccaa caggcctcca ccaccaccca agtgctgggc
     3421 accaacgaaa cccaagtgtc gaccccgcca tttaagcgac agcgactcgg agacagacag
     3481 cggacccttg agcagcccga ttctacaaaa gcaccacagc agctggcacg tgtcaacctt
     3541 aggacccagt gtgacactga cggcgcacac gagcacggga ggacacgtga ctgtaacagt
     3601 gcacctgtaa tacaccttag aggtgaagcc aataaactaa agtgtttaag gtataggttg
     3661 caaaaacata aatctgtact gtttgccaaa gcatcctcca cgtggcattg ggccactggc
     3721 acagaggaca atacatgtaa aacaacattt gtaacattgt ggcatgatag tgtggaacag
     3781 cgggcacaat ttctagccac tgtacatatt cctaagggca tagaggcctt accaggatat
     3841 atgtcattgt ttgcataatc tttgtaaata ttgtatatat tgtatctatt gtatagacgc
     3901 ggaacgttaa gggtacacac ctacctgtag tgctggagcc ctttctatgc tgggtgtctg
     3961 catggaccta tgcactacta ctactaatta gcttttggct gtctatttta tcttctctta
     4021 ctgccttttt aatttttttt gttactgtgt ttcttgggtt tctagcacta tatatacagg
     4081 cagcagcgtc ccttacctaa ctgtgacttg tgactaccac acaaccagcc aatactgcta
     4141 ctacgtgtac ataacctatc catttgtgtt atagattgca tatatgtatc ctgttgtggt
     4201 aaaggattcc caaggcggac attatgatat tgtggtgtgg ggccctgatg atgtagatgt
     4261 attgtttgtg tttttagtgt tggtgtgtct tatgttgctt ctgtttttgt tacggttgat
     4321 gcagtaggta cccccccttt tgtattgccc tgtttataca tatacatata catattgttt
     4381 ttatttggtt tttgtttttg tttttttgtg tgtgcctgtg tgtgtaaata aacacattta
     4441 caatgccacg tgcacggcgt cgcaaacgtg cttcagttac acagctatat cagtcctgta
     4501 agctaacagg cacatgtccc cctgatgtta ttaataaggt ggagcacaat accttggctg
     4561 ataaaatatt acgctggggt agtttaggaa tatttttggg aggtttgggc attggcacag
     4621 ggtctggcac cggtgggcgc acaggctata ttcccattgg tacacgtcct cctactgttg
     4681 tggatgtagg acctcctgca cgcccccctg tggttataga aacagtaggg gcctctgatc
     4741 catctattgt gtctttggta gaggattcta gtattataga agcaggagca ccatatccta
     4801 actttactgg cacaggtggg tttgaggtca ctacagcatc cactactact cctgctgtgt
     4861 tagacattac tccaggtaac actgtgcagg ttagtagcag tagttttact aacccatcct
     4921 ttactgaacc tgccttagtg gagccccctc aaacaggtga ggtttcggga catattttgg
     4981 ttagtacctc tacatctggc acccatggct atgaggaaat acccatgcaa acatttgcgt
     5041 cggagggaac aggcaatgaa cctataagta gtacacctat tcctggggta cgcaggttag
     5101 caggccctcg cctgtatagt agggcctatc agcaggtgcg ggtggatgaa tccacatttc
     5161 ttcgccaccc tgcatctatg gttacatatg acaaccctgt gtatgaccca gaggaaacta
     5221 taatatttga acatcctagc atacaccagg ctcctgatcc agcatttatg gatattgtgg
     5281 ccttgcacag gccggccctt actgcccgta aaggtacggt acgtttcagt cgtttaggac
     5341 aaaaatctac ccttcgcacc cgtagtggta aacaaatagg ggcgcgggta catttttatc
     5401 atgacattag ccctatacaa cccaccgaac acttagaact gcagccacta gggcgggcct
     5461 tacaacaaga acctattgac acattatatg acatatatct gacacagatt attccaatga
     5521 tactgtcatt caacctactt ctgtgtccag caggcctaca cctactacta tacccctctg
     5581 taactgccac atcagccgtg tctgcctctc gcacacaaaa tgttacagca cctttgtctg
     5641 caggagcaga tgttccagtg tttgatggcc ctgacattga tttttccacc tcccatgcca
     5701 ctactcctac tcctgtagtg ccgtccattg cacctcccag ttcttttatt gtgtatggaa
     5761 ctgaggtatt atttaatgcc tagttatata ttttttccta aaaaactaaa cgtgtccact
     5821 atttttttgc agatggcttt gtggcggcct agtgacagca aggtatacct gcctcctgcc
     5881 ccccgtatcc aagttctcag caccgacgac tatgttacca gaacaaaact attttattat
     5941 gctggtagtt ctagattact tactgttggc catccatatt ttcctattcg ccaggcaagt
     6001 ggtaaaaatc gtatagttgt ccccaaagtg tctggatacc aatatagagt gtttcgtgtg
     6061 cggctacccg acccaaataa atttggacta cctgatgctt cattatacaa tcccgatacc
     6121 cagcgcttag tgtgggcttg taagggtctt gaggtaggcc gcggacagcc tttaggaata
     6181 ggtgttagtg gccatccatt gtttaacaag cttaatgaca ctgaaaatgc cacgctgttt
     6241 gatgttaatc ctggtgagga tactcgggat aatgtttcta tggattataa acaaacacaa
     6301 ctatgtatta ttggttgcaa gcctccctta ggtgaacatt gggcaaaagg tactccatgc
     6361 agtggcgctt cagctgccgc tggtagttgc cccccacttg aacttgccag tactgttata
     6421 caggatggtg atatggtaga cacaggtttt ggggcaatgg attttgcagc cctgcaaaca
     6481 aataggtctg acgttccctt ggatattgtt actacaacat gtaaatatca gattatttcg
     6541 aatggctgca gagccattgg tgatcgcatg tttttcttgc gccgagagca atgttttaag
     6601 acattttata ataggcaggc acacctaggc agggttcctg atgactatta tttaaaggtt
     6661 caccttctac ctttcgtgct tcccctacaa gctctcttta tgcatccaca cctagtgggt
     6721 ccatggttac ctcagagtca cagttgttta acaagcctta ctggctacaa cgtgcacagg
     6781 gcacaaacaa tggcatttgt tgggcaatct gctttttgta acagttgtga cacatcacgt
     6841 agtacaaata tgtccatctg tgctaccaaa actgttgagt ctacatataa agcctctagt
     6901 ttcatggaat atttgagaca tggagaagaa tttgatttgc aatttatatt tcaactatgt
     6961 gttattaatt taacagctga aattatggcc tacttacatg gcatggatgc tacattactg
     7021 gaggactgga attttggttc cttaccacct cctactgcta gtcttggtga tacctaccgc
     7081 tttttacagt ctcaggccat aacctgtcag aaaaacagtc ctcctcctgc agaaaaaaag
     7141 gacccctatg cagatcttac attttgggag gtggatttaa aggagcggtt ttcactagaa
     7201 ttggatcagt ttccattggg gcgcaagttt ttgctgcaaa gtggcacccg ctcgcggcct
     7261 actgcattgt cccgtaaaag ggttgcagca tctaccacat ccaccgcccc caaacgtaaa
     7321 cgtgttaaac gctcccggta gtagtggtgt gtgtgtgtgt gagtatgtat gtgtatgtgt
     7381 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgc atgtttatat atgtatatgt gtttgttgtg
     7441 tatattgtat gttgttgtgt acgtgtgtat atactgtgcg accccctggc ttcctgtatg
     7501 catgtttgtg ttgttaataa atgcgtgtca tgtatgtgtg ttatttggtt gcacccctgt
     7561 gagtaagtgt gcaagcagat ataacccccc ctgtccggtg acgccttact gaccagtgac
     7621 catgccctgc ctttggcccc tggttatatt taaggaacga accgttttcg gttgcctaaa
     7681 atggccgccg ttttggctgg cggccgccgt tttggcgggc tactatacaa tgggtaatcc
     7741 ttgtattgct tcatatcctt tccaacacag gtgtgcatac catcataagg tgtgcctggc
     7801 agttatttgg cacactataa atgcatgtta taattaactt taatacatgt gttactcacc
     7861 gtgcaatacc cattgtttgg ggcatatagt tgtgctgact atgttcgcct aagtacgttt
     7921 ttggcaacag cagtgtagtt tctgtccaag aatgtgtctg ctaactttta taatgtactt
     7981 agcaacatgg tttatacaca cctaatccgg ttgttcc
//
DBGET integrated database retrieval system