LOCUS NC_039089 8017 bp DNA circular VRL 24-AUG-2018
DEFINITION Human papillomavirus type 71 DNA, complete genome.
ACCESSION NC_039089
VERSION NC_039089.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE human papillomavirus 71
ORGANISM human papillomavirus 71
Viruses; Monodnaviria; Shotokuvirae; Cossaviricota;
Papovaviricetes; Zurhausenvirales; Papillomaviridae;
Firstpapillomavirinae; Alphapapillomavirus; Alphapapillomavirus 14.
REFERENCE 1
AUTHORS Matsukura,T. and Sugase,M.
TITLE Relationships between 80 human papillomavirus genotypes and
different grades of cervical intraepithelial neoplasia: association
and causality
JOURNAL Virology 283 (1), 139-147 (2001)
PUBMED 11312670
REFERENCE 2 (bases 1 to 8017)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (24-AUG-2018) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 8017)
AUTHORS Matsukura,T.T. and Sugase,M.
TITLE Direct Submission
JOURNAL Submitted (22-MAR-2000) Toshihiko T. Matsukura, National Institute
of Infectious Diseases, Department of Virology II; 1-23-1 Toyama,
Shinjuku, Tokyo 160, Japan (E-mail:toshi@nih.go.jp,
Tel:81-03-5285-1111)
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to AB040456.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..8017
/organism="human papillomavirus 71"
/mol_type="genomic DNA"
/db_xref="taxon:120686"
misc_feature 1..8017
/note="whole genome of human papillomavirus type 71"
gene 834..2762
/gene="E1"
/locus_tag="D1R35_gp1"
/db_xref="GeneID:918588"
CDS 834..2762
/gene="E1"
/locus_tag="D1R35_gp1"
/codon_start=1
/product="hypothetical protein"
/protein_id="NP_073548.1"
/db_xref="GeneID:918588"
/translation="MADCEGTGTDDDGGEAGTGAGGWFFVEAIVDRCTGPQPSSDEDE
DDNDTGEDMVDFINDRHAGDGQEVAAEVYRQQEALDDEAIVQPLKRKFLASPLSAGVC
VDKELSPRLDAISIGRESQKAKRRLFELQDSGYGNTQVDTEAAGNQVPRDGTPGGLHT
EQEEERQGGDGAAGIDSTPAHTGTNTVLALLNSSNRRATLLGKFKDLYGLSYMELVRQ
FKSNKTTCLDWVVCAFGVYCTVAEGVKTLIQQHCEYAHIQQQTCSWGVVILMLLRYKC
AKNRDTVAKGLSMLLNIPETNMLIEPPKIRSTPAALYWFRASMGNASDIFGETPEWIV
RQTVVGHSMEECQFQLSVMVQWAYDHDITDESILAYEYARLADVDSNAAAFLASNCQA
KYVKDACTMCRHYKRAEQAQMSMSQWISFRSAKITEEGDWRTIVKYLRHQDIEFITFI
IALKNFLKGIPKKSCLVFYGPADTGKSYFCMSLLRFLGGVVISYANSSSHFWLQPLAD
AKLGLIDDVTPNCWSYIDVYLRNALDGNQICIDRKHRPLLQLKCPPLLITTNTNPLEE
ERWKFLRSRLQLFTFKNAFPLNSKGDPMYPLNDANWKCFFQRLWARLDLHEQDEQEDN
GDTGQPFRCVPGDVTRTV"
misc_feature 834..2759
/gene="E1"
/locus_tag="D1R35_gp1"
/note="E1; Provisional; Region: PHA02774"
/db_xref="CDD:222927"
ORIGIN
1 ttgttctact tcttactcat tattataaat tataatgttt gtataagaaa tataggtgta
61 accgaaaacg gtgtaaccga aatgggtgca tatataaagc aatgcttggt tcagcagatt
121 tagctatgtc cagcggggac gcatacccca ccaacctctt cagactgtgt aaccagtacg
181 acgtggacct gcaggacctg aacctaacct gcatattctg cagaacaatt aacagacgtg
241 gaagtcgtgg ccttgcatat aaggagctga aagttgtgtg gagaagtggt tttccgtttg
301 ctgcatgtgc ctgctgtttg gaaatagctg gaaaactaag gcaacttaga tattggcaat
361 tttcaggctt tgcaaacaca gtggaattag acaccggaac gccagttaca gagcaactaa
421 tacggtgcta cgtgtgtcac aagccattgt gtagtgtgga aaaagaaaga ataattacag
481 aaggcaggcg atttcataaa atagcaggcc attggcgcgg tgcttgccta cagtgttgga
541 aaccatgcga ggccaacaat gtaccttaaa gacattgtcc tgcagctaca gccagaggtt
601 gttgacctgt attgtcacga gcaatttgcc agctcagacg aggaagataa tagggtggac
661 ggtgagcaac ccacagaacc agcacagcag gcatataggg tggtttcata ctgtggtagg
721 tgctgtcgtg cagttaggct tgtggtggaa agcgacgaag cagacataag agcgcttcaa
781 cagctgttac tgggcacact gacaatagtg tgtcccatct gcgtgtagct gccatggccg
841 actgtgaagg tacaggtact gatgatgatg ggggcgaagc gggaacaggc gcgggagggt
901 ggttttttgt agaagctatt gtggacagat gtaccggccc acagccatcc agtgatgagg
961 atgaggatga caatgatact ggggaagata tggtggattt tataaatgac aggcatgcag
1021 gggacggaca ggaagtggca gcagaggtgt acagacagca agaagcatta gatgacgagg
1081 caattgtgca gcccctaaaa cgaaagtttc ttgcaagtcc tttgtctgca ggggtgtgtg
1141 tagacaaaga gttaagcccg cggctagatg ctatatccat aggcagggaa tcccaaaaag
1201 caaaacgaag gttatttgaa ctacaggaca gtgggtatgg caatacgcaa gtggatactg
1261 aagcggcagg aaaccaggta ccaagggacg ggacgccagg ggggctgcac acagaacagg
1321 aggaggagcg tcaggggggg gatggggcgg caggcataga cagtacacca gcacacacag
1381 gaacaaatac agtgttggct ttgttaaact ccagtaatcg aagggcaaca ctgctaggta
1441 agtttaaaga cttatatggg ttatcatata tggaattggt acggcaattt aaaagtaata
1501 aaacaacatg tttggactgg gtagtatgtg cgtttggtgt gtattgcacg gttgcggagg
1561 gtgtaaaaac cttgatacag caacactgtg aatacgcaca catacaacaa caaacatgtt
1621 cctggggagt ggttatatta atgctgttgc gatataagtg tgctaaaaac agagataccg
1681 tggcaaaggg attaagcatg ttattaaata taccagaaac aaacatgcta atagaaccac
1741 caaaaataag aagtacgcct gctgcattat attggtttag ggcgagcatg ggaaacgcaa
1801 gtgacatatt cggggaaacg ccagaatgga tagttagaca aactgtggta ggacacagca
1861 tggaggaatg ccagtttcag ttatcagtaa tggtgcaatg ggcatatgac catgatataa
1921 cagatgaaag tatattggca tatgaatatg cccgtctggc tgatgtagat agtaatgcag
1981 cagccttttt agcaagcaat tgtcaagcca aatatgtgaa agatgcatgc acaatgtgta
2041 gacattataa acgagcagag caggcacaaa tgtctatgtc acaatggata tcatttagaa
2101 gtgctaaaat aacagaagaa ggggattggc gaacaatagt aaaatattta agacaccagg
2161 atatagaatt tattacattt attatagcat taaaaaattt tttaaagggt ataccaaaga
2221 aaagctgttt agtattttat gggcctgcag acactggcaa atcctatttt tgcatgagcc
2281 tattacggtt tttgggtggg gttgtcattt cctatgccaa ttccagcagc catttttggt
2341 tgcagccgtt agctgatgca aaattaggac taatagatga tgtaacccct aattgctgga
2401 gctatataga tgtatatcta agaaatgcat tagacggtaa tcaaatatgt atagatagaa
2461 aacacaggcc attactgcag ttaaagtgtc ccccattgtt aataacaaca aataccaatc
2521 cgttagagga ggaacgatgg aagtttttac gtagtagact gcagctgttt acatttaaaa
2581 atgcatttcc tttaaattca aaaggggacc ctatgtatcc actaaatgat gcaaactgga
2641 aatgcttttt tcaaaggttg tgggctcggt tagacttaca cgagcaggac gagcaggagg
2701 acaatggaga cactggccag ccgtttagat gcgtgccagg agacgttact agaactgtat
2761 gaaaaagata gtgaccagct acaggaccaa attaaacact ggcaacacgt gcgatgggaa
2821 aatgtgttgt tatttaaggc aagggaagca ggaattactc atcttgccca ccaggtggtg
2881 cctgtgcttg gtattgctaa agccaaagct tgtaaagcaa ttgaaattca gttggcatta
2941 aagacattac ttaacagtcc ctatagcaac gaacgatgga cattgcgtga cacgagccag
3001 gaaatgtggg acgcagtgcc taagcaatgc tggaaaaaaa aaggctacac tgtagaagtg
3061 cgatacgatt gcaaagagga aaagacaatg tgttacacat gttggaggga aatatatgtg
3121 caaaacagta caaatgagac atgggaaaaa gtgtgtggcc tggtggacca tgcgggcata
3181 tactatttac acgatgggat acgtgtagac tgtgtattat tctccaagga agcagtaata
3241 tatggggaca caggcatctg ggaagtacat gtgggttcaa gggtgattta tgatgcattc
3301 gactcctctg tgtctagcac ccaggacacc gagcaagacc aagtacccac tattaaacct
3361 actgaccacg gacccgactc gcacccccaa caggcctcca ccaccaccca agtgctgggc
3421 accaacgaaa cccaagtgtc gaccccgcca tttaagcgac agcgactcgg agacagacag
3481 cggacccttg agcagcccga ttctacaaaa gcaccacagc agctggcacg tgtcaacctt
3541 aggacccagt gtgacactga cggcgcacac gagcacggga ggacacgtga ctgtaacagt
3601 gcacctgtaa tacaccttag aggtgaagcc aataaactaa agtgtttaag gtataggttg
3661 caaaaacata aatctgtact gtttgccaaa gcatcctcca cgtggcattg ggccactggc
3721 acagaggaca atacatgtaa aacaacattt gtaacattgt ggcatgatag tgtggaacag
3781 cgggcacaat ttctagccac tgtacatatt cctaagggca tagaggcctt accaggatat
3841 atgtcattgt ttgcataatc tttgtaaata ttgtatatat tgtatctatt gtatagacgc
3901 ggaacgttaa gggtacacac ctacctgtag tgctggagcc ctttctatgc tgggtgtctg
3961 catggaccta tgcactacta ctactaatta gcttttggct gtctatttta tcttctctta
4021 ctgccttttt aatttttttt gttactgtgt ttcttgggtt tctagcacta tatatacagg
4081 cagcagcgtc ccttacctaa ctgtgacttg tgactaccac acaaccagcc aatactgcta
4141 ctacgtgtac ataacctatc catttgtgtt atagattgca tatatgtatc ctgttgtggt
4201 aaaggattcc caaggcggac attatgatat tgtggtgtgg ggccctgatg atgtagatgt
4261 attgtttgtg tttttagtgt tggtgtgtct tatgttgctt ctgtttttgt tacggttgat
4321 gcagtaggta cccccccttt tgtattgccc tgtttataca tatacatata catattgttt
4381 ttatttggtt tttgtttttg tttttttgtg tgtgcctgtg tgtgtaaata aacacattta
4441 caatgccacg tgcacggcgt cgcaaacgtg cttcagttac acagctatat cagtcctgta
4501 agctaacagg cacatgtccc cctgatgtta ttaataaggt ggagcacaat accttggctg
4561 ataaaatatt acgctggggt agtttaggaa tatttttggg aggtttgggc attggcacag
4621 ggtctggcac cggtgggcgc acaggctata ttcccattgg tacacgtcct cctactgttg
4681 tggatgtagg acctcctgca cgcccccctg tggttataga aacagtaggg gcctctgatc
4741 catctattgt gtctttggta gaggattcta gtattataga agcaggagca ccatatccta
4801 actttactgg cacaggtggg tttgaggtca ctacagcatc cactactact cctgctgtgt
4861 tagacattac tccaggtaac actgtgcagg ttagtagcag tagttttact aacccatcct
4921 ttactgaacc tgccttagtg gagccccctc aaacaggtga ggtttcggga catattttgg
4981 ttagtacctc tacatctggc acccatggct atgaggaaat acccatgcaa acatttgcgt
5041 cggagggaac aggcaatgaa cctataagta gtacacctat tcctggggta cgcaggttag
5101 caggccctcg cctgtatagt agggcctatc agcaggtgcg ggtggatgaa tccacatttc
5161 ttcgccaccc tgcatctatg gttacatatg acaaccctgt gtatgaccca gaggaaacta
5221 taatatttga acatcctagc atacaccagg ctcctgatcc agcatttatg gatattgtgg
5281 ccttgcacag gccggccctt actgcccgta aaggtacggt acgtttcagt cgtttaggac
5341 aaaaatctac ccttcgcacc cgtagtggta aacaaatagg ggcgcgggta catttttatc
5401 atgacattag ccctatacaa cccaccgaac acttagaact gcagccacta gggcgggcct
5461 tacaacaaga acctattgac acattatatg acatatatct gacacagatt attccaatga
5521 tactgtcatt caacctactt ctgtgtccag caggcctaca cctactacta tacccctctg
5581 taactgccac atcagccgtg tctgcctctc gcacacaaaa tgttacagca cctttgtctg
5641 caggagcaga tgttccagtg tttgatggcc ctgacattga tttttccacc tcccatgcca
5701 ctactcctac tcctgtagtg ccgtccattg cacctcccag ttcttttatt gtgtatggaa
5761 ctgaggtatt atttaatgcc tagttatata ttttttccta aaaaactaaa cgtgtccact
5821 atttttttgc agatggcttt gtggcggcct agtgacagca aggtatacct gcctcctgcc
5881 ccccgtatcc aagttctcag caccgacgac tatgttacca gaacaaaact attttattat
5941 gctggtagtt ctagattact tactgttggc catccatatt ttcctattcg ccaggcaagt
6001 ggtaaaaatc gtatagttgt ccccaaagtg tctggatacc aatatagagt gtttcgtgtg
6061 cggctacccg acccaaataa atttggacta cctgatgctt cattatacaa tcccgatacc
6121 cagcgcttag tgtgggcttg taagggtctt gaggtaggcc gcggacagcc tttaggaata
6181 ggtgttagtg gccatccatt gtttaacaag cttaatgaca ctgaaaatgc cacgctgttt
6241 gatgttaatc ctggtgagga tactcgggat aatgtttcta tggattataa acaaacacaa
6301 ctatgtatta ttggttgcaa gcctccctta ggtgaacatt gggcaaaagg tactccatgc
6361 agtggcgctt cagctgccgc tggtagttgc cccccacttg aacttgccag tactgttata
6421 caggatggtg atatggtaga cacaggtttt ggggcaatgg attttgcagc cctgcaaaca
6481 aataggtctg acgttccctt ggatattgtt actacaacat gtaaatatca gattatttcg
6541 aatggctgca gagccattgg tgatcgcatg tttttcttgc gccgagagca atgttttaag
6601 acattttata ataggcaggc acacctaggc agggttcctg atgactatta tttaaaggtt
6661 caccttctac ctttcgtgct tcccctacaa gctctcttta tgcatccaca cctagtgggt
6721 ccatggttac ctcagagtca cagttgttta acaagcctta ctggctacaa cgtgcacagg
6781 gcacaaacaa tggcatttgt tgggcaatct gctttttgta acagttgtga cacatcacgt
6841 agtacaaata tgtccatctg tgctaccaaa actgttgagt ctacatataa agcctctagt
6901 ttcatggaat atttgagaca tggagaagaa tttgatttgc aatttatatt tcaactatgt
6961 gttattaatt taacagctga aattatggcc tacttacatg gcatggatgc tacattactg
7021 gaggactgga attttggttc cttaccacct cctactgcta gtcttggtga tacctaccgc
7081 tttttacagt ctcaggccat aacctgtcag aaaaacagtc ctcctcctgc agaaaaaaag
7141 gacccctatg cagatcttac attttgggag gtggatttaa aggagcggtt ttcactagaa
7201 ttggatcagt ttccattggg gcgcaagttt ttgctgcaaa gtggcacccg ctcgcggcct
7261 actgcattgt cccgtaaaag ggttgcagca tctaccacat ccaccgcccc caaacgtaaa
7321 cgtgttaaac gctcccggta gtagtggtgt gtgtgtgtgt gagtatgtat gtgtatgtgt
7381 gtgtgtgtgt gtgtgtgtgt gtgtgtgtgc atgtttatat atgtatatgt gtttgttgtg
7441 tatattgtat gttgttgtgt acgtgtgtat atactgtgcg accccctggc ttcctgtatg
7501 catgtttgtg ttgttaataa atgcgtgtca tgtatgtgtg ttatttggtt gcacccctgt
7561 gagtaagtgt gcaagcagat ataacccccc ctgtccggtg acgccttact gaccagtgac
7621 catgccctgc ctttggcccc tggttatatt taaggaacga accgttttcg gttgcctaaa
7681 atggccgccg ttttggctgg cggccgccgt tttggcgggc tactatacaa tgggtaatcc
7741 ttgtattgct tcatatcctt tccaacacag gtgtgcatac catcataagg tgtgcctggc
7801 agttatttgg cacactataa atgcatgtta taattaactt taatacatgt gttactcacc
7861 gtgcaatacc cattgtttgg ggcatatagt tgtgctgact atgttcgcct aagtacgttt
7921 ttggcaacag cagtgtagtt tctgtccaag aatgtgtctg ctaactttta taatgtactt
7981 agcaacatgg tttatacaca cctaatccgg ttgttcc
//