LOCUS NC_005947 4302 bp DNA linear VRL 13-AUG-2018
DEFINITION Avian endogenous retrovirus EAV-HP, complete genome.
ACCESSION NC_005947
VERSION NC_005947.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq; Gag-env fusion protein; gag-env gene; long terminal repeat.
SOURCE Avian endogenous retrovirus EAV-HP
ORGANISM Avian endogenous retrovirus EAV-HP
Viruses; Riboviria; Pararnavirae; Artverviricota; Revtraviricetes;
Ortervirales; Retroviridae.
REFERENCE 1
AUTHORS Sacco,M.A., Flannery,D.M., Howes,K. and Venugopal,K.
TITLE Avian endogenous retrovirus EAV-HP shares regions of identity with
avian leukosis virus subgroup J and the avian retrotransposon
ART-CH
JOURNAL J. Virol. 74 (3), 1296-1306 (2000)
PUBMED 10627540
REFERENCE 2 (bases 1 to 4302)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (25-JUN-2004) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 4302)
AUTHORS Sacco,M.A.
TITLE Direct Submission
JOURNAL Submitted (09-APR-1999) Sacco M.A., Immunology and Pathology,
Institute for Animal Health, Compton, Near Newbury, Berkshire, RG20
7NN, UNITED KINGDOM
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to AJ238124.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..4302
/organism="Avian endogenous retrovirus EAV-HP"
/proviral
/mol_type="genomic DNA"
/host="Gallus gallus domesticus(Chicken,line N)"
/db_xref="taxon:93465"
/country="United Kingdom"
/note="Chicken endogenous retrovirus EAV-HP1"
repeat_region 79..365
/rpt_type=long_terminal_repeat
primer_bind 366..383
gene 515..3892
/gene="gag-env"
/locus_tag="AEREVgp1"
/db_xref="GeneID:2853315"
CDS 515..3892
/gene="gag-env"
/locus_tag="AEREVgp1"
/codon_start=1
/product="Gag-env fusion protein"
/protein_id="YP_031678.1"
/db_xref="GOA:Q9QME4"
/db_xref="HSSP:1EOQ"
/db_xref="InterPro:IPR000721"
/db_xref="InterPro:IPR001878"
/db_xref="InterPro:IPR001969"
/db_xref="InterPro:IPR001995"
/db_xref="InterPro:IPR004028"
/db_xref="InterPro:IPR005166"
/db_xref="InterPro:IPR008916"
/db_xref="InterPro:IPR008919"
/db_xref="InterPro:IPR009007"
/db_xref="InterPro:IPR010999"
/db_xref="InterPro:IPR012344"
/db_xref="InterPro:IPR013084"
/db_xref="InterPro:IPR018061"
/db_xref="InterPro:IPR018154"
/db_xref="InterPro:IPR021109"
/db_xref="UniProtKB/TrEMBL:Q9QME4"
/db_xref="GeneID:2853315"
/translation="MDQVIKVLVQFCKDYCGKSTPSRKEIATVLSLLNELGELDSPRH
VLDSSRWDLLTLALCQRAMASQKATELKTWGLMLGALKAARAEHKLGAVMSGEGAPGS
GSLEFCRTGAQTGAQTTANKTATEREEDCEKDNEESQRLGGGATTPTAPPNSIALSPP
PPYPKQPLYPSLATTSEQGAGPSPKGKGEGRLKLTDWGQIKEEVAQKGLAATYTLPVV
VSEEGGPIWVPLDPKGVARMIEAVEKKGLKSPLTMNALEALTASGPMLPYDIENLMRM
VLKPVQYTLWREEWHTKLKQMLITAQGDQRNPIYGSDIQRLTGNAPGLLTPQAQVCQL
RPGELIATTDAAIDAFRKLARSAEPTTPWTEIAQGPTEPFQEFADRLIKAVEGSDLPR
AVHGPVILDCLYQKSSEGVQGILRAAPGRLQTPGEAIKYVLDKQKACPSVAGEVAAAV
AGVMMACREADHRSADRQLGPCFKCGQLGHIRAQCRMGTGGGVTCQQCGRKGHAAPQC
RARRPPSQGNNNGRQSEHGDIIFRPMQAPDLSLPMAALSLSTHERPLVKATISCTNLP
PDFQGPLSIFVTALIDSGADVTVVTETEWPSSWPAEASQSIMGVGGATPSRRSTNEVQ
AVVINRDGSLEKPALLTPLVARVPGTLLGRDFLRQIGIPQYPLNTFKGYVTNVTACDN
DADLASQTACLIKALNTTLPWDPQELDILGSQMIKNGTTRTCVTFGSVCYKENNRSIV
CHNFDGNFNGTGGAEAELRDFIAKWKSDDLLIRPYVNQSWTMVSPINVESFSISRRYC
GFTSNETRYYRGDLSNWCGSKRGKWSAGYSNGTKCSSNTTGCGGNCTTEWNYYAYGFT
FGKQPEVLWNNGTAKALPPGIFLICGDRAWQGIPRNALGGPCYLGQLTMLSPNFTTWI
AYGPNITGHRRSRRSLSRLSPDCGDELQLWSVTARIFASFFAPGIAAAQALKEIERLA
CWSVKQANLTSLILNAMLEDTSSIRHAVLQNRAAIDFLLLAQGHGCQDVEGMCCFNLS
DHSESIHKALQAMKEHTEKIRVEDDPIGDWFTRTFGDLGRWLAKGVKTLLFALLVIAC
LLAIIPCLIKCFQDCLLRTMNQFMDERIKYHRIREQL"
misc_feature 518..775
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Retroviral M domain; Region: Retro_M; pfam02813"
/db_xref="CDD:280904"
misc_feature 1193..1585
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="gag gene protein p24 (core nucleocapsid protein);
Region: Gag_p24; pfam00607"
/db_xref="CDD:425774"
misc_feature 1592..1765
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Gag protein p24 C-terminal domain; Region:
Gag_p24_C; pfam19317"
/db_xref="CDD:437149"
misc_feature <1883..>2050
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="universal minicircle sequence binding protein
(UMSBP); Provisional; Region: PTZ00368"
/db_xref="CDD:173561"
misc_feature 2234..2485
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Retropepsins, pepsin-like aspartate proteases;
Region: HIV_retropepsin_like; cd05482"
/db_xref="CDD:133149"
misc_feature order(2258..2260,2264..2266,2270..2272,2330..2338,
2465..2467)
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="inhibitor binding site [active]"
/db_xref="CDD:133149"
misc_feature 2258..2266
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="catalytic motif [active]"
/db_xref="CDD:133149"
misc_feature 2258..2260
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Catalytic residue [active]"
/db_xref="CDD:133149"
misc_feature order(2330..2344,2348..2362)
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Active site flap [active]"
/db_xref="CDD:133149"
misc_feature 2603..3247
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Avian retrovirus envelope protein, gp85; Region:
Avian_gp85; pfam03708"
/db_xref="CDD:281671"
misc_feature 3434..3649
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="heptad repeat 1-heptad repeat 2 region (ectodomain)
of the transmembrane subunit of Rous sarcoma virus (RSV),
and related domains; Region: RSV-like_HR1-HR2; cd09949"
/db_xref="CDD:197372"
misc_feature order(3434..3442,3446..3451,3455..3460,3464..3472,
3476..3484,3488..3493,3497..3505,3509..3535,3539..3547,
3572..3577,3584..3586,3593..3604,3608..3610,3626..3628,
3635..3640,3647..3649)
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:197372"
misc_feature 3434..3547
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="HR1; other site"
/db_xref="CDD:197372"
misc_feature 3515..3565
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="immunosuppressive region; other site"
/db_xref="CDD:197372"
misc_feature 3521..3523
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="Cl binding site [ion binding]; other site"
/db_xref="CDD:197372"
misc_feature 3566..3589
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="CX(6)C motif; other site"
/db_xref="CDD:197372"
misc_feature 3623..3649
/gene="gag-env"
/locus_tag="AEREVgp1"
/note="HR2; other site"
/db_xref="CDD:197372"
repeat_region 3938..4253
/rpt_type=long_terminal_repeat
ORIGIN
1 tccagctgct catctgcagg acagagcgct tgacaaccca tgagaggaac tgttgtaggc
61 gtagcgaggg aaacgaggtg tgttgtaggc gtagcgaggg aaacgaggtg tgacgcgtgc
121 aggttcctat gccacctgtg tgttatgcca cctgtgtgtg ccaagtgtaa cttcgtgatt
181 ggaggaaaca cttgtattta aacacgtagc ctatagcaat aaacgccatt tgcctcactt
241 actcctgggg tctgggtgag catctggccc cgacctggta aagggtcggt ttcgcccagc
301 agtaagccct acatgtggac agaggacgaa caccggaaga gcgaacggag actacatgca
361 acaagtggtg accccgacgt gatctgtctg ctgagggtag ctgcccggca tgcccgtgac
421 gtggagaaga gcctgtgagg tggcggcagg tcgcgcacgg aagacgtatt gcggttaaga
481 gcctgcgagg tggcggcagg tcgcatacgt aacaatggac caagtcatta aggtacttgt
541 gcagttttgt aaggactact gtggaaaatc tactccttcc cggaaggaga tcgcgacagt
601 tttgtcgctg ttaaatgagc tgggggagct ggactctccc cgccacgttt tggattcgag
661 taggtgggac ttgctcacct tggcgctatg ccagcgcgcc atggccagtc agaaggctac
721 ggaacttaaa acgtggggac tgatgttagg agcccttaag gcggccaggg cagagcacaa
781 acttggcgcg gtcatgagcg gggagggagc tccggggagc ggatctctgg agttttgcag
841 gaccggcgct cagaccggcg ctcagacgac agcgaataaa acggcgacgg agagagagga
901 agattgcgag aaggacaacg aagagtcgca gaggctcggg gggggtgcaa cgacgccgac
961 ggcccctcct aattctattg cgctgtcgcc gcccccgccc taccctaagc agccgttata
1021 tccttccttg gcaacgacat cggagcaggg ggcgggacca tccccaaaag ggaaggggga
1081 ggggagactt aagcttactg attgggggca gatcaaagag gaagtggcac agaaaggcct
1141 ggcggcaact tatacactcc cagttgttgt ctcggaggag ggaggcccaa tctgggtccc
1201 gttggaccca aagggggtag caaggatgat tgaggcagta gaaaagaaag ggctgaagtc
1261 gccattgacg atgaatgccc ttgaggccct cacagcatcg ggcccaatgc tgccttatga
1321 tattgaaaat ttaatgcgca tggtgttgaa accagtgcag tatacgctct ggagagagga
1381 gtggcatacc aaattaaaac aaatgctgat tacggcgcag ggtgatcaaa gaaaccccat
1441 atatgggtct gatatacaga gattaacggg caatgcacca ggtctgctga cccctcaagc
1501 acaagtttgt caacttagac caggagagtt gatagcgact acggacgctg caatagacgc
1561 gttccgaaag cttgcaagga gcgctgagcc cactactccg tggacagaga tcgcgcaagg
1621 ccccacagag ccgtttcagg agtttgcaga cagactaatt aaggcagtgg aaggctcgga
1681 tctgcccaga gcagttcacg gtcccgtcat cctggattgc ttgtatcaga agtccagtga
1741 aggggtgcag gggattttgc gagcggcgcc ggggaggctc caaacccctg gtgaggccat
1801 caagtatgtc ctagataagc aaaaggcctg tccgtctgtg gcaggggagg tagctgcggc
1861 ggtagcagga gtgatgatgg cctgtaggga ggcggaccat cgtagtgcgg accgacagtt
1921 aggaccttgc tttaaatgtg gccagttggg ccatattagg gcccaatgca gaatgggcac
1981 gggtggaggt gtaacatgtc agcagtgtgg gcggaagggt catgcagcac cgcaatgcag
2041 ggcccgtagg cctccaagcc agggaaataa caacgggaga cagtccgagc acggtgacat
2101 catttttcga cctatgcagg cccctgacct aagtttaccc atggcggcgc tgtctctaag
2161 tacccatgag cgccctctgg tgaaagccac tatttcttgc accaacctcc cgccggattt
2221 tcaaggccct ctatctatct ttgtcactgc cctcatagat tccggcgccg acgttactgt
2281 ggtcacggaa acagaatggc catcctcgtg gcccgcggag gcctcgcagt ctattatggg
2341 ggttggaggg gcgaccccct cacgccggtc taccaatgag gtacaagcgg ttgtgattaa
2401 cagggatggc tccttagaga aaccggcgtt gcttacgcca ttggtggcgc gtgtccccgg
2461 aactcttctg gggcgggatt tcttgcgaca gataggcatt ccacagtatc ctctgaacac
2521 cttcaaggga tacgtcacta atgttactgc ttgcgataac gatgccgatt tagccagcca
2581 aacagcatgc ttgataaagg ctctaaatac aaccctccct tgggaccccc aagaattgga
2641 tattttaggg tcccagatga tcaagaacgg aacaacacgt acgtgtgtta cctttggttc
2701 agtgtgctat aaagagaaca atcgcagtat agtctgtcac aattttgatg ggaattttaa
2761 tgggactggt ggggcggaag cagaattgcg tgacttcata gcaaaatgga aaagtgatga
2821 ccttcttata aggccctatg tcaaccaatc atggacgatg gtaagtccaa taaacgtaga
2881 gagtttttca ataagtcgta gatattgtgg attcaccagt aacgagactc gttactatag
2941 aggggacctt tctaattggt gtggttcaaa aaggggaaaa tggtcagcgg ggtacagcaa
3001 cgggacaaaa tgttccagca acacgacggg ttgcggtggt aattgcacaa cggaatggaa
3061 ttattatgca tatgggttta ccttcgggaa acagccagag gtgttgtgga acaatgggac
3121 tgctaaggca ctcccaccag gtattttctt gatttgtggg gacagggctt ggcaaggcat
3181 cccgcgtaat gccttgggag ggccctgtta tctaggacaa ttgactatgc tctctcctaa
3241 ctttaccacc tggatagcgt atgggccgaa cattacgggt caccgccgta gcaggcgctc
3301 gctgagtcgt ctctcgcctg actgcggtga tgagctacag ctatggagtg tgacagcccg
3361 gatatttgct tctttctttg ctcctggtat agcagcagca caggccttaa aggagatcga
3421 acgattggca tgttggtcgg ttaagcaagc gaatttaaca tcattaatat tgaatgcgat
3481 gctggaggac acgagcagca tccggcacgc agtgttgcag aatcgagcag ccatcgattt
3541 cttactcctg gcgcagggac acgggtgtca agacgtggaa gggatgtgtt gcttcaatct
3601 cagcgatcac agtgagtcca ttcacaaggc gctccaagcc atgaaggaac atacagagaa
3661 gatacgggtg gaagatgatc ccatagggga ctggtttacg cgcacgtttg gtgatctagg
3721 aaggtggctc gcgaaaggtg ttaagacgct actgtttgcc ttgcttgtca tagcctgtct
3781 attagctatc attccatgtt taatcaagtg ctttcaggat tgtctattga gaacaatgaa
3841 tcagtttatg gatgaacgca taaaatatca tagaattagg gagcagctgt aggttccgaa
3901 cgcgatgtga cgggagctat cggcataggg agggggagat gttgtaggcg tagcgaggga
3961 aacgaggtgt gttgtaggcg tagcgaggga aacgaggtgt gacgcgtgca ggttcctatg
4021 ccacctgtgt gttatgccac ctgtgtgtgc caagtgtaac ttcgtgattg gaggaaacac
4081 ttgtatttaa acacgtagcc tatagcaata aacgccattt gcctcactta ctcctggggt
4141 ctgggtgagc atctggcccc gacctggtaa agggtcggtt tcgcccagca gtaagcccta
4201 catgtggaca gaggacgaac accggaagag cgaacggaga ctacatgcaa caggaacagg
4261 gctgggccct caactctgct tgaagcagaa ggaggcgtct gc
//