LOCUS NC_026168 2130 bp DNA circular VRL 11-JAN-2019
DEFINITION Sewage-associated gemycircularvirus-3 isolate BS4149, complete
sequence.
ACCESSION NC_026168
VERSION NC_026168.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Sewage associated gemycircularvirus 3
ORGANISM Sewage associated gemycircularvirus 3
Viruses; Monodnaviria; Shotokuvirae; Cressdnaviricota;
Repensiviricetes; Geplafuvirales; Genomoviridae; Gemykibivirus;
Gemykibivirus sewopo1.
REFERENCE 1 (bases 1 to 2130)
AUTHORS Kraberger,S., Arguello-Astorga,G.R., Greenfield,L.G., Galilee,C.,
Law,D., Martin,D.P. and Varsani,A.
TITLE Characterisation of a diverse range of circular replication
associated protein encoding DNA viruses recovered from a sewage
treatment oxidation pond
JOURNAL Infect. Genet. Evol. 31, 73-86 (2015)
PUBMED 25583447
REFERENCE 2 (bases 1 to 2130)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (13-JAN-2015) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 2130)
AUTHORS Kraberger,S., Arguello-Astorga,G.R., Martin,D.P. and Varsani,A.
TITLE Direct Submission
JOURNAL Submitted (25-FEB-2014) School of Biological Sciences, University
of Canterbury, Christchurch 8140, New Zealand
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence is identical to KJ547643.
##Assembly-Data-START##
Assembly Method :: DNAbaser v. 2011
Sequencing Technology :: Sanger dideoxy sequencing
##Assembly-Data-END##
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..2130
/organism="Sewage associated gemycircularvirus 3"
/mol_type="genomic DNA"
/isolate="BS4149"
/isolation_source="sewage oxidation pond"
/db_xref="taxon:1843761"
/country="New Zealand"
/collection_date="12-Sep-2012"
gene 109..1020
/locus_tag="SQ07_gp1"
/db_xref="GeneID:38745206"
CDS 109..1020
/locus_tag="SQ07_gp1"
/codon_start=1
/product="capsid protein"
/protein_id="YP_009115528.1"
/db_xref="GeneID:38745206"
/translation="MVYRRQSRRKSAYRPRSAKSRYGRKRTRATRLSRRLRPMSTRSL
LNRTSQKKRDNMLSYTNTTAADPFSTVYSATGAVMRFPVGQTNPGAQFFYVWNATARP
GETSDGQRGSKIDTSLRTSSTIYAKGLREKITVETNNSAPWEWRRICFTSKDDFGERD
PATSTFFRQTSNGMVRMLSALSSGIYLQDELFEGARNVDWLSVFTAPLSRKNFSIKYD
RTRTIRSTNNTGTIRNFKLWHPMEHNIAYQEEAVGESMTDQSVSVTGRVGMGNYYVID
MFRKHGANDDQSTLTFTPEATFFWHEK"
gene complement(976..2108)
/locus_tag="SQ07_gp2"
/db_xref="GeneID:38745207"
CDS complement(join(976..1379,1493..2108))
/locus_tag="SQ07_gp2"
/codon_start=1
/product="replication-associated protein"
/protein_id="YP_009115529.1"
/db_xref="GeneID:38745207"
/translation="MTFRFAAKYGLLTYAQIGDRDVEDFGWRVSDMLGSLGAECIVGR
ESHADGGLHIHAFFMFERKFQSRNVRVFDMDGCHPNIVRGYSTPEDGARYAIKEGDVI
AGGLDVDALGSSVAGSKTVWAQIILAESRDDFFAACAELAPRALLCSFTSLRCYADWK
YREDPAPYRHPEDVSFDTSRFPELDRWVQESLRGTARGKRCSLRSGRRRSLILYGPTK
LGKTLWARSLGNHAYFGGLFSMDESIDNVDYAVFDDMQGGLKFFHSYKFWLGAQSQFY
VTDKYKGKRLVHWGKPCIYLYNHNPLCDEGADHDWLLGNCDIVGLDADDSLLVPEEGS
LGGES"
misc_feature complement(1794..2102)
/locus_tag="SQ07_gp2"
/note="Geminivirus Rep catalytic domain; Region:
Gemini_AL1; pfam00799"
/db_xref="CDD:366313"
misc_feature complement(join(1324..1379,1493..1745))
/locus_tag="SQ07_gp2"
/note="Geminivirus rep protein central domain; Region:
Gemini_AL1_M; pfam08283"
/db_xref="CDD:285483"
gene complement(1383..2108)
/locus_tag="SQ07_gp3"
/db_xref="GeneID:38745208"
CDS complement(1383..2108)
/locus_tag="SQ07_gp3"
/codon_start=1
/product="RepA"
/protein_id="YP_009115530.1"
/db_xref="GeneID:38745208"
/translation="MTFRFAAKYGLLTYAQIGDRDVEDFGWRVSDMLGSLGAECIVGR
ESHADGGLHIHAFFMFERKFQSRNVRVFDMDGCHPNIVRGYSTPEDGARYAIKEGDVI
AGGLDVDALGSSVAGSKTVWAQIILAESRDDFFAACAELAPRALLCSFTSLRCYADWK
YREDPAPYRHPEDVSFDTSRFPELDRWVQESLRGTARGKRCSLRSGSDVDSGFLPIHG
SVPLGPIRLRRSDHSRILRNANL"
misc_feature complement(1794..2102)
/locus_tag="SQ07_gp3"
/note="Geminivirus Rep catalytic domain; Region:
Gemini_AL1; pfam00799"
/db_xref="CDD:366313"
ORIGIN
1 taatattatt ctacctttca gatggtgtca gtgtcagcta tttatctata aaaagcccac
61 gccccagccc ctttggcccc caaaattaaa tgcctcaaaa tccctcacat ggtctaccgg
121 cgccaatcgc gacgcaagtc tgcatatcga ccccgcagtg ccaagtcccg ctatggacgc
181 aagcggacaa gagccactcg gctgtcccga aggttgcgac ctatgtccac ccgaagcctt
241 ctcaaccgga ccagtcagaa aaagcgggat aacatgttgt cctacaccaa tacaacagca
301 gccgatcctt tctccaccgt gtactctgca acgggagcag tcatgcgatt tccagtggga
361 cagactaacc caggagcaca atttttctac gtatggaatg caacagcccg accaggagaa
421 acctccgacg gtcaacgagg atccaagatc gacacgtcac tgcgtacttc ctcaacaatc
481 tacgccaagg gattacgcga gaaaatcact gtggagacca ataactccgc cccctgggaa
541 tggcgccgta tatgctttac ctcgaaggac gattttgggg aacgtgatcc cgccacctcc
601 accttctttc gccagacgtc caacggcatg gtccgcatgc tgtccgcatt gtcaagcgga
661 atatacctac aggacgaact attcgagggt gcccgcaatg tcgattggct gtcagtgttt
721 accgccccgc tctcccggaa gaacttctcc attaagtacg accgaacacg gaccatccgg
781 tctacaaaca acaccggaac aattcgaaat ttcaagcttt ggcacccaat ggaacataat
841 atagcctatc aagaagaggc ggttggtgag agtatgactg atcagtctgt gtcagtcacg
901 ggtcgagtag gaatgggcaa ctattacgtc attgatatgt tccgtaagca cggggctaat
961 gacgaccaat ctaccctaac tttcaccccc gaggctacct tcttctggca cgagaagtga
1021 gtcatctgcg tccagaccca ctatgtcgca atttcccaac aaccaatcat gatccgcacc
1081 ctcgtcgcat agagggttgt ggttgtacag atagatgcag ggcttgcccc aatgaaccag
1141 tcgtttcccc ttatacttgt ccgttacgta aaattgtgac tgtgcaccca accagaattt
1201 gtaactgtga aaaaacttca atccaccttg catatcgtcg aaaacagcat agtcgacgtt
1261 gtcgatggac tcatccatgg aaaacaggcc cccaaagtaa gcgtggttgc ctaaacttcg
1321 cgcccacaga gttttgccta gttttgtggg tccatacaag atgaggcttc gtcttcggcc
1381 tattacaagt tagcattacg taaaatcctg gagtggtccg agcgccggag gcggatcgga
1441 cctaacggaa ccgagccatg aattggcaaa aaccccgagt caacatcaga acccgagcgg
1501 agcgagcagc gcttacctct agcagtccct ctaagagact cttgcaccca tcgatcgagt
1561 tcaggaaatc ttgacgtgtc aaatgataca tcctctgggt gtcgataagg agctgggtcc
1621 tctctgtact tccaatcggc ataacatcgg agtgaggtga atgaacacag aagtgccctt
1681 ggtgccagtt ccgcacatgc cgcaaaaaag tcgtctcgag actccgccag gatgatttga
1741 gcccagacag ttttagatcc agccactgag cttccaagag cgtccacgtc gagtcctcca
1801 gcaataacgt caccctcctt gatcgcgtat cgtgctccat cttcaggagt gctgtagcca
1861 cggacgatat ttggatgaca tccatccata tcaaacacac ggacatttcg tgattggaac
1921 ttccgttcga acatgaagaa agcatggata tgcagtcctc catcagcgtg agactctcgg
1981 cccacgatac actccgctcc aagcgatcca agcatgtcgc taactctcca cccgaaatcc
2041 tccacgtctc ggtctccgat ttgggcatac gtgagtaatc catacttggc ggcgaatcga
2101 aaagtcatgt gacagacact gaaaggtaga
//