LOCUS NC_014470 29276 bp ss-RNA linear VRL 31-DEC-2021
DEFINITION Bat coronavirus BM48-31/BGR/2008, complete genome.
ACCESSION NC_014470
VERSION NC_014470.1
DBLINK BioProject: PRJNA485481
KEYWORDS RefSeq.
SOURCE Bat coronavirus BM48-31/BGR/2008
ORGANISM Bat coronavirus BM48-31/BGR/2008
Viruses; Riboviria; Orthornavirae; Pisuviricota; Pisoniviricetes;
Nidovirales; Cornidovirineae; Coronaviridae; Orthocoronavirinae;
Betacoronavirus; Sarbecovirus.
REFERENCE 1 (bases 1 to 29276)
AUTHORS Drexler,J.F., Gloza-Rausch,F., Glende,J., Corman,V.M., Muth,D.,
Goettsche,M., Seebens,A., Niedrig,M., Pfefferle,S., Yordanov,S.,
Zhelyazkov,L., Hermanns,U., Vallo,P., Lukashev,A., Muller,M.A.,
Deng,H., Herrler,G. and Drosten,C.
TITLE Genomic characterization of severe acute respiratory
syndrome-related coronavirus in European bats and classification of
coronaviruses based on partial RNA-dependent RNA polymerase gene
sequences
JOURNAL J Virol 84 (21), 11336-11349 (2010)
PUBMED 20686038
REFERENCE 2 (bases 1 to 29276)
CONSRTM NCBI Genome Project
TITLE Direct Submission
JOURNAL Submitted (26-AUG-2010) National Center for Biotechnology
Information, NIH, Bethesda, MD 20894, USA
REFERENCE 3 (bases 1 to 29276)
AUTHORS Drexler,J.F., Corman,V.M. and Drosten,C.
TITLE Direct Submission
JOURNAL Submitted (09-NOV-2009) Institute of Virology, University of Bonn
Medical Centre, Sigmund Freud-Str. 25, Bonn 53127, Germany
COMMENT REVIEWED REFSEQ: This record has been curated by NCBI staff. The
reference sequence is identical to GU190215.
COMPLETENESS: full length.
FEATURES Location/Qualifiers
source 1..29276
/organism="Bat coronavirus BM48-31/BGR/2008"
/mol_type="genomic RNA"
/strain="BtCoV/BM48-31/BGR/2008"
/host="Rhinolophus blasii"
/db_xref="taxon:864596"
/country="Bulgaria"
/collection_date="2008"
5'UTR 1..190
gene 191..21384
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/db_xref="GeneID:9714832"
CDS join(191..13297,13297..21384)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/ribosomal_slippage
/note="ORF1ab polyprotein is cleaved to yield the
RNA-dependent RNA polymerase and other nonstructural
proteins; polyprotein pp1ab"
/codon_start=1
/product="ORF1ab polyprotein"
/protein_id="YP_003858583.1"
/db_xref="GeneID:9714832"
/translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDTVEEAVAEARQ
HLIEGTCGIVDLQKGVLPQLEQPYIFLKRCDARTAPHGHVMVELVAELDGVQYGRSGE
SLGVLVPHVGETPIGYRKVLVRKNGNKGAGGHLYGADLRFYDLGDELGTDPLDDFQQD
WNTKHGSGLRRDLFRELNGGVYTRYVDNNFCGPDGYPLECIKDLLARAGKSSAPLAEQ
LDFLESKRGVYCCREHEHEIAWYTERSDKSYELQTPFDITNAKKFDSFKGECPKFVFP
LNSTVKVLQPRVEKKKTEGFLGRIRTVYQVASPGECNSMHLSTYMNCNHCGEKSWQTC
DFLTATCEMCGNQNTVEEGPTTCGYVPSNAVVKMVCPACQNPEIGPDHSVADYHNNSK
IETRLRKGGRIKSFGGCVFSYVGCYNKRAFWVPRAAANIGSNHTGVVGEGVETMNEDL
LQILSRERVVINIVGEFCLNEEIAILLASLSASTSAFVETVKNLDFKTFKKIIESCGN
YKVTKGKFKPGVWNIGTSKSLLTPLHCFSSQAAGVVRSIFSRTLATANHSIVDLHRAA
MIIFSDISDQANRVLDAMVNTSDLVTESVVVMAYLTGGLVQQVSTWLSQLLNTSVDKF
SAVLRWLEQKLQGGIDFLRQAWGILKLLVTGAYVVIRGKIQVVNTSLIECVTSFVDVV
NKVFELCTDYITVAGARVRAINFGEVLIAQSRGLYRQCVRARDQLQLLMPLKSPKDVV
FLDGDAYDTLLTSEEVTVKNGTLEALDLELSDVVTGVAEGVPVCVNGLMLLELKEKEQ
YCALSPSLLATNNVFTLKGGAPTKGVTFGEDTVVEIQGYKSVKITFELDERVDKVLNE
KCASYTVETGTTAEELACVVAESVVKTLQPISELLTPMGIDLDEWSVAKFYLFDESGE
AVLSSHMYCSFYPPDEEEEEDLEESEDVEYGTEDDYTGAPLEFGASSTVEQDEVHDEE
EDWLAPQEESEVLYDQFTDYHKLTDNVFIKCADIVEESLKVNPTVVVNAANIHLKHGG
GVARALDKATGGSMQKESNDYISTNGPLRVGGSCLLSGHNLAKHCLHVVGPNKNAGED
IKLLDAAYENFNAYEVVLSPLLSAGIFGVSPIQSLETCKRVVRNTVYIVVNDSVVFDQ
LLAKTPGKTNERPVVESSEICEEVNQKPVVEFSETKELHEETNQKLKSSEEPVKTRIE
ELNTTVDEAKFLTTKLLLYADVNGNLSEDSKVLIGNDGASFKKGAPYIVGDIISEGEL
TCVVLPTKAVGGTTHMLTRALKNVPSDTYLTTYPGQGVSGYTLDEAKAALKKSRSVFY
ILPSANVNAKEEVLGTVAWNLREMLAHAEETRKVMPVCMDVRAIISTIQRKYKGIGIQ
EGLVDYKVRFYFYSSKTPIARVISNLNSLGEPLITMPLGYVTHGLNLEESARYMRSVK
VPVVVSVSSPDAVTSYNGYVTSASKSAEEHFIETVSLAGSYKDWSYSGQRTELGVEFL
KRGDKIVYHTVGNVIEFHMEGEVLPLEKLKTLLALREVKTIKVFTTVDNINLHTQVID
MSMTYGQQLGPTYMDGADLTKVKPHASHENKTFFVLPSDDTLRIEAFEYYHTVDESFF
GRYMSALNHTKRWKYPQVGGLTSIKWADNNCYLSSVLLSLQQIDIKFNAPALQDAYYR
ARAGDAANFCALVLAYSKKTVGELGDVRETMAHLLQHANLESAKRVLNVVCKHCGQKS
TTLSGVEAVMYMGTLSYDHLKRGVKIPCVCGREATQYLVKQESTFVMMSAPPAEYTLQ
TGEFLCANEYTGNYQCGHYTHITNRETIYKIDGALLTKITEYKGPVADVFYKETSYST
DIKPVSYKLDGVTYTEINPDLNGYYKKDNAYYTEQPIDLVPTQPLPNASFDNFRFVCA
NTKFADDLNQMTGFKKPPSRDLTITFFPDLNGDVVAIDYRHYTPTFKKGAKLVHKPIL
WHVNQTTTKSTFKPNMWCLRCLYSTKPVPTSNSFEVLSSDDAQGMDNLACESQQTVAE
EVVDNPTIQKDIIECDVKTTEVVGNVILKPSADGIKVTSELEHEDLMAAYVNETSITI
KKPNELSIMLGLKTIATHGAAAINSVPWIKICAYVKPFLGYVAEQSKNCIKRCFRRVF
NDYMPFLLTLLLQLCTFTKSTNFRIKAAMPIVIARNSVIGGVRFCLDALTMYVKSPKF
SGILTVVMWLLLLSVCLGCLVYAVASFGAILSGFGLMSYCDGVRAGYVNSSNVTIPDY
CAGSLPCGVCLGGLDSLDAYPALETIQVTISSYKLDLTFVGMMAEWFLAYMLFTKFFY
LLGLFALMQLFFGLFATHFVNNSWLMWLIINVVQMAPISAMVRMYVFFASFYYVWKAY
IHVINGCTSSTCIMCYKRNRATRVECTTIVNGMKKSFYVYANGGQGFCKLHNWNCLNC
DTFCSGSTFISDEVARDLSLQFKRPINPTDQSSYNVDSVTVKDGTLYLYFQKAGKLTY
ERHPLSYFVNLDNLRANNVKGTLPINVIVFDGKSKCEEAAAKSASVYYSQLMCQPILL
LDQALISDVGDSTEVAVKMFDAYVNAFSSTFNAPMEKLKTFIATAHAEIAKGVSLDSV
LSTFLSAARQGFVDSDVDTKDVMECLKLSHHSDLEITSDSCNNFMLTYNKVENMTPRD
LGACIDCSARHINAQVAKSHNVSLVWNVKDYMSLSEQLRKQIRSAAKKNNIPFKLTCA
TTRQVVNVITTKISLKGGKFVSNNWFRFLLKMTVLMVLVAFIFYFITPTHTLMGHDVF
SSEIIGYKAIHNGVTRDVLTTDDCFANKHTGFDHWFSQRGGSYRNDKTCPVIAAVITR
EVGFIVPGLPGTVRRASNGDFLHFLPRVFSAVGNICYTPAKLIEYTDFATSACVLAAE
CTIFKDAQGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPSTYLEGSVRVV
TTFDSEYCRHGTCERSDAGVCLSTNGRWVLNNDYYRSIPGVFCGADASDLLFNIFTPL
VRPVGTLDISASVVAGGLIAILVTCVAYYFMKFRRAFGEYNHVVFANALLFLLSFTIL
CLTPAYTFLPGIYSLLYLYLTFYFTNDVSFLAHLQWLAMFSPIVPFWITVTYVVCISI
KHCHWFFSNYLKKRVVFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLLPLTQYNRYL
ALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSITSAVLQSGF
RKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTLEDMLNPNYDDLLIRK
SNHNFLVQASNVQLRVIGHTMQNCLLKLKVDIANPKTPKYKFVRIQPGQTFSVLACYN
GAPSGVYQCAMRSNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDL
EGNFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGERWFLNRFTTTLNDFNLVAM
KYNYEPLTQDQVDILGPLSAQTGVAVMDMCAALKELLQNGLNGRTILGSTILEDEFTP
FDVVRQCSGVTFQGKFKKVVKGTHHWLLLTLLTSLLILVQSTQWSLFFFVYEHAFLPF
TMGVVCFAACAMVLVKHKHAFLCLFLLPSLITVAYFNMIYMPASWVMRVMTWLDLVDT
SLSGYRLKDCVMYALAAFLLILMTARTVYDDAARRVWTVMNVITLVYKVYYGNSLDQA
LAMWALVISVTSNYSGVVTTIMFLARAIVFLCVEYYPILFITGNTLQCIMLVYCFLGY
CCCCYFGLFCLLNRYFRLTLGVYDYFVSTQEFRYMNSQGLLPPKTSLDAFKLNVKLLG
IGGKPCIKVATVQSKMSDIKCTSVVLLSVLQQLRIESSSKLWAQCVQLHNDILLAKDT
TEAFEKMVSLLSVLLSMQGAVDINKLCDEMLNNRATLQAIASEFSSLPSYAAYATAQE
AYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSED
KRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYN
TYKNTCDGNTFTYASALWEIQQVVDADSKVVQLSEINMDNSQNLAWPLIVTALRSNSA
VKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTSKGGRFVLALLSDHQDLKWA
RFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRL
QAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITV
IPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCTNDPVGFILRNT
VCTVCGMWKGYGCSCDQLREPVMQAADAPAFLNRVCGVSAARLTPCGTGTSTDVVYRA
FDIYNEKVAGFAKFLKTNCCRFQEVDEEGNLLDSYFVVKRHTMSNYQHEETMYNLVKE
CPAVAVHDFFKFRVDGDMVPHISRQRLTKYTMADLVYALRHFDEGNCDTLKEILVTYN
CCDDAYFNKKDWYDFVENPDILRVYACLGERVRQALLKTVQFCDAMRDAGIVGVLTLD
NQDLNGNWYDFGDFVQVAPGAGIPIVDSYYSLLMPILTLTKALAAESHMDCDTTKPLI
KWDLLKYDFTEERLCLFNRYFKYWDQTYHPNCINCLDDRCILHCANFNVLFSTVFPPT
SFGPLVRKIFVDGVPFVVSTGYHFRELGVVHNQDVNLHSSRLSFKELLVYAADPAMHA
ASGNLLLDKRTTCFSVAALTNSVAFQTVKPGNFNKDFYDFAVSKGFFKEGSSVELKHF
FFAQDGNAAISDYDYYRYNLPTMCDIRQLLFVVEVVDKYFDCYDGGCINANQVIVNNL
DKSAGFPFNKWGKARLYYDSMSYEDQDALFAYTKRNVIPTITQMNLKYAISAKNRART
VAGVSICSTMTNRQFHQKLLKSIAATRGATVVIGTSKFYGGWHNMLKTVYSDVETPNL
MGWDYPKCDRAMPNMLRIMASLVLARKHSTCCNLSHRFYGLANECAQVLSEMVMCGGS
LYVKPGGTSSGDATTAYANSVFNICQAVTANVNALLSTDGNKIADKYVRNLQHRLYEC
LYRNRDVDHEFVEEFYAYLRKHFSMMILSDDAVVCYNSNYAAQGLVASIKNFKAVLYY
QNNVFMSEAKCWTETDLTKGPHEFCSQHTMLVKQGDDYVYLPYPDPSRILGAGCFVDD
IVKTDGTLMIERFVSLAIDAYPLTKHPNQEYADVFHLYLQYIRKLHDELTGHMLDMYS
VMLTNDNTSRYWEPEFYEAMYTPHTVLQAVGACVLCNSQTSLRCGSCIRRPFLCCKCC
YDHVISTSHKLVLSVNPYVCNAPGCDVTDVTQLYLGGMSYYCKSHKPPISFPLCANGQ
VFGLYKNTCVGSDNVTDFNAIATCDWTNAGDYILANTCTERLKLFAAETLKANEETFK
LSYGIATVREVLSDRELHLSWEIGKPRPPLNRNYVFTGYRVTKNSKVQIGEYTFEKGD
YGDAVVYRGTTTYKLNVGDYFVLTSHTVMPLTAPTLVPQEHYVRITGLYPTLNISDEF
SSNVANYQKVGMQKYSTLQGPPGTGKSHFAIGLALYYPSARIVYTACSHAAVDALCEK
ALKYLPIDKCSRIIPARARVECFDKFKVNSTLEQYVFCTVNALPETTADIVVFDEVSM
ATNYDLSVVNARLRAKHYVYIGDPAQLPAPRTLLTKGTLEPEYFNSVCRLMKTIGPDM
FLGTCRRCPAEIVDTVSALVYDNKLRAHKGKSSQCFKMFYKGVITHDVSSAINRPQIG
VVREFLTRNPAWRKAVFISPYNSQNAVASKILGLPTQTVDSSQGSEYDYVIFAQTTET
AHSCNVNRFNVAITRAKVGILCIMSDKDLYDKLQFTSLEVPRRSVAVLQSENVTGLFK
DCSKLITGLHPTQAPTYLSVDTKFKTEGLCVDIPGIPKDMTYRRLISMMGFKMNYQVN
GYPNMFITRDEAIKHVRAWIGFDVEGCHATRDAVGTNLPLQLGFSTGVNLVAVPTGYV
DTSAATEFSRVNAKPPPGDQFKHLIPLMYKGLPWNIVRVKIVQMLSDTLKDLSDRVVF
VLWAHGFELTSMKYFVKIGPERTCCLCDKRATCFSTSSDTYACWHHSVGFDYVYNPFM
IDVQQWGFTGNLQSNHDQHCQVHGNAHVASCDAIMTRCLAIHECFVKRVDWSVEYPII
GDELRINVACRKVQHMVVKSALLADKFPVLHDIGNPKAIKCVPQADVEWKFYDVQPCS
DKAYKIEELFYSYATHHDKFTDGVCLFWNCNVDRYPSNAIVCRFDTRVLSNLNLPGCD
GGSLYVNKHAFHTPAFDKGAFANLKQLPFFYYSDSPCESHGKQVVSDIDYVPLKSATC
ITRCNLGGAVCRHHASEYRQYLDAYNMMISAGFSLWIYKQFDTYNLWNTFTRLQSLEN
VAYNVVNKGHFDGQAGEKPVSIINNTVYTKVDGVDVEIFENKTTLPVNVAFELWAKRN
IKPVPEIKILNNLGVDIAANTVIWDYKRESPAYISTIGVCTMTDIAKKPTENACSSLT
VFFDGRVDGQVDSFRNARNGVLITEGSVKGLNPSKGPPQASLNGVTLIGESVKTQFNY
FKKVDGVVQQLPETYFTQSRSLDDFKPRSQMEVDFLQLAMDEFIERYKLEGYAFEHIV
YGDFSHGQLGGLHLMIGLAKRSLESLLKLEDFIPIDSTVKNYFVTDAQTGSSKCVCSV
IDLLLDDFVEIIKSQDLSVVSKVVTVTIDYAEISFMLWCKDGHVETFYPKLQANQTWQ
PGVAMPNLYKMQRMLLDKCDLHNYGENAVIPKGIMMNVAKYTQLCQYLNTLTIAVPYN
MRVIHFGAGSDKGVAPGSAVLKQWLPVGTLLVDSDINDFVSDADSTLIGDCSTVYTAN
KWDLIISDMYDPKTKHILKENDSKEGFFTYLCGFIKQKLALGGSVAIKITEHSWNADL
YKLMGYFSWWTAFVTNVNASSSEAFLIGVNYLGKQKESIDGYTMHANYIFWRNTNPIQ
LSSYSLFDMSKFPLKLRGTAVMSLKDNQINDMICSLLEKGRLIIRENNKVVFSSDVLV
NN"
misc_feature 227..571
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_N; cd21796"
/db_xref="CDD:439285"
misc_feature 572..730
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_C; cd22662"
/db_xref="CDD:439355"
misc_feature order(632..646,650..673,707..715,719..724,728..730)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439355"
misc_feature order(668..670,680..685,689..694,701..706,713..715)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439355"
misc_feature 734..2644
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to SARS-CoV Nsp2, and related proteins from
betacoronaviruses in the B lineage; Region:
betaCoV_Nsp2_SARS-like; cd21516"
/db_xref="CDD:439199"
misc_feature 2831..3220
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Protein of unknown function (DUF3655); Region:
DUF3655; pfam12379"
/db_xref="CDD:432517"
misc_feature 3233..3604
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3251..3259,3269..3289,3512..3517,3521..3535,
3599..3601)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3788..4162
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="SUD-N macrodomain (or Mac2 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-N_Nsp3-like; cd21562"
/db_xref="CDD:394883"
misc_feature 4190..4567
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Single-stranded poly(A) binding domain; Region:
SUD-M; pfam11633"
/db_xref="CDD:431970"
misc_feature 4574..4774
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Severe Acute
Respiratory Syndrome coronavirus and related
betacoronaviruses in the B lineage; Region:
SUD_C_SARS-CoV_Nsp3; cd21525"
/db_xref="CDD:394841"
misc_feature 4787..5695
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(5105..5116,5264..5272,5276..5281,5288..5290,
5300..5302,5375..5377,5399..5404,5447..5449,5453..5455,
5522..5524,5570..5572,5588..5599,5681..5683)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(5264..5272,5519..5524,5570..5572,5579..5581,
5588..5590,5597..5599,5681..5683)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5828..6148
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nucleic acid binding domain of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_NAB; cd21822"
/db_xref="CDD:409348"
misc_feature 6221..6568
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_betaSM; cd21814"
/db_xref="CDD:409629"
misc_feature 6785..8377
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Severe acute respiratory
syndrome-related coronavirus and betacoronavirus in the B
lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
/db_xref="CDD:409665"
misc_feature 6785..6853
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409665"
misc_feature 7100..7168
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409665"
misc_feature 8423..9565
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8423..8491
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9221..9286
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9329..9394
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9473..9541
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9599..9877
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
misc_feature 9893..10783
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9893..9916,9923..9925,10235..10237,10247..10267,
10292..10306,10379..10381,10397..10399,10739..10741,
10751..10753,10775..10780)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(9944..9946,9953..9961,10028..10030,10043..10045,
10301..10318,10370..10381,10385..10387,10397..10399,
10442..10444,10448..10456)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
misc_feature 10802..11671
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
misc_feature 11672..11920
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11675..11677,11684..11695,11702..11710,11714..11719,
11726..11728,11753..11755,11762..11764,11780..11782,
11816..11833,11837..11854,11873..11887)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
misc_feature 11921..12511
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nsp8 replicase; Region: nsp8; pfam08717"
/db_xref="CDD:400866"
misc_feature 12515..12853
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12515..12532
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12521..12529,12533..12541,12731..12736,12800..12805,
12809..12817,12821..12829,12833..12841,12845..12853)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
misc_feature 12854..13246
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="alphacoronavirus and betacoronavirus non-structural
protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
/db_xref="CDD:409326"
misc_feature order(12854..12877,12887..12889,12893..12901,12905..12916,
12926..12931,12938..12943,12950..12952,12971..12988,
13025..13030,13058..13060,13064..13069,13079..13081,
13085..13102,13115..13123,13130..13141)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature order(12893..12901,12905..12913,12926..12928,12971..12973,
12977..12988,13025..13033,13085..13096,13103..13105,
13136..13141,13196..13198)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12914..12919,12926..12928,12977..12988,13025..13033,
13103..13105,13136..13141,13196..13198,13208..13210)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12971..12994,13025..13030,13058..13069,13082..13087,
13091..13093,13130..13141)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature join(13286..13297,13297..16065)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Severe acute respiratory syndrome coronavirus
RNA-dependent RNA polymerase, also known as non-structural
protein 12, and similar proteins from betacoronaviruses in
the B lineage: responsible for replication and
transcription of the viral RNA genome; Region:
SARS-CoV-like_RdRp; cd21591"
/db_xref="CDD:394895"
misc_feature order(14074..14088,14236..14241,14245..14247,14251..14265,
14281..14292,14299..14301,14371..14373,14380..14385,
14389..14394,14401..14403,14407..14421,14425..14445,
14455..14457,14461..14463,14473..14478,14488..14490,
14782..14787,14794..14796,14809..14814,14818..14820,
14827..14838,15265..15267)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp8 interaction site [polypeptide binding]; other
site"
/db_xref="CDD:394895"
misc_feature order(14494..14508,14512..14514,14527..14529,14554..14556,
14560..14562,14587..14604,14917..14919,14923..14925,
15796..15798)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp7 interaction site [polypeptide binding]; other
site"
/db_xref="CDD:394895"
misc_feature order(14761..14763,14767..14775,14902..14904,14938..14940,
14944..14946,14962..14964,14974..14976,14986..14988,
14998..15000,15037..15039,15043..15045,15049..15051,
15313..15324,15331..15333,15541..15552,15706..15711,
15763..15765,15787..15789,15838..15843,15853..15855,
15859..15864)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative RNA binding site [nucleotide binding];
other site"
/db_xref="CDD:394895"
misc_feature 14767..14808
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif G; other site"
/db_xref="CDD:394895"
misc_feature 14881..14949
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif F; other site"
/db_xref="CDD:394895"
misc_feature order(14914..14916,15307..15315,15328..15330,15340..15342,
15544..15546)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="inhibitor binding site [chemical binding];
inhibition site"
/db_xref="CDD:394895"
misc_feature 15100..15150
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif A; other site"
/db_xref="CDD:394895"
misc_feature 15307..15396
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif B; other site"
/db_xref="CDD:394895"
misc_feature 15526..15570
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif C; other site"
/db_xref="CDD:394895"
misc_feature 15592..15657
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif D; other site"
/db_xref="CDD:394895"
misc_feature 15697..15732
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="conserved polymerase motif E; other site"
/db_xref="CDD:394895"
misc_feature 16066..16350
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Cys/His rich zinc-binding domain (CH/ZBD) of
coronavirus SARS NSP13 helicase and related proteins;
Region: ZBD_cv_Nsp13-like; cd21401"
/db_xref="CDD:439168"
misc_feature order(16198..16200,16261..16263,16267..16269,16306..16308,
16333..16347)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439168"
misc_feature 16360..16503
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="stalk domain of coronavirus Nsp13 helicase and
related proteins; Region: stalk_CoV_Nsp13-like; cd21689"
/db_xref="CDD:410205"
misc_feature order(16369..16371,16456..16458)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="key interaction residues; other site"
/db_xref="CDD:410205"
misc_feature 16513..16749
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="1B domain of coronavirus SARS NSP13 helicase and
related proteins; Region: 1B_cv_Nsp13-like; cd21409"
/db_xref="CDD:394817"
misc_feature order(16597..16602,16606..16608,16699..16701)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nucleic acid substrate binding site [nucleotide
binding]; other site"
/db_xref="CDD:394817"
misc_feature 16711..16719
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:394817"
misc_feature 16816..17835
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="helicase domain of betacoronavirus non-structural
protein 13; Region: betaCoV_Nsp13-helicase; cd21722"
/db_xref="CDD:409655"
misc_feature order(16918..16935,17275..17277,17392..17394,17677..17679,
17683..17685,17764..17766)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ATP binding site [chemical binding]; other site"
/db_xref="CDD:409655"
misc_feature order(16927..16932,17185..17190,17275..17277,17764..17766)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative active site [active]"
/db_xref="CDD:409655"
misc_feature 17881..19443
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nonstructural protein 14 of betacoronavirus;
Region: betaCoV_Nsp14; cd21659"
/db_xref="CDD:394958"
misc_feature order(17881..17883,17887..17898,17923..17952,18019..18021,
18031..18033,18046..18069,18169..18174,18238..18240,
18244..18246,18256..18261,18442..18444,18451..18456,
18463..18471,18517..18519)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="heterodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394958"
misc_feature order(18136..18138,18142..18144,18439..18441,18670..18672,
18685..18687)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ExoN active site [active]"
/db_xref="CDD:394958"
misc_feature order(18742..18744,18784..18786,18793..18798,18805..18807,
18865..18876,18880..18882,18922..18930,18964..18972,
19021..19035,19069..19071,19126..19128,19132..19137,
19144..19146,19150..19152,19384..19386)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N7-MTase active site [active]"
/db_xref="CDD:394958"
misc_feature 19450..19632
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N-terminal domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: NTD_alpha_betaCoV_Nsp15-like; cd21171"
/db_xref="CDD:439163"
misc_feature order(19450..19458,19510..19512,19516..19527,19549..19551,
19564..19566,19591..19593,19597..19608)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439163"
misc_feature order(19474..19491,19528..19539,19543..19545,19552..19554,
19558..19575,19582..19593,19630..19632)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439163"
misc_feature 19642..20037
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="middle domain of alpha- and beta-coronavirus
Nonstructural protein 15 (Nsp15), and related proteins;
Region: M_alpha_beta_cv_Nsp15-like; cd21167"
/db_xref="CDD:439161"
misc_feature order(19678..19683,19711..19713,19717..19719,19723..19725,
19729..19737,19933..19938,19942..19953)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439161"
misc_feature 19756..19761
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="hexamer interface [polypeptide binding]; other
site"
/db_xref="CDD:439161"
misc_feature 20029..20481
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nidoviral uridylate-specific endoribonuclease
(NendoU) domain of coronavirus Nonstructural Protein 15
(Nsp15) and related proteins; Region:
NendoU_cv_Nsp15-like; cd21161"
/db_xref="CDD:439158"
misc_feature order(20053..20055,20059..20061,20167..20175,20239..20241,
20251..20256,20260..20262,20278..20280,20284..20286,
20290..20292,20296..20298,20302..20307,20311..20313)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:439158"
misc_feature order(20149..20151,20161..20163,20185..20190,20194..20196,
20314..20316,20323..20328,20467..20469,20473..20475)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative active site [active]"
/db_xref="CDD:439158"
misc_feature 20494..21378
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Coronavirus NSP13; Region: NSP13; pfam06460"
/db_xref="CDD:399456"
CDS 191..13312
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="polyprotein pp1a; ORF1a polyprotein is cleaved to
yield nonstructural proteins"
/codon_start=1
/product="ORF1a polyprotein"
/protein_id="YP_010229071.1"
/db_xref="GeneID:9714832"
/translation="MESLVPGFNEKTHVQLSLPVLQVRDVLVRGFGDTVEEAVAEARQ
HLIEGTCGIVDLQKGVLPQLEQPYIFLKRCDARTAPHGHVMVELVAELDGVQYGRSGE
SLGVLVPHVGETPIGYRKVLVRKNGNKGAGGHLYGADLRFYDLGDELGTDPLDDFQQD
WNTKHGSGLRRDLFRELNGGVYTRYVDNNFCGPDGYPLECIKDLLARAGKSSAPLAEQ
LDFLESKRGVYCCREHEHEIAWYTERSDKSYELQTPFDITNAKKFDSFKGECPKFVFP
LNSTVKVLQPRVEKKKTEGFLGRIRTVYQVASPGECNSMHLSTYMNCNHCGEKSWQTC
DFLTATCEMCGNQNTVEEGPTTCGYVPSNAVVKMVCPACQNPEIGPDHSVADYHNNSK
IETRLRKGGRIKSFGGCVFSYVGCYNKRAFWVPRAAANIGSNHTGVVGEGVETMNEDL
LQILSRERVVINIVGEFCLNEEIAILLASLSASTSAFVETVKNLDFKTFKKIIESCGN
YKVTKGKFKPGVWNIGTSKSLLTPLHCFSSQAAGVVRSIFSRTLATANHSIVDLHRAA
MIIFSDISDQANRVLDAMVNTSDLVTESVVVMAYLTGGLVQQVSTWLSQLLNTSVDKF
SAVLRWLEQKLQGGIDFLRQAWGILKLLVTGAYVVIRGKIQVVNTSLIECVTSFVDVV
NKVFELCTDYITVAGARVRAINFGEVLIAQSRGLYRQCVRARDQLQLLMPLKSPKDVV
FLDGDAYDTLLTSEEVTVKNGTLEALDLELSDVVTGVAEGVPVCVNGLMLLELKEKEQ
YCALSPSLLATNNVFTLKGGAPTKGVTFGEDTVVEIQGYKSVKITFELDERVDKVLNE
KCASYTVETGTTAEELACVVAESVVKTLQPISELLTPMGIDLDEWSVAKFYLFDESGE
AVLSSHMYCSFYPPDEEEEEDLEESEDVEYGTEDDYTGAPLEFGASSTVEQDEVHDEE
EDWLAPQEESEVLYDQFTDYHKLTDNVFIKCADIVEESLKVNPTVVVNAANIHLKHGG
GVARALDKATGGSMQKESNDYISTNGPLRVGGSCLLSGHNLAKHCLHVVGPNKNAGED
IKLLDAAYENFNAYEVVLSPLLSAGIFGVSPIQSLETCKRVVRNTVYIVVNDSVVFDQ
LLAKTPGKTNERPVVESSEICEEVNQKPVVEFSETKELHEETNQKLKSSEEPVKTRIE
ELNTTVDEAKFLTTKLLLYADVNGNLSEDSKVLIGNDGASFKKGAPYIVGDIISEGEL
TCVVLPTKAVGGTTHMLTRALKNVPSDTYLTTYPGQGVSGYTLDEAKAALKKSRSVFY
ILPSANVNAKEEVLGTVAWNLREMLAHAEETRKVMPVCMDVRAIISTIQRKYKGIGIQ
EGLVDYKVRFYFYSSKTPIARVISNLNSLGEPLITMPLGYVTHGLNLEESARYMRSVK
VPVVVSVSSPDAVTSYNGYVTSASKSAEEHFIETVSLAGSYKDWSYSGQRTELGVEFL
KRGDKIVYHTVGNVIEFHMEGEVLPLEKLKTLLALREVKTIKVFTTVDNINLHTQVID
MSMTYGQQLGPTYMDGADLTKVKPHASHENKTFFVLPSDDTLRIEAFEYYHTVDESFF
GRYMSALNHTKRWKYPQVGGLTSIKWADNNCYLSSVLLSLQQIDIKFNAPALQDAYYR
ARAGDAANFCALVLAYSKKTVGELGDVRETMAHLLQHANLESAKRVLNVVCKHCGQKS
TTLSGVEAVMYMGTLSYDHLKRGVKIPCVCGREATQYLVKQESTFVMMSAPPAEYTLQ
TGEFLCANEYTGNYQCGHYTHITNRETIYKIDGALLTKITEYKGPVADVFYKETSYST
DIKPVSYKLDGVTYTEINPDLNGYYKKDNAYYTEQPIDLVPTQPLPNASFDNFRFVCA
NTKFADDLNQMTGFKKPPSRDLTITFFPDLNGDVVAIDYRHYTPTFKKGAKLVHKPIL
WHVNQTTTKSTFKPNMWCLRCLYSTKPVPTSNSFEVLSSDDAQGMDNLACESQQTVAE
EVVDNPTIQKDIIECDVKTTEVVGNVILKPSADGIKVTSELEHEDLMAAYVNETSITI
KKPNELSIMLGLKTIATHGAAAINSVPWIKICAYVKPFLGYVAEQSKNCIKRCFRRVF
NDYMPFLLTLLLQLCTFTKSTNFRIKAAMPIVIARNSVIGGVRFCLDALTMYVKSPKF
SGILTVVMWLLLLSVCLGCLVYAVASFGAILSGFGLMSYCDGVRAGYVNSSNVTIPDY
CAGSLPCGVCLGGLDSLDAYPALETIQVTISSYKLDLTFVGMMAEWFLAYMLFTKFFY
LLGLFALMQLFFGLFATHFVNNSWLMWLIINVVQMAPISAMVRMYVFFASFYYVWKAY
IHVINGCTSSTCIMCYKRNRATRVECTTIVNGMKKSFYVYANGGQGFCKLHNWNCLNC
DTFCSGSTFISDEVARDLSLQFKRPINPTDQSSYNVDSVTVKDGTLYLYFQKAGKLTY
ERHPLSYFVNLDNLRANNVKGTLPINVIVFDGKSKCEEAAAKSASVYYSQLMCQPILL
LDQALISDVGDSTEVAVKMFDAYVNAFSSTFNAPMEKLKTFIATAHAEIAKGVSLDSV
LSTFLSAARQGFVDSDVDTKDVMECLKLSHHSDLEITSDSCNNFMLTYNKVENMTPRD
LGACIDCSARHINAQVAKSHNVSLVWNVKDYMSLSEQLRKQIRSAAKKNNIPFKLTCA
TTRQVVNVITTKISLKGGKFVSNNWFRFLLKMTVLMVLVAFIFYFITPTHTLMGHDVF
SSEIIGYKAIHNGVTRDVLTTDDCFANKHTGFDHWFSQRGGSYRNDKTCPVIAAVITR
EVGFIVPGLPGTVRRASNGDFLHFLPRVFSAVGNICYTPAKLIEYTDFATSACVLAAE
CTIFKDAQGKPVPYCYDTNLLEGSISYSELRPDTRYVLMDGSIIQFPSTYLEGSVRVV
TTFDSEYCRHGTCERSDAGVCLSTNGRWVLNNDYYRSIPGVFCGADASDLLFNIFTPL
VRPVGTLDISASVVAGGLIAILVTCVAYYFMKFRRAFGEYNHVVFANALLFLLSFTIL
CLTPAYTFLPGIYSLLYLYLTFYFTNDVSFLAHLQWLAMFSPIVPFWITVTYVVCISI
KHCHWFFSNYLKKRVVFNGVTFSTFEEAALCTFLLNKEMYLKLRSETLLPLTQYNRYL
ALYNKYKYFSGALDTTSYREAACCHLAKALNDFSNSGADVLYQPPQTSITSAVLQSGF
RKMAFPSGKVEGCMVQVTCGTTTLNGLWLDDVVYCPRHVICTLEDMLNPNYDDLLIRK
SNHNFLVQASNVQLRVIGHTMQNCLLKLKVDIANPKTPKYKFVRIQPGQTFSVLACYN
GAPSGVYQCAMRSNHTIKGSFLNGSCGSVGFNIDYDCVSFCYMHHMELPTGVHAGTDL
EGNFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVINGERWFLNRFTTTLNDFNLVAM
KYNYEPLTQDQVDILGPLSAQTGVAVMDMCAALKELLQNGLNGRTILGSTILEDEFTP
FDVVRQCSGVTFQGKFKKVVKGTHHWLLLTLLTSLLILVQSTQWSLFFFVYEHAFLPF
TMGVVCFAACAMVLVKHKHAFLCLFLLPSLITVAYFNMIYMPASWVMRVMTWLDLVDT
SLSGYRLKDCVMYALAAFLLILMTARTVYDDAARRVWTVMNVITLVYKVYYGNSLDQA
LAMWALVISVTSNYSGVVTTIMFLARAIVFLCVEYYPILFITGNTLQCIMLVYCFLGY
CCCCYFGLFCLLNRYFRLTLGVYDYFVSTQEFRYMNSQGLLPPKTSLDAFKLNVKLLG
IGGKPCIKVATVQSKMSDIKCTSVVLLSVLQQLRIESSSKLWAQCVQLHNDILLAKDT
TEAFEKMVSLLSVLLSMQGAVDINKLCDEMLNNRATLQAIASEFSSLPSYAAYATAQE
AYEQAVANGDSEVVLKKLKKSLNVAKSEFDRDAAMQRKLEKMADQAMTQMYKQARSED
KRAKVTSAMQTMLFTMLRKLDNDALNNIINNARDGCVPLNIIPLTTAAKLMVVVPDYN
TYKNTCDGNTFTYASALWEIQQVVDADSKVVQLSEINMDNSQNLAWPLIVTALRSNSA
VKLQNNELSPVALRQMSCAAGTTQTACTDDNALAYYNTSKGGRFVLALLSDHQDLKWA
RFPKSDGTGTIYTELEPPCRFVTDTPKGPKVKYLYFIKGLNNLNRGMVLGSLAATVRL
QAGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITV
IPEANMDQESFGGASCCLYCRCHIDHPNPKGFCDLKGKYVQIPTTCTNDPVGFILRNT
VCTVCGMWKGYGCSCDQLREPVMQAADAPAFLNGFAV"
misc_feature 227..571
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_N; cd21796"
/db_xref="CDD:439285"
misc_feature 572..730
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminal domain of non-structural protein 1 from
Severe acute respiratory syndrome-related coronavirus and
betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp1_C; cd22662"
/db_xref="CDD:439355"
misc_feature order(632..646,650..673,707..715,719..724,728..730)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:439355"
misc_feature order(668..670,680..685,689..694,701..706,713..715)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439355"
misc_feature 734..2644
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 2 (Nsp2)
similar to SARS-CoV Nsp2, and related proteins from
betacoronaviruses in the B lineage; Region:
betaCoV_Nsp2_SARS-like; cd21516"
/db_xref="CDD:439199"
misc_feature 2831..3220
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Protein of unknown function (DUF3655); Region:
DUF3655; pfam12379"
/db_xref="CDD:432517"
misc_feature 3233..3604
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="X-domain (or Mac1 domain) of viral non-structural
protein 3 and related macrodomains; Region:
Macro_X_Nsp3-like; cd21557"
/db_xref="CDD:438957"
misc_feature order(3251..3259,3269..3289,3512..3517,3521..3535,
3599..3601)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ADP-ribose binding site [chemical binding]; other
site"
/db_xref="CDD:438957"
misc_feature 3788..4162
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="SUD-N macrodomain (or Mac2 domain) of the SARS
Unique Domain (SUD) of SARS-CoV non-structural protein 3
and related macrodomains; Region:
Macro_cv_SUD-N_Nsp3-like; cd21562"
/db_xref="CDD:394883"
misc_feature 4190..4567
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Single-stranded poly(A) binding domain; Region:
SUD-M; pfam11633"
/db_xref="CDD:431970"
misc_feature 4574..4774
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminal SARS-Unique Domain (SUD) of
non-structural protein 3 (Nsp3) from Severe Acute
Respiratory Syndrome coronavirus and related
betacoronaviruses in the B lineage; Region:
SUD_C_SARS-CoV_Nsp3; cd21525"
/db_xref="CDD:394841"
misc_feature 4787..5695
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus papain-like protease; Region:
betaCoV_PLPro; cd21732"
/db_xref="CDD:409649"
misc_feature order(5105..5116,5264..5272,5276..5281,5288..5290,
5300..5302,5375..5377,5399..5404,5447..5449,5453..5455,
5522..5524,5570..5572,5588..5599,5681..5683)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="ubiquitin binding site [polypeptide binding]; other
site"
/db_xref="CDD:409649"
misc_feature order(5264..5272,5519..5524,5570..5572,5579..5581,
5588..5590,5597..5599,5681..5683)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:409649"
misc_feature 5828..6148
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nucleic acid binding domain of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_NAB; cd21822"
/db_xref="CDD:409348"
misc_feature 6221..6568
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus-specific marker of non-structural
protein 3 from Severe acute respiratory syndrome-related
coronavirus and betacoronavirus in the B lineage; Region:
SARS-CoV-like_Nsp3_betaSM; cd21814"
/db_xref="CDD:409629"
misc_feature 6785..8377
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="C-terminus of non-structural protein 3, including
transmembrane and Y domains, from Severe acute respiratory
syndrome-related coronavirus and betacoronavirus in the B
lineage; Region: TM_Y_SARS-CoV-like_Nsp3_C; cd21717"
/db_xref="CDD:409665"
misc_feature 6785..6853
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="TM1 [structural motif]; Region: TM1"
/db_xref="CDD:409665"
misc_feature 7100..7168
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="TM2 [structural motif]; Region: TM2"
/db_xref="CDD:409665"
misc_feature 8423..9565
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="coronavirus non-structural protein 4 (Nsp4)
transmembrane domain; Region: cv_Nsp4_TM; cd21473"
/db_xref="CDD:394836"
misc_feature 8423..8491
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 1 [structural motif]; Region:
putative TM helix 1"
/db_xref="CDD:394836"
misc_feature 9221..9286
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 2 [structural motif]; Region:
putative TM helix 2"
/db_xref="CDD:394836"
misc_feature 9329..9394
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 3 [structural motif]; Region:
putative TM helix 3"
/db_xref="CDD:394836"
misc_feature 9473..9541
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="putative TM helix 4 [structural motif]; Region:
putative TM helix 4"
/db_xref="CDD:394836"
misc_feature 9599..9877
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Coronavirus nonstructural protein 4 C-terminus;
Region: Corona_NSP4_C; pfam16348"
/db_xref="CDD:406690"
misc_feature 9893..10783
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 5, also
called Main protease (Mpro); Region: betaCoV_Nsp5_Mpro;
cd21666"
/db_xref="CDD:394887"
misc_feature order(9893..9916,9923..9925,10235..10237,10247..10267,
10292..10306,10379..10381,10397..10399,10739..10741,
10751..10753,10775..10780)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:394887"
misc_feature order(9944..9946,9953..9961,10028..10030,10043..10045,
10301..10318,10370..10381,10385..10387,10397..10399,
10442..10444,10448..10456)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="polypeptide substrate binding site [polypeptide
binding]; other site"
/db_xref="CDD:394887"
misc_feature 10802..11671
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 6; Region:
betaCoV-Nsp6; cd21560"
/db_xref="CDD:394846"
misc_feature 11672..11920
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 7; Region:
betaCoV_Nsp7; cd21827"
/db_xref="CDD:409253"
misc_feature order(11675..11677,11684..11695,11702..11710,11714..11719,
11726..11728,11753..11755,11762..11764,11780..11782,
11816..11833,11837..11854,11873..11887)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409253"
misc_feature 11921..12511
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="nsp8 replicase; Region: nsp8; pfam08717"
/db_xref="CDD:400866"
misc_feature 12515..12853
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="betacoronavirus non-structural protein 9; Region:
betaCoV_Nsp9; cd21898"
/db_xref="CDD:409331"
misc_feature 12515..12532
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="N-finger; other site"
/db_xref="CDD:409331"
misc_feature order(12521..12529,12533..12541,12731..12736,12800..12805,
12809..12817,12821..12829,12833..12841,12845..12853)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homodimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409331"
misc_feature 12854..13246
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="alphacoronavirus and betacoronavirus non-structural
protein 10; Region: alpha_betaCoV_Nsp10; cd21901"
/db_xref="CDD:409326"
misc_feature order(12854..12877,12887..12889,12893..12901,12905..12916,
12926..12931,12938..12943,12950..12952,12971..12988,
13025..13030,13058..13060,13064..13069,13079..13081,
13085..13102,13115..13123,13130..13141)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp14 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
misc_feature order(12893..12901,12905..12913,12926..12928,12971..12973,
12977..12988,13025..13033,13085..13096,13103..13105,
13136..13141,13196..13198)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="oligomer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12914..12919,12926..12928,12977..12988,13025..13033,
13103..13105,13136..13141,13196..13198,13208..13210)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="homotrimer interface [polypeptide binding]; other
site"
/db_xref="CDD:409326"
misc_feature order(12971..12994,13025..13030,13058..13069,13082..13087,
13091..13093,13130..13141)
/gene="ORF1ab"
/locus_tag="BtCoVBM48_gp1"
/note="Nsp16 interface [polypeptide binding]; other site"
/db_xref="CDD:409326"
gene 21391..25170
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/db_xref="GeneID:9714824"
CDS 21391..25170
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/codon_start=1
/product="spike protein"
/protein_id="YP_003858584.1"
/db_xref="GeneID:9714824"
/translation="MKFLAFLCLLGFANAQDGKCGTLSNKSPSKLTQTPSSRRGFYYF
DDIFRSSIRVLTTGHFLPFNTNLTWYLTLKSNGKQRIYYDNPNINFGDGVYFGLTEKS
NVFRGWIFGSTLDNTTQSAVLFNNGTHIVIDVCNFNFCADPMFAVNSGQPYKTWIYTS
AANCTYHRAHAFNISTNMNPGKFKHFREHLFKNVDGFLYVYHNYEPIDLNSGFPSGFS
VLKPILKLPFGLNITYVKAIMTLFSSTQSNFDADASAYFVGHLKPLTMLVDFDENGTI
IDAIDCSQDPLSELKCTTKSFTVEKGIYQTSNFRVTPTTEVVRFPNITQLCPFNEVFN
ITSFPSVYAWERMRITNCVADYSVLYNSSASFSTFQCYGVSPTKLNDLCFSSVYADYF
VVKGDDVRQIAPAQTGVIADYNYKLPDDFTGCVIAWNTNSLDSSNEFFYRRFRHGKIK
PYGRDLSNVLFNPSGGTCSAEGLNCYKPLASYGFTQSSGIGFQPYRVVVLSFELLNAP
ATVCGPKQSTELVKNKCVNFNFNGLTGTGVLTNSTKKFQPFQQFGRDVSDFTDSVRDP
KTLEILDIAPCSYGGVSVITPGTNASSSVAVLYQDVNCTDVPTMLHADQISHDWRVYA
FRNDGNIFQTQAGCLIGAAYDNSSYECDIPIGAGICAKYTNVSSTLVRSGGHSILAYT
MSLGDNQDIVYSNNTIAIPMNFSISVTTEVLPVSMTKTSVDCNMYICGDSTECSNLLL
QYGSFCTQLNRALAGIAVEQDRNTRDVFAQTKAMYKTPSLKDFGGFNFSQILPDPAKP
SSRSFIEDLLYNKVTLADPGFMKQYGDCLGGVNARDLICAQKFNGLTVLPPLLTDEMI
AAYTAALISGTATAGFTFGAGAALQIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFN
KAISQIQDSLSTTTTALGKLQDVINQNAIALNTLVKQLSSNFGAISSVLNDILSRLDK
VEAEVQIDRLITGRLQSLQTYVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCG
KGYHLMSFPQAAPHGVVFLHVTYVPSQEQNFTTAPAICHEGKAHFPREGVFVTNGTHW
FITQRNFYSPQPITTDNTFVSGNCDVVIGIVNNTVYDPLQPELDSFKEELDKYFKNHT
SQNVSLDGLNNINASVVDIKKEIEHLNEIAKSLNESLIDLQELGKYEQYIKWPWYVWL
GFIAGLIAIVMATIMLCCMTSCCSCLKGVCSCASCCKFDEDHSEPVLTGVKLHYT"
misc_feature 21442..22275
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="N-terminal domain of the S1 subunit of the Spike
(S) protein from Severe acute respiratory syndrome
coronavirus and related betacoronaviruses in the B
lineage; Region: SARS-CoV-like_Spike_S1_NTD; cd21624"
/db_xref="CDD:394950"
misc_feature order(21517..21519,21526..21537,21739..21741,21745..21747,
21883..21885,21973..21981,22054..22056,22063..22065,
22069..22083,22204..22212)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394950"
misc_feature 22318..22974
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="receptor-binding domain of the S1 subunit of severe
acute respiratory syndrome-related coronavirus Spike (S)
protein and similar proteins; Region:
SARS-CoV-like_Spike_S1_RBD; cd21477"
/db_xref="CDD:394824"
misc_feature order(22318..22320,22426..22428,22507..22515,22519..22524,
22531..22536,22552..22554,22723..22725,22738..22752,
22756..22761,22765..22767,22897..22902,22906..22908)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="trimer interface [polypeptide binding]; other site"
/db_xref="CDD:394824"
misc_feature order(22468..22473,22477..22479,22486..22488,22492..22500,
22504..22506,22510..22512,22516..22524,22531..22536,
22540..22542,22648..22656,22894..22896,22900..22902,
22906..22908)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="cryptic epitope [polypeptide binding]; other site"
/db_xref="CDD:394824"
misc_feature order(22615..22617,22699..22701,22711..22713,22717..22722,
22816..22818,22828..22830,22837..22839,22864..22866)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="receptor binding site [polypeptide binding]; other
site"
/db_xref="CDD:394824"
misc_feature order(22678..22680,22693..22770,22816..22875)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="receptor binding motif; other site"
/db_xref="CDD:394824"
misc_feature 22978..24972
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="SD-1 and SD-2 subdomains, the S1/S2 cleavage
region, and the S2 fusion subunit of the spike (S)
glycoprotein from SARS-CoV-2 (COVID-19) and related
betacoronaviruses in the B lineage; Region:
SARS-CoV-like_Spike_SD1-2_S1-S2_S2; cd22378"
/db_xref="CDD:411965"
misc_feature order(23158..23160,23197..23199,23326..23328,23473..23475,
23497..23499,23749..23751,24568..24570,24640..24642,
24748..24750,24820..24822,24865..24867,24928..24930)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="N-linked glycosylation sites [posttranslational
modification]; other site"
/db_xref="CDD:411965"
misc_feature order(23371..23382,23389..23391,23395..23415,23419..23475)
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="S1/S2 cleavage region; other site"
/db_xref="CDD:411965"
misc_feature 23710..23766
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="fusion peptide; other site"
/db_xref="CDD:411965"
misc_feature 23794..23847
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="internal fusion peptide; other site"
/db_xref="CDD:411965"
misc_feature 24100..24297
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="heptad repeat 1 [structural motif]; Region: heptad
repeat 1"
/db_xref="CDD:411965"
misc_feature 24832..24957
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="heptad repeat 2 [structural motif]; Region: heptad
repeat 2"
/db_xref="CDD:411965"
misc_feature 25045..>25140
/gene="S"
/locus_tag="BtCoVBM48_gp2"
/note="Coronavirus spike glycoprotein S2, intravirion;
Region: CoV_S2_C; pfam19214"
/db_xref="CDD:437051"
gene 25179..25994
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/db_xref="GeneID:9714825"
CDS 25179..25994
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/codon_start=1
/product="ORF3 protein"
/protein_id="YP_003858585.1"
/db_xref="GeneID:9714825"
/translation="MDLFLNIFTLGSITRQPGKVENVSPASSFHSTASIPLQATLPFG
WLVVGVAFLAVFQSAAKLIPFNSLWQRCLYQSFQLLCNVLLIALTVYSHLLLVAAGLE
APFLYLLALIYFLQCVVFGRLLVRCWLCWKCKSKNPLIYDSNYFVCWHTHTHDYCIPY
NSITNTIVLTAGDGVTIPIRTQDYQIGGYFEKWESGVKDYLTLIGPFTEVYYQLESTQ
ISTDTGINNATFFLFSKNDEREQESVQVHTIDGSSGVVNPIYDEPTPTTSVPL"
misc_feature 25191..25988
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="accessory protein ORF3a of severe acute respiratory
syndrome-associated coronavirus and similar proteins from
related betacoronavirus; Region: SARS-CoV-like_ORF3a;
cd21648"
/db_xref="CDD:439223"
misc_feature 25293..25355
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature order(25314..25319,25326..25331,25338..25343,25347..25355,
25359..25364,25371..25376,25431..25433,25443..25445,
25500..25505,25512..25514,25521..25526,25530..25535,
25542..25544,25608..25613,25656..25661,25668..25670,
25680..25682,25731..25733,25737..25745,25821..25826,
25830..25832,25842..25847,25851..25853,25866..25868,
25872..25874,25878..25880)
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="dimer interface [polypeptide binding]; other site"
/db_xref="CDD:439223"
misc_feature 25413..25475
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature 25491..25553
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="putative TM segment [structural motif]; Region:
putative TM segment"
/db_xref="CDD:439223"
misc_feature order(25569..25571,25578..25580,25584..25586,25626..25637,
25641..25643)
/gene="ORF3"
/locus_tag="BtCoVBM48_gp3"
/note="putative tetramer interface [polypeptide binding];
other site"
/db_xref="CDD:439223"
gene 26018..26248
/gene="E"
/locus_tag="BtCoVBM48_gp4"
/db_xref="GeneID:9714826"
CDS 26018..26248
/gene="E"
/locus_tag="BtCoVBM48_gp4"
/codon_start=1
/product="envelope protein"
/protein_id="YP_003858586.1"
/db_xref="GeneID:9714826"
/translation="MYSFVSEETGTLIVNSVLLFLAFVVFLLVTLAILTALRLCAYCC
NIVNVSLVKPTFYVYSRVKSLNSSQEVPEFLV"
misc_feature 26021..26245
/gene="E"
/locus_tag="BtCoVBM48_gp4"
/note="Severe acute respiratory syndrome coronavirus 2
Envelope small membrane protein; Region: SARS-CoV-2_E;
cd21536"
/db_xref="CDD:394862"
misc_feature order(26039..26041,26060..26065,26069..26074,26078..26104,
26108..26110,26156..26164,26186..26188,26195..26200,
26204..26212)
/gene="E"
/locus_tag="BtCoVBM48_gp4"
/note="homopentameric interface [polypeptide binding];
other site"
/db_xref="CDD:394862"
misc_feature 26234..26245
/gene="E"
/locus_tag="BtCoVBM48_gp4"
/note="PDZ binding motif; other site"
/db_xref="CDD:394862"
gene 26299..26982
/gene="M"
/locus_tag="BtCoVBM48_gp5"
/db_xref="GeneID:9714827"
CDS 26299..26982
/gene="M"
/locus_tag="BtCoVBM48_gp5"
/codon_start=1
/product="membrane protein"
/protein_id="YP_003858587.1"
/db_xref="GeneID:9714827"
/translation="MTNSSASPPTETITVEQLKHLLEQWNLVIGFLFFAWILLLQFAY
SNRNRFLYIIKLVFLWLLWPITLACFVLAAVYRINWVTGGIAIAMACIVGLMWLSYFV
ASFRLFARTRSWWSFNPETNILLNVPLRGTILTRPLLESELVIGAVIIRGHLRMAGHS
LGRCDIKDLPKEITVATSRTLSYYRLGASQRVASDSGFAVYHRYRIGNYKLNTDHIGS
DDNIALLVQ"
misc_feature 26332..26979
/gene="M"
/locus_tag="BtCoVBM48_gp5"
/note="coronavirus Membrane (or Matrix) protein; Region:
CoV_M; cl40475"
/db_xref="CDD:424106"
gene 26993..27181
/gene="ORF6"
/locus_tag="BtCoVBM48_gp6"
/db_xref="GeneID:9714828"
CDS 26993..27181
/gene="ORF6"
/locus_tag="BtCoVBM48_gp6"
/codon_start=1
/product="ORF6 protein"
/protein_id="YP_003858588.1"
/db_xref="GeneID:9714828"
/translation="MFSLVAFQVTVAELLILIMKSFGLALTHIQIGIVSLLKILTNRL
DRRYSKLDEEEPMEIDHP"
misc_feature 26993..27175
/gene="ORF6"
/locus_tag="BtCoVBM48_gp6"
/note="Open reading frame 6 from SARS coronavirus; Region:
Sars6; pfam12133"
/db_xref="CDD:432352"
gene 27188..27544
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/db_xref="GeneID:9714829"
CDS 27188..27544
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/codon_start=1
/product="ORF7a protein"
/protein_id="YP_003858589.1"
/db_xref="GeneID:9714829"
/translation="MKFLLLVAIVSIASAELYHYQECARGTTVLLKEPCQPNTYEGNS
PYHPLADNKFAITCTNTKFSFVCQDETRHVFQLRARSISPRLFASPKHHSDDFTPVIL
IIVTLLFVIYCCMKRQ"
misc_feature 27227..27475
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Severe Acute Respiratory Syndrome coronavirus
(SARS-CoV) structural accessory protein ORF7a and similar
proteins from related betacoronaviruses in the subgenera
Sarbecovirus (B lineage); Region: ORF7a_SARS-CoV-like;
cd21663"
/db_xref="CDD:394934"
misc_feature 27233..27259
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand A [structural motif]; Region: Ig strand
A"
/db_xref="CDD:394934"
misc_feature 27263..27286
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand B [structural motif]; Region: Ig strand
B"
/db_xref="CDD:394934"
misc_feature 27299..27313
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand C [structural motif]; Region: Ig strand
C"
/db_xref="CDD:394934"
misc_feature 27326..27337
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand D [structural motif]; Region: Ig strand
D"
/db_xref="CDD:394934"
misc_feature 27341..27361
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand E [structural motif]; Region: Ig strand
E"
/db_xref="CDD:394934"
misc_feature 27365..27391
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand F [structural motif]; Region: Ig strand
F"
/db_xref="CDD:394934"
misc_feature 27395..27427
/gene="ORF7a"
/locus_tag="BtCoVBM48_gp7"
/note="Ig strand G [structural motif]; Region: Ig strand
G"
/db_xref="CDD:394934"
gene 27541..27663
/gene="ORF7b"
/locus_tag="BtCoVBM48_gp8"
/db_xref="GeneID:9714830"
CDS 27541..27663
/gene="ORF7b"
/locus_tag="BtCoVBM48_gp8"
/codon_start=1
/product="ORF7b protein"
/protein_id="YP_003858590.1"
/db_xref="GeneID:9714830"
/translation="MIHLTLFDFYLCVLSLLLFLVIIMLIIFCFVLELQDLNEQ"
misc_feature 27541..27660
/gene="ORF7b"
/locus_tag="BtCoVBM48_gp8"
/note="Severe Acute Respiratory Syndrome coronavirus
structural accessory protein ORF7b and similar proteins
from related betacoronaviruses in the B lineage; Region:
ORF7b_SARS_bat-CoV-like; cd21598"
/db_xref="CDD:394937"
gene 27665..28918
/gene="N"
/locus_tag="BtCoVBM48_gp9"
/db_xref="GeneID:9714831"
CDS 27665..28918
/gene="N"
/locus_tag="BtCoVBM48_gp9"
/codon_start=1
/product="nucleocapsid protein"
/protein_id="YP_003858591.1"
/db_xref="GeneID:9714831"
/translation="MTDNGQSNSRNAPRITFGVSDTSDNNQNAERAGARPKQRRPQGP
PNNTASWFTALTQHGKEGLSFPRGQGVPVNTNSTRDDQIGYYRRATRRVRGGDGKMKE
LSPRWYFYYLGTGPEAALPYGANKDGIVWVATEGALNTPKDHIGTRNPNNNAAIVIQL
PQGTTLPKGFYAEGSRGGSQASSRSNSRSRGNSRNSTPSSSRGSSPARMAAGGDTALA
LLLLDRLNQLESKVSGKTPQQSQVVTKKTAAEASKKPRQKRTATKAYNVTQAFGRRGP
EPTQGNFGDQELIRLGTDYKNWPQIAQFAPSASAFFGMSRIGMEVTPTGTWLTYNGAI
KLDDKDPNFKDQVILLNKHIDAYKTFPPTEPKKDKKKKADEVQSLPQRQKKQATVTLL
PAADLDDFSKQLQNSMNASPDSTQA"
misc_feature 27779..28828
/gene="N"
/locus_tag="BtCoVBM48_gp9"
/note="Coronavirus nucleocapsid protein; Region:
Corona_nucleoca; pfam00937"
/db_xref="CDD:425955"
misc_feature order(27809..27826,27980..27982,27986..27988,27992..27994,
28106..28108,28127..28129)
/gene="N"
/locus_tag="BtCoVBM48_gp9"
/note="RNA binding site [nucleotide binding]; other site"
/db_xref="CDD:439219"
3'UTR 28919..29276
ORIGIN
1 tttaaaatct gtgtagctgt cacttggctg catgcccagt gcacttacgc agtatatctt
61 ataaactttt actgtcgttg acaggacacg agtaactcgt ctatcttctg caggctgctt
121 acggtttcgt ccgtgttgca gccgatcatc agcataccta ggtttcgtcc gggtgtgacc
181 gaaaggtaag atggagagcc ttgtccctgg tttcaacgag aaaacacacg tccaactcag
241 tttacctgtt ttgcaggttc gtgacgtgct cgtacgtgga ttcggtgaca ccgtagaaga
301 ggctgtcgct gaagcacgcc aacatttaat tgaaggaaca tgtggcattg ttgatctcca
361 gaagggtgtt ttaccccaac tggaacaacc ttacattttc cttaaacgct gtgatgcccg
421 tactgctccc cacggccatg ttatggtcga attggtggca gagcttgatg gcgtccagta
481 tggtaggagc ggagaatctc ttggtgtgtt agtcccgcat gtgggtgaaa caccaattgg
541 ttaccgcaag gttcttgtcc gtaagaacgg taataaggga gccggtggtc acttgtacgg
601 cgccgatcta aggttttacg atctaggtga cgaacttggc actgaccccc ttgatgactt
661 ccaacaagat tggaatacta agcatggcag tgggcttcgc cgcgatctct ttagggagct
721 caatggtggt gtctacacac gctatgttga taacaacttc tgcggaccag atggttatcc
781 tctggaatgc ataaaagact tgcttgcgcg agctggcaag tcaagcgcac ctcttgctga
841 acagcttgac tttttggagt ctaagagagg tgtgtactgt tgccgtgaac atgagcatga
901 gattgcttgg tacacggagc gctctgataa gagctatgag cttcagacac cttttgacat
961 tactaatgcc aaaaagtttg attctttcaa aggcgaatgt cctaaattcg tcttcccact
1021 taattccaca gttaaagtct tgcaaccacg tgttgaaaag aagaagactg agggattttt
1081 aggccgtatt cgtacggttt accaagttgc atcacctggt gagtgtaact ctatgcacct
1141 gtctacctac atgaattgta accattgtgg tgaaaagtca tggcagacat gtgatttctt
1201 aacagccact tgtgaaatgt gtggcaacca gaatactgtt gaagagggac ctaccacatg
1261 tgggtatgta cctagtaatg ctgtagtaaa gatggtttgc cctgcctgtc agaacccaga
1321 gattgggccc gaccatagtg tggcagatta ccacaacaac tcaaagattg aaactcgact
1381 ccgcaaggga ggtaggatta aatcttttgg tggctgtgtt ttctcttatg ttggctgcta
1441 caacaagcgt gccttctggg tgccgcgtgc cgcagccaat attggttcca accatactgg
1501 cgttgtcggt gaaggagttg aaaccatgaa tgaggacttg cttcagatct taagtcgtga
1561 acgtgttgtt atcaacattg ttggcgagtt ttgtctgaac gaggagattg ctatcttact
1621 tgcttctctt tcagcgtcta ctagtgcttt tgtagagaca gttaaaaatc ttgattttaa
1681 aactttcaag aaaatcattg agtcttgtgg taattataaa gtgaccaagg gcaagtttaa
1741 acctggtgtc tggaatattg gcactagtaa atcattgctt acacctctgc attgcttttc
1801 atcgcaggct gcaggtgtag ttcgctcaat cttctctcgc acacttgcta cagctaatca
1861 ttcaattgta gacctgcaca gagctgctat gattattttc tctgatatat cagatcaagc
1921 caatcgtgtg ctggatgcca tggtcaacac atctgactta gtcactgaaa gtgttgttgt
1981 catggcttat ctcacaggtg gattggtgca acaggttagt acctggttga gccaattact
2041 taatacttct gtagacaagt ttagcgcagt tttgcgctgg cttgagcaaa agctccaagg
2101 tggcattgat tttcttcgtc aagcttgggg cattcttaaa cttctagtca ctggcgctta
2161 tgtggttata agaggtaaaa ttcaagttgt taacactagt ctcatagagt gcgttacatc
2221 atttgttgac gtcgttaaca aggtttttga gctctgtact gattacatta ctgttgctgg
2281 tgctagagtg cgtgctataa attttggtga ggttttgatt gcgcaaagtc gaggcctcta
2341 ccgtcagtgt gtacgtgcca gagatcagct tcagttgttg atgcctttaa aatcgccaaa
2401 ggacgttgtt ttccttgacg gtgatgctta tgacacactt ttaacctctg aggaggtaac
2461 agttaaaaat ggaactcttg aagcacttga tcttgaactc agtgacgtag ttactggtgt
2521 tgcggaggga gtccctgttt gtgttaatgg tctaatgctc ttagaattga aggaaaagga
2581 gcaatactgt gcattgtctc cttctctact tgcaactaat aacgtcttta cattaaaagg
2641 aggtgcccct acaaagggag tcacatttgg tgaagacaca gttgtggaaa tccagggtta
2701 taaaagtgtt aaaatcactt ttgagttgga tgaacgtgtt gataaagttc ttaatgagaa
2761 gtgtgcatcc tatacagtag agaccggcac tacagctgaa gaacttgctt gtgttgtggc
2821 tgagtctgtt gtaaagactt tacaaccaat ctctgaatta ctaacaccca tgggtatcga
2881 ccttgatgaa tggagtgtag ctaaatttta cctgtttgat gagtctggtg aagcagttct
2941 ttcatcacac atgtattgtt ccttctatcc tcctgacgaa gaggaagaag aagatttgga
3001 agagtctgaa gatgtcgaat acggtactga agacgattac acaggcgccc cgctcgaatt
3061 cggtgctagt agcactgttg agcaggacga ggtccatgat gaagaggaag actggcttgc
3121 accacaggaa gagtccgaag tgttatatga ccagtttacc gattatcaca aactcacaga
3181 caatgtcttc ataaagtgtg ctgatattgt tgaagagtct ctgaaagtca acccaacagt
3241 tgtagtgaat gctgctaaca tacacttgaa acatggtggt ggtgttgcac gagcactaga
3301 taaagcaact ggcggtagta tgcaaaaaga atctaatgat tacatttcta ctaatggtcc
3361 tcttagagta ggaggttcct gtcttctctc aggccataac ttagctaaac attgtttgca
3421 tgttgttggc ccaaataaaa atgcaggaga ggatattaaa ctccttgatg cagcctatga
3481 gaacttcaat gcatatgaag tagttttatc accactactg tcagcaggta tctttggtgt
3541 aagtcctatt caatcactag agacttgcaa gcgcgtggtg cgtaacacag tctacattgt
3601 tgtaaacgac agtgtggtat ttgatcagct tttagccaaa acccctggaa agactaacga
3661 gaggcctgtt gttgaatctt cagaaatttg tgaggaggtc aaccagaagc ctgttgttga
3721 gttttcagaa actaaggagt tgcatgaaga gaccaatcag aagcttaagt cttcggaaga
3781 gcctgttaaa acccgcattg aggagttgaa cacaactgtt gatgaggcta aatttctaac
3841 cactaagttg ctattatatg cagatgtcaa tggtaatctt agtgaagatt caaaggtgtt
3901 aattggcaat gatggcgcgt cctttaaaaa gggtgctcct tacattgttg gtgacatcat
3961 tagcgaaggt gaattgacct gtgttgtttt accaactaaa gctgtgggtg gtacaactca
4021 tatgcttact agggctctga agaatgtacc ttcagacacc tatcttacaa cctatcctgg
4081 tcagggtgtt tcaggctaca ctttggatga agctaaagca gctcttaaaa agtccagatc
4141 agtcttttac atattacctt cagcaaatgt taatgctaag gaagaggttt taggcactgt
4201 tgcctggaac ttgcgtgaaa tgttagctca cgctgaggaa actaggaaag ttatgccagt
4261 ctgtatggat gttagggcga ttatttccac aattcagcgt aagtataaag gaattggaat
4321 acaggaaggt cttgtagatt ataaagtaag attttacttt tactctagca aaacacctat
4381 tgctagggta atctcaaacc ttaattctct tggagagccc ctgattacta tgcctttggg
4441 ctatgtcaca catggactca atttagaaga atcagcgcgc tacatgcgtt ctgttaaggt
4501 tcctgttgta gtttctgtgt cttcacctga tgcagtcact tcttacaacg gttatgttac
4561 atctgcttct aagagtgctg aagaacactt tattgaaaca gtttccttag cgggttcata
4621 caaggattgg tcatattctg gccaacgtac tgaacttggt gttgaattcc tcaagagagg
4681 tgacaaaatt gtctaccata cagtaggaaa tgttatagaa tttcacatgg aaggtgaagt
4741 tcttcctctt gagaagttaa aaactctctt agctttaaga gaggttaaaa ctattaaggt
4801 gttcacaact gtagataaca tcaacttaca cacacaagtc attgatatgt ctatgactta
4861 tggacaacag ctaggaccca cctatatgga cggtgctgat cttactaaag tcaaacctca
4921 tgctagtcat gagaacaaga ctttctttgt cctacctagt gatgatacgc tacgtattga
4981 agcttttgag tactatcata ccgtagacga gagttttttt ggtagataca tgtcagcatt
5041 aaaccatact aaaagatgga agtatcctca agttggtggt ttaacatcta taaaatgggc
5101 agataacaat tgttacttgt ctagtgttct tttgtcacta caacaaattg acattaagtt
5161 taatgcacca gcacttcagg atgcttatta tagagcgcgt gctggtgatg ctgctaactt
5221 ctgtgcactt gtgctcgcat acagcaagaa gactgtaggt gagctgggtg atgtacgtga
5281 aacaatggcc catttattac agcatgcaaa cttagagtcc gctaaacggg ttcttaatgt
5341 tgtgtgcaaa cactgtggac agaaaagcac tacacttagt ggtgttgaag ctgtcatgta
5401 catgggaacc ctctcttatg atcatcttaa gagaggtgtt aagatacctt gtgtatgtgg
5461 tcgtgaagct acacaatact tagtgaaaca agagtcaact tttgttatga tgtcagctcc
5521 acctgcagag tacactcttc aaaccggtga gtttttgtgt gctaatgagt acactggtaa
5581 ttaccagtgt ggtcattata cacatattac aaatagagaa actatctata aaattgatgg
5641 tgctctcttg actaaaatta ctgaatataa gggtcctgtt gctgatgttt tctataagga
5701 aacatcctac agtacagata taaaacctgt gtcatacaaa ctcgatggag tgacttacac
5761 agagataaat ccagatctaa atgggtatta caaaaaggac aatgcttact atacagaaca
5821 gcctattgac cttgtaccaa ctcaaccttt gcctaatgca agttttgaca atttcagatt
5881 tgtttgtgct aacaccaaat ttgctgatga cttgaaccag atgactggct ttaaaaagcc
5941 tccatctagg gatttaacaa ttacgttctt ccctgatttg aatggtgatg tggttgctat
6001 tgattataga cactacacac ctactttcaa aaagggtgct aaacttgtcc ataagccaat
6061 actgtggcat gttaatcaga ctactactaa gtcaacgttt aaacccaata tgtggtgtct
6121 gcgttgtctt tatagtacaa agcctgttcc cacttcaaat tcgtttgagg tgttaagttc
6181 agatgacgca caaggaatgg acaatcttgc ttgtgaaagt caacaaactg tcgctgaaga
6241 agtagtggat aatcctacca tacagaaaga catcatagag tgtgacgtga aaactaccga
6301 agttgtaggc aatgtcatac taaaaccatc agcagatggc attaaagtta catcagagtt
6361 ggaacatgag gatcttatgg ctgcttatgt gaatgaaact agcattacca ttaagaagcc
6421 caatgagctt tctatcatgt tgggtttaaa aacaattgct acacatggtg ctgctgctat
6481 taatagtgta ccctggatta agatttgtgc ttatgtcaag ccctttcttg gttacgttgc
6541 agagcaatct aagaattgta ttaagcgctg ttttaggcgt gtttttaatg attatatgcc
6601 attcttgttg acgcttttat tgcaattatg cacttttact aagagtacaa attttagaat
6661 aaaagctgct atgcccattg ttatagctag aaatagtgta ataggtggtg ttagattttg
6721 tctagatgct ttgactatgt atgttaaatc acctaagttt tctggaatac tcactgttgt
6781 tatgtggtta ttattattaa gtgtctgctt aggatgttta gtctatgcag tagcttcttt
6841 tggtgccatc ttatctggtt ttggtctgat gtcttattgt gatggcgtta gggcgggtta
6901 tgttaactcg tctaatgtca ctattcctga ctactgcgca ggcagtttac cttgtggtgt
6961 ttgtttgggt ggtttggatt ctttagatgc atacccagct ttagagacga ttcaggttac
7021 catttcttcc tacaagttag acttgacttt tgtgggaatg atggctgaat ggtttttggc
7081 atatatgttg tttactaaat ttttttattt attaggtctc tttgccttaa tgcagttgtt
7141 tttcggtctt tttgctacac actttgtgaa taattcctgg ttaatgtggc ttattataaa
7201 tgtagtgcaa atggctccca tttctgctat ggttagaatg tatgtgttct ttgcctcttt
7261 ctactatgtg tggaaagctt atatacacgt tattaatggc tgtacatcat ccacttgtat
7321 catgtgttac aagcgtaatc gtgcaacacg tgttgaatgc accaccattg tcaacggcat
7381 gaagaagtcc ttctatgttt atgctaatgg tggtcaaggt ttttgcaaac ttcacaactg
7441 gaattgtttg aattgtgaca ctttctgttc tggaagtaca tttatcagtg atgaggtagc
7501 acgtgaccta tcattacagt ttaagagacc tattaatcca acagaccagt cttcttacaa
7561 tgttgatagt gttacagtaa aagatggtac actctacttg tattttcaga aggctggtaa
7621 actcacctat gagagacatc cactttctta ctttgttaat ttggacaacc tgagagctaa
7681 taacgttaaa ggcactttgc ctattaatgt tatagttttt gatggtaagt ctaaatgtga
7741 agaagctgct gctaaatctg catctgttta ctatagtcag ttgatgtgtc aacctatatt
7801 actattagac caagctctta tttctgatgt tggcgatagt actgaagtgg ctgttaaaat
7861 gtttgatgcg tatgttaatg cattctcatc aacatttaac gctcctatgg aaaaactaaa
7921 gacattcatt gcgacagccc atgctgaaat agctaaaggt gtttctttgg atagtgtttt
7981 gtctacattt ttgtctgcag ccagacaagg atttgtggat tctgatgtag acactaagga
8041 tgtcatggag tgtcttaaac tgtcacacca ttctgatttg gagattacga gtgacagttg
8101 taacaatttt atgcttactt acaacaaggt tgaaaacatg acacctagag atctaggtgc
8161 ctgcattgat tgtagtgcgc gtcatattaa tgcacaagtg gcaaagagtc acaatgtttc
8221 ccttgtttgg aatgtaaagg actatatgtc tttgtcagaa cagctacgta aacaaatacg
8281 tagtgctgcc aaaaagaaca atataccatt taaactcact tgtgctacta ctagacaagt
8341 tgtgaatgtt ataacaacaa aaatatcact aaaaggtggt aagtttgtta gcaataattg
8401 gtttaggttc ctactcaaaa tgacagtttt gatggtattg gttgccttta tcttctattt
8461 tattacaccc acccatactt taatgggtca tgatgtgttt tcttctgaaa ttatcggtta
8521 taaagcaata cataatggtg ttaccagaga tgtgttgacc accgatgatt gttttgctaa
8581 caagcacact ggctttgacc attggttcag tcagcgtggt ggttcatata gaaatgacaa
8641 gacctgccct gttatagcgg ctgttattac gcgtgaggta ggctttatag tacctggtct
8701 tcctggtact gtaaggcgtg cttccaatgg tgactttttg catttcttac ctagagtttt
8761 tagcgctgtt ggtaacattt gttacacgcc agcgaaatta atagagtata ctgactttgc
8821 aacttcagcc tgcgtgcttg ctgctgaatg tactatcttt aaggatgctc aaggtaaacc
8881 tgtaccttat tgctatgata ctaatttgct tgagggttct atttcttata gtgagttacg
8941 ccccgacact agatatgtgt taatggatgg ctcaattata caattcccta gcacttacct
9001 tgaaggttct gtgagagtgg taacaacttt tgattctgaa tactgcagac atggtacttg
9061 cgaacgatca gacgcgggtg tgtgtttgtc tactaatggt agatgggttc ttaataatga
9121 ttattatcga tccattccag gtgtcttttg tggtgctgat gcttcagact tactctttaa
9181 catcttcaca cctcttgtta gaccagtagg cacacttgac atttcagctt ctgttgtagc
9241 aggtggcctt atagccatcc ttgttacatg tgttgcttac tactttatga agtttaggcg
9301 tgcgtttgga gagtacaacc atgttgtctt tgctaatgca cttttgtttt tactgtcttt
9361 tactatactc tgtttgacac ctgcgtacac atttttacca ggtatctatt cattgcttta
9421 cttgtacttg accttctatt ttactaatga cgtgtctttc ttggctcacc tgcaatggct
9481 agctatgttc tcaccaatag tgcctttctg gataacagtc acttatgttg tctgtatttc
9541 tattaagcat tgccattggt tctttagtaa ttacctcaag aagagagttg tttttaatgg
9601 agttacattt agcacttttg aggaggctgc tctgtgtacc tttcttttga ataaggaaat
9661 gtatcttaaa ttgcgtagtg agacactttt gccacttaca caatataata gataccttgc
9721 tctttataat aagtacaagt attttagtgg agctttagac acaactagct atagggaagc
9781 tgcatgctgt cacttagcga aagctctaaa tgacttcagt aactctggtg ctgatgttct
9841 ataccaacca ccacagactt ctattacctc agctgtttta cagagcggtt ttagaaagat
9901 ggcattcccc tcaggcaaag ttgagggatg tatggtacag gtcacatgtg gaacaacaac
9961 cctgaacggt ttgtggttag acgatgtggt ctattgccct agacatgtta tctgcacact
10021 agaagatatg cttaacccaa attatgatga cttacttatt agaaagtcta accataattt
10081 ccttgtgcag gctagtaatg tgcaattgcg tgttattggc catactatgc agaactgctt
10141 gctcaaactt aaggttgaca tagctaatcc taaaacacct aagtataagt ttgtacgtat
10201 tcaacctgga cagacttttt cagtgttagc ttgctacaat ggtgcaccct cgggtgttta
10261 ccagtgtgca atgaggtcca accacactat taagggttca ttccttaatg gttcttgtgg
10321 tagtgttggt tttaacattg actatgactg cgtgtccttc tgttacatgc accatatgga
10381 gcttcctaca ggagttcatg ctggtacaga cttggaaggt aacttctatg gaccatttgt
10441 tgatagacaa acagcacaag cagcaggaac tgatacaacc attacactta atgtgttggc
10501 ttggctctat gctgctgtta ttaatgggga aagatggttc cttaataggt ttacaactac
10561 cctaaatgat tttaatcttg ttgctatgaa gtacaactat gaacctctaa cacaagatca
10621 agttgacatc cttggaccgc tttctgccca aactggagtg gctgtcatgg atatgtgtgc
10681 agcactgaaa gaattgttgc aaaatggctt aaatggtcgt accatactcg gtagtaccat
10741 tttagaagat gagtttacac cttttgacgt tgttagacaa tgctcaggtg taacctttca
10801 aggtaaattc aagaaagtcg tcaaaggtac ccatcattgg ttgctgttga cactcttgac
10861 ttctttgtta atacttgtcc aaagtacaca gtggtcactg tttttctttg tgtatgaaca
10921 tgcctttttg ccgtttacaa tgggtgttgt gtgttttgct gcatgcgcta tggttcttgt
10981 taagcataag catgcatttt tgtgtttgtt tttgttacct tccttaataa ctgttgctta
11041 ttttaatatg atctacatgc ctgctagttg ggttatgcgt gtcatgacat ggttagattt
11101 agtcgacacc agcttgtctg gttatagact taaggactgt gttatgtatg cgttagctgc
11161 tttcttactc atccttatga cagctcgtac tgtttatgat gatgctgcta gacgtgtttg
11221 gacagttatg aatgttataa cacttgtcta caaggtctac tatggtaatt cgcttgatca
11281 agcacttgct atgtgggctc ttgttatttc tgtaacctct aactattctg gtgtcgttac
11341 gactatcatg tttttagcta gagctatagt gtttttgtgt gttgagtatt atcctatttt
11401 gtttattact ggcaacacct tacagtgtat aatgcttgtt tattgtttct tgggctattg
11461 ttgctgttgt tactttggtc ttttctgttt actcaaccgc tatttcagat taactcttgg
11521 tgtgtatgac tattttgtct ccacacaaga gtttaggtat atgaattcac agggactttt
11581 acctcctaag actagtttgg atgcctttaa actcaatgtt aaattattgg gtattggagg
11641 taagccttgt attaaagtgg ccactgttca gtctaaaatg tctgatataa agtgcacttc
11701 tgttgtattg ctttcagttc tacaacaact tagaattgaa tcctcatcca aattgtgggc
11761 acagtgcgtg caattgcaca atgacatctt acttgctaag gatacaactg aggcatttga
11821 aaagatggtc tcattgttat ctgttctgct ttctatgcaa ggcgctgtag atattaataa
11881 gttgtgtgat gaaatgctca acaatcgtgc tactttacaa gccattgctt cagagtttag
11941 ttctctacca tcttatgcag cttatgctac agcccaggag gcttatgagc aggctgttgc
12001 taatggagac tctgaagttg ttcttaagaa attgaaaaag tctttaaatg tggctaaatc
12061 tgaatttgac agggatgccg ccatgcaacg taagttggaa aagatggcgg accaggccat
12121 gacccaaatg tacaagcagg ctagatctga agacaagagg gcaaaagtta ctagtgccat
12181 gcagacaatg ctattcacta tgcttagaaa gcttgataat gatgctttga acaatattat
12241 taacaatgca cgtgatggtt gtgtaccact caacatcata ccattgacaa ctgcagccaa
12301 actcatggtt gttgtccccg attataacac ctacaagaat acttgtgatg gcaacacatt
12361 tacgtatgct tccgctctct gggaaatcca gcaggttgtg gatgcagata gtaaagttgt
12421 tcagttgagt gaaattaaca tggacaattc tcaaaacctt gcttggcctc ttattgttac
12481 agcattgagg tccaattctg cagtcaaatt acagaataat gaactgagtc ctgttgcact
12541 gcgccagatg tcgtgtgccg caggtactac acaaacagct tgcactgatg acaatgcact
12601 tgcctattac aacacttcta agggaggtag gtttgtgctt gcattattat cagaccacca
12661 agatctcaaa tgggcacgtt tcccaaagag tgatggtaca ggtactatat acacagaact
12721 ggaaccacca tgtaggtttg ttacagacac accaaaaggc cctaaagtga agtacttgta
12781 ctttatcaag ggccttaaca acctaaatag aggtatggta ctgggtagtt tagctgctac
12841 agtacgttta caagctggca atgctacgga agttcctgcc aattctactg tgctttcttt
12901 ttgtgcgttt gctgtggatc cagctaaggc atataaagat tacctagcta gtggtggaca
12961 accaattacc aattgcgtaa agatgctgtg cacacacaca ggtacaggac aggctataac
13021 tgtaatacca gaagccaata tggaccaaga gtcctttggt ggtgcttcat gttgcttgta
13081 ttgtagatgc cacattgatc atccaaatcc taagggattt tgtgacttga agggtaagta
13141 tgtccaaata cctaccacat gcactaatga ccccgtgggt tttattctta gaaacacagt
13201 ctgtactgtc tgcggtatgt ggaaaggtta tggctgtagt tgtgatcaac tccgcgagcc
13261 cgtgatgcag gcagctgatg ccccagcgtt tttaaacggg tttgcggtgt aagtgcggcc
13321 cgtcttacac cgtgcggcac aggcacaagc actgatgtcg tttacagggc ttttgatatt
13381 tataatgaga aagttgctgg ttttgcaaag ttcctaaaaa caaattgttg ccgtttccag
13441 gaagttgatg aagagggcaa cttattagac tcctattttg ttgttaagag acatactatg
13501 tctaattatc aacatgagga gactatgtat aatttagtta aagagtgtcc agctgttgct
13561 gtgcacgact tctttaaatt tagagtagat ggtgacatgg taccacacat atcacgccag
13621 cgtcttacta aatacacaat ggcagactta gtctatgcac ttcgtcattt tgatgaaggt
13681 aattgtgaca ccttaaaaga aatattagtc acatacaatt gttgtgatga cgcatatttc
13741 aataaaaagg attggtacga ctttgtggaa aatcctgata tactacgcgt atacgcatgc
13801 ctaggtgagc gtgtgcgcca agctttgtta aagactgtac agttctgcga tgccatgcgc
13861 gatgcgggca ttgttggtgt actcaccttg gataatcaag atctgaatgg gaattggtac
13921 gatttcggtg acttcgtaca agtggcacca ggtgcaggta ttcctattgt agattcttat
13981 tattcattgc tgatgcccat tcttacgtta acgaaggcat tggcagccga gtcccatatg
14041 gactgtgata ctacaaagcc tctcattaag tgggacttgt tgaagtatga tttcacggaa
14101 gaaagattat gtctttttaa ccgttatttc aagtattggg atcaaacata ccaccctaat
14161 tgtattaact gtttggatga taggtgtatc ctacactgtg caaactttaa tgttttattt
14221 tccacggtgt ttccgccaac aagttttggc ccacttgtga gaaaaatttt tgtggatggt
14281 gttccttttg ttgtatcaac aggctaccat ttccgtgagt tgggagttgt acataatcag
14341 gatgtaaact tacacagctc acgtctcagt tttaaggaac ttttagtgta cgctgctgat
14401 cctgctatgc atgctgcatc aggcaacctg ttgcttgata aacgcactac atgcttttca
14461 gtggctgcac tgacaaatag tgttgctttt caaactgtca aacctggtaa ttttaataaa
14521 gacttttatg actttgctgt gtctaaaggt ttcttcaagg aaggaagttc tgttgaattg
14581 aaacacttct tctttgcaca ggatggcaat gccgctatta gtgattatga ttactatcgt
14641 tataatcttc ctacaatgtg tgacatcaga caactgcttt ttgtggttga ggtggtcgac
14701 aaatactttg attgttacga tggcggttgc ataaatgcta accaagtcat tgttaacaat
14761 ttggataaat cagctggatt cccctttaat aaatggggaa aggctagact ttattatgat
14821 tctatgagtt atgaagatca ggatgcgttg ttcgcttata ctaagcgcaa tgtgatccct
14881 accattactc agatgaatct taaatatgcc attagtgcta agaatagagc gcgcaccgta
14941 gctggtgttt ctatctgtag cactatgacc aatagacagt tccatcagaa attattaaag
15001 tctatagccg ctacaagagg tgccacagtt gtaataggca ctagtaaatt ctatggtggc
15061 tggcataaca tgttaaaaac tgtttacagt gatgttgaaa ctcctaacct tatgggttgg
15121 gattacccaa aatgtgatag agccatgcct aacatgctta ggataatggc atcacttgtt
15181 cttgctcgca aacatagtac ttgttgtaac ctttcacacc gtttctacgg gttagctaat
15241 gagtgtgctc aggtacttag tgaaatggtt atgtgtggcg gttcactcta tgtgaaacca
15301 ggcggtacat cttcaggaga tgccaccact gcttatgcta atagtgtctt taacatttgt
15361 caagctgtta cagctaatgt taatgcactt ttgtctactg atggtaataa aattgctgac
15421 aagtatgtcc gcaatttaca acatagactt tatgaatgtc tctatagaaa tagagacgtt
15481 gatcatgaat ttgtagaaga attttacgct tatttgcgta aacacttttc tatgatgatt
15541 ctctctgatg atgctgttgt ttgctataat agcaactatg cagctcaagg tttagtagct
15601 agcattaaga actttaaagc agttctttat tatcaaaaca atgtttttat gtctgaggca
15661 aaatgctgga ctgagaccga ccttactaaa ggacctcatg aattttgctc tcagcataca
15721 atgctagtta aacaaggaga tgattacgtg tacctgcctt acccagaccc atctagaatt
15781 ttaggcgctg gttgttttgt tgatgatatc gtcaaaaccg atggtacact tatgatagaa
15841 cggtttgtgt ccctagcgat agacgcctac ccacttacaa agcaccctaa ccaggagtac
15901 gctgatgtct tccatttgta tttgcaatac attaggaagt tgcatgatga gcttactgga
15961 cacatgttag acatgtattc agtcatgcta acaaatgata acacttctag gtattgggaa
16021 cctgagtttt atgaggctat gtacacacca catacagtct tgcaggctgt aggcgcgtgt
16081 gtgttatgca attcacagac ttcacttcgt tgcggctcat gcatcagacg accattcctg
16141 tgttgcaagt gctgctatga ccatgtcatt tcgacttcgc ataaattagt gctgtccgtt
16201 aatccctatg tttgcaatgc ccccggttgt gatgtcacag acgtgacgca actttattta
16261 ggaggtatga gctactactg caagtcgcac aagccaccta ttagctttcc tttgtgtgct
16321 aatggtcagg tttttggtct ttataagaac acttgtgttg gcagcgataa cgtaactgat
16381 ttcaatgcca tagccacatg tgactggact aatgccggtg attacatact tgctaacacc
16441 tgcactgaga gattgaaact ctttgctgct gaaactttaa aagctaatga agagacattt
16501 aaactatcct atggcatcgc cactgtgcgt gaagtgctgt ctgatagaga attacatcta
16561 tcttgggaga ttgggaagcc tcgacctccc ttgaatagaa attatgtctt tactggctat
16621 agagttacta agaacagtaa agtgcagata ggagagtaca cctttgaaaa aggtgactat
16681 ggtgatgctg ttgtgtatag aggtactaca acttataagt taaatgtggg cgattacttt
16741 gtgttaacat cacacactgt aatgcccttg actgcaccta ctttagtgcc acaagagcac
16801 tatgtgagaa taactggctt ataccctaca cttaacatct ctgatgagtt ttctagcaat
16861 gttgctaact atcaaaaagt aggtatgcag aagtattcta ctttgcaagg accaccaggt
16921 acaggtaaga gccactttgc cattgggttg gcattgtact atccatctgc acgcatagtc
16981 tacacggcat gctcacacgc ggctgtggat gctctatgcg agaaggcgct aaaatacttg
17041 ccaatagaca agtgtagcag aataatacct gcgcgagctc gcgtggagtg cttcgacaaa
17101 ttcaaggtta attcaacact tgaacagtat gttttctgta cagtcaatgc gctgcctgaa
17161 actactgctg atattgtagt ctttgacgag gtttcaatgg ccacaaatta tgacttgagc
17221 gtcgttaatg ctagattacg tgctaagcat tatgtctaca ttggtgatcc tgctcaatta
17281 cctgcaccac gcacattgct tacaaagggc acactagaac ctgaatattt taactctgtg
17341 tgtcgtctaa tgaaaacaat aggtcccgac atgttccttg gtacgtgtcg ccgatgtcct
17401 gctgaaatag tcgacactgt cagtgcttta gtttatgata ataaacttag ggcacataaa
17461 ggcaagtcat cacaatgttt taaaatgttt tataaaggag tgattacaca tgacgtgtca
17521 tctgcaatca acagaccaca gattggcgtg gttagagaat ttctgacacg caaccctgct
17581 tggagaaaag ctgtttttat ttcaccttat aactcacaga atgctgtggc ttcaaaaata
17641 cttggactgc ctacgcaaac tgtagattct tcacaaggtt ctgaatatga ctacgtcata
17701 tttgctcaga ccacagaaac agctcattca tgcaatgtta atagatttaa tgttgctatt
17761 acaagagcca aagtaggtat tttgtgcata atgtccgata aggacctcta tgataaatta
17821 caatttacta gtctggaagt cccacgtaga agtgtggctg tattgcaatc agagaatgta
17881 actggacttt ttaaggactg tagtaagcta ataactggct tacatcctac acaagcacct
17941 acatacctta gtgttgatac taaattcaaa actgaaggtt tgtgtgtcga cataccagga
18001 ataccaaagg acatgaccta tcgtaggctc atctctatga tgggttttaa aatgaactac
18061 caagttaatg gttaccctaa catgtttatt acccgtgatg aagcaatcaa gcatgttcgt
18121 gcttggattg gctttgatgt agagggttgt catgcaacta gggatgccgt aggtacaaac
18181 ctaccactcc agttagggtt ttcaactggt gttaacttag tagctgttcc tacaggctat
18241 gttgacacaa gtgcagccac agagttctct agagtaaatg caaaaccacc acctggggac
18301 cagtttaaac atctaatacc gcttatgtac aagggtttac cttggaacat agtgcgtgtt
18361 aagattgtac aaatgcttag tgatacacta aaagaccttt cagatagagt cgtgttcgtc
18421 ctttgggcac atggctttga acttacttca atgaagtatt ttgtcaagat tggaccagaa
18481 cggacgtgtt gtctgtgtga caagcgcgca acttgctttt caacttcatc agatacatac
18541 gcttgctggc accactctgt gggttttgac tatgtctata atccatttat gattgatgtc
18601 cagcagtggg gatttactgg caatttgcag agtaaccatg accaacattg ccaagttcat
18661 ggcaatgcac atgttgctag ttgtgatgcc atcatgactc gttgtcttgc cattcacgag
18721 tgctttgtga agcgcgtgga ttggtctgta gaatacccta ttataggtga cgagctgaga
18781 attaatgtag catgcagaaa agtacaacat atggttgtaa agtctgcttt gcttgcggat
18841 aagtttccag ttcttcacga tattggtaat ccaaaggcta taaagtgtgt ccctcaggct
18901 gatgtagaat ggaagttcta cgatgtgcaa ccttgtagtg acaaagctta caaaatagaa
18961 gagttgttct attcttatgc aacccatcat gataaattta cagatggcgt gtgtttgttt
19021 tggaactgta acgtggatcg ttacccttct aatgcaattg tttgccggtt tgatactaga
19081 gtgttatcta acttgaatct gcctggctgt gatggtggta gtttgtatgt aaataaacat
19141 gcattccaca cacctgcctt tgataaaggt gcttttgcta acttgaagca attaccattt
19201 ttctattatt ctgacagtcc ttgcgagtca catggtaagc aagtcgtgtc agacattgat
19261 tatgtgcctc ttaaatctgc tacgtgtatt acacgatgca acttaggcgg tgccgtttgt
19321 cgtcatcatg catctgagta cagacagtat ttagatgctt ataacatgat gatttcggcc
19381 ggctttagcc tttggattta caagcagttt gacacttata atctctggaa tacctttact
19441 aggttacaga gtttagagaa tgtggcttac aatgttgtta ataaaggaca ttttgatggt
19501 caagctggtg aaaaaccagt ttccatcatt aataataccg tctacacaaa ggtggatggt
19561 gttgatgtag aaatctttga aaataaaacg actttgcctg ttaatgttgc atttgagctt
19621 tgggctaaac gtaacattaa acctgttcca gaaataaaga tactcaataa tttgggtgtt
19681 gatattgctg ctaatactgt tatttgggat tataaaagag aatcaccagc ctatatttca
19741 acaataggtg tctgtacaat gactgacatt gctaagaaac ctactgaaaa cgcttgttcc
19801 tcactcaccg tcttttttga tggtagagtt gatggacagg ttgattcttt tagaaatgca
19861 cgtaatggtg ttttaattac agaaggctca gtgaaagggt taaacccttc taaggggcca
19921 ccacaggcta gtcttaatgg agtcacattg attggagaat ctgtaaaaac acagtttaat
19981 tactttaaaa aagtagatgg cgttgttcaa caactgccag aaacctactt tactcagagc
20041 agaagtttag atgatttcaa acccaggtca caaatggagg ttgatttcct acaacttgca
20101 atggatgaat tcatagagcg gtataagctc gagggttacg cctttgagca tatcgtctat
20161 ggagatttta gtcatggaca attaggtggg ctacatctta tgattggtct cgccaaaagg
20221 tctttagaat cactactgaa acttgaggat tttatcccga ttgacagtac tgtgaaaaat
20281 tattttgtaa cggatgcaca aacaggttca tctaaatgtg tgtgctctgt cattgatctt
20341 ttacttgacg attttgttga aataataaaa tctcaggatt tgtctgtcgt ttcaaaagtg
20401 gtcacggtca ccattgacta tgctgaaatt tcatttatgc tttggtgtaa agatggacat
20461 gttgagacat tttacccaaa actgcaagca aatcaaacat ggcaacctgg tgtcgccatg
20521 cccaatttgt ataagatgca aagaatgctt cttgataagt gcgaccttca caattatggt
20581 gaaaatgctg tgataccaaa aggaataatg atgaatgtcg ctaaatatac tcaactgtgt
20641 caatatttaa atacacttac tatagcagtg ccttataaca tgcgagttat acattttggt
20701 gcgggatctg ataaaggtgt cgcaccaggc tctgctgtac tcaaacaatg gttgccagtt
20761 ggcacgttgt tggttgattc agacataaat gattttgtgt ctgatgctga ttctacatta
20821 ataggagact gctctactgt ttatacagct aataaatggg atcttattat tagtgatatg
20881 tacgatccga agacaaagca catattaaaa gaaaacgact ccaaggaagg atttttcact
20941 tacttatgtg gttttattaa acaaaagctt gccttgggag gttccgtggc tataaagata
21001 acagaacatt cttggaatgc cgatctttat aagctcatgg gatatttctc atggtggact
21061 gcttttgtca ctaatgtaaa cgcttcttct tcagaggctt tcttaatagg tgttaactac
21121 cttggtaaac agaaagaatc cattgacgga tataccatgc atgctaacta catattttgg
21181 aggaacacaa accctataca attgtcttcc tactctcttt tcgacatgag taaattccca
21241 ctaaagctta ggggaactgc tgtcatgtcc ttaaaagata atcagatcaa cgatatgatc
21301 tgttctcttt tagaaaaggg tagacttatc attagagaga ataataaagt tgttttctct
21361 agtgatgtcc tagtaaataa ttaaacgaac atgaaatttt tggcttttct ctgtcttctt
21421 ggctttgcta acgctcaaga tggcaagtgt ggtacactat ctaataaaag tccatctaag
21481 cttactcaga ctccttcttc taggaggggt ttttattatt ttgatgacat ttttaggtct
21541 tcaattcgtg tgcttaccac tggccatttt cttcctttta atactaacct tacttggtat
21601 ttgactttaa agtctaatgg taagcagagg atttattatg ataatcccaa cattaacttt
21661 ggtgatggtg tttattttgg tctaaccgag aaatctaatg tttttcgagg ttggattttt
21721 ggttcgacat tagacaacac aactcagtct gctgttctct ttaataatgg tacacacatt
21781 gttatagatg tgtgtaactt taatttttgt gctgatccaa tgtttgctgt caatagtgga
21841 cagccttata aaacctggat ttatactagt gcggctaatt gcacttacca cagagcacat
21901 gcatttaata ttagcactaa tatgaatcca ggtaagttta aacattttag ggagcacctg
21961 tttaagaatg tagacggctt cctatatgtc tatcataact atgaacccat tgatcttaac
22021 agtggttttc cttctggctt ttctgtttta aaaccaatac ttaagctgcc ttttggtctc
22081 aacattacat atgttaaggc cataatgaca ttgttttctt ccactcaaag taattttgat
22141 gctgacgctt ctgcttactt tgtgggccat ctaaaacctc tcaccatgct tgttgacttt
22201 gacgagaatg gcaccattat tgatgctata gattgctctc aagatccact ctcagagctt
22261 aagtgtacca ctaagagttt tacagttgaa aaaggaattt atcaaacctc taacttccgt
22321 gttacaccaa ccactgaagt tgttaggttt cctaacatta cacagctttg tccttttaac
22381 gaagttttca atataacctc tttcccatcc gtttacgcgt gggagagaat gcgcattact
22441 aattgtgttg cggactactc agtgctttac aattcttctg cctccttctc aacatttcag
22501 tgttatggcg tttcacctac aaagctcaac gatttatgct ttagcagtgt ttacgcagac
22561 tactttgttg tgaagggtga tgatgtacgc caaattgcac ctgctcagac aggtgtgatt
22621 gctgattaca attacaaatt gcctgatgat tttacaggtt gtgtaatagc ctggaataca
22681 aattctttgg acagttccaa cgaattcttt tacaggagat tcagacatgg aaagattaaa
22741 ccttatgggc gtgacctttc caatgttctt tttaaccctt caggtggtac atgttcagct
22801 gaaggtctta attgttacaa accacttgcc tcctatggat ttacacagtc ctctggaatt
22861 ggctttcaac catacagagt ggttgtgctt tcttttgagt tgttaaacgc acctgctaca
22921 gtttgtgggc ctaaacagtc tactgagcta gttaagaaca agtgtgttaa cttcaatttc
22981 aacggactta caggcactgg tgtgcttact aattctacta aaaagttcca accttttcaa
23041 cagtttgggc gtgacgtttc agattttacg gactccgtca gagaccctaa aacccttgag
23101 attcttgaca ttgcaccttg ttcatacggc ggtgtcagtg ttataactcc tggtacaaat
23161 gcttctagtt cagtggctgt tttgtatcag gatgttaatt gtacagatgt gcctactatg
23221 ttacatgctg atcaaatttc tcatgattgg cgtgtgtatg ccttccgtaa tgatggcaac
23281 atattccaaa cacaggctgg ttgtttgatt ggtgctgctt atgacaactc atcttatgag
23341 tgtgatattc ctataggagc tggcatttgt gctaagtata cgaatgtttc tagcacactt
23401 gtgcgctccg gtggacactc catactagct tacaccatgt ctcttggtga caatcaagac
23461 attgtttatt ctaacaacac cattgctatt ccaatgaatt ttagtattag tgtcactact
23521 gaggtcttgc ctgtttcaat gactaagact tcagtagatt gtaacatgta tatttgcggt
23581 gactccactg aatgcagtaa tttgctgcta cagtatggta gtttctgcac gcagttaaac
23641 agagctcttg ccggtatagc tgtggaacaa gacagaaata ctcgagatgt ctttgcacaa
23701 actaaggcca tgtacaagac tccttctttg aaggattttg gtggttttaa tttttcacag
23761 attttgccag accccgctaa accgtctagt agatctttta ttgaggactt gctttacaac
23821 aaagtcacac ttgctgaccc aggttttatg aagcagtatg gtgattgttt aggtggtgtt
23881 aatgctcgtg acctcatttg tgcacaaaag ttcaatgggc tcacagtact cccaccccta
23941 ctcactgatg aaatgattgc ggcatacacg gcagcactaa taagtggaac ggctacggca
24001 ggttttactt ttggtgcagg tgctgcgctt cagatacctt ttgcgatgca aatggcttac
24061 agatttaatg gcattggtgt cactcaaaat gttttgtatg agaaccagaa acaaattgct
24121 aatcagttca ataaggctat ctcacaaatt caggattcct taagtactac tactacagca
24181 cttggcaaat tacaggatgt gattaaccaa aatgccatag cccttaacac actagttaaa
24241 cagcttagct ccaattttgg tgctatttct agtgtactga atgatattct gtctcgactt
24301 gacaaagtag aggccgaagt tcaaattgac aggcttataa caggacgttt acagagcttg
24361 cagacttatg ttacacagca acttatcaga gccgcagaaa ttagagcctc tgctaatctt
24421 gctgctacaa aaatgtccga gtgtgtactt ggccagtcta agagagtaga cttttgtgga
24481 aaaggatatc atttgatgtc cttccctcag gctgctcctc atggtgtagt tttcttacat
24541 gttacttatg taccatcgca ggaacaaaac ttcactactg cacctgctat ttgtcatgaa
24601 ggtaaagcac actttcctcg tgaaggcgtc ttcgtcacaa atggcacaca ctggtttatc
24661 actcagcgaa atttttattc gcctcagcct attactacag acaatacatt tgtgtcaggc
24721 aattgtgatg ttgtcattgg cattgttaat aacactgtct acgacccact acagcctgaa
24781 ctagactcat ttaaagaaga acttgacaag tattttaaaa accatacttc acagaatgtt
24841 agtcttgatg gtcttaacaa cataaatgct tcagttgtgg acattaaaaa ggaaattgaa
24901 catctcaatg agattgccaa aagcctaaat gaatcactca tcgacctaca agaactaggc
24961 aagtatgagc agtacattaa atggccgtgg tatgtgtggc ttggctttat tgccggtctc
25021 attgccatcg tcatggctac aattatgttg tgttgcatga ccagctgttg tagttgtctt
25081 aaaggtgttt gctcatgtgc ttcatgttgc aaattcgatg aagaccactc cgaaccagtg
25141 cttactggag tgaagttaca ttacacataa acgaacttat ggatttgttt ttgaacatct
25201 tcactttagg atctattact agacaacctg gtaaagttga aaatgtttct cctgcaagtt
25261 cttttcattc tacagcgtcc atccctttac aggccactct acctttcgga tggcttgttg
25321 ttggcgttgc atttcttgct gtttttcaaa gcgctgcgaa attaatacct tttaacagtc
25381 tttggcagcg ttgcttatac cagagctttc aattgctttg caatgtgctt cttattgctt
25441 tgacagttta ctcgcactta ctgcttgttg ctgcagggct tgaagcacct ttcctttatc
25501 tacttgcttt gatttacttc ttacagtgcg ttgtatttgg caggcttctt gtcagatgct
25561 ggctgtgctg gaaatgcaaa tcaaagaatc cattaattta tgactcaaac tattttgttt
25621 gctggcatac tcacactcat gactattgta ttccttacaa tagcattaca aacactatcg
25681 tcctcactgc aggtgatggt gtcactattc ccattcggac acaagactac caaattggtg
25741 gttacttcga aaaatgggaa tctggtgtta aggactatct tacacttatt ggtcctttca
25801 ctgaagttta ttaccagctt gaatctaccc agatttccac agacactggt attaataatg
25861 cgacattctt cctcttctca aagaatgatg aaagagaaca ggaaagtgtc caagttcaca
25921 caatcgacgg ctcatcagga gttgtaaacc caatttacga tgagccgacg ccgactacta
25981 gcgtgcctct ttaagcacat tgattgagta cgaacttatg tactcattcg tttcagaaga
26041 aaccggtacg ttaatagtta atagcgtact tctttttcta gcttttgtgg tattcttgct
26101 agtcacccta gccatcctta ctgcgcttcg attgtgtgca tactgctgca atattgttaa
26161 cgtgagttta gtaaaaccga ctttttacgt ttactcacgt gtaaaaagct tgaattcctc
26221 tcaggaggtt cctgaatttc tggtctaaac gaactaatta ttatttttat tcttttagga
26281 actttaatat tgctctctat gactaacagt agtgcttctc ctcctacgga gaccattacc
26341 gtagagcagt taaaacacct acttgagcaa tggaacctag ttataggttt tctgtttttc
26401 gcttggattc tgctactaca gtttgcttac tccaacagga acaggtttct ttacataata
26461 aagcttgtgt ttctctggct tctttggcca attacactag cctgctttgt gcttgctgcc
26521 gtctacagaa ttaactgggt tacaggaggc atagctatag cgatggcctg cattgtgggt
26581 ctcatgtggc ttagctactt tgtggcttca ttcaggcttt tcgcacggac caggtcttgg
26641 tggtctttta acccagaaac caacattttg cttaacgtgc cactacgtgg taccattctg
26701 accagaccgc ttcttgagag tgaacttgtc attggtgctg tgatcattcg tggtcacctc
26761 cgtatggctg gacactccct tggacgctgt gacattaagg acctccctaa agaaatcact
26821 gttgctacat cacgaactct atcttattac agattaggag cctcccagcg tgtagcatct
26881 gattcaggtt ttgctgttta ccaccgctat cgtatcggta attacaagct aaataccgac
26941 cacataggca gtgacgacaa tattgctttg ctagtacagt aagagacaac agatgtttag
27001 tctagttgct ttccaagtta ccgtagcaga gttgttaatt ttaattatga aatcttttgg
27061 attggcactt actcatatcc aaattggtat agtttcatta ttaaaaatcc taacaaaccg
27121 tctagataga aggtattcta aactagacga agaagaacct atggaaattg atcatcctta
27181 aacgaacatg aaatttcttt tactcgtggc aattgtaagt atagcatcag cagaacttta
27241 ccattaccaa gagtgtgcta gaggtacaac cgtactctta aaggagcctt gccaacctaa
27301 tacttacgaa ggcaactcac cttatcaccc tttggctgac aacaagtttg ctatcacttg
27361 tacaaacacc aaatttagtt ttgtttgtca ggacgagaca agacacgtat ttcaattacg
27421 tgcccggtct atttcaccca gactttttgc cagtccaaaa catcatagtg acgatttcac
27481 cccggtgatc cttattattg tcacattgct ctttgtaatc tactgttgca tgaagagaca
27541 atgattcatt taactttgtt tgatttctac ctttgtgtcc tatctttgct acttttcttg
27601 gtcattataa tgctaatcat cttttgtttt gtgttagaat tacaagatct aaacgaacaa
27661 taaaatgact gataatggac aatcaaactc gcgtaatgcg cctcgcatta cgtttggtgt
27721 ctcagatacc tcagacaata atcagaatgc agaacgtgct ggagcgcggc caaagcaaag
27781 aagaccgcaa ggccctccta acaacacagc atcctggttc acagctctca ctcagcatgg
27841 taaagaaggt ctctcctttc cgcgaggaca gggagtgccc gttaatacca atagtaccag
27901 ggacgaccaa attggctact atcgcagagc tacccgacga gttcgtggtg gtgatggtaa
27961 gatgaaagaa ctcagcccgc gctggtactt ctactatcta ggaactggac cagaggccgc
28021 attaccttat ggtgctaaca aagatggcat agtttgggtc gctacagaag gagccctaaa
28081 cacgcctaaa gatcacattg gcacgcgcaa tcccaacaac aatgctgcca ttgtcataca
28141 gttaccacaa ggtactacct tgccaaaagg cttctacgct gaaggaagtc gtggtggcag
28201 tcaagcctcc tcgcgttcta actcacgtag ccgtggtaat tccagaaatt caacacctag
28261 cagcagcaga ggttcatcac ctgcacgcat ggctgccgga ggagatacgg cacttgcatt
28321 attgctgtta gacaggctga accagcttga gagcaaagtt tcaggtaaga caccacaaca
28381 atcacaggtt gtcacaaaga aaacagctgc tgaggcttct aaaaagccca gacagaaaag
28441 aacagctacc aaagcctata atgttactca ggcttttggt aggcgaggtc ccgaacctac
28501 acagggaaat ttcggtgacc aggaattaat cagattaggt actgattaca aaaattggcc
28561 acagattgca cagtttgcac ccagtgcttc tgcattcttt ggcatgtccc gtataggaat
28621 ggaagtcaca cctacaggga cttggttaac ctataatggt gccataaaat tggatgataa
28681 agacccaaat ttcaaagacc aagttattct gcttaataag cacattgatg cttataagac
28741 atttccacct acagaaccta aaaaggacaa gaagaaaaag gctgatgaag tacagtcact
28801 gccgcagcgt cagaagaaac aggcaactgt gactctgtta cctgcagcag atttggatga
28861 tttttccaaa caacttcaga attccatgaa tgcttcacct gattctactc aggcctaaat
28921 tcatgttgac cacacaaggc agatgggcta tgtaaacgtt ttcgctattc cgtttacgat
28981 acatagtcta ctcttgtgca gaatgaattc tcgtagctaa acagcacaag taggtttagt
29041 taactttaat ctcacatagc aatctttaat caatgtgtaa cattagggag gactggaaag
29101 agccaccaca tagtcaccga ggccacgcgg agtacgatcg agggtacagt gactaatgct
29161 agggagagct gcctatatgg aagagcccta atgtgtaaaa ttattttagt agtgctatcc
29221 ccatgtgatt ttaatagctt cttaggagaa tgacaaaaaa aaaaaaaaaa aaaaaa
//