The Three Faces of Riboviral Spontaneous Mutation: Spectrum, Mode of Genome Replication, and Mutation Rate

Download PDF České info

Riboviruses (RNA viruses without DNA replication intermediates) are the most abundant pathogens infecting animals and plants. Only a few riboviral infections can be controlled with antiviral drugs, mainly because of the rapid appearance of resistance mutations. Little reliable information is available concerning i) kinds and relative frequencies of mutations (the mutational spectrum), ii) mode of genome replication and mutation accumulation, and iii) rates of spontaneous mutation. To illuminate these issues, we developed a model in vivo system based on phage Qß infecting its natural host, Escherichia coli. The Qß RT gene encoding the Read-Through protein was used as a mutation reporter. To reduce uncertainties in mutation frequencies due to selection, the experimental Qß populations were established after a single cycle of infection and selection against RT ⁻ mutants during phage growth was ameliorated by plasmid-based RT complementation in trans. The dynamics of Qß genome replication were confirmed to reflect the linear process of iterative copying (the stamping-machine mode). A total of 32 RT mutants were detected among 7,517 Qß isolates. Sequencing analysis of 45 RT mutations revealed a spectrum dominated by 39 transitions, plus 4 transversions and 2 indels. A clear template•primer mismatch bias was observed: A•C>C•A>U•G>G•U> transversion mismatches. The average mutation rate per base replication was ≈9.1×10⁻⁶ for base substitutions and ≈2.3×10⁻⁷ for indels. The estimated mutation rate per genome replication, μ_g, was ≈0.04 (or, per phage generation, ≈0.08), although secondary RT mutations arose during the growth of some RT mutants at a rate about 7-fold higher, signaling the possible impact of transitory bouts of hypermutation. These results are contrasted with those previously reported for other riboviruses to depict the current state of the art in riboviral mutagenesis.

Published in the journal: . PLoS Genet 8(7): e32767. doi:10.1371/journal.pgen.1002832
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1002832

Summary

Introduction

Riboviruses (RNA viruses with no DNA replication intermediates) infect organisms from prokaryotes to higher eukaryotes and frequently cause deadly diseases. The mortality, morbidity, and economic burden of ribovirus-borne diseases strongly impact human society, especially in developing countries where neither sanitation nor treatment may be adequate [1]. Although extensive efforts have focused on developing countermeasures to prevent or treat riboviral diseases, only a few of these diseases can be effectively controlled by vaccination or antiviral drugs. In addition, control or eradication of riboviral diseases is soon balanced by the emergence of new riboviral pathogens or treatment-resistant strains of old ones (reviewed in [2]). Thus, we seek to understand which special features of these viruses contribute to their success. One key feature is their high mutation rate (reviewed in [3]).

Although the evolutionary forces driving high riboviral mutation rates remain unclear (reviewed in [1]), three mechanistic factors play important roles: the higher error-insertion rates of RNA replicases compared to DNA replicases, the lack of proofreading activity in RNA replicases, and the nonexistence of post-replicative RNA mismatch repair. The estimated mean rate per infection cycle is about 1.3 for several common single-stranded RNA (ssRNA) human pathogens [4], roughly 0.1 for ssRNA tobacco viruses [5], and 0.03 for the double-stranded RNA (dsRNA) bacteriophage φ6 [6]. Unfortunately, most of these estimates were based on studies in which small, potentially unrepresentative sequences were used as mutation reporters. In some cases, estimated rates in excess of 1 per infection cycle are probably incompatible with viability [7]. A further problem is the scarce information on the mode (linear, exponential, or mixed) by which riboviruses replicate their genomes within the host cell. Distinct modes of genome replication impact the pattern of intra-cell mutation accumulation in the riboviral genome (and hence the mutation rate per infection cycle) differently. The only two empirical studies published to date on riboviral replication strategy, one conducted with the phage φ6 [8] and the other with the ssRNA turnip mosaic virus [9], suggest that riboviruses replicate their genome mostly in a linear fashion, but further results are needed based on other riboviral systems.

In addition, there are limited data on the kinds and relative frequencies of spontaneous mutations (the mutation spectrum) in riboviruses, again a reflection of mutation reporters that do not sufficiently sample the genome. Only three spontaneous mutation spectra based on a cognate riboviral gene of adequate size are available and, unfortunately, none seems to be fully illustrative. The tobacco mosaic virus rate and spectrum [10] were derived under conditions of multiple sequential infections. The tobacco etch potyvirus spectrum [11] probably contains a large fraction of mutations resulting from methodological manipulations rather than from virus replication errors. Finally, the phage φ6 spectrum [6] was obtained from a mutation-accumulation experiment in the absence of gene complementation in trans, which tends to discriminate against strongly deleterious mutations.

A complete portrait of spontaneous mutagenesis in riboviruses is important not only for understanding their prevalence but also for improving ways to prevent and to treat riboviral diseases. For instance, accurate information on riboviral mutation kinds and rates may facilitate the creation of more stable attenuated vaccines [12]. Similarly, it seems likely that antiviral treatments based on mutagenic base analogs may prove to be more effective if the base analogs specifically increase the rate of those errors that riboviral replicases already generate most frequently. Although pathway-directed mutagenesis is unlikely to prevent the appearance of riboviral resistance to specific base analogs, it may enlighten the development of more efficient combinatory therapies [13] and at least slow disease progression, thus enhancing the immune response.

The main aims of the present study were to characterize the mutation spectrum, to determine the mode of genome replication, and to estimate the spontaneous mutation rate of a ribovirus using the bacteriophage Qß as an experimental model. Qß has been well characterized physiologically [e.g.], [ 14]–[17], physiochemically [e.g.], [ 18]–[21], structurally [e.g.], [ 22]–[27], and molecularly [e.g.], [ 28]–[32]. It is a linear (+)-strand ssRNA phage whose natural host is Escherichia coli, although it can also propagate in other gram-negative bacteria with an F pilus. Its 4217-nt long genome is organized in three cistrons that encode (from 5′ to 3′) the A2 or Maturation protein, which mediates both the binding of Qß to the host and post-replicative host lysis; the Coat protein and its elongated A1 or Read-Through (RT) protein, which is required for Qß capsid assembly and for host infection; and the catalytic ß subunit of the Qß replicase. (RT is translated when a ribosome incorporates tryptophan at the natural UGA stop codon of the Coat-coding gene at a frequency of ≈3% [33].) Qß's life cycle may be summarized as follows: i) the phage attaches to the F pilus of E. coli and the genome enters the cytoplasm; ii) cellular components translate the ß subunit of the phage replicase, which then polymerizes with four host subunits (the ribosomal protein S1, the translation elongation factors EF-Tu and EF-Ts, and the host factor HF) and binds the Qß genome; iii) the ß subunit copies the (+)-strand genome to produce a (−)-strand RNA that in turn is used as template to produce more (+) strands; iv) (+) strands serve as templates for the production of the phage proteins; v) 40–60 minutes after infection, by which time the host cell is filled with phage particles, partially assembled virions, and phage-specific side products, the cell lyses, releasing (10–40)×10³ particles of which only 10–50% are infectious (reviewed in [14], [34]).

Here, we used the gene encoding the RT protein (excluding the portion that encodes the Coat protein) as an in vivo mutation reporter. Selection against RT⁻ mutants was ameliorated by using a complementing system in trans based on a plasmid that encodes the entire Coat/RT mRNA with the natural UGA stop codon replaced with a TGG tryptophan codon [35]. To further reduce the effect of selection, the experimental Qß populations were established after a single cycle of infection. We assessed the Qß genome mutation rate (μ_g) in three different ways: i) a forward-mutation test in which mutants carrying phenotypically detectable RT mutations were isolated and sequenced and μ_g was estimated from the frequency of observed nonsense mutations and indels; ii) single-burst reversion tests in which two different RT⁻ mutants were employed (one carrying a single-base substitution and the other a four-base insertion) and μ_g was estimated from the corresponding reversion rates; and iii) a phenotype-blind forward-mutation test in which some first-generation progeny of the RT mutants detected by the first method were isolated and sequenced and μ_g was estimated from the frequency of all secondary RT mutations generated de novo. The distributions of RT⁺ revertants observed in the reversion tests were used to infer the mode in which Qß replicates its genome, and the spontaneous Qß mutation spectrum was obtained from the RT mutations collected through the forward-mutation tests.

Results

Description of the System

The basics of the experimental system and the strains used in this study are described in Figure 1 and Table 1, respectively.

Schematic representation of the experimental system used to isolate <i>RT</i> mutants. — **Fig. 1. Schematic representation of the experimental system used to isolate RT mutants.**

The Mutation Spectrum

Mutations arising in a mutation-reporter (target) sequence can be of two types. “Detectable” mutations are those that display the mutant phenotype when present as a single mutation. “Undetectable” mutations lack the mutant phenotype when present as a single mutation but may nevertheless be observed when they arise in the presence of a detectable mutation, in which case they are sometimes called “hitchhiker” mutations (and their detectable partner may be called a “driver” mutation). Sometimes, especially with mutants with equivocal phenotypes, no mutation is found in the target, reflecting either some imperfection in the screening method or a mutation elsewhere in the genome whose effect mimics that of the reference mutation; such isolates are thereafter included in the non-mutant total. Another distinction is often relevant: some mutations produce a fully mutant phenotype but others produce an intermediate phenotype (and are therefore often called “leaky” mutations or are said to produce a “weak” mutant phenotype). In this study, yet another dimension is added. Each Qß mutant originally isolated as requiring a helper host to generate a plaque or each of a number of non-mutant control plaques was re-plated and up to four next-generation plaques were harvested and sequenced. When all members of such a family contain the same mutation, we call it the “primary” mutation, and if some of the next-generation plaques contain additional mutations, we call them “secondary” mutations, which may arise when mutation rates are sufficiently high.

One-step growth curves of wild-type (wt) Qß in RT-helper (RTH) cells, which complement RT⁻ mutations, indicated that Qß requires ≈75 min to lyse an infected RTH cell (Figure 2). Thus, to limit the number of infection cycles to one before seeking RT mutants, RTH lysates were generated by adding chloroform 75 min after infection with wt Qß. Samples of these lysates were plated with RTH cells and the resulting plaques were harvested and tested for the RT⁻ phenotype (impaired growth on non-complementing NR16205 cells but normal growth on RTH cells). Among 7517 plaques tested in four independent experiments, 47 candidate RT mutants were recovered and sequenced. Of these, 30 contained at least one primary RT mutation (Table 2). (The 17 candidates with no primary RT mutation may have carried an RT⁻-mimicking mutation elsewhere in the genome or, because Qß grows better on RTH cells than on NR16205 cells, might have carried weak non-RT mutations and showed enhanced growth on RTH cells.) Most of the primary mutations were missense but two (one in mutant RT23 and one in RT37) were indels consisting of single-base additions. Two mutants each carried two primary mutations; in RT18, both were missense; in RT41, one was missense and the other was a synonym. Three mutants (RT10, RT40 and RT46) each carried a nonsense mutation that generated a stop codon; RT40 is a special case because it converted the leaky UGA codon that terminates the Qß Coat protein to a far less leaky UAA stop codon. In two cases, the primary mutation displayed at most a very weak phenotype upon re-plating: the primary mutation of RT27 was a synonym and that of RT33 was missense. These mutants are included in Table 2 and dependent calculations because their mutations could in principle produce a deleterious effect but would have no significant impact if disregarded. The 13 RT secondary mutations (Table 2) presumably arose sufficiently early during the growth of the screened plaques on RTH lawns to be detected among the next-generation progeny. They include 6 missense mutations and 7 synonyms, a ratio that deviates from the approximately 3.3∶1 ratio expected from the set of RT codons. Applying the binomial distribution, finding 6 missense among a total of 13 mutations has P = 0.014 and finding ≤6 has P = 0.018. This result presumably signals selection against RT mutations with strong effect during plaque growth on RTH lawns, consistent with the smaller burst sizes of RT_IN (a mutant Qß strain carrying a four-bases insertion in RT; see Table 1) than wt Qß in helper cells. (The average burst sizes of RT_IN and wt Qß are 328 and 847, respectively, estimated from three different one-step curves per phage type.)

**Fig. 2. Representative one-step growth curves of wt Qß in RTH cells.**

Spontaneous <i>RT</i> mutants and mutations. — **Tab. 2. Spontaneous RT mutants and mutations.**

Table 3 lists the kinds of mutations in the entire set of 45. The 2 indels are strikingly less frequent than the 43 single-base substitutions. The general expectation that frameshifting indels generate a detectable mutant phenotype when arising in a protein-coding sequence reduces the chances of having missed other indels during the scoring of mutants. In addition, pQßRT, the RT-expressing plasmid used in this study, can complement RT deletions comprising up to 447 nt [36], which reduces the probability of having missed indels >1-nt long. The 39 transitions were almost 10-fold more frequent than the 4 transversions; if both transversions and transitions were to arise at equal frequencies among base-substitution pathways, the expected ratio would be 1∶0.5 (each site being able to generate two kinds of transversions and one kind of transition), a 20-fold difference from the observed ratio. Transitions, when ranked in decreasing order of observed numbers, were U→C (16)>G→A (10)>A→G (8)>C→U (5). The numbers of the four bases in the target decrease in the same order, U(175)>G(147)>C(139)>A(130), but this trend cannot quantitatively explain the normalized frequencies of mutated bases, which is 0.091>0.068>0.058>0.038. Thus, the intrinsic mutability of the four bases, presumably reflecting the error propensities of the Qß replicase, is likely to be the main determinant of the relative frequencies of observed mutations.

Sequence changes in <i>RT</i>. — **Tab. 3. Sequence changes in RT.**

The mutations were widely distributed over the target (Figure 3). Because only 4 RT positions out of the observed 38 hosted more than one substitution, the spectrum is clearly far from saturation. Both indels arose within short homopolymeric runs, a common pattern in mutation spectra that presumably reflects misaligned primer-templates [37], [38]. The substitutions showed no correlation with their nearest neighbors either individually or as purines versus pyrimidines (analyses not shown). However, because a tendency towards enhanced mutability of any base within a G/C-rich sequence has been observed in both E. coli [39], [40] and the T-even coliphage RB69 [41], we also examined the base composition (G+C versus A+T) of the local sequence environments where substitutions were observed. G•C base pairs are more stable than A•U pairs, so that G/C-rich sequences might help to stabilize secondary structures containing hairpin loops, where unpaired bases may be more sensitive to oxidative damage. In addition, duplexes richer in G•C pairs may be slower to unwind, which might render replication more error-prone in currently unknown but perhaps generally applicable ways. A recent description of the structure of the Qß replicase [31] suggested that the replicating Qß genome (template+complement) forms a 6–7 base-pair duplex in the internal cavity of the replicase before both the single-stranded product and template exit the enzyme. Accordingly, we analyzed the base composition of the sequences six and seven bases upstream of the observed substitutions (Figure 4). Both the 6-mers and the 7-mers contain more (G+C) than expected from the target content of bases. The difference for the 6-mers has P = 0.059 and for the 7-mers has P = 0.034 (replicated G-test for goodness-of-fit, P-values for “pooled G”, G_P, 1 df). Nevertheless, these small differences, combined with the homogeneity in base composition of the analyzed sequences, made the “total G” (G_T) non-significant in both analysis (G_T = 0.409, 6 df, and G_T = 0.420, 7 df, for 6 -⁠ and 7-mers, respectively). Overall, a larger sample of mutations would probably indicate more clearly the existence (or absence) of any effect of the G/C content of the local sequence on Qß-replicase error tendencies.

**Fig. 3. Spectrum of spontaneous Qß mutations.**

**Fig. 4. Frequencies of (G+C) and (A+T) upstream of base substitutions.**

The Mode of Genome Replication

To determine how mutations accumulate in the Qß (+)-strand progeny during replication and thus to estimate the rate of spontaneous mutation per genome replication in Qß, it is necessary to know the mode by which Qß produces its progeny during cell infection. Two distinct modes are possible. One is linear, wherein the infecting (+)-strand genome is used repeatedly as a template and then at least some of the resulting (−)-strand RNAs are each used repeatedly as templates; consequently, at the end of the infection cycle, each of the many (+)-strand progeny has experienced only two replications, from (+) to (−) and from (−) to (+). In this model, due to the many (+)-strand progeny contributed by the fewer (−)-strand templates, most replication errors will produce a single mutant during the second round of replication and only a small fraction of errors will generate a clone of mutants when a replication error occurs in the first round of replication and is further repeatedly copied in the second round [4]. The other mode is classical exponential replication, in which case the numbers of mutants recovered from single viral bursts display an exponential distribution [42]. Intermediate models combining linear and exponential replication in different proportions are also conceivable.

To determine which model best fits the distribution of mutants in Qß, two separate single-burst reversion tests were conducted (see Materials and Methods), one using the mutant RT_IN and the other using RT_SUB (described in Table 1). The tests involved plating cultures, containing bursts from infected RTH cells, onto NR16205 cells, aiming to deliver roughly 1 revertant-yielding burst on each of many plates. With RT_IN, among the 250 cultures plated, six did not form plaques well and were discarded, and five had the following number of Qß plaques: 1254, 585, 345, 342 and 105. Because the plating efficiency of wt Qß with NR16205 is ≈1.5-fold lower than with the RTH strain, those numbers correspond to about 1881, 818, 518, 513 and 158 plaques, respectively. Since the frequency of RT⁺ revertants in the starting RT_IN population was (4.63±1.50)×10⁻⁶ (mean ± SD, n = 5) and roughly 4560 infected cells were introduced into each of the 250 experimental cultures, the expected total number of revertant bursts produced by preexisting RT⁺ phages was ≈5.3, in close agreement with the observed five large revertant bursts. The variation in numbers of revertants among these five bursts is consistent with the observed variation in burst sizes of wt Qß growing in the RTH host, 847±308 (mean ± SD, n = 3 one-step curves), and individual bursts with sizes from 300 to 3000 have been observed. In the case of RT_SUB, among the 500 cultures assayed, one had to be discarded and another contained 935 plaques; in this case, the expected number of bursts from preexisting RT⁺ revertants was 3.5. None of the cultures containing bursts attributable to preexisting RT⁺ phages were included in the analyses.

The revertant distributions differed for the two mutants (Table 4). With RT_IN, the distribution closely fitted a Poisson, supporting a linear mode of genome replication for Qß and strongly inconsistent with an exponential mode. With RT_SUB, the distribution deviated significantly from a Poisson, showing an excess of plates containing ≥3 revertants. Even within a linear replication mode, however, these results may reflect either or both of two causes. The first is that different reversion pathways during the first and second rounds of replication will tend to have different rates at any particular site. With RT_IN, the reversion target for the first replication consists of 5′-UCUUAAUUAAGU-3′ where the target is underlined and reversion to wild-type would probably occur by the deletion of UUAA or, perhaps less likely, by pseudoreversion by the loss of one base from any of the four homo-dinucleotides, producing a gene with one extra codon. Unusually, the second-replication target is 5′-ACUUAAUUAAGA-3′, which is identical to the first-replication target except for the outermost flanking bases. Thus, the two error rates might be very similar and the ratio of (+)-strand to (−)-strand products might have been large enough so that errors accumulated mostly during the second replication and the resultant revertant bursts were largely composed of clones of size 1. With RT_SUB, however, reversion must have occurred along the available single-base-substitution pathways (up to 8 for a UAG stop codon, depending on the functional competence of the encoded amino acids), each of which differs between replications and which might therefore have displayed large rate asymmetries, which can easily exceed 100-fold in the case of DNA genomes [e.g., 43]. The second cause is that the total number of copying events probably differ between (−)-strand and (+)-strand synthesis during cell infection; in Qß, for instance, the number of accumulated (+) strands was estimated to be about 10 times greater than the number of (−) strands [14], [44], so that revertant bursts of size 1 from the (+)-strand synthesis would then be more frequent than the larger bursts from the first rounds of replication. Notably, however, these larger bursts, once they appear, are expected to exhibit variable sizes that depend on, among other factors, the growth conditions [45], and that might therefore impact the observed distribution.

Observed and expected distributions of <i>RT</i><sup>+</sup> revertants in RT<sub>IN</sub> and RT<sub>SUB</sub> bursts. — **Tab. 4. Observed and expected distributions of RT⁺ revertants in RT_IN and RT_SUB bursts.**

The Rate of Spontaneous Mutation

To estimate the rate of spontaneous mutation per genome replication (μ_g) for a ribovirus, it is necessary to know (i) the mutation frequency f, (ii) the number of infection cycles c that elapse between the initial infection and the scoring of mutants, (iii) the average number of times n that each genome is replicated per infection cycle, (iv) the number of detectably mutable bases in the mutational target (T), and (v) the genome size (G). In the present case, G = 4217 nt, c = 1, and, from our results, n≈2. Although T = 591 RT bases for estimating the indel mutation rate (μ_I), that number cannot be used when estimating the corresponding base-substitution rate (μ_SUB) because, while nearly all indels are detectable, many substitutions fail to produce a mutant phenotype. Instead, μ_SUB may be estimated from the number of substitutions that generate a stop codon (nonsense mutation) because, like indels, nonsense mutations are generally detectable. When considering nonsense mutations, T equals one-third of the number of paths in the mutational target that may generate a stop codon (one-third because each base can mutate by three different paths) [46]. In this study, 3 nonsense mutations were found among 7517 Qß isolates, and T = 66 paths leading to a stop codon in the RT target. Thus, f_path = 3/(7517)(66) = 6.047×10⁻⁶, f_SUB = 3f_path = 1.814×10⁻⁵ per base, and μ_SUB = f_SUB/cn = 9.0704×10⁻⁶. Because 2 indels were found, f_I = 2/(7517)(591) = 4.502×10⁻⁷, and μ_I = f_I/cn = 2.251×10⁻⁷. Hence, μ_g = (μ_I+μ_SUB)G = 0.039.

In addition to the primary mutations detected by their phenotypes, some hitchhiking mutations were found. These secondary mutations may be used for an independent estimate of μ_g, in which case T = 591 bases. A total of 9 secondary mutations (all base substitutions) were detected among 112 sequenced sub-isolates. (The remaining secondary mutations from the 13 described in Table 2 were observed in RT⁻ isolates lacking any detectable primary RT mutation and thus were excluded from these calculations.) Thus, f_SUB = 9/(112)(591) = 1.36×10⁻⁴, μ_SUB = f_SUB/cn = 6.80×10⁻⁵, and μ_SUBg = μ_SUBG = 0.287. This value is greater than the corresponding value from the nonsense-mutation method by 7.4-fold and may, as discussed later, signal the impact of transient hypermutation.

Mutation rates can also be estimated for the reversion of the mutants RT_IN and RT_SUB using the results of the single-burst reversion tests. First, some definitions are needed: the number of cultures = C; the average number of infected cells per tube = N; the average burst size = B; the number of initial [(+)-strand→(−)-strand] copies = c₁ with an error rate μ₁ per copy; the number of succeeding [(−)-strand→(+)-strand] copies = c₂ with an error rate μ₂ per copy and a burst size B = c₂ that ignores unpackaged genomes; and there are n = 2 two rounds of replication per infection. Then the average total number of mutational events per infected cell will be c₁μ₁+c₂μ₂; however, these components cannot be disentangled with our data, so we will assume that c₂μ₂≫c₁≥₁ (e.g., most of the mutations are generated in the second replication, as indicated by the results from the single-burst reversion tests), in which case the average total number of mutations per infected cell will be c₂μ₂ = Bμ₂.

For a set of cultures of which some contain 0 mutants, the fraction of null tubes is e^−m where m is the average number of mutational events per culture [47]. The total number of replication events per culture≈NB, whence μ₂≈μ = m/NB. For RT_IN, the fraction of null tubes was 31/239, m = 2.04, N≈4560 infected cells per tube, and B = 328±93 (mean ± SD, n = 3 one-step curves), so that μ(RT_IN) = (1.37±0.39)×10⁻⁶. For RT_SUB, the fraction of null tubes was 238/498, m = 0.738, N≈35, B = 859±165 (mean ± SD, n = 3 one-step curves), and μ(RT_SUB) = (2.46±0.47)×10⁻⁵. The ratio μ(RT_SUB)∶μ(RT_IN)≈18 which, given the indel sample size of 2, agrees well with the corresponding ratio of the two kinds of mutations (substitutions and indels) in the spectrum (43∶2≈22) or by rate (9.07×10⁻⁶)∶(2.25×10⁻⁷)≈40).

Another way to estimate these reversion rates is to use μ = f/2 but, as directly above, to assume that all detected mutations arose in the second round of replication, those arising in the first round being too infrequent to be readily observed, in which special case, μ = f as above. Here, f is simply the sum of all observed RT⁺ revertants divided by all the Qß progeny in all tubes, NBC. For RT_IN, the total number of revertants was 510 (Table 4), so that μ(RT_IN) = 510/(4560)(328±93)(239) = (1.43±0.40)×10⁻⁶, a value close to the null-class value because of the excellent agreement between the observed distribution and the Poisson expectations (Table 4). For RT_SUB, the total number of revertants was 446 (Table 4), so that μ(RT_SUB) = 446/(35)(859±165)(498) = (2.98±0.58)×10⁻⁵, again a value close to the null-class value but slightly higher due to the occurrence of a small excess of plates with larger numbers of revertants compared to the expectations of the Poisson distribution (Table 4). Because the number of paths in which the RT_SUB mutated codon (UAG) may change producing an RT⁺ revertant is not known, the estimated μ(RT_SUB) is an upper limit corresponding to 8 paths or 2⅔ substitutions.

Discussion

The Mutation Spectrum

We have obtained a spontaneous mutation spectrum for the RNA coliphage Qß using a cognate mutational target, the RT-coding gene minus the portion encoding the Coat protein. This 591-nt target generously samples the 4217-nt Qß genome, and the RT and genome base compositions are indistinguishable (G-test of independence, P = 0.9719, 3 df). The spectrum, based on 45 single-base changes, is a mixture of 32 primary mutations plus 11 secondary mutations found hitchhiking on some primary mutations, plus 2 single synonymous mutations (at target sites 18 and 294) arising during sequencing that showed no primary mutation. This spectrum has three defining characteristics. One is its strikingly low frequency of indels, only 2 among 30 RT mutants and 45 mutations, thus representing only about 4% of the total mutations, while in spectra from several DNA-based microbes (phages λ and T4, E. coli, Saccharomyces cerevisiae, and Schizosaccharomyces pombe), indels comprise about 40% of the mutations (average 41%, range = 27–59% [39], [46], [48]–[51]). Another characteristic is its unusually high transition∶transversion ratio (39∶4 = 9.75) compared to a random expectation of 1∶2 = 0.5. This transition bias contrasts with the transition∶transversion ratios observed for the same DNA-based microbes mentioned above (mean 0.87, range 0.08–1.67). Finally, normalized to target-base frequencies, the spectrum reveals a biased mutation tendency consisting of U→C>G→A>A→G>C→U> all transversions. Taking into account the dynamics of Qß genome replication with most mutations arising during the second round of replication, this mutation bias reflects a mismatch formation/extension bias in the template•progeny sense of A•C>C•A>U•G>G•U> transversions mismatches. This bias does not seem to reflect either cytosine deamination (which promotes C→U) or guanine oxidation (which promotes G→U), but rather the insertion of ionized, tautomerized, wobbled or syn-conformation bases.

Several other spectra of spontaneous riboviral mutations have been described previously.

The first, using (+)-strand tobacco mosaic virus (TMV) and a target-complementation system in trans, reported a notable preponderance (24∶11) of indels over substitutions, similar numbers of transversions (6) and transitions (5), and a remarkably high frequency (9/23) of mutants with multiple mutations (“multiples”) [10]. These multiples may have arisen either because of transient hypermutability (which may also have been observed with Qß, as described below) or because the mutants were recovered after multiple sequential cell infections, estimated at c≈6. In this study, only mutants carrying lethal mutations were recovered. Thus, if many of the TMV base-substitutions were leaky, then, given c≈6, the high frequency of indels might have reflected their fully mutant phenotypes in contrast to the often leaky substitutions; however, although the TMV target sequence contained 115 paths to stop codons, no nonsense mutations were recovered in the small sample of substitutions. Perhaps some other factor impinged on this system, such as photodynamic mutagenesis.
Another spectrum, obtained in a mutation-accumulation experiment using almost the entire genome as a mutational target for the dsRNA phage φ6, also exhibited a strong transition∶transversion bias (46∶5 = 9.2) and only one indel [6], although selection may have reduced the recovery of indels. In this phage, the parental genome can be written as [+/−]. Upon infection, the (−) strand is copied repeatedly into (+) strands, some of which are translated and others of which are encapsulated. Within the nascent viroid, the (+) strand is copied once to produce a [+/−] progeny particle. In this pathway, a mutation such as U→C can arise by an A•C mispair in the first round of replication, or by a U•G mispair in the second round. Both paths would produce clones of size 1 in a single-burst reversion test, the predominantly observed result [8], but the culpable mispair remains unknown. The observed mutation bias was A→G(14)≈C→U(13)≥G→A(10)≈U→C(9)>all transversions, or, when normalized to the numbers of target bases, A→G(0.0052)>C→U(0.0034)≥G→A(0.0027)≈U→C(0.0028). While this is approximately the opposite of the Qß bias, it could turn out to be very similar in terms of preferred mispairs depending on which mispairs generated the φ6 mutations.
A (+)-strand tobacco etch potyvirus spectrum [11], obtained by means of a target-complementation system in trans, was probably contaminated with mutations arising during reverse transcription and PCR amplification of the isolated virus, with no way to sort out which mutations were of viral and which of processing origin because the existence of any relation between the detected mutations and a mutant phenotype was not examined.
For the (+)-strand hepatitis C virus, mutation sampling in vivo based on deep sequencing of plasma samples from untreated patients revealed an approximately 75-fold ratio of transitions to transversions with all four transitions present at similar frequencies, whereas kinetic studies in vitro of the viral replicase revealed a strong bias in favor of G•U and U•G mismatches [52], although mismatch extension efficiencies were not much explored. This contrast resembles that seen in numerous reversion systems, which are typically limited to small numbers of templates that tend towards often strongly site-specific mismatching biases.

Taken together, the informative parts of these spectra indicate that riboviral mutation spectra differ from those characteristic of DNA viruses and cellular organisms in displaying many more transitions than transversions and an even smaller proportion of indels.

The Mode of Genome Replication

With the aims of determining the way in which mutations accumulate during riboviral replication and estimating the rate of spontaneous mutation per genome replication, we investigated the mode in which Qß replicates its genome. Results from two independent single-burst reversion tests indicated that this mode is essentially linear, with the genome of each Qß progeny resulting from only two replications: from the original parental (+) strand to a (−) strand and then to a new (+) strand. Our results further suggest that most replication errors occur during the second round of replication, which in turn reveals the specific mismatches that produced the substitutions in the mutational spectrum.

An interesting result is that the distribution of RT⁺ revertants deviated significantly from the expected Poisson distribution for RT_SUB but not for RT_IN. With no reason to suspect that the two strains replicate their genomes differently, this discrepancy may reflect intrinsic differences between their reversion targets. The reversion target in RT_IN is the same in both rounds of replication, while reversion in RT_SUB may occur through up to 8 different single-base-substitution pathways in each round of replication. Thus, reversion rate asymmetries between the two rounds of replication may be anticipated for RT_SUB, allowing some reversion to occur during the first round of replication and thus producing some revertant clones of size >1 during the second round.

The mode of genome replication in riboviruses has been addressed in only a few instances. Using the single-burst reversion test, a predominantly linear mode was reported for the phage φ6 [8]. In that study, however, the observed distribution of mutants (i.e., revertants) differed somewhat from the expected Poisson for a linear mode of replication, suggesting an exponential component in the replication dynamics that was estimated to generate ≈1% of the total progeny [8]. Such discrepancies between observed and Poisson distributions may occur because of sampling errors or the presence of a small exponential component. A way to discriminate among these is to plot the logarithm of the cumulative frequency distribution of observed mutants against the logarithm of the sizes of the mutant classes [53]. Figure 5 shows such plot for T2, φ6, RT_IN, and RT_SUB. In a log-log plot, exponential replication will display a linear relationship between the cumulative distribution of mutants and the number of mutants per class, with a slope close to −1. In linear replication, however, the plot will not be linear and the slope for the cumulative distribution will be steeper because most mutants arise in clones of size 1. In agreement with this reasoning, the data for T2, which replicates exponentially [42], exhibits a linear relationship with slope −1.20±0.03, based on the sum of the r and the w mutants and excluding all classes containing ≥16 mutants (i.e., classes starting to approach the T2 burst size), while the plots for φ6, RT_IN, and RT_SUB display nonlinear relationships.

Double-logarithmic plots of the relative cumulative frequency distributions of mutants for T2, φ6, RT<sub>IN</sub>, and RT<sub>SUB</sub>. — **Fig. 5. Double-logarithmic plots of the relative cumulative frequency distributions of mutants for T2, φ6, RT_IN, and RT_SUB.**

In a recent report [9], the dynamics of (+)-strand and (−)-strand accumulation during cell infection were quantitatively analyzed for the (+)-strand RNA turnip mosaic virus using strand-specific quantitative real-time PCR. The results indicated that the virus replicates its genome in a mostly linear mode, in agreement with other quantitative results from in silico modeling of the optimal riboviral replication strategy in response to the error rate and the availability of resources, among other parameters [45], [54]. However, the continuous accumulation of turnip mosaic virus (−) strands throughout infection suggests that a purely linear mode of replication may have been unlikely; indeed, the occurrence of a trace of exponential replication was reported. While we cannot exclude a trace of exponential replication in the case of Qß, our results suggest that the RT_SUB revertant distribution may depart from a Poisson distribution mostly due to asymmetries in the reversion rates at the first and the second rounds of replication.

Overall, the empirical data gathered to date on the riboviral mode of replication indicate that, regardless of the single -⁠ or double-stranded genome structure of the virus, the strategy of preference is mainly linear. The advantages that this mode of replication may confer to riboviruses over an exponential mode have been evaluated previously (e.g., [8]).

The Spontaneous Mutation Rate

Our results provide several independent estimates of the spontaneous Qß mutation rate per genome replication (μ_g). The first is based mainly on the small set of three nonsense mutations detected among 30 RT mutants. Because many base substitutions do not produce a detectable phenotype, the estimation of the μ_SUB fraction of μ_g = (μ_SUB+μ_IN)G from the frequency of nonsense mutations is a preferred method because nonsense mutations are highly detectable and their target size is easy to determine from the codon composition of the mutation target. However, this method has two drawbacks: nonsense mutations are typically a small fraction of all substitutions, so that sufficient mutants must be harvested and sequenced for a reliable estimate [46]; and no substitutions to C (in the coding strand) can generate a stop codon, so that the average rate per base from the nonsense-generating pathways must be assumed to apply to all pathways. Using the nonsense-mutation method and adding the small component due to indel mutagenesis, the Qß genomic rate was estimated to be μ_g = 0.039 per replication or about 0.08 per infection cycle. For this nonsense-based estimate of μ_g, RT mutants were collected after one-step growth of wt Qß, so that c = 1 in the calculations. While some prior RT mutations may have arisen during the growth of the wt Qß stocks in non-complementing NR16205 lawns, lethal indels and nonsense mutations would have been subjected to strong negative selection. In some riboviruses, mutants bearing lethal mutations can grow in the presence of complementation in trans provided by a plasmid (e.g., this study) or by a gene inserted into a host chromosome (e.g., [10]), and thus it may be assumed that complementation can also be provided by a co-infecting wild-type phage, which means that even de novo Qß mutants carrying an RT⁻ mutation may have expanded during the growth of the original wt Qß stocks, rendering 1<c≤3. In the case where only one RT⁻ mutant and one wild-type co-infect the same host cell and up to 50% of the resulting progeny are RT⁻ mutants, the consequent selection coefficient (s) of the RT⁻ mutant will be 0.50 per infection cycle. Applying the method described in Burch et al. [6] to estimate the effect of selection within the plaque and considering μ_g = 0.039, the probability of loss of an RT⁻ mutant with s≥0.50 arising in the first infection cycle of wt Qß on a host lawn would be ≥42% at the end of the growth phase (Figure S1). However, previous reports [22], [55] indicate that co-infection by distinct Qß mutants and consequent complementation occur at low to undetectable frequencies. Even if any RT⁻ mutation arose during the last cycle of growth in NR16205 lawns, the small fraction of each wt Qß isolate used to establish the one-step lysates further reduced the frequency of preexisting RT⁻ mutations in our starting wt Qß populations.

The second method for estimating μ_g is based on the single-burst reversion tests. Here, the mutation rate is based on the null-class method [47]. For RT_IN, μ_target = 1.37×10⁻⁶ and the number of mutating bases in the four-base duplication may be taken as 4 (although it may be argued to be 1); then μ_INg = (4217/4)(1.37×10⁻⁶) = 0.0014. For RT_SUB, μ_target = 2.46×10⁻⁵ and the potential number of mutating bases is 3; then μ_SUBg = (4217/3)(2.46×10⁻⁵) = 0.035. The sum of the indel and the substitution rates is μ_g = 0.036 (0.041 if the indel reversion target size is taken as 1), a value (perhaps deceptively) close to that of μ_g = 0.039 calculated from the spectrum.

The third method for estimating μ_g applies only in cases where the mutations are not required to produce a mutant phenotype, which can arise when a target is sequenced without regard to phenotype (provided the mutation is not a dominant lethal) or when, as in the present case, hitchhiker mutations arise secondarily to and in combination with a driver mutation, the target then consisting of the entire sequence of the mutation reporter. Hitchhikers could arise during any of the roughly 3 infection cycles that generate a Qß plaque but, in order to be detected, most would have to arise in the first cycle with μ_g = 0.287. This value may be an underestimate because the low ratio (6/7) of missense mutations to synonymous mutations among the secondary RT mutations suggests significant selection against missense mutations during plaque growth even in RTH lawns; if the hitchhikers were a random set, then missense mutations would comprise about ¾, or 9.75, of 13 substitutions. In fact, RT_IN has an average burst size in RTH cells that is 2.9-fold smaller than that of wt Qß, a difference that implies a selection coefficient of s≈0.65 per infection cycle. An RT mutant with s≥0.65 arising with μ_g = 0.287 in the first infection cycle on a RTH lawn would have a ≥40% probability of being lost by the end of the growth phase (Figure S1). Thus, μ_g = 0.287 estimated from hitchhikers might be significantly underestimated.

The presence of more mutants with multiple mutations than expected from a random distribution is remarkably widespread among DNA and RNA genomes and is probably more often due to transient hypermutation caused by some temporary perturbation of replication-fidelity factors than due to mutator mutations [56], [57]. A notable example is the considerably higher frequency of mutations than expected among mutants already carrying a driver mutation produced by the replicase of the DNA phage RB69 [56]. However, because the Qß replicase gene occupies 42% of the genome and the estimated μ_g is high, we considered that some mutants might have arisen in a mutator background and then gone on to produce hitchhikers at an increased frequency. Therefore, we examined whether the gene encoding the ß subunit of the Qß replicase harbored mutations in the 28 RT mutants carrying detectable primary mutations and their four parental wild-types. We observed three T→C substitutions (at ß-subunit position 75 of mutant RT32, position 550 of RT42, and position 1668 of RT20) but all were synonyms, so that replicase mutators were apparently not impacting our set of RT mutations. Instead, the excess of secondary mutations among our RT mutants may have arisen by the action of an abnormal replicase ß subunit produced by an error of translation or protein conformation.

Among the three values, our best estimate of μ_g was obtained by the nonsense-mutation method. While the rate obtained from the reversion tests was similar, its accuracy depends on the extent to which the two mutants fairly sample the whole genome, and the similarity may have been fortuitous. Both selection and transient hypermutation may have played an important role in the production of the RT mutations considered in the third method. Unfortunately, even our favored μ_g estimate is based on small samples of mutations (3 nonsense and 2 indels), which enlarges the margin of potential sampling error. When our first and second μ_g estimates are combined with the fraction (0.4) of random mutations that are lethal for the (−)-strand-RNA vesicular stomatitis virus [58], (0.075 mutations per infection cycle ×591 nt per target ×7517 targets tested ×0.4 of mutations detectable)/4217 nt per genome = 32 RT mutants, in close agreement with the 30 observed and providing modest further support for μ_g≈0.04.

While the mutation rates per genome replication estimated here and reported for TMV [10] and phage φ6 [6] are all in the neighborhood of 0.04, rates for mammalian riboviruses center around 0.7 and display a wide range [7]. However, the latter rates were based on tiny targets often consisting of a single base or pathway and may have been reported because they were large and thus more easily measured; alternatively, as has been frequently suggested, immune surveillance in mammals may drive higher mutation rates. It is interesting that while the mutation frequency can be increased over the background with nitrous acid by up to 80-fold in tobacco mosaic virus with retention of some viability [59], it can be increased only about 2.5-fold in poliovirus and vesicular stomatitis virus before extinction begins [60], suggesting that mammalian riboviruses do indeed sustain mutation rates substantially higher than those of phage and plant riboviruses. Finally, although co-infection and complementation do not seem to occur at a detectable frequency with Qß, it may occur with other riboviruses, perhaps somewhat elevating mutant frequencies and thus causing mutation rates to be overestimated.

Materials and Methods

Plasmids, Bacterial Strains, and Growth Media

Plasmids and bacterial strains are listed in Table 1. All three pQß plasmids express the indicated Qß components constitutively and have been described [35], [36], [61]. The RT_IN mutant carries a tandem duplication of 2158-UUAA-2161 that corresponds to 416–419 in the target sequence. Cell transformations with the plasmids were performed using CaCl₂ [62]. Unless otherwise indicated, RTH cells were grown in Luria-Bertani medium (LB) supplemented with 2 mM CaCl₂ and 100 µg/ml trimethoprim (TMP), while NR16205 cells were grown in LB containing 15 µg/ml tetracycline. Cells and phages were plated using LB bottom agar with 2.0% Bacto agar. The top agar was always made up in distilled water. For counting plaques or scoring mutants, the top agar contained 0.4% Sigma-Aldrich Noble agar; for other uses, it contained 0.8% Bacto agar. All growth was at 37°C.

One-Step Growth

NR16205 cells were transformed with pQßm100 (which expresses wt Qß) and plated on NR16205 lawns to yield wt Qß plaques, which were independently harvested into tubes containing 1 ml D broth (0.2% Bacto tryptone, 0.5% NaCl) and 25 µl of chloroform. For one-step growth curves in RTH cells, 10 µl of phage suspension from a wt Qß isolate was mixed with 1 ml of cells at OD₆₀₀≈0.5 (10⁸ cells/ml) at a multiplicity of infection (MOI)≈0.01 for 20 min at room temperature, centrifuged to remove non-adsorbed phages, resuspended, and serially diluted in LB+TMP. Samples diluted 10³ -⁠ and 10⁵-fold were held for 3 h at 37°C with gentle shaking, and 100-µl aliquots were removed from each dilution every 10 min and plated with RTH cells. Plates were incubated overnight and the follow-on titers were used to estimate Qß densities over time. Three one-step growth experiments were conducted in parallel for each wt Qß isolate used to generate one-step lysates. Visual inspection of the resulting curves sufficed to determine the time (≈75 min) for Qß to complete one infection cycle in RTH cells. These one-step curves were also used to estimate the burst size of wt Qß in RTH cells according to the protocol detailed for RT_IN and RT_SUB (see Table S1).

Qß Replication Mode

The distribution of RT⁺ revertants among RT⁻ bursts was monitored as in a previous study [8] using two different RT⁻ mutants (RT_IN and RT_SUB as described in Table 1). Preliminary measurements provided their burst sizes (Figure S2, Table S1) and revertant frequencies, which are needed to conduct the burst experiments. Ten and five independent experiments were carried out with RT_SUB and RT_IN, respectively, and ≈500 RT⁺ revertants were scored per mutant. In each experiment, ≈10⁶ phages were added to 1 ml of RTH cells at OD₆₀₀≈0.5. After 20 min of adsorption at room temperature, the mixture was centrifuged for 1 min at 8,000 g and the pellet was resuspended in 1 ml LB broth. From the supernatant, 100 µl were collected to estimate the amount of non-adsorbed phages. The resuspended pellet was further diluted and 50 aliquots of 100 µl each were distributed into individual tubes, where infection was allowed to continue for ≈75 min at 37°C and then stopped with 15 µl of dichloromethane. Lysates were aerated for 30 min at 37°C to allow the dichloromethane to evaporate and their entire volumes were then independently plated on NR16205 lawns. The observed distributions of RT⁺ revertants were compared to the expected Poisson distributions using G-tests for goodness-of-fit.

Isolating and Sequencing Spontaneous RT Mutants

To limit the number of infection cycles to one before seeking spontaneous mutants, RT mutants were scored among the progeny of one-step growth of wt Qß in RTH cells. RTH cells were infected with wt Qß (MOI≈0.01) as above and one-step lysates were recovered by adding chloroform after 75 min of growth. Samples from the lysates were plated on RTH lawns at ≈70 plaques per plate and well-isolated plaques were independently sampled into 96-well plates containing 0.6 ml D-broth per well (reserving six un-inoculated wells as cross-contamination controls). For each of four independent lysates, three different rounds of 630 isolations each were performed. In each round, a control plate containing 8 wt and 82 RT_IN isolates was also established to confirm the ability of RTH cells to complement RT⁻ mutants and the inability of any RTH cells remaining in the isolates to grow in LB supplemented with tetracycline. Isolates were spotted in parallel on lawns of NR16205 and RTH cells using a 6×8-array replica plater. After a few losses, a total of 7517 plaques were tested. Isolates that grew poorly in NR16205 cells were re-tested in both bacterial strains and the RT-coding genes of two independent sub-isolates per putative RT mutant were sequenced. After this first round of sequencing, two additional sub-isolates as well as the original isolate were sequenced for each verified RT mutant. The original wt Qß isolate used to develop each lysate and two sub-isolates of it were also sequenced.

Plate lysates were prepared from RTH cells (0.25 ml at OD₆₀₀≈0.5) mixed with phages at MOI≈0.1 in Noble top agar. After overnight incubation, the plates were covered with 7 ml of SM buffer with gelatin [62] and were gently rocked for 30 min. The SM buffer was recovered and 100 µl of chloroform were added to each sample. Cell debris was removed by centrifugation at 12,000 g for 10 min. The supernatant was supplemented with polyethylene glycol (PEG 8000) to 10% w/v and NaCl to 1 M, incubated for 1 h on ice, and centrifuged at 3,000 g for 15 min at 4°C [63]. The pellets were resuspended in 2 ml of 10 mM MgSO₄, 10 mM Tris-HCl, pH 8, and the resulting concentrated phages were used as sources for RNA purification. Phage RNA was isolated using the QIAamp Viral RNA Mini Kit. From the extracted RNAs, 10 µg were then treated with DNase I (New England BioLabs) to degrade residual host DNA. The DNase-treated product was purified using the RNAeasy Mini Kit. From the purified RNA, 1 µg was subjected to reverse transcription with the Omniscript RT Kit and about 25 ng of the RT product was amplified with PfuTurbo DNA polymerase (Stratagene). PCR products were confirmed by agarose gel electrophoresis, purified with the QIAquick PCR Purification Kit, and sequenced using BigDye Terminator v3.1 (Applied Biosystems). All kits were purchased from Qiagen and were used according to the manufacturer's recommendations. Sub-isolates showing secondary mutations were subjected to a second round of RT, amplification and sequencing. The primers utilized in the RT, PCR, and sequencing reactions and the PCR cycling parameters are listed in Table S2.

Statistical Analyses

The RT and genome base compositions were compared using the G-test of independence. This test was also applied to compare the observed distributions of RT⁺ revertants among the single-burst reversion tests conducted with each of two different RT⁻ mutants, RT_SUB and RT_IN. The G-test for goodness of fit was used to compare the observed and expected Poisson distributions of RT⁺ revertants among RT⁻ single-bursts, and the replicated G-test for goodness of fit was applied to compare the G+C content of the local sequence environment (six to seven bases upstream) of the base substitutions observed in RT with the expected content according to the base composition of the whole gene. When applying this last test, each upstream position (from +1 to +6 or +7) was considered as an independent replicate. All tests were performed as per Sokal and Rohlf [64].

Supporting Information

Zdroje

1. HolmesEC 2009 The evolution and emergence of RNA viruses Oxford University Press 254

2. DomingoE 2010 Mechanisms of viral emergence. Vet Res 41 38

3. DomingoEHollandJJ 1997 RNA virus mutations and fitness for survival. Annu Rev Microbiol 51 151 178

4. DrakeJWHollandJJ 1999 Mutation rates among RNA viruses. Proc Natl Acad Sci U S A 96 13910 13913

5. SanjuánRNebotMRChiricoNManskyLMBelshawR 2010 Viral mutation rates. J Virol 84 9733 9748

6. BurchCLGuyaderSSamarovDShenH 2007 Experimental estimate of the abundance and effects of nearly neutral mutations in the RNA virus φ6. Genetics 176 467 476

7. DrakeJW 1993 Rates of spontaneous mutation among RNA viruses. Proc Natl Acad Sci U S A 90 4171 4175

8. ChaoLRangCUWongLE 2002 Distribution of spontaneous mutants and the inferences about the replication mode of the RNA bacteriophage φ6. J Virol 76 3276 3281

9. SardanyésJMartínezFDaròsJ-AElenaSF 2011 Dynamics of a plant RNA virus intracellular accumulation: stamping machine vs. geometric replication. Genetics 188 637 646

10. MalpicaJMFraileAMorenoIObiesCIDrakeJW 2002 The rate and character of spontaneous mutation in an RNA virus. Genetics 162 1505 1511

11. TromasNElenaSF 2010 The rate and spectrum of spontaneous mutations in a plant RNA virus. Genetics 185 983 989

12. DasATBerkhoutB 2010 HIV-1 evolution: frustrating therapies, but disclosing molecular mechanisms. Phil Trans R Soc B 365 1965 1973

13. AndersonJPDaifukuRLoebLA 2004 Viral error catastrophe by mutagenic nucleosides. Ann Rev Microbiol 58 183 205

14. WeissmannC 1974 The making of a phage. FEBS Lett 40 S10 S18

15. WoodyMACliverDO 1995 Effects of temperature and host cell growth phase on replication of F-specific RNA coliphage Qß. Appl Environ Microbiol 61 1520 1526

16. WoodyMACliverDO 1997 Replication of coliphage Qß as affected by host cell number, nutrition, competition from insusceptible cells and non-FRNA coliphages. J Appl Microbiol 82 431 440

17. TsukadaKOkazakiMKitaHInokuchiYUrabeI 2009 Quantitative analysis of the bacteriophage Qß infection cycle. Biochim Biophys Acta 1790 65 70

18. GarwesDSilleroAOchoaS 1969 Virus-specific proteins in Escherichia coli infected with phage Qß. Biochim Biophys Acta 186 166 172

19. RadloffRJKaesbergP 1973 Electrophoretic and other properties of bacteriophage Qß: the effect of a variable number of Read-Through proteins. J Virol 11 116 128

20. BlumenthalTCarmichaelGG 1979 RNA replication: function and structure of Qß-replicase. Ann Rev Biochem 48 525 548

21. EigenMBiebricherCKGebinogaMGardinerWC 1991 The hypercycle. Coupling of RNA and protein biosynthesis in the infection cycle of an RNA bacteriophage. Biochemistry 30 11005 11018

22. HoriuchiKMatsuhashiS 1970 Three cistrons in bacteriophage Qß. Virology 42 49 60

23. VollenweiderHJKollerTWeberHWeissmannC 1976 Physical mapping of Qß replicase binding sites on Qß RNA. J Mol Biol 101 367 377

24. EdlindTDBasselAR 1977 Secondary structure of RNA from bacteriophages f2, Qß and PP7. J Virol 24 135 141

25. TakamatsuHIsoK 1982 Chemical evidence for the capsomeric structure of phage Qß. Nature 298 819 824

26. SchuppliDBarreraIWeberH 1994 Identification of recognition elements on bacteriophage Qß minus strand RNA that are essential for template activity with Qß replicase. J Mol Biol 243 811 815

27. GolmohammadiRFridborgKBunduleMValegårdKLiljasL 1996 The crystal structure of bacteriophage Qß at 3.5 Å resolution. Structure 4 543 554

28. BeekwilderMJNieuwenhuizenRvan DuinJ 1995 Secondary structure model for the last two domains of single-stranded RNA phage Qß. J Mol Biol 247 903 917

29. BeekwilderJNieuwenhuizenRPootRvan DuinJ 1996 Secondary structure model for the first three domains of Qß RNA. J Mol Biol 256 8 19

30. InokuchiYKajitaniM 1997 Deletion analysis of Qß replicase. J Biol Chem 272 15339 15345

31. KidmoseRTVasilievNNChetverinABRom AndersenGKnudsenCR 2010 Structure of the Qß replicase, an RNA-dependent RNA polymerase consisting of viral and host proteins. Proc Natl Acad Sci U S A 107 10884 10889

32. TakeshitaDTomitaK 2010 Assembly of Qß viral RNA polymerase with host translational elongation factors EF-Tu and -Ts. Proc Natl Acad Sci U S A 107 15733 15738

33. HofstetterHMonsteinH-JWeissmannC 1974 The readthrough protein A1 is essential for the formation of viable Qß particles. Biochim Biophys Acta 374 238 251

34. Van DuinJTsarevaN 2006 Single-stranded RNA phages. CalendarR The bacteriophages. 2nd edition Oxford University Press 175 196

35. PrianoCAroraRButkeJMillsDR 1995 A complete plasmid-based complementation system for RNA coliphage Qß: three proteins of bacteriophages Qß (Group III) and SP (Group IV) can be interchanged. J Mol Biol 249 283 297

36. AroraRPrianoCJacobsonABMillsDR 1996 cis-Acting elements within an RNA coliphage genome: fold as you please, but fold you must!! J Mol Biol 258 433 446

37. StreisingerGOkadaYEmrichJNewtonJTsugitaA 1968 Frameshift mutations and the genetic code. Cold Spring Harbor Symp Quant Biol 31 77 84

38. BebenekKKunkelTA 2000 Streisinger revisited: DNA synthesis errors mediated by substrate misalignments. Cold Spring Harbor Symp Quant Biol 65 81 92

39. HallidayJAGlickmanBW 1991 Mechanisms of spontaneous mutation in DNA repair-proficient Escherichia coli. Mutat Res 250 55 71

40. DrakeJW 2012 Contrasting mutation rates from specific-locus and long-term mutation-accumulation procedures. G3 2 483 485

41. BebenekADressmanHKCarverGTNgSPetrovV 2001 Interacting fidelity defects in the replicative DNA polymerase of bacteriophage RB69. J Biol Chem 276 10387 10397

42. LuriaSE 1951 The frequency distribution of spontaneous bacteriophage mutants as evidence for the exponential rate of phage reproduction. Cold Spring Harbor Symp Quant Biol 16 463 470

43. RonenARahatA 1976 Mutagen specific and position effects on mutation in T4rII nonsense sites. Mut Res 34 21 34

44. BilleterMALibonatiMViñuelaEWeissmannC 1966 Replication of viral ribonucleic acid. X. Turnover of virus-specific double-stranded ribonucleic acid during replication of phage MS2 in Escherichia coli. J Biol Chem 241 4750 4757

45. ThébaudGChadoeufJMorelliMJMcCauleyJWHaydonDT 2010 The relationship between mutation frequency and replication strategy in positive-sense single-stranded RNA viruses. Proc R Soc B 277 809 817

46. DrakeJW 2009 Avoiding dangerous missense: thermophiles display especially low mutation rates. PLoS Genet 5 e1000520 doi:10.1371/journal.pgen.1000520

47. LuriaSEDelbrückM 1943 Mutations in bacteria from virus sensitivity to virus resistance. Genetics 28 491 511

48. WagnerJNohmiT 2000 Escherichia coli DNA polymerase IV mutator activity: genetic requirements and mutational specificity. J Bacteriol 182 4587 4595

49. SchultzGEJrCarverGTDrakeJW 2006 A role for replication repair in the genesis of templated mutations. J Mol Biol 358 963 973

50. LangGIMurrayAW 2008 Estimating the per-base-pair mutation rate in the yeast Saccharomyces cerevisiae. Genetics 1789 67 82

51. FraserJLANeillEDaveyS 2003 Fission yeast Uve1 and Apn2 function in distinct oxidative damage repair pathways in vivo. DNA Repair 2 1253 1267

52. PowdrillMHTchesnokovEPKozakRARussellRSMartinR 2011 Contribution of a mutational bias in hepatitis C virus replication to the genetic barrier in the development of drug resistance. Proc Natl Acad Sci U S A 108 20509 20513

53. DenhardtDTSilverRB 1966 An analysis of the clone size distribution of φX174 mutants and recombinants. Virology 30 10 19

54. RegoesRRCrottySAntiaRTanakaMM 2005 Optimal replication of poliovirus within cells. Am Nat 165 364 373

55. LingCMHungPPOverbyLR 1970 Independent assembly of Qß and MS2 phages in doubly infected Escherichia coli. Virology 40 920 929

56. DrakeJWBebenekAKisslingGEPeddadaS 2005 Clusters of mutations from transient hypermutability. Proc Natl Acad Sci U S A 102 12849 12854

57. DrakeJW 2007 Mutations in clusters and showers. Proc Natl Acad Sci U S A 104 8203 8204

58. SanjuánRMoyaAElenaSF 2004 The distribution of fitness effects caused by single-nucleotide substitutions in an RNA virus. Proc Natl Acad Sci U S A 101 8396 8401

59. GiererAMudryKW 1958 Production of mutants of tobacco mosaic virus by chemical alteration of its ribonucleic acid in vitro. Nature 182 1457 1458

60. HollandJJDomingoEde la TorreJCSteinhauerDA 1990 Mutation frequencies at defined single codon sites in vesicular stomatitis virus and poliovirus can be increased only slightly by chemical mutagenesis. J Virol 64 3960 3962

61. MillsDRPrianoCMerzPABinderowBD 1990 Qß RNA bacteriophage: mapping cis-acting elements within an RNA genome. J Virol 64 3872 3881

62. SambrookJRussellDW 2001 Molecular cloning: a laboratory manual. 3rd edition Cold Spring Harbor (New York) Cold Spring Harbor Laboratory Press 999

63. YamamotoKRAlbertsBMBezingerRLawhorneLTreiberG 1970 Rapid bacteriophage sedimentation in the presence of polyethylene glycol and its application to large-scale virus purification. Virology 40 734 744

64. SokalRRRohlfFJ 1995 Biometry. 3rd edition W.H. Freeman and Company 887

65. KozminSGPavlovYIDunnRLSchaaperRM 2000 Hypersensitivity of Escherichia coli Δ(uvrB-bio) mutants to 6-hydroxylaminopurine and other base analogs is due to a defect in molybdenum cofactor biosynthesis. J Bacteriol 182 3361 3367