Abstract
Stochastic phenotype switching has been suggested to play a beneficial role in microbial populations by leading to the division of labour among cells, or ensuring that at least some of the population survives an unexpected change in environmental conditions. Here we use a computational model to investigate an alternative possible function of stochastic phenotype switching – as a way to adapt more quickly even in a static environment. We show that when a genetic mutation causes a population to become less fit, switching to an alternative phenotype with higher fitness (growth rate) may give the population enough time to develop compensatory mutations that increase the fitness again. The possibility of switching phenotypes can reduce the time to adaptation by orders of magnitude if the “fitness valley” caused by the deleterious mutation is deep enough. Our work has important implications for the emergence of antibiotic-resistant bacteria. In line with recent experimental findings we hypothesise that switching to a slower growing but less sensitive phenotype helps bacteria to develop resistance by exploring a larger set of beneficial mutations while avoiding deleterious ones.
Introduction
Biological evolution relies on two mechanisms which are instrumental in natural selection: preferential survival of better adapted individuals (selection) and variations among individuals (phenotypic variation). One of the sources of phenotypic variability is genetic alteration due to mutations and recombination. However, even genetically identical organisms will often behave differently because the same genotype may lead to many different phenotypes - observable traits of an organism - as a result of environmental factors and the organism's history. Although ubiquitous and easily observed in animals and plants, phenotypic diversity can already be amply demonstrated in microorganisms. Examples range from different cell sizes depending on growth medium [48], through bistability in utilization of different food sources [34], to diversification between motile/non-motile cells [20]. Microorganisms are often able to switch between these phenotypes in response to a change in external conditions such as the arrival of a new food source or depletion of the currently used one. A typical example is diauxic shift - a switch to another food source, for example from glucose to cellobiose in L. lactis when glucose becomes depleted [40], which involves altering gene expression levels without changing the genetic code.
Some microorganisms switch seemingly randomly between two or more phenotypes even in the absence of external stimuli. This causes the population to become phenotypically heterogeneous. Several explanations have been proposed as to why stochastic phenotype switching has evolved [1]. One of them is the division of labour [2] in which different microbial cells perform different functions and this maximizes the benefit to the population. Another theory, called bet hedging [6, 25], proposes that in a fluctuating and unpredictable environment it pays to have a fraction of the population in a different state, which is perhaps maladapted to the present environment but better suited to possible future environments. Since only a small fraction of the population expresses the maladapted phenotype at any one time, this strategy conserves resources while allowing the population to stay prepared for an unexpected change. Examples include bacterial persisters [4, 24], flu(Ag43)/fim switch [3, 4, 19, 43] and competence to non-competence switching in the bacterium B. subtilis [28]. Animal cells also exhibit this behaviour. Epithelial-to-mesenhymal transition - a landmark in cancer progression - is thought to be a phenotypic, epigenetic change [32]. There is also some evidence that phenotypic switching may be involved in resistance to chemotherapy [21], although this remains controversial [49].
Here we investigate an alternative possible role for the evolution of phenotype switching in microbes: the facilitation of genetic evolution in a static environment. Previous theoretical work [27] has demonstrated that switching to a persistent phenotype can provide a larger pool of bacteria to evolve genetic resistance to antibiotics. That work only addressed the situation in which resistance is caused by a single, beneficial mutation. However, a significant change of fitness, defined here as a measure of a microorganism’s growth rate, often requires multiple mutations. Although many fitness landscapes have at least one accessible pathway [14, 45], along which the fitness increases monotonically, in some cases a fitness valley - a genotype with a lower fitness than all its neighbours - is unavoidable. For example, developing resistance to the antibiotic streptomycin involves a fitness cost which must be counteracted by compensatory mutations, i.e. subsequent mutations that increase the microorganism’s fitness to at least that of the ancestral strain [29, 37, 38].
Here we study evolution in the presence of such a fitness valley, in which a population must acquire two mutations to reach the best-adapted genotype. The first mutation is deleterious and corresponds to a valley in the fitness landscape that is crossed by a second, compensatory mutation. Theory and computer modelling suggests that fitness valleys significantly affect biological evolution [9, 41, 46, 47], often, but not always, slowing it down. However, in our model cells can also switch to an alternative phenotype in which the fitness landscape is flat. We show that this switching enables the population to avoid the costly valley, even if the alternative phenotype is less fit than the initial state. We also demonstrate the existence of an optimal range for the switching rate. In this range the time to evolve the best-adapted genotype can be reduced by many orders of magnitude compared to the case without switching. Finally, we show that if the switching rate is allowed to evolve it will converge to values within this optimal range.
MODEL
We consider a population of haploid cells. Each cell can be in one of two phenotypic states, A and B, and can assume one of three genotypes (“genetic states”), as shown in Fig. 1A,B. Cells replicate stochastically with rate ri (1 – N/K), where i labels one of the six possible states, N is the total population size and K is the carrying capacity of the environment. The logistic-like factor (1 – N/K) causes growth to cease when N becomes as large as K, which limits the total population size. During replication a cell will produce a mutated offspring of an adjacent genotype with probability μ. Mutation does not change the phenotypic state. All cells switch randomly between the states A and B at the symmetric switching rate α. Cells are randomly removed from the population at a constant death rate d. If the carrying capacity K is small (K = 10 · · · 104) the model is appropriate to describe a small microbial population growing in a microfluidic chemostat with constant dilution rate [5]. For larger K (K = 104 · · · 109) the model is relevant to populations cultured in mesoscopic (cm-size) chemostats.
The population initially consists of all individuals of type 1A, i.e. of genotype 1 in phenotypic state A, which has the growth rate r1A = 1 (arbitrary units). The initial population size, unless otherwise stated, is equal to the equilibrium size (1 – d)K. State 3A is the global maximum of both fitness landscapes with the growth rate r3A = 1 + S, where S > 0 is its selective advantage over the ‘wild-type’ 1A (Fig. 1C). Such a beneficial mutant has the probability of fixation, from a single individual in a population of cells in state 1A, approximately given by [33] when 1/N ≪ S ≪ 1. We are interested in the adaptation time T that it takes for the population to evolve the first individual in state 3A, conditioned on the population not going extinct (otherwise the time would be infinite).
We shall begin by considering the case in which the growth rate r2A = 1 – δ, while the growth rates r1B = r2B = r3B = 1, which leads to a “fitness valley” in phenotype A and a flat fitness landscape in phenotype B (Fig. 1C). In the absence of phenotype switching the only way a population in state 1A can evolve an individual in state 3A is to go through the low-fitness state 2A. However, phenotype B allows the population to traverse an alternative route that avoids state 2A.
RESULTS
Optimal range of the switching rate exists in which evolution is fastest
Figure 2A shows simulation results examining the effect of phenotype switching on the adaptation time T for a range of the parameters of the model. In all cases the presence of low frequency switching, i.e. small α, decreases the adaptation time by orders of magnitude. Holding all other parameters fixed and changing only the switching rate reveals an optimal range for α (Fig. 2B) in which T is minimized provided μ is sufficiently small. For the smallest mutation probability (μ = 10−5) considered in Fig. 2B the minimal adaptation time is approximately two orders of magnitude lower than in the absence of switching (α = 0), or when switching is frequent (α very large). Only if the mutation rate is unrealistically large does switching not speed up evolution (SI and Fig. S1).
In the absence of transitions between states 2A and 2B (i.e. removing this step from the model in Fig. 1A) the adaptation time decreases monotonically with α (SI and Fig. S2).
Fastest evolutionary trajectories avoid the valley
To understand the evolutionary trajectory selected in the optimal range of switching frequency we examined the histories of successful cells, i.e. the states from Fig. 1A visited during evolution from state 1A to the final state 3A. We then represented each cell's trajectory in the state space as a sequence of links connecting the visited states (Fig. 3A). These were then classified as one of 21 classes by grouping sequences with multiple (back-and-forth) transitions between the states together with those without (Fig. 3A). Fig. 3B shows the most probable trajectory as a function of μ and α. As expected, simulated evolution favours different trajectories depending on the values of μ, α. The (μ, α)-space can be approximately separated into three regions, each corresponding to one or more trajectory classes. Region 1 corresponds to trajectories that go through the fitness valley at 2A. In this region mutations are frequent enough to offset the loss of fitness incurred when passing through the valley. In region 2 the dominant trajectory avoids the valley by switching to phenotype B. This region corresponds to the fastest adaptation times from Fig. 2B. Region 3 is characterised by a mixture of trajectory types using both phenotype B and visiting state 2A. This region corresponds to large α which not only enables transitions to phenotype B but also from state 2B to the deleterious state 2A. The latter is responsible for the increase in the adaptation time for large a in Fig. 2B. A similar decomposition into three regions can be obtained from the probability of individual transitions occurring between the states of the system (Fig. S3).
We next examined which trajectories would be preferred if we varied K instead of μ, as the carrying capacity is more likely to vary significantly in real microbial populations and is much easier to control experimentally than the mutation rate. Since we ascertained earlier (cf. the previous paragraph) that the time to adaptation is most significantly affected by whether the trajectory visits state 2A or not, we further reduced the number of trajectory classes to just two (Fig. 3C): the “u” (upper) trajectory type that avoids state 2A or the “v” (valley) trajectory type that visits it. We found a large region in the space (K, α) that favours trajectories that avoid the state 2A (Fig. 3D). This region extends to large values of K ∽ 109. The adaptation time depends non-monotonically on the population size K (Fig. S4); similar non-monotonic behaviour has been seen in models without phenotype switching [13, 31].
Results are robust to small fitness costs of phenotype switching
So far we have assumed that the alternative phenotype B has the same fitness as the wild-type state 1A. We shall now consider the case in which phenotype B is less adapted to the environment than the state 1A. This is a common scenario; for example, phenotypes more resistant to antibiotics often have a lower growth rate than susceptible phenotypes do in the absence of the drug [4]. To model this, we assume that the growth rates r1B = r2B = r3B = 1 – c, where c > 0 is the fitness cost of switching to phenotype B (see Fig. 1C). A cell that switches from 1A to 1B grows more slowly but switching can still be advantageous if the alternative route by phenotype B is faster than crossing the fitness valley at state 2A. Fig. 4 shows the adaptation time T as a function of switching rate α for different fitness costs. For sufficiently small c the same qualitative behaviour as for c = 0 is observed.
The same optimal switching rate range as in Fig. 2 can be seen in Fig. 4 for costs c ≤ 0.05 of state 1A’s fitness, although it reduces in significance as c increases. This is due to the increased time taken by trajectories that avoid state 2A as a result of the reduced growth rates of the phenotype B states. For large c, such as c = 0.1 in Fig. 4, there is no optimal switching range but the adaptation time T decreases monotonically with α. Stochastic phenotype switching thus speeds up evolution even in the presence of (moderate) fitness costs.
Switching rate evolves until it reaches the optimal range
So far we have shown that there often exists an optimal range for the switching rate that results in the shortest adaptation time. This range coincides with successful cells avoiding the deleterious state 2A by switching to phenotype B. To see whether a variable switching rate would evolve to be in this optimal range, thus optimising the evolutionary process, we simulated a population of cells that started at 1A with α ≈ 0 initially. The switching rate was subsequently allowed to evolve: during replication, offspring were assigned a new a value with probability μ, the same as for mutations between genotypes 1 ↔ 2 ↔ 3. The new α was randomly and uniformly selected from a fixed set of possible values (α = 2.56 × 10−10 × 5i for integer i ∈ [0,15]); similar results were obtained when α evolved incrementally (Fig. S5). In this extended model the evolutionary trajectory of successful cells spans three dimensions: two for the genotype, one of which evolves the switching rate, and one for the two phenotypes A and B (Fig. 5A). The end point of the trajectory remains a cell produced at state 3A regardless of its switching rate. We collected trajectories of successful cells in the expanded state space, including states with different α, and calculated transition probabilities between any two states. Fig. 5B shows these probabilities as links of different thickness between the states in state space. We observe that successful trajectories are unlikely to feature genotype mutations in phenotype A (very few red links). Instead, most successful cells first switch to state 1B, mutate to 3B, and switch back to state 3A. The evolved switching rate often falls within the optimal range found in Fig. 2B for the same set of parameters K, μ and d. This is further illustrated in Fig. 5C where we overlay the adaptation time from Fig. 2B with a bar chart that shows the probability that a particular α was selected by evolution.
Fig. 5C reveals two more interesting features. First, we see that cells that have evolved α above the optimal range (α ≥ 0.02) are unlikely to be successful due to fast transitions between states 2B and 2A (fitness valley). Such transitions effectively lower the fitness of state 2B and create a new fitness valley at this state. Second, we see that a large number of trajectories cross genotype space at α values below the optimal α range. However, the plot of T in Fig. 5C suggests this will be a slow process. The latter observation can be explained through these cells first evolving a large switching rate, allowing them to switch quickly to phenotype B, before their switching rate decreases again. This picture is corroborated by observing that at low a there is little phenotype switching (purple links in Fig. 5B) and yet the likelihood of mutations in phenotype B across genotype space is large (green links in Fig. 5B). This provides an alternative efficient trajectory for a population to cross the fitness valley. However, such trajectories need at least one extra mutation compared to those that evolve a directly to within the optimal range. Therefore evolution is dominated by the latter trajectories, in particular for low mutation probabilities μ.
DISCUSSION
Phenotypic plasticity has been recently shown to affect Darwinian evolution of animals [15]. Our theoretical research suggests that stochastic phenotypic switching is also able to significantly speed up biological evolution of microbes, perhaps by many orders of magnitude. Switching phenotypes enables cells to evade a fitness valley which may be difficult to cross otherwise, particularly for small populations. Although our model uses only three genotypes, we believe the results presented here are much more general and apply to realistic, multi-dimensional fitness landscapes [35, 42, 45]. Epistasis causes such landscapes to contain many local fitness maxima separated by genotypes with lower growth rate [23]. In a small population, in the absence of phenotype switching, evolution can spend much time in a local maximum before “tunnelling” through one of the fitness valleys. Phenotypic switching opens up a second fitness landscape in which minima and maxima can be located at different genotypes compared to the first landscape. Phenotype switching provides alternative routes between otherwise isolated fitness maxima in both landscapes, and thus enables evolution to proceed faster. This also adds another dimension to the complexity of evolutionary pathways [35, 36, 45] which is relevant for the predictability of evolution [10].
We now discuss how plausible our computational results are for real microbial populations. Although our theory is quite general, we focus on antibiotic resistance evolution. We believe our model represents best a situation in which a population of microbes is exposed to sub-lethal concentrations of an antibiotic. In this scenario, state 1A corresponds to a ‘wild-type’ genotype whose growth rate is slightly lowered by the presence of the antibiotic, but cells are still able to reproduce. The fitness landscape (Fig. 1C) for phenotype A then corresponds to a situation in which an initial mutation further lowers the growth rate by e.g. making the target enzyme less susceptible to the drug but also less efficient [30, 37, 38], while a second mutation compensates for the loss of efficiency and increases the growth rate beyond that of the ancestral strain. In the same example the flat landscape for phenotype B can result from switching to a state with higher gene expression of multidrug efflux pumps or antibiotic-degrading enzymes. This has been observed for a biofilm-forming bacterium P. aeruginosa which can switch to a resistant phenotype in the presence of antibiotics [8, 12]. Alternatively, phenotype B can be a persister - although the conventional view is that persistent cells do not replicate, this has been recently challenged [44]. Slower (but non-zero) replication rates can enable persisters to mutate and explore a larger set of genotypes than the non-persister (phenotype A) would be able to do in the absence of switching.
In microbes, the probability of single nucleotide mutation is 10−10 – 10−9 per replication [11, 26] but can be higher (10−7) in mutator strains [26, 39] or in the presence of antibiotics [18]. Assuming μ = 10−6 which would correspond to ∽ 10 point mutations in a mutator strain or any loss-of-function mutation in a non-mutator strain (assuming a typical gene length 1kbp [22]), Fig. 3D indicates that phenotype switching would be the preferred mode of evolution for α < 10−4 in a population of K = 103 cells, for 10−7 < α < 5 × 10−3 for intermediate size K = 106, and for α ≪ 10−3 for K = 109. For lower mutation probabilities μ we expect this behaviour to extend to lower switching rates, consistent with the increasing optimal ranges seen in Fig. 2B, while maintaining approximately the same upper limit of α ≪ 10−3,…, 10−2. Switching rates encountered in nature can be as large as this [3, 43] and thus we believe that stochastic phenotypic switching can be a significant feature affecting the biological evolution of microbes.
The model we have proposed is a mathematical idealisation. Although it may be possible to test the quantitative predictions of our model in a carefully designed experiment, many of the assumptions we made would have to be relaxed to model naturally occurring populations of bacteria. However, recent work suggests that mechanisms similar to what we described here may be at play in real microbial populations. In particular, Refs. [7, 50] show that the bacterium E. coli switches to a filamentous phenotype (“phenotype B”) as a result of exposure to sub-inhibitory concentration of the antibiotic ciprofloxacin, and filamentous cells occasionally produce normal-length cells (“phenotype A”) resistant to ciprofloxacin. It has been hypothesized that phenotype B provides a “safe niche” giving bacteria enough time to evolve resistance. Although the switching between the phenotypes in this experiment is not entirely random but a response to stress, the observed behaviour is similar to that predicted by our model.
METHODS
Custom written Java programs were used to simulate the model. For smaller K (K ≤ 100) each cell was assigned a current type (one of the 6 possible states from Fig. 1A) and a sequence of past states beginning at state 1A. A variant of the Gillespie kinetic Monte Carlo algorithm[17] was used to evolve the system. In each time step, a random cell and action (replication, death, or phenotype switch) was selected with a probability proportional to the rate with which that action occurs. A second random number drawn from an exponential distribution, with mean equal to the inverse of the total rate of all possible actions in the system, was used as a measure of time elapsed during that step. For larger K > 100 (or when c > 0) the evolution of the system was approximately modelled using a type of τ–leaping algorithm [16].
Statistical analysis was performed in Mathematica using a custom-written code. Mean values presented in all plots were obtained by averaging over 104 runs, except for Fig. 3D where points were averaged over 102 – 103 runs. Error bars are s.e.m.
Acknowledgments
We thank Rosalind Allen for helpful discussions and many suggestions at the beginning of this project. BW was supported by a RSE/Scottish Government Personal Research Fellowship. MRE and AT thank the EPSRC for support through grant number EP/J007404/1 and a studentship respectively. This work has made use of the resources provided by the Edinburgh Compute and Data Facility (ECDF) (http://www.ecdf.ed.ac.uk/).