Polymorphism and the Predictability of Evolution in Fisher’s Geometric Model

Sandeep Venkataram; Diamantis Sellis; Dmitri A. Petrov

doi:10.1101/001016

ABSTRACT

Predicting the future evolutionary state of a population is a primary goal of evolutionary biology. One can differentiate between forward and backward predictability, where forward predictability is the probability of the same adaptive outcome occurring in independent evolutionary trials, and backward predictability is the likelihood of a particular adaptive path given the knowledge of the starting and final states. Most studies of evolutionary predictability assume that alleles along an adaptive walk fix in succession with individual adaptive mutations occurring in monomorphic populations. However, in nature, adaptation generally occurs within polymorphic populations, and there are a number of mechanisms by which polymorphisms can be stably maintained by natural selection. Here we investigate the predictability of evolution in monomorphic and polymorphic situations by studying adaptive walks in diploid populations using Fisher’s geometric model, which has been previously found to generate balanced polymorphisms through overdominant mutations. We show that overdominant mutations cause a decrease in forward predictability and an increase in backward predictability relative to diploid walks lacking balanced states. We also show that in the presence of balanced polymorphisms, backward predictability analysis can lead to counterintuitive outcomes such as reaching different final adapted population states depending on the order in which mutations are introduced and cases where the true adaptive trajectory appears inviable. As stable polymorphisms can be generated in both haploid and diploid natural populations through a number of mechanisms, we argue that natural populations may contain complex evolutionary histories that may not be easily inferred without historical sampling.

INTRODUCTION

Predicting evolution is one of the fundamental challenges of evolutionary biology (reviewed in de Visser and Krug (2014)). This question became particularly prominent with Gould’s famous thought-experiment on “replaying the tape of life” (Gould, 1990). Gould wondered whether we would regenerate the observed evolutionary history of the world if we reset our evolutionary history to any point in the past and let evolution retake its course from there. More generally, we can ask whether it is possible to predict the path or the final destination of the evolutionary process from a given starting point. It is also possible, however, to ask whether we can reconstruct the true evolutionary trajectory given the final adapted state (Weinreichet al., 2006). This distinction between types of predictability is rarely made (however see Nourmohammadet al. (2013) and Szendro et al. (2013)), so we formalize the methods for studying predictability and utilize these distinctions to study the impact of polymorphism on the predictability of evolution.

Forward predictability of evolution

We define forward predictability as the probability of observing a particular future evolutionary outcome from a known starting state. Previous experimental evolution studies have generally (but not always) focused on the forward predictability of evolution. This type of analysis can be done at a number of levels, including the predictability of overall fitness changes, phenotypic shifts and different levels of genotypic changes (pathways, genes, and individual mutations).

For example, Fereaet al. (1999), Cooperet al. (2003) and Fong et al. (2005) evolved independent replicates of microbes and observed similar changes in gene expression and growth rate in the evolved clones. A large study of 145 parallel long-term experimental evolutions with Escherichia coli grown at elevated temperature showed that the same genes and pathways were repeatedly targeted for mutations in independent populations (Tenaillonet al., 2012) as did a study of 40 replicate Saccharomyces cerevisiae batch culture evolutions (Langet al., 2013) and a study that sequenced clones from 10 replicate evolutions for each of 13 different genetic backgrounds (Kryazhimskiyet al., 2014). Tenaillonet al. (2012) also observed a high degree of parallel evolution at the level of individual nucleotides, but nucleotide level parallelism was rarely observed by Langet al. (2013). Herron and Doebeli (2013) evolved E. coli under multiple carbon sources and repeatedly observed the evolution of two distinct ecotypes with differential ability to grow on each carbon source. By sequencing independent replicate clones of both ecotypes, they found the same genes, and sometimes the same exact mutations invading these replicate populations and differentiating the ecotypes. These studies suggest that evolution is indeed forward predictable to a surprising degree.

Repeated evolution has been observed at both the genetic and morphological levels in natural systems as well (reviewed in Stern (2013)). Kviteket al. (2008) showed that highly divergent yeast strains isolated from oak trees had similar growth rates across a panel of diverse growth conditions. Studies of Anolis lizards in the Caribbean show repeated independent adaptive radiations into similar niches across the islands (Losos, 1998). In addition, a study of the adaptive radiation of cichlid fish in Lake Tanganyika showed convergent morphological evolution when the skeletal morphology of the various species was compared to their phylogeny (Muschicket al., 2012).

Backward predictability of evolution

In addition to Gould’s thought-experiment, one can study predictability in a historical manner. Given the current state, we can try to predict the ancestral state or the evolutionary path that resulted in the current state of the study system. We call this backward predictability, as it requires us to look backward in time. For example, one can try to predict exactly how corn or rice became domesticated from one or more wild ancestors (Matsuokaet al., 2002; Molinaet al., 2011), identify the ancestral species that gave rise to Darwin’s Finches (Darwin, 1872; Satoet al., 2001), or reconstruct the ancestral state of a particular protein (Ortlundet al., 2007).

Alternatively, if we already know the ancestral state, we can try to predict the particular order of mutations or phenotypic states that led to the evolution of the current state. Weinreichet al. (2006) conducted a seminal study of backward predictability in this sense, using a combinatorially complete reverse genetic study design pioneered by Malcolmet al. (1990). Weinreichet al. (2006) reconstructed every possible combination of five mutations in the beta-lactamase gene in E. coli which are known to lead to high levels of resistance to the drug rifampicin. They then assayed each genotype’s resistance to the drug, which they used as a proxy for fitness. Using this data, they determined the fitness changes involved in every step of each of the 5! = 120 possible mutational paths that converts the wild-type genotype to the resistant five-mutant genotype. A mutational path was deemed viable if fitness monotonically increased with every step, that is, there were no mutations along the path that decreased resistance to the drug.

Weinreichet al. (2006) found that only 18 of the 120 possible paths were viable, suggesting high backward predictability of evolution. In contrast, Khanet al. (2011) performed an analysis of five adaptive mutations from experimentally evolved bacterial lineages using identical methodology and found that a majority of the orders were viable. Finally, Frankeet al. (2011) studied backward predictability in all subsets of two to six mutations in an empirical eight-locus system and found that the number of viable paths varied widely for a given subset size. For example, they observed both zero and nine viable paths (out of 24 possible) in different four-locus subsets. The varying degrees of backward predictability found in these different systems does not yet allow us to draw general conclusions, and the laborious nature of the experiments makes it challenging to study more than a few mutations at a time. In addition, without knowing the true order in which the mutations arose in the population, it is unclear how accurate backward predictability analysis actually is.

Predictability in Fisher’s Geometric Model

Overall, there seems to be no consensus on whether evolution is backward predictable using the method of Weinreichet al. (2006). It is also unclear how forward and backward predictability are correlated with each other. In principle, one would want to conduct forward evolution and then conduct backward predictability analysis on the same system to understand their relationship. However such studies would be extremely laborious, and given the disparate answers coming out of different experimental systems, a large number of independent experiments in many systems would need to be conducted to give a convincing answer.

Another difficulty in experimental evolution studies of predictability are practical limitations in sampling adaptive mutations. As most studies can only afford to sample a few adapted individuals from a given experiment, mutations must be at high frequency to be observed and a common assumption is that each of these mutations fixed in the population in succession (Gillespie, 1983, 1984; Orr, 2002; Weinreichet al., 2006; Khanet al., 2011; Frankeet al., 2011). However, we know that mutations can be maintained in a polymorphic state by a number of mechanisms. These include negative frequency-dependent selection (Levinet al., 1988; Iserbytet al., 2013), spatial and temporal fluctuations in selection (Rainey and Travisano, 1998; Kasumovicet al., 2008; Saltz and Nuzhdin, 2014) and heterozygote advantage (also called overdominance, Takahata and Nei (1990)). Polymorphisms can also be present in an unstable form through clonal interference (Desai and Fisher, 2007; Herron and Doebeli, 2013; Kvitek and Sherlock, 2013; Langet al., 2013). The presence of functionally consequential polymorphisms in a population can in principle significantly alter predictability analysis as the selective effect of a new mutation may be dependent on other alleles segregating in the population (fitness epistasis). Many of these polymorphisms are either lost by the end of the experiment or are not observed in the sampled adapted individuals, leading to incorrect inferences of predictability. Additional complications can arise when estimating predictability as mutations can occur in multiple backgrounds in a given population, so the likelihood of each mutation occurring in a particular background also has to be taken into account, as well as any epistatic interactions the mutation has with the rest of that background.

Due to the challenges of isolating sufficient numbers of independent adaptive mutations from experimental populations to study predictability, we utilize a simulation-based approach to study the impact of polymorphisms on forward and backward predictability. We employ Fisher’s geometric model (FGM, Fisher (1930)), which is a well-studied (Orr, 1999, 2005) phenotypic model that treats individuals and alleles as a phenotype that is a vector in coordinate space with a fitness that is determined by the distance of the individual’s phenotype from a predefined optimal phenotype using a gaussian function (Figure 1a). Selliset al. (2011) showed that adaptive mutations in diploid FGM simulations are frequently overdominant if the mutations are sufficiently large in phenotypic space, resulting in balanced polymorphisms. Such overdominant mutations are stable but can be driven out of the population by subsequent adaptive mutations. As we are interested in the interaction between balanced polymorphic states and the predictability of evolution, we select the distribution of mutational effects such that some evolutionary trajectories contain overdominant mutations, generating stable polymorphisms, and others do not. We then compare both types of trajectories to understand how polymorphisms influence predictability. We conclude that the presence of polymorphic states has a substantial qualitative effect on the predictability of evolution, such that at least in this model, forward and backward predictability are inversely correlated.

Figure 1. Fisher’s geometric model description and confirmation of accurate separation of simulations into those with and without overdominant mutations.

(A) Modified from Figure 2A Sellis et al 2011. Two orthogonal axes represent independent character traits. Fitness is determined by a symmetrical Gaussian function centered at the origin. Consider a population initially monomorphic for the wild-type allele . A mutation m gives rise to a mutant phenotype vector . The phenotype of the mutant heterozygote assuming phenotypic codominance (h = 1/2) is . The different circles specify the areas in which mutations are adaptive (i.e. successfully invade the population, α_dip) and replacing (i.e. fix in the population, γ) in diploids. (B) Density plot of all phenotypes of homozygous individuals observed in the adaptive walks of FGM simulations that do not contain overdominant mutations. Note that all observed phenotypes lie within γ, as all mutations must be replacing and not balancing in this group of simulations. Circles denote α_dip and γ as described in (A). (C) Homozygous phenotypes for simulations that do contain overdominant mutations. Note that a large number of phenotypes lie outside of γ, as expected for overdominant mutations, confirming that we are correctly separating walks with and without overdominant mutations. When comparing B and C, we observe that simulations with overdominant mutations are less forward predictable than those without such mutations.

METHODS

Simulations

We model adaptive walks in diploid populations with Wright-Fisher simulations using Fisher’s geometric model (FGM) as in Selliset al. (2011). In FGM, alleles are represented as a vector in n-dimensional phenotype space (Figure 1a). The simulations use code modified from Sellis et al. to allow for more than 2 dimensions. We perform 10,000 replicate simulations with population size N = 5, 000 for 10,000 generations. We explore two models, one with two dimensions and one with 25 dimensions. We partition our adaptive walks into those that do and those that do not contain overdominant mutations to study the impact of balanced states on predictability. For the remainder of our analysis, we identify the most frequent allele in each simulated population at the end of 10,000 generations of evolution and study the mutations present on that allele. We limit our analysis to studying the first five mutations of each adaptive walk and ignore simulations with fewer than 5 mutations in order to control for the length of the adaptive walk when studying predictability.

Forward Predictability Analysis

We calculate the forward predictability of the adaptive trajectory using two metrics. In both of these metrics, we only consider homozygous phenotypes. Our first metric, maximum pairwise distance, considers pairs of adaptive walks. We compute the maximum of the phenotypic distances between the observed single mutant phenotypes of the two adaptive walks, the double mutant phenotypes, the triple mutant phenotypes etc. Our second metric measures the maximal deviation from the optimal trajectory. For each adaptive walk, we compute the maximal phenotypic distance of any encountered (homozygous) phenotype from the line segment connecting the ancestral phenotype and the optimum.

Backward Predictability Analysis

We compute backward predictability on adaptive walks of exactly five mutations. We calculate the probability of all possible mutational orders for the given set of mutations in a manner similar to Weinreichet al. (2006), but generalized to allow balanced states as the experimental protocol of Weinreichet al. (2006) assumes that every mutation along each mutational order fixes in succession. We summarize the set of possible mutational orders for a given set of mutations through the effective number of trajectories statistic, which we define as where p is the probability of each mutational order possible for a given set of mutations. If no mutational order is viable (has nonzero probability), the effective number of trajectories is defined to be 0. Please see the Supplementary Methods for full methodological details.

RESULTS

We explore the predictability of evolution in the framework of Fisher’s geometric model (FGM) of adaptation. In FGM, alleles are represented as vectors in coordinate space, with individuals having a phenotype that is the average of the phenotypes of their constituent alleles. Mutations are vectors that modify the phenotype of an allele, and fitness is a guassian function of the distance of the individual’s phenotype from the optimal phenotype (which we define as the origin).

In order to focus on the effect of polymorphic states on the predictability of evolution, we choose a parameter regime that generates simulations both with and without overdominant mutations after a number of trial simulations with various parameter values. We perform 10,000 replicate simulations of adaptation under FGM in diploids with N = 5000 individuals. Mutational magnitudes are drawn from an exponential distribution with mean and the population is initiated at two units from the optimum. The mutation rate is 5 ∗ 10⁻⁶, which results in a mutation-limited regime (significantly less than one mutation per generation as 2 ∗ N ∗ µ = 0.05), in order to minimize the generation of polymorphic states by clonal interference so that we can focus on only those polymorphic states generated by overdominant mutations.

We conduct our simulations using an FGM of two dimensions, and show that our qualitative results also hold at 25 dimensions. In the 25 dimension regime, we need to rescale our mutational magnitude mean to 5 in order to obtain a sufficient number of walks with five mutations over our 10,000 generation simulations for statistical analysis. For all of our statistical analyses, we consider only those mutations that are present on the most frequent allele at generation 10,000. Such mutations are typically the only ones available for analysis in a natural system. We additionally limit our analysis to studying the first five mutations of each adaptive walk, and ignore simulations with fewer than five mutations in order to compare adaptive walks of equal lengths. We partition the resulting five-mutation adaptive walks into those that do (n = 4975, 1548 in simulations with two and 25 dimensions, respectively) and do not (n = 1251, 10) contain overdominant mutations to study the impact of balanced polymorphisms on the predictability of evolution. The presence of overdominant mutations in an observed five-mutation adaptive trajectory is detected by the observation of a set of alleles during the FGM simulation that are capable of being maintained as a balanced polymorphism (Kimura, 1956). For details, please see the Supplementary Methods.

Predictability of Adaptive Walks

We first consider the forward predictability of phenotypic paths, which we define as the tendency of independent adaptive walks to explore similar portions of phenotypic space. The ability of adaptive walks with overdominant mutations to explore a larger phenotypic space compared to walks without overdominance (α-dip vs γ, Figure 1a) should lead to lower predictability of the phenotypic intermediates along the adaptive walk, which is confirmed by visual inspection of our simulations (Figure 1b,c) and is consistent with the findings of Selliset al. (2011).

We quantify forward predictability by measuring the distribution of maximal phenotypic distances between pairs of independent adaptive trajectories. Pairs of walks with overdominant states are, on average, 40% further apart than walks without overdominant mutations and are therefore less forward predictable (Figure 2, Kolmogorov-Smirnov test p ≪ 10⁻¹⁰). We also measure forward predictability as the maximal phenotypic distance of each observed trajectory from the optimal trajectory - the vector from the ancestral phenotype to the optimal phenotype. We observe that the presence of overdominant mutations in a walk increases the average distance from the optimal trajectory by 5% (Figure 3, Kolmogorov-Smirnov test p ≪ 10⁻¹⁰), again suggesting that overdominant mutations decrease forward predictability.

Figure 2. Overdominant mutations decrease forward predictability by 40% using the maximum pairwise distance metric.

Shown are the cumulative distributions of the maximum phenotypic distance between independent pairs of adaptive walks, excluding the ancestral state. This is a measure of the phenotypic forward repeatability of independent walks on the same evolutionary landscape. The maximum phenotypic distance in simulations without overdominant states is significantly less than in simulations with such states (Kolmogorov-Smirnov test p ≪ 10⁻¹⁰).

Figure 3. Overdominant mutations decrease forward predictability by 5% using the maximum distance from the optimal trajectory metric.

Shown are the cumulative distributions of the maximum distance from the optimal trajectory of adaptive walks. This is a measure of the phenotypic forward predictability of walks. The maximum distance from the optimal trajectory in simulations without overdominant mutations is significantly less than those with such mutations (Kolmogorov-Smirnov test p ≪ 10^-10).

We then study backward predictability in a manner similar to Weinreichet al. (2006). As before, we limit our analysis to adaptive walks of exactly five mutations, which is comparable to many recent experimental studies of backward predictability (Weinreichet al., 2006; Khanet al., 2011; Frankeet al., 2011). Backward predictability analysis requires knowledge of the five mutations that occurred during the FGM simulations and computes the likelihood of every possible order of those five mutations in generating the observed adapted five-mutation allele (e.g. see Weinreichet al. (2006) Figure 2). In order to conduct this analysis, we compute the probability of every possible path to the five-mutant state by successively introducing each of the five mutations into the population and assessing the probability of each of these mutations to successfully invade the population (see Supplementary Methods). Although we artificially constrain the available phenotypes to only those generated by combinations of the five mutations under consideration, this analysis is a model for studying predictability in situations where there are only a few possible adaptive mutations, such as the drug resistance mutations used by Weinreich et al. We compute the effective number of adaptive trajectories for each adaptive walk, with a higher number suggestive of a lower backward predictability.

The results of our backward predictability analysis are shown in Figure 4. We find that in contrast to forward predictability, overdominant states decrease the effective number of paths (and thus increase backward predictability) in a walk by 30%, on average (Kolmogorov-Smirnov test p ≪ 10⁻¹⁰). In other words, conditional on reaching a particular five-mutant state, it is more probable that independent trials of a walk that experienced at least one overdominant state will use the same mutational order in repeated trials relative to a walk without overdominant states. We also utilize the mean path divergence of Lobkovskyet al. (2011) to study backward predictability and find that overdominant states resulted in walks that were 10% less divergent (and thus more backward predictable), on average (Kolmogorov-Smirnov test p ≪ 10⁻¹⁰).

Figure 4. Overdominant mutations increase backward predictability by 30% using the effective number of paths metric.

Shown are the cumulative distributions of the effective number of paths for adaptive walks with five mutations. This is a metric of backward predictability of evolution. Each mutation is introduced into the ancestral background in every possible order, and the number of viable mutational orders, weighted by their probabilities, determines the effective number of paths. The effective number of paths in simulations without overdominant mutations is significantly greater than in simulations with such mutations (Kolmogorov-Smirnov test p ≪ 10^-10).

Multiple End States

In addition to studying the probability of a given mutational order in our backward predictability analysis, we also study the adapted population state that results from each viable mutation order. In particular, we observe that when mutations are introduced in different orders, the population encounters different intermediate alleles, resulting in instances where the final adapted five-mutant allele can balance against different intermediate alleles depending on the order in which the mutations were introduced into the population. We also observe instances where walks that did not experience balanced states in the FGM simulations generate balanced states when introduced in a different order.

We find that 53% of all walks have at least two different end population states containing the final adapted allele, with a maximum of 19 different population states for a single set of five mutations. We also find that the presence of overdominant mutations in the FGM simulation has a significant effect on whether there are multiple end states observed. The presence of an overdominant mutation in the observed walk increases the frequency of multiple end states from 30% to 60%. Our results suggest that adaptation occurring in the same genetic background, in response to the same selection pressure and using the same mutations, can result in significantly different final population states depending on the historical order in which the adaptive mutations occurred.

Qualitative categorization with regard to backward predictability

We analyze our backward predictability results to discern qualitative categorizations of our simulations. We find four broad categorizations of simulations: 1) simulations whose backward predictability reconstructions of the five-mutant allele by introducing the mutations in the order observed in the FGM simulation generate no balanced states, 2) those reconstructions that do generate balanced states, 3) reconstructions where the order of mutations that was observed in the simulation was impossible to reconstruct due to deleterious intermediate states during the reconstructions and 4) reconstructions where every possible order of mutations was impossible due to deleterious intermediate states (which is a subset of category 3).

We observe 2326, 3898, 89 and 5 simulations in each of these four categories, respectively. We can further separate these categories by conditioning on our original definitions of whether or not a simulation contained an overdominant intermediate state (i.e. whether there was a set of alleles that could be maintained in a stable balanced state at any point during the FGM simulation before the 5-mutant state reached 5% frequency). We find 1187, 62, 2 and 0 simulations in each of these four categories, respectively, among the simulations that we had previously identified as not containing overdominant intermediate states while we observe 1139, 3836, 87 and 5 simulations in each of these four categories, respectively, among simulations that we had previously identified as containing overdominant intermediate states.

The presence of backward predictability reconstructions where the observed order (and in a few cases, every order) of mutations is impossible is surprising. We hypothesize that this is due to the presence of adaptive alleles that are generated and stably maintained during a walk that are transient and do not survive until the end of the simulation. We call these “hidden alleles”, as they are hidden from almost all modern experimental studies of adaptation. Lack of knowledge of hidden alleles appear to decrease the computed probability of the true adaptive path observed in the FGM simulations, and in extreme cases, can make the true path impossible to reconstruct. Visual inspection of adaptive trajectories that are unable to be successfully reconstructed confirms this intuition (Figure 5). Backward predictability reconstructions that incorporate all mutations present at ≥ 1% frequency at any point in the simulation, regardless of whether the mutation was present on the allele sampled at the end of the simulation, can successfully reconstruct the observed adaptive trajectory of this previously impossible evolutionary outcome, confirming the necessity of hidden alleles for the viability of the observed adaptive trajectory in these instances.

Figure 5. Example simulation with a hidden allele where the observed most frequent allele was impossible to reconstruct by our method to compute backward predictability.

The frequency of the two mutational lineages that reached at least 1% frequency in the population are shown throughout the 10,000 generations of the simulation. The main lineage, ending with allele ABCDGH, is at high frequency at the end of the simulation, while the minor lineage, ending with allele ABCDEF (a “hidden allele”) is at low frequency at the end of the simulation.

In the simulation, four mutations initially fix in quick succession, resulting in allele ABCD fixed in the population. At this point, mutations causing balanced polymorphisms result in branched mutational lineages. Mutation E is the first mutation to occur on allele ABCD, generating a balanced polymorphism between alleles ABCD and ABCDE and allowing both alleles to be stably maintained in the population at intermediate frequency. Mutation F then quickly occurs on the background of allele ABCDE, generating allele ABCDEF which also balances with allele ABCD. Mutation G then occurs on the background of allele ABCD generating allele ABCDG soon afterwards, which balances with allele ABCDEF. Finally, mutation H occurs on allele ABCDG generating allele ABCDGH, which outcompetes all other alleles and is nearly fixed by the end of the simulation.

In our backward predictability reconstructions, we consider only the first five mutations of the most frequent allele at the end of the simulation, that is, we consider only mutations A, B, C, D and G as these were the first five mutations on allele ABCDGH. In attempting to reconstruct this observed order of mutations, we find that we can successfully introduce mutations A, B, C and D in order, but mutation G, which results in allele ABCDG, is not beneficial if allele ABCD is the only other allele in the population (data not shown). Therefore, the true order of mutations is impossible to reconstruct in this case when only sampling allele ABCDGH at the end of the simulation. However, if we also consider mutations E and F, we are able to successfully reconstruct the intermediate steps of the observed adaptive trajectory, suggesting that the presence of allele ABCDEF is necessary for allele ABCDG to be beneficial (data not shown).

We then compare the forward and backward predictability metrics described above on the different categories of simulations. In particular, we compare the simulations that were initially defined as not containing overdominant states at any point to those that did not have balanced states in the backward predictability analysis but did have balanced states during the FGM simulation. We find no significant difference between these sets of simulations by any of our predictability metrics (maximum pairwise distance, maximum distance from optimal trajectory and effective number of paths Kolmogorov-Smirnov test p > 0.05). This result suggests that the signal in our predictability metrics is being driven by the presence of balanced states between intermediate alleles along the adaptive trajectory to the five-mutant allele rather than a general feature of observing balanced states in our simulations as a whole.

High Dimensionality

In our implementation of Fisher’s Model, balanced states arise when mutations are overdominant. The presence of additional phenotypic dimensions, which seems realistically plausible from observed rates of pleiotropy (Dudleyet al., 2005; Albertet al., 2008), increases the frequency of overdominant mutations (Selliset al., 2011). However, this concordantly decreases the fitness advantage of the average new beneficial mutation, decreasing the number of adaptive mutations that successfully invade the population over our 10,000 generation FGM simulations. To study the impact of high dimensional landscapes on predictability, we conducted simulations using 25 dimensions with a mean mutation size of 5. The increase in mean mutation size relative to our original two dimensional simulations is necessary to generate a sufficient number of walks containing at least 5 mutations within 10,000 generations. We again partitioned the simulations into those with (n = 1548) and without (n = 10) overdominant mutations at any point of the FGM simulation before the time when the five-mutant allele reached 5% frequency.

We observe the same qualitative results in 25 dimensions as in 2 dimensions (see Supplementary Figures 1-4). In general, it appears that our conclusions about predictability of adaptive walks do not depend on the dimensionality of the system, and only on the presence of overdominant mutations in the adaptive walk.

DISCUSSION

In this study, we explored the predictability of evolution using Fisher’s geometric model. We distinguished between forward and backward predictability, where forward predictability measures the likelihood of the same or a similar adaptive trajectory occurring in independent evolutions, while backward predictability measures the likelihood of a particular order of adaptive mutations given the ultimate adapted state. We knew from prior work that diploids frequently generate overdominant mutations under Fisher’s geometric model (Selliset al., 2011), so we studied predictability using walks with and without overdominant mutations to understand the impact of balanced polymorphisms on predictability.

We found that simulations without overdominant mutations are more forward predictable than simulations with overdominance, while the reverse is true for backward predictability. The anti-correlation between forward and backward predictability can be intuitively understood by considering the the nature of adaptation in Fisher’s geometric model. In walks without overdominant mutations, mutations are confined to within γ (Figure 1a), leading to high forward predictability. There is minimal opportunity for deviation from the optimal trajectory, and most of the adaptive mutations that occur during these walks have similar direction vectors to the optimal trajectory. Therefore, regardless of the order of mutations, each step will move the population closer to the optimum, making most of the trajectories viable, and resulting in low backward predictability. The reverse is true in walks with overdominant mutations, which explore a much larger portion of phenotypic space (α_dip). Overdominant mutations tend to overshoot the optimum and are frequently followed by compensatory mutations. The larger amount of phenotypic space explored generates lower forward predictability, while the high frequency of compensatory mutations, and thus the importance of the order in which the mutations are introduced, results in high backward predictability. While Fisher’s geometric model is a useful tool to consider adaptation under phenotypic stabilizing selection, further work is required to determine the extent to which this anti-correlation is generalizable to biological systems. Nevertheless, the anti-correlation we observe between forward and backward predictability highlights the importance of distinguishing between types of predictability in future studies.

In natural populations, stable polymorphisms can be due to overdominance or other types of balancing selection, such as negative frequency dependent selection (Levinet al., 1988; Iserbytet al., 2013), and spatially or temporally variable selection (Rainey and Travisano, 1998; Kasumovicet al., 2008; Saltz and Nuzhdin, 2014). Transient functional polymorphisms at intermediate frequencies can also be generated via clonal interference (Desai and Fisher, 2007; Herron and Doebeli, 2013; Kvitek and SHERLOCK, 2013; Langet al., 2013). Both frequency dependent selection and clonal interference can occur in both haploid and diploid populations. Our work shows that the presence of polymorphisms in the population, regardless of source, significantly complicates analysis of adaptive trajectories, and these complications must be considered in all natural systems.

One such complication is the existence of simultaneous mutational lineages, which can result in hidden alleles (i.e. alleles that are not present at the end of the evolution) and transient population states that nevertheless significantly impact the future course of evolution. Ignoring hidden alleles can significantly modify the inferred backward predictability, and in extreme cases, can incorrectly suggest that the true order of mutations is impossible. Different orders of mutations can also generate different sets of heterozygous genotypes and different end population states, requiring the consideration of the state of the entire adapted population rather than the presence of a particular adapted allele.

Polymorphic states also drastically increase the number of possible adaptive paths. In systems where adaptation proceeds through sequential fixation, one only needs to consider the fitness of the 2ⁿ possible genotypes relative to the ancestral background for an n-mutation system. This is the methodology used in the experimental backward predictability studies of Weinreichet al. (2006), Khanet al. (2011) and Franke et al. (2011). However, in regimes where polymorphic states are frequently generated, the fitness of an invading mutation can vary depending on the alleles already present in the population. Within each adaptive trajectory, every mutation along the trajectory needs to be introduced into the prior population at low frequency on every available allele and tracked until the frequency of the new mutation has been stabilized in order to establish that the mutation is truly beneficial. Such a study would be extremely laborious, and to our knowledge, has never been conducted in any system.

Experimental Implications

In an experimental setting, high forward predictability means it is likely that the same set of mutations will be generated in independent adaptive walks, which make the probabilities generated through backward predictability analysis meaningful for predicting future events. This can occur by either a small mutational target size such as mutations that cause resistance to drugs, or a large mutational input into the population which makes rare but extremely beneficial mutations dominate the adaptive process (e.g. Desai and Fisher (2007); Kvitek and Sherlock (2011); Gersteinet al. (2012); Pennings (2012)). A study in FGM also suggests that a multi-locus FGM where each locus only influences a subset of the independent phenotypic dimensions (restricted pleiotropy) also promotes forward predictability, which the authors call parallel evolution (Chevinet al., 2010). Despite the large number of replicates required to achieve statistical significance, experimentally determining forward predictability has been shown to be feasible.

On the other hand, the possibility of hidden alleles makes accurate estimates of backward predictability impossible in both natural and artificial experimental systems. Since we do not have access to hidden alleles from natural populations, it is impossible to accurately compute the backward predictability of the adaptive walk leading to the current population state. Studying backward predictability using forward evolutions and constant sampling is equally infeasible. Even if we could sample every mutation that rises to reasonable frequency in a population, almost all of these mutations will be lost, and there may be far too many to determine the subset which are non-neutral. As mentioned above, there is also the problem of combinatorially many adaptive walks possible for even a few mutations, making complete experimental analysis of even a five mutation system extremely challenging. As others have mentioned, sampling a few high-fitness mutations and conducting backward predictability experiments may not generate a correct representation of the probability of any particular adaptive walk, as there may be alternative adaptive peaks (Weinreichet al., 2006). Additionally, there is the possibility of adaptation and potential epistatic interactions at sites not under consideration, and spatial or temporal fluctuations in selection pressures can further complicate accurate assessments of backward predictability in natural systems, and calls into question the accuracy of reconstructed ancestral states.

Finally, the impact of hidden alleles on evolutionary trajectories depends on the rate at which stable polymorphic states are generated. Rainey and Travisano (1998), for example, observed adaptive radiation by niche construction in every replicate evolution experiment they conducted. Under these conditions, we may expect hidden alleles to be frequent in a large evolving population. The adapted state of natural populations may thus experience a strong historical dependence on transient mutations that are eventually lost and impossible to sample, decreasing the forward predictability of evolution and making the inference of backward predictability impossible. The rate at which polymorphic states are generated in natural systems and potential differences between types of polymorphic states and their impact on forward and backward predictability should be further explored to improve our understanding of the predictability of evolution.

Literature Cited

↵
Albert, A. Y. K., S. Sawaya, T. H. Vines, A. K. Knecht, C. T. Miller, et al., 2008 The genetics of adaptive shape shift in stickleback: pleiotropy and effect size. Evolution; international journal of organic evolution 62: 76–85.
OpenUrl CrossRef PubMed Web of Science
↵
Chevin, L.-M., G. Martin, and T. Lenormand, 2010 Fisher’s model and the genomics of adaptation: restricted pleiotropy, heterogenous mutation, and parallel evolution. Evolution 64: 3213–31.
OpenUrl CrossRef PubMed Web of Science
↵
Cooper, T. F., D. E. Rozen, and R. E. Lenski, 2003 Parallel changes in gene expression after 20,000 generations of evolution in Escherichiacoli. Proceedings of the National Academy of Sciences of the United States of America 100: 1072–7.
OpenUrl Abstract/FREE Full Text
↵
Darwin, C., 1872 The Origin of Species. John Murray, London, 6th edition.
↵
de Visser, J. A. G. M., and J. Krug, 2014 Empirical fitness landscapes and the predictability of evolution. Nature reviews. Genetics 15: 480–90.
OpenUrl CrossRef PubMed
↵
Desai, M. M., and D. S. Fisher, 2007 Beneficial mutation selection balance and the effect of linkage on positive selection. Genetics 176: 1759–98.
OpenUrl Abstract/FREE Full Text
↵
Dudley, A. M., D. M. Janse, A. Tanay, R. Shamir, and G. M. Church, 2005 A global view of pleiotropy and phenotypically derived gene function in yeast. Molecular systems biology 1: 2005.0001.
↵
Ferea, T., D. Botstein, P. O. Brown, and R. F. Rosenzweig, 1999 Systematic changes in gene expression patterns following. Proceedings of the National Academy of Sciences of the United States of America 96: 9721–9726.
OpenUrl Abstract/FREE Full Text
↵
Fisher, R., 1930 The genetical theory of natural selection. Oxford at the Clarendon Press, Oxford, 1st edition.
↵
Fong, S. S., A. R. Joyce, and B. O. Palsson, 2005 Parallel adaptive evolution cultures of Escherichia coli lead to convergent growth phenotypes with different gene expression states. Genome Research 15: 1365–72.
OpenUrl Abstract/FREE Full Text
↵
Franke, J., A. Klozer, J. A. G. M. de Visser, and J. Krug, 2011 Evolutionary Accessibility of Mutational Pathways. PLoS Computational Biology 7.
↵
Gerstein, A. C., D. S. Lo, and S. P. Otto, 2012 Parallel Genetic Changes and Non-parallel Gene-environment Interactions Characterize the Evolution of Drug resistance in Yeast. Genetics 192: 241–252.
OpenUrl Abstract/FREE Full Text
↵
Gillespie, J., 1983 A Simple Stochastic gene substitution model. Theoretical Population Biology 23: 202–215.
OpenUrl CrossRef PubMed Web of Science
↵
Gillespie, J., 1984 Molecular evolution over the mutational landscape. Evolution 38: 1116–1129.
OpenUrl CrossRef Web of Science
↵
Gould, S. J., 1990 Wonderful Life: The Burgess Shale and the Nature of History. W. W. Norton & Company.
↵
Herron, M. D., and M. Doebeli, 2013 Parallel Evolutionary Dynamics of Adaptive Diversification in Escherichia coli. PLoS Biology 11: e1001490.
OpenUrl CrossRef PubMed
↵
Iserbyt, A., J. Bots, H. Van Gossum, and T. N. Sherratt, 2013 Negative frequency-dependent selection or alternative reproductive tactics: maintenance of female polymorphism in natural populations. BMC evolutionary biology 13: 139.
OpenUrl
↵
Kasumovic, M. M., M. J. Bruce, M. C. B. Andrade, and M. E. Herberstein, 2008 Spatial and temporal demographic variation drives within-season fluctuations in sexual selection. Evolution 62: 2316–25.
OpenUrl CrossRef PubMed Web of Science
↵
Khan, A. I., D. M. Dinh, D. Schneider, R. E. Lenski, and T. F. Cooper, 2011 Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332: 1193–6.
OpenUrl Abstract/FREE Full Text
↵
Kimura, M., 1956 Rules for testing stability of a selective polymorphism. Proceedings of the National Academy of Sciences of …1966: 336–340.
OpenUrl
↵
Kryazhimskiy, S., D. P. Rice, E. R. Jerison, and M. M. Desai, 2014 Global Epistasis Makes Adaptation Predictable Despite Sequence-Level Stochasticity. Science 344: 1519–1522.
OpenUrl Abstract/FREE Full Text
↵
Kvitek, D. J., and G. Sherlock, 2011 Reciprocal Sign Epistasis between Frequently Experimentally Evolved Adaptive Mutations Causes a Rugged Fitness Landscape. PLoS Genetics 7: e1002056.
OpenUrl CrossRef PubMed
↵
Kvitek, D. J., and G. Sherlock, 2013 Whole Genome, Whole Population Sequencing Reveals That Loss of Signaling Networks Is the Major Adaptive Strategy in a Constant Environment. PLoS Genetics 9: e1003972.
OpenUrl
↵
Kvitek, D. J., J. L. Will, and A. P. Gasch, 2008 Variations in stress sensitivity and genomic expression in diverse S. cerevisiae isolates. PLoS Genetics 4: e1000223.
OpenUrl CrossRef
↵
Lang, G. I., D. P. Rice, M. J. Hickman, E. Sodergren, G. M. Weinstock, et al., 2013 Pervasive genetic hitchhiking and clonal interference in forty evolving yeast populations. Nature.
↵
Levin, B. R., J. Antonovics, and H. Sharma, 1988 Frequency-Dependent Selection in Bacterial Populations [and Discussion]. Philosophical Transactions of the Royal Society B: Biological Sciences 319: 459–472.
OpenUrl CrossRef PubMed
↵
Lobkovsky, A. E., Y. I. Wolf, and E. V. Koonin, 2011 Predictability of evolutionary trajectories in fitness landscapes. PLoS Computational Biology 7: e1002302.
OpenUrl
↵
Losos, J. B., 1998 Contingency and Determinism in Replicated Adaptive Radiations of Island Lizards. Science 279: 2115–2118.
OpenUrl Abstract/FREE Full Text
↵
Malcolm, B., K. Wilson, and B. Matthews, 1990 Ancestral lysozymes reconstructed, neutrality tested, and thermostability linked to hydrocarbon packing. Nature 345: 86–89.
OpenUrl CrossRef PubMed Web of Science
↵
Matsuoka, Y., Y. Vigouroux, M. M. Goodman, J. Sanchez G, E. Buckler, et al., 2002 A single domestication for maize shown by multilocus microsatellite genotyping. Proceedings of the National Academy of Sciences of the United States of America 99: 6080–4.
OpenUrl Abstract/FREE Full Text
↵
Molina, J., M. Sikora, N. Garud, J. M. Flowers, S. Rubinstein, et al., 2011 Molecular evidence for a single evolutionary origin of domesticated rice. Proceedings of the National Academy of Sciences of the United States of America 108: 8351–6.
OpenUrl Abstract/FREE Full Text
↵
Muschick, M., A. Indermaur, and W. Salzburger, 2012 Convergent evolution within an adaptive radiation of cichlid fishes. Current Biology 22: 2362–8.
OpenUrl CrossRef PubMed
↵
Nourmohammad, A., T. Held, and M. Lässig, 2013 Universality and predictability in molecular quantitative genetics. Current opinion in genetics & development 23: 684–93.
OpenUrl
↵
Orr, H. A., 1999 The evolutionary genetics of adaptation: a simulation study. Genetical Research 74: 207–14.
OpenUrl CrossRef PubMed Web of Science
↵
Orr, H. A., 2002 The population genetics of adaptation: the adaptation of DNA sequences. Evolution; international journal of organic evolution 56: 1317–30.
OpenUrl CrossRef PubMed Web of Science
↵
Orr, H. A., 2005 The genetic theory of adaptation: a brief history. Nature reviews. Genetics 6: 119–27.
OpenUrl CrossRef PubMed Web of Science
↵
Ortlund, E. A., J. T. Bridgham, M. R. Redinbo, and J. W. Thornton, 2007 Crystal structure of an ancient protein: evolution by conformational epistasis. Science (New York, N.Y.) 317: 1544–8.
OpenUrl
↵
Pennings, P. S., 2012 Standing Genetic Variation and the Evolution of Drug Resistance in HIV. PLoS Computational Biology 8: e1002527.
↵
Rainey, P., and M. Travisano, 1998 Adaptive radiation in a heterogeneous environment. Nature 394: 69–72.
OpenUrl CrossRef PubMed Web of Science
↵
Saltz, J. B., and S. V. Nuzhdin, 2014 Genetic variation in niche construction: implications for development and evolutionary genetics. Trends in ecology & evolution 29: 8–14.
OpenUrl
↵
Sato, A., H. Tichy, C. O’hUigin, P. Grant, B. R. Grant, et al., 2001 On the origin of Darwin’s finches. Molecular Biology and …: 299–311.
↵
Sellis, D., B. Callahan, D. A. Petrov, and P. W. Messer, 2011 Heterozygote advantage as a natural consequence of adaptation in diploids. Proceedings of the National Academy of Sciences of the United States of America 2011: 1–6.
OpenUrl
↵
Stern, D. L., 2013 The genetic causes of convergent evolution. Nature reviews. Genetics 14: 751–64.
OpenUrl CrossRef PubMed
↵
Szendro, I. G., J. Franke, J. A. G. M. de Visser, and J. Krug, 2013 Predictability of evolution depends nonmonotonically on population size. Proceedings of the National Academy of Sciences of the United States of America 110: 571–6.
OpenUrl Abstract/FREE Full Text
↵
Takahata, N., and M. Nei, 1990 Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics 124: 967–78.
OpenUrl Abstract/FREE Full Text
↵
Tenaillon, O., A. Rodriguez-Verdugo, R. L. Gaut, P. McDonald, A. F. Bennett, et al., 2012 The Molecular Diversity of Adaptive Convergence. Science 335: 457–461.
OpenUrl Abstract/FREE Full Text
↵
Weinreich, D. M., N. F. Delaney, M. A. Depristo, and D. L. Hartl, 2006 Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312: 111–4.
OpenUrl Abstract/FREE Full Text

View the discussion thread.

Posted August 21, 2014.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11753)
Bioengineering (8752)
Bioinformatics (29201)
Biophysics (14974)
Cancer Biology (12100)
Cell Biology (17413)
Clinical Trials (138)
Developmental Biology (9422)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18309)
Genetics (12245)
Genomics (16804)
Immunology (11869)
Microbiology (28098)
Molecular Biology (11596)
Neuroscience (60975)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] ↵
Albert, A. Y. K., S. Sawaya, T. H. Vines, A. K. Knecht, C. T. Miller, et al., 2008 The genetics of adaptive shape shift in stickleback: pleiotropy and effect size. Evolution; international journal of organic evolution 62: 76–85.
OpenUrl CrossRef PubMed Web of Science

[2] ↵
Chevin, L.-M., G. Martin, and T. Lenormand, 2010 Fisher’s model and the genomics of adaptation: restricted pleiotropy, heterogenous mutation, and parallel evolution. Evolution 64: 3213–31.
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Cooper, T. F., D. E. Rozen, and R. E. Lenski, 2003 Parallel changes in gene expression after 20,000 generations of evolution in Escherichiacoli. Proceedings of the National Academy of Sciences of the United States of America 100: 1072–7.
OpenUrl Abstract/FREE Full Text

[4] ↵
Darwin, C., 1872 The Origin of Species. John Murray, London, 6th edition.

[5] ↵
de Visser, J. A. G. M., and J. Krug, 2014 Empirical fitness landscapes and the predictability of evolution. Nature reviews. Genetics 15: 480–90.
OpenUrl CrossRef PubMed

[6] ↵
Desai, M. M., and D. S. Fisher, 2007 Beneficial mutation selection balance and the effect of linkage on positive selection. Genetics 176: 1759–98.
OpenUrl Abstract/FREE Full Text

[7] ↵
Dudley, A. M., D. M. Janse, A. Tanay, R. Shamir, and G. M. Church, 2005 A global view of pleiotropy and phenotypically derived gene function in yeast. Molecular systems biology 1: 2005.0001.

[8] ↵
Ferea, T., D. Botstein, P. O. Brown, and R. F. Rosenzweig, 1999 Systematic changes in gene expression patterns following. Proceedings of the National Academy of Sciences of the United States of America 96: 9721–9726.
OpenUrl Abstract/FREE Full Text

[9] ↵
Fisher, R., 1930 The genetical theory of natural selection. Oxford at the Clarendon Press, Oxford, 1st edition.

[10] ↵
Fong, S. S., A. R. Joyce, and B. O. Palsson, 2005 Parallel adaptive evolution cultures of Escherichia coli lead to convergent growth phenotypes with different gene expression states. Genome Research 15: 1365–72.
OpenUrl Abstract/FREE Full Text

[11] ↵
Franke, J., A. Klozer, J. A. G. M. de Visser, and J. Krug, 2011 Evolutionary Accessibility of Mutational Pathways. PLoS Computational Biology 7.

[12] ↵
Gerstein, A. C., D. S. Lo, and S. P. Otto, 2012 Parallel Genetic Changes and Non-parallel Gene-environment Interactions Characterize the Evolution of Drug resistance in Yeast. Genetics 192: 241–252.
OpenUrl Abstract/FREE Full Text

[13] ↵
Gillespie, J., 1983 A Simple Stochastic gene substitution model. Theoretical Population Biology 23: 202–215.
OpenUrl CrossRef PubMed Web of Science

[14] ↵
Gillespie, J., 1984 Molecular evolution over the mutational landscape. Evolution 38: 1116–1129.
OpenUrl CrossRef Web of Science

[15] ↵
Gould, S. J., 1990 Wonderful Life: The Burgess Shale and the Nature of History. W. W. Norton & Company.

[16] ↵
Herron, M. D., and M. Doebeli, 2013 Parallel Evolutionary Dynamics of Adaptive Diversification in Escherichia coli. PLoS Biology 11: e1001490.
OpenUrl CrossRef PubMed

[17] ↵
Iserbyt, A., J. Bots, H. Van Gossum, and T. N. Sherratt, 2013 Negative frequency-dependent selection or alternative reproductive tactics: maintenance of female polymorphism in natural populations. BMC evolutionary biology 13: 139.
OpenUrl

[18] ↵
Kasumovic, M. M., M. J. Bruce, M. C. B. Andrade, and M. E. Herberstein, 2008 Spatial and temporal demographic variation drives within-season fluctuations in sexual selection. Evolution 62: 2316–25.
OpenUrl CrossRef PubMed Web of Science

[19] ↵
Khan, A. I., D. M. Dinh, D. Schneider, R. E. Lenski, and T. F. Cooper, 2011 Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332: 1193–6.
OpenUrl Abstract/FREE Full Text

[20] ↵
Kimura, M., 1956 Rules for testing stability of a selective polymorphism. Proceedings of the National Academy of Sciences of …1966: 336–340.
OpenUrl

[21] ↵
Kryazhimskiy, S., D. P. Rice, E. R. Jerison, and M. M. Desai, 2014 Global Epistasis Makes Adaptation Predictable Despite Sequence-Level Stochasticity. Science 344: 1519–1522.
OpenUrl Abstract/FREE Full Text

[22] ↵
Kvitek, D. J., and G. Sherlock, 2011 Reciprocal Sign Epistasis between Frequently Experimentally Evolved Adaptive Mutations Causes a Rugged Fitness Landscape. PLoS Genetics 7: e1002056.
OpenUrl CrossRef PubMed

[23] ↵
Kvitek, D. J., and G. Sherlock, 2013 Whole Genome, Whole Population Sequencing Reveals That Loss of Signaling Networks Is the Major Adaptive Strategy in a Constant Environment. PLoS Genetics 9: e1003972.
OpenUrl

[24] ↵
Kvitek, D. J., J. L. Will, and A. P. Gasch, 2008 Variations in stress sensitivity and genomic expression in diverse S. cerevisiae isolates. PLoS Genetics 4: e1000223.
OpenUrl CrossRef

[25] ↵
Lang, G. I., D. P. Rice, M. J. Hickman, E. Sodergren, G. M. Weinstock, et al., 2013 Pervasive genetic hitchhiking and clonal interference in forty evolving yeast populations. Nature.

[26] ↵
Levin, B. R., J. Antonovics, and H. Sharma, 1988 Frequency-Dependent Selection in Bacterial Populations [and Discussion]. Philosophical Transactions of the Royal Society B: Biological Sciences 319: 459–472.
OpenUrl CrossRef PubMed

[27] ↵
Lobkovsky, A. E., Y. I. Wolf, and E. V. Koonin, 2011 Predictability of evolutionary trajectories in fitness landscapes. PLoS Computational Biology 7: e1002302.
OpenUrl

[28] ↵
Losos, J. B., 1998 Contingency and Determinism in Replicated Adaptive Radiations of Island Lizards. Science 279: 2115–2118.
OpenUrl Abstract/FREE Full Text

[29] ↵
Malcolm, B., K. Wilson, and B. Matthews, 1990 Ancestral lysozymes reconstructed, neutrality tested, and thermostability linked to hydrocarbon packing. Nature 345: 86–89.
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Matsuoka, Y., Y. Vigouroux, M. M. Goodman, J. Sanchez G, E. Buckler, et al., 2002 A single domestication for maize shown by multilocus microsatellite genotyping. Proceedings of the National Academy of Sciences of the United States of America 99: 6080–4.
OpenUrl Abstract/FREE Full Text

[31] ↵
Molina, J., M. Sikora, N. Garud, J. M. Flowers, S. Rubinstein, et al., 2011 Molecular evidence for a single evolutionary origin of domesticated rice. Proceedings of the National Academy of Sciences of the United States of America 108: 8351–6.
OpenUrl Abstract/FREE Full Text

[32] ↵
Muschick, M., A. Indermaur, and W. Salzburger, 2012 Convergent evolution within an adaptive radiation of cichlid fishes. Current Biology 22: 2362–8.
OpenUrl CrossRef PubMed

[33] ↵
Nourmohammad, A., T. Held, and M. Lässig, 2013 Universality and predictability in molecular quantitative genetics. Current opinion in genetics & development 23: 684–93.
OpenUrl

[34] ↵
Orr, H. A., 1999 The evolutionary genetics of adaptation: a simulation study. Genetical Research 74: 207–14.
OpenUrl CrossRef PubMed Web of Science

[35] ↵
Orr, H. A., 2002 The population genetics of adaptation: the adaptation of DNA sequences. Evolution; international journal of organic evolution 56: 1317–30.
OpenUrl CrossRef PubMed Web of Science

[36] ↵
Orr, H. A., 2005 The genetic theory of adaptation: a brief history. Nature reviews. Genetics 6: 119–27.
OpenUrl CrossRef PubMed Web of Science

[37] ↵
Ortlund, E. A., J. T. Bridgham, M. R. Redinbo, and J. W. Thornton, 2007 Crystal structure of an ancient protein: evolution by conformational epistasis. Science (New York, N.Y.) 317: 1544–8.
OpenUrl

[38] ↵
Pennings, P. S., 2012 Standing Genetic Variation and the Evolution of Drug Resistance in HIV. PLoS Computational Biology 8: e1002527.

[39] ↵
Rainey, P., and M. Travisano, 1998 Adaptive radiation in a heterogeneous environment. Nature 394: 69–72.
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Saltz, J. B., and S. V. Nuzhdin, 2014 Genetic variation in niche construction: implications for development and evolutionary genetics. Trends in ecology & evolution 29: 8–14.
OpenUrl

[41] ↵
Sato, A., H. Tichy, C. O’hUigin, P. Grant, B. R. Grant, et al., 2001 On the origin of Darwin’s finches. Molecular Biology and …: 299–311.

[42] ↵
Sellis, D., B. Callahan, D. A. Petrov, and P. W. Messer, 2011 Heterozygote advantage as a natural consequence of adaptation in diploids. Proceedings of the National Academy of Sciences of the United States of America 2011: 1–6.
OpenUrl

[43] ↵
Stern, D. L., 2013 The genetic causes of convergent evolution. Nature reviews. Genetics 14: 751–64.
OpenUrl CrossRef PubMed

[44] ↵
Szendro, I. G., J. Franke, J. A. G. M. de Visser, and J. Krug, 2013 Predictability of evolution depends nonmonotonically on population size. Proceedings of the National Academy of Sciences of the United States of America 110: 571–6.
OpenUrl Abstract/FREE Full Text

[45] ↵
Takahata, N., and M. Nei, 1990 Allelic genealogy under overdominant and frequency-dependent selection and polymorphism of major histocompatibility complex loci. Genetics 124: 967–78.
OpenUrl Abstract/FREE Full Text

[46] ↵
Tenaillon, O., A. Rodriguez-Verdugo, R. L. Gaut, P. McDonald, A. F. Bennett, et al., 2012 The Molecular Diversity of Adaptive Convergence. Science 335: 457–461.
OpenUrl Abstract/FREE Full Text

[47] ↵
Weinreich, D. M., N. F. Delaney, M. A. Depristo, and D. L. Hartl, 2006 Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312: 111–4.
OpenUrl Abstract/FREE Full Text