Fitness dependence of the fixation-time distribution for evolutionary dynamics on graphs

David Hathcock; Steven H. Strogatz

doi:10.1101/496380

Abstract

Evolutionary graph theory models the effects of natural selection and random drift on structured populations of competing mutant and non-mutant individuals. Recent studies have found that fixation times in such systems often have right-skewed distributions. Little is known, however, about how these distributions and their skew depend on mutant fitness. Here we calculate the fitness dependence of the fixation-time distribution for the Moran Birth-death process in populations modeled by two extreme networks: the complete graph and the one-dimensional ring lattice, obtaining exact solutions in the limit of large network size. We find that with non-neutral fitness, the Moran process on the ring has normally distributed fixation times, independent of the relative fitness of mutants and non-mutants. In contrast, on the complete graph, the fixation-time distribution is a fitness-weighted convolution of two Gumbel distributions. When fitness is neutral the fixation-time distribution jumps discontinuously and becomes highly skewed on both the complete graph and the ring. Even on these simple networks, the fixation-time distribution exhibits rich fitness dependence, with discontinuities and regions of universality. Extensions of our results to two-fitness Moran models, times to partial fixation, and evolution on random networks are discussed.

I. INTRODUCTION

Reproducing populations undergo evolutionary dynamics. Mutations can endow individuals with a fitness advantage, allowing them to reproduce more quickly and outcompete non-mutant individuals [1]. Two natural questions arise: If a single mutant individual is introduced into a population, what is the probability that the mutant lineage will spread and ultimately take over the population (an outcome known as fixation)? And if fixation occurs, how much time does it take?

These questions have been addressed, in part, by evolutionary graph theory, which studies evolutionary dynamics in structured populations. Thanks to this approach, fixation probabilities are now well understood for various models on various networks [2–12]. Less is known about fixation times. Given a model of evolutionary dynamics, one would like to predict the mean, variance, and ideally the full distribution of its fixation times.

Of these quantities, the mean is the best understood. Numerical and analytical results exist for mean fixation times on both deterministic [4, 6, 11–17] and random [16–19] networks. Yet although mean fixation times are important to study, the information they provide can be misleading, because fixation-time distributions tend to be broad and skewed and hence are not well characterized by their means alone [11, 20–23]. Initial analytical results have determined the asymptotic fixation-time distribution for several simple networks, but only when the relative fitness of the mutants is infinite [24–26]. For other values of the relative fitness, almost nothing is known. Preliminary results suggest that at neutral fitness (when mutants and non-mutants are equally fit), the fixation-time distribution becomes highly right-skewed [26].

In this paper we investigate the full fitness dependence of fixation-time distributions for the Moran process [27, 28], a simple model of evolutionary dynamics. In the limit of large network size, we derive asymptotically exact results for the fixation-time distribution and its skew for two network structures at opposite ends of the connectivity spectrum: the complete graph, in which every individual interacts with every other individual; and the one-dimensional ring lattice, in which each individual interacts only with its nearest neighbors on a ring.

The specific model we consider is the Moran Birth-death (Bd) process [29], defined as follows. On each node of the network there is an individual, either mutant or non-mutant. The mutants have a fitness level r, which designates their relative reproduction rate compared to non-mutants. When r > 1, the mutants have a fitness advantage, whereas when r = 1 they have neutral fitness. At each time step we choose a node at random, with probability proportional to its fitness, and choose one of its neighbors with uniform probability. The first individual gives birth to an offspring of the same type. That offspring replaces the neighbor, which dies. The model population is updated until either the mutant lineage takes over (in which case fixation occurs) or the mutant lineage goes extinct (a case not considered here).

As mentioned above, the distribution of fixation times is often skewed. The skew emerges from the stochastic competition between mutants and non-mutants through multiple mechanisms. For instance, when the mutants have neutral fitness the process resembles an unbiased random walk. We find that the asymptotic fixation-time distribution for a simple random walk is only skewed when the walk is unbiased. The lack of bias allows for occasional long recurrent excursions (that are suppressed in biased walks) during successful runs to fixation. The fixation-time distribution is strongly skewed because there are many ways to execute such walks that are much longer than usual, but comparably few ways for mutants to sweep through the population much faster than usual.

Depending on network structure, the fixation-time skew can also come from a second, completely separate mechanism, which involves characteristic slowdowns that arise because individuals do not discriminate between mutants and non-mutants during the replacement step of the Moran process. For example, when very few non-mutants remain, the mutants can waste time replacing each other. These slowdowns are reminiscent of those seen in a classic problem from probability theory, the coupon collector’s problem, which asks: How long does it take to complete a collection of N distinct coupons if a random coupon is received at each time step? The intuition for the long slowdowns is clear: when nearly all the coupons have been collected, it can take an exasperatingly long time to collect the final few, because one keeps acquiring coupons that one already has. The problem was first solved by Erdős and Rényi, who proved that for large N, the time to complete the collection has a Gumbel distribution [30]. In fact, for evolutionary processes with infinite fitness there exists an exact mapping onto coupon collection [25, 26]. Remarkably, while this correspondence breaks down for finite fitness, the coupon collection heuristic still allows us to predict correct asymptotic fixation-time distributions for non-neutral fitness.

In the following sections we show that for N ≫ 1, the neutral-fitness Moran process on the complete graph and the one-dimensional ring lattice has highly skewed fixation-time distributions, and we solve for their cumulants exactly. For non-neutral fitness the fixation-time distribution is normal on the lattice and a weighted convolution of Gumbel distributions on the complete graph. These results are novel; apart from the infinite fitness limit and some partial results at neutral fitness (noted below), the fitness dependence of these distributions was previously unknown.

We begin by developing a general framework for computing fixation-time distributions and cumulants of birth-death Markov chains, and then apply it to the Moran process to prove the results above. We also consider the effects of truncation on the process and examine how long it takes to reach partial, rather than complete, fixation. The fixation-time distributions have rich dependence on the fitness level and the degree of truncation, with both discontinuities and regions of universality. To conclude, we discuss extensions of our results to two-fitness Moran models and to more complicated network topologies.

II. GENERAL THEORY FOR BIRTH-DEATH MARKOV PROCESSES

For simplicity, we restrict attention to network topologies and initial mutant populations for which the probability of adding or removing a mutant in a given time step depends only on the number of existing mutants, not on where the mutants are located on the network. The state of the system can therefore be defined in terms of the number of mutants, m = 0, 1, …, N, where N is the total number of nodes on the network. The Moran process is then a birth-death Markov chain with N + 1 states, transition probabilities b_m and d_m determined by the network structure, and absorbing boundaries at m = 0 and m = N. In this section we review several general analytical results for absorbing birth-death Markov chains, explaining how they apply to fixation times in evolutionary dynamics. We also develop an approach, which we call visit statistics, that enables analytical estimation of the asymptotic fixation time cumulants.

On more complicated networks, the probability of adding or removing a mutant depends on the configuration of existing mutants. For some of these networks, however, the transition probabilities can be accurately estimated using a mean-field approximation [19, 23, 25, 26]. Then, to a good approximation, the results below apply to such networks as well.

A. Eigendecomposition of the birth-death process

Assuming a continuous-time process, the state of the Markov chain described above evolves according to the master equation, where p(t) is the probability of occupying each state of the system at time t and Ω is the transition rate matrix, with columns summing to zero. In terms of the transition probabilities b_m and d_m, the entries of Ω are where m and n run from 0 to N, δ_m,n is the Kronecker delta, and b₀ = d₀ = b_N = d_N = 0. The final condition guarantees the system has absorbing boundaries with stationary states p_m = δ_m,0 and p_m = δ_m,N when the population is homogeneous. Thus we can decompose the transition matrix into stationary and transient parts, defining the transient part Ω_tr as in Eq. (2), but with m, n = 1, …, N − 1. The transient transition matrix acts on the transient states of the system, denoted p_tr(t). The eigenvalues of Ω_tr are real and strictly negative, since probability flows away from these states toward the absorbing boundaries. To ease notation in the following discussion and later applications, we shall refer to the positive eigenvalues of −Ω_tr as the eigenvalues of the transition matrix, denoted λ_m, where m = 1, … , N − 1.

From the perspective of Markov chains, the fixation time T is the time required for first passage to state m = N, given m₀ initial mutants, . At time t, the probability that state N has been reached (i.e., the cumulative distribution function for the first passage times) is simply , where is the fixation probability given m₀ initial mutants. The distribution of first passage times is therefore . Since we normalize by the fixation probability, this is precisely the fixation-time distribu-tion conditioned on reaching N.

The solution to the transient master equation is the matrix exponential p_tr(t) = exp(Ω_trt) · p_tr(0), yielding a fixation-time distribution [31]. If we assume one initial mutant m₀ = 1 this becomes . The matrix exponential can be evaluated in terms of the eigenvalues λ_m by taking a Fourier (or Laplace) transform (for details, see Ref. [22]). For a single initial mutant, the result is that the fixation time T has a distribution f_T (t) given by

This formula holds as long as the eigenvalues λ_m are distinct, which for birth-death Markov chains occurs when b_m and d_m are non-zero (except at the absorbing boundaries) [32]. Generalizations of this result for arbitrarily many initial mutants have also recently been derived, in terms of eigenvalues of the transition matrix and certain sub-matrices [22].

The distribution in Eq. (3) is exactly that corresponding to a sum of exponential random variables with rate parameters λ_m. The corresponding cumulants equal . As our primary interest is the asymptotic shape of the distribution, we normalize T to zero mean and unit variance and study (T−μ)/σ, where μ and σ denote the mean and standard deviation of T. The standardized distribution is then given by σf_T(σt + μ). The rescaled fixation time has cumulants which, for many systems including those considered below, are finite as N → ∞. When the limit exists, we define the asymptotic cumulants by κ_n = lim_N→∞ κ_n(N). In particular, because we have standardized our distribution, the third cumulant κ₃ is the skew. In practice the limit N → ∞ is taken by computing the leading asymptotic behavior of both the numerator and denominator in Eq. (4). As we will see below the scaling of these terms with N depends on both the population network structure and the mutant fitness (see also asymptotic analysis in Supplemental Material, Sections S3 & S4 [33]). This approach allows us to characterize the asymptotic shape of the fixation-time distribution in terms of the constants κ_n. Since λ_m > 0, it is clear from this expression that, for finite N, the skew and all higher order cumulants must be positive, in agreement with results for random walks with non-uniform bias [34]. As N → ∞ this is not necessarily true; in some cases the cumulants vanish.

The eigendecomposition gives the fixation-time distribution and cumulants in terms of the non-zero eigenvalues of the transition matrix. In general the eigenvalues must be found numerically, but in cases where they have a closed form expression the asymptotic form of the cumulants and distribution can often be obtained exactly.

B. Analytical cumulant calculation: Visit statistics

In this section we develop machinery to compute the cumulants of the fixation time analytically without relying on matrix eigenvalues. For this analysis, we specialize to cases where b_m/d_m = r for all m, relevant for the Moran processes considered below. These processes can be thought of as biased random walks overlaid with non-constant waiting times at each state.

It is helpful to consider the Markov chain conditioned on hitting N, with new transition probabilities and so that the fixation probability . If X_t is the state of the system at time t, then with defined analogously. We derive explicit expressions for and in Supplemental Material, Section S1 [33]. Conditioning is equivalent to a similarity transformation on the transient part of the transition matrix: , where S is diagonal with S_mm = 1 − 1/r^m. Furthermore, since b_m/d_m = r, we can decompose Ω_tr = Ω_RWD, where D is a diagonal matrix, D_mm = b_m + d_m, that encodes the time spent in each state and Ω_RW is the transition matrix for a random walk with uniform bias,

Applying the results of the previous section and using the fact that the columns of Ω sum to zero, we can write there fixation-time distribution of the conditioned Markov chain as , where 1 is the row vector containing all ones. This distribution has characteristic function [31] and the derivatives (−i)ⁿϕ⁽ⁿ⁾(0) give the moments of T in terms of . This inverse has a nice analytical form because S and D are diagonal and Ω_RW is tridiagonal Toeplitz. We call this approach visit statistics because the elements V_ij of encode the average number of visits to state i starting from state j.

Each power of Eq. (7) produces products of (b_i + d_i) that arise in linear combinations determined by the visit numbers V_ij. Therefore, the cumulants of the fixation time have the general form where are weighting factors based on the visit statistics of the biased random walk, given the initial number of mutants m₀. In what follows, we always assume m₀ = 1 and suppress the dependence of the weighting factors on initial condition, writing instead. A detailed derivation of Eq. (8) and explicit expressions for and are given in the Appendix below.

To the best of our knowledge this representation of the fixation-time cumulants has not been previously derived, although a similar approach was recently used to computemean fixation times for evolutionary dynamics on complex networks [19]. This expression is equivalent to the well-known recurrence relations for absorption-time moments of birth-death processes [21, 35] but is easier to handle asymptotically, and can be useful even without explicit expressions for . Estimating the sums in Eq. (8) allows us to compute the asymptotic fixation time cumulants exactly.

C. Recurrence relation for fixation-time moments

Evaluation of the eigenvalues of the transition matrix for large systems can be computationally expensive, with the best algorithms having run times quadratic in matrix size. Numerical evaluation of the expression given in Eq. (8) is even worse, as it requires summing elements. If only a finite number of fixation time cumulants (and not the full distribution) are desired, there are better numerical approaches. Using standard methods from probability theory [36], we derive a recurrence relation that allows numerical moment computation with run time linear in system size N. For completeness we provide the full derivation of the reccurence for the fixation-time skew in Supplemental Material, Section S2 [33].

D. Equivalence between advantageous and disadvantageous mutations

In the following applications, we will generally speak of the mutants as having a fitness advantage, designated by the parameter r > 1. Our results, however, can be immediately extended to disadvantageous mutations. In particular, the fixation-time distributions (conditioned on fixation occurring) for mutants of fitness r and 1/r are identical. When a mutant with fitness 1/r is introduced into the population (and eventually reaches fixation), the non-mutants are r times as fit as the mutants. Therefore, this system is equivalent to another system that starts with N − 1 fitness r mutants which eventually die out (the mutants in the former system are the non-mutants in the latter). It has been shown that the times to go from one initial mutant to fixation (m = 1 → m = N) and from N − 1 initial mutants to extinction (m = N − 1 → m = 0) have identical distribution [22]. Thus indeed, the conditioned fixation-time distributions are identical for mutants of fitness r and 1/r. Of course the fixation probability is very different in the two cases: for the disadvantageous mutations it approaches 0 for large N [5].

III. ONE-DIMENSIONAL LATTICE

We now specialize to Moran Birth-death (Bd) processes, starting with the one-dimensional (1D) lattice. We assume periodic boundary conditions, so that the N nodes form a ring. The mutants have relative fitness r, meaning they give birth r times faster, on average, than non-mutants do.

Starting from one mutant, suppose that at some later time m of the N nodes are mutants. On the 1D lattice, the population of mutants always forms a connected arc, with two mutants at the endpoints of the arc. Therefore, the probability b_m of increasing the mutant population by one in the next time step is the probability of choosing a mutant node at an endpoint to give birth, namely 2r/(rm + N − m), times the probability 1/2 that the neighboring node to be replaced is not itself a mutant. (The latter probability equals 1/2 because there are two neighbors to choose for replacement: a mutant neighbor on the interior of the arc and a non-mutant neighbor on the exterior. Only the second of these choices produces an increase in the number of mutants.) Multiplying these probabilities together we obtain where the probability d_m of decreasing the mutant population by one is found by similar reasoning. Note that this derivation fails for m = 1 (m = N − 1) when the arc of mutants (non-mutants) contains only one node, but one can check Eq. (9) still holds for these cases. These quantities play the role of transition probabilities in a Markov transition matrix. The next step is to find the eigenvalues of that matrix.

A. Neutral fitness

First we work out the eigenvalues for the case of neutral fitness, r = 1. In this case, the transition probabilities are equal, b_m = d_m = 1/N, and independent of m. Therefore, the Moran process is simply a random walk, with events occurring at a rate of 2/N per time step.

The associated transition matrix is tridiagonal Toeplitz, which has eigenvalues given by

Applying Eq. (4) and computing the leading asymptotic form of the sums (see Supplemental Material, Section S3 [33]), we find that as N → ∞, the fixation-time distribution has cumulants where ζ denotes the Riemann zeta function. In particular, the skew , as previously calculated by Ottino-Löffler et al. [26] via martingale methods.The other cumulants (and characteristic function below) haven’t previously been computed for the Bd process on the 1D lattice. The largeness of the skew stems from the recurrent property of the random walk. As N → ∞, long walks with large fixation times become common and the system revisits each state infinitely often [37].

Knowledge of the cumulants allows us to obtain the exact characteristic function of the fixation-time distribution:

Although we cannot find a simple expression for the distribution itself, we can efficiently evaluate it by taking the inverse Fourier transform of the characteristic function numerically. Figure 1(a) shows that the predicted fixation-time distribution agrees well with simulations.

FIG. 1.

Fixation-time distributions on the 1D lattice obtained from 10⁶ simulation runs. All distributions are standardized to zero mean and unit variance. Solid curves are the theoretical predictions. Shown are the fixation-time distributions for (a) a 1D lattice of N = 100 nodes with neutral fitness r = 1 and (b) a 1D lattice of N = 5000 nodes with mutant fitnesses r = 1.1 and r = 2.0. For the neutral fitness case, the theoretical distribution was generated by numerical inverse Fourier transform of the characteristic function (Eq. (12)). The r = 1.1 distribution is slightly but visibly skewed due to finite network size.

B. Non-neutral fitness

Next, consider r ≠ 1 with the transition probabilities given by Eq. (9). Then the eigenvalues of the transition matrix are no longer expressible in closed form. If r is not too large, however, the probabilities b_m and d_m do not vary dramatically with m, the number of mutants. In particular, b_m ~ 1/N for all m when N is large. Therefore, as a first approximation we treat the Bd process on a 1D lattice as a biased random walk with b_m = r/(1 + r) and d_m = 1/(1+r). The eigenvalues of the corresponding transition matrix are

The cumulants again involve sums , which can be approximated in the limit N → ∞ by,

Since the integral is independent of N and converges for r ≠ 1, each of the sums scales linearly: S_n ~ N. Thus, using Eq. (4), we see that all cumulants past second order approach 0,

Hence the fixation-time distribution is asymptotically normal, independent of fitness level.

By evaluating the integrals in Eq. (14), we can more precisely compute the scaling of the cumulants. For the skew we find

The integral approximation becomes accurate when the first term in the sums S_n becomes close to the value of the integrand evaluated at the lower bound (x = 0). The fractional difference between these quantities is

Then we have Δ ≪ 1 when N ≫ N_c where (assuming r is near 1). For the skew, we require the sums with n = 2 and 3, giving N_c ≈ 10/(r − 1).

The above calculation fails for r ≫ 1, because when r = ∞ the transition probabilities b_m = 1/m have different asymptotic behavior as N → ∞. In particular, more time is spent waiting at states with large m. The process still has normally distributed fixation times [26], but the skew becomes for large N. Notice that the coefficient is different from that given by the infinite-r limit of Eq. (16), . We conjecture that there is a smooth crossover between these two scaling laws with the true skew given approximately by for some exponent q, where κ₃ is the skew given in Eq. (16). For small r this ansatz has skew similar to that of a random walk, but captures the correct large-r limit. We do not have precise theoretical motivation for this ansatz, but as discussed below, it works quite well.

Numerical calculation of the skew for the 1D lattice was performed using the recurrence relation method discussed in Section II C. The results are shown in Fig. 2 for a few values of r. This calculation confirms our initial hypothesis, near neutral fitness the waiting times are uniform enough that the process is well approximated by a biased random walk and the skew approaches 0, scaling in excellent agreement with Eq. (16). When N ≪ N_c, the bias is not sufficient to give the mutants a substantial advantage: the process is dominated by drift and the fixation-time distribution has large skew κ₃ ≈ 1.807, as found in the preceding section. For N ≫ N_c, selection takes over, the cumulants approach 0, and the distribution becomes normal. A similar crossover appears in the study of the fixation probability, where a transition from φ₁ ~ 1/N to φ₁ ~ 1−1/r is seen when N passes a critical system size (that is slightly different than N_c). For large fitness r ≫ 1, the ansatz Eq. (19) captures the scaling behavior if we use an exponent q = 1/2. Direct numerical simulations of the process confirm that, for any r > 1, the fixation time on the 1D lattice has an asymptotically normal distribution [Fig. 1(b)].

FIG. 2.

Scaling of the skew of the fixation-time distribution on the 1D lattice with non-neutral fitness. Data points show numerical calculation of the skew for various fitness levels. The solid lines are the predicted scaling given in Eq. (19) with exponent q = 1/2 for each value of fitness r. For small N (and small enough r), the skew is that of a random walk, namely κ₃ = 1.807, as shown by the dashed line. For large N, the skew with an r-dependent coefficient.

The random walk approximation allows us to find the asymptotic scaling of the fixation-time cumulants, but ignores the heterogeneity of waiting times present in the Moran process. Using visit statistics we can compute the cumulants exactly with Eq. (8) and rigorously prove they vanish as N → ∞, verifying that the waiting times have no influence on the asymptotic form of the distribution. For details, see Supplemental Material, Section S3 [33].

Our analysis of the 1D lattice reveals an intriguing universality property of its fixation-time distribution. For any value of relative fitness r other than r = 1, the fixation-time distribution approaches a normal distribution as N → ∞. Thus, for r ≠ 1 the asymptotic shape of the distribution is universal and independent of r (though bear in mind, its mean and variance do depend on r).

When r = 1, corresponding to precisely neutral fitness, the unbiased random walk yields a qualitatively different distribution with considerably larger skew. This qualitative change as r passes through unity leads to a discontinuous jump in the skew at r = 1.

As one might expect, the discontinuity stems from passage to the infinite-N limit. For finite but large N, the distribution varies continuously with r, though our numerical results indicate that the sharp increase in skew still occurs very close to r = 1. We will see in the next section that the discontinuity and highly skewed distribution at neutral fitness persist when we alter the network structure from a locally connected 1D lattice to a fully connected complete graph.

IV. COMPLETE GRAPH

Next we consider the Moran process on a complete graph, useful for modeling well-mixed populations in which all individuals interact. By similar reasoning to above, given m mutants the probability of adding a mutant in the next time step is while the probability of subtracting a mutant is

Interestingly, as we will see in this section, these transition probabilities give rise to a fitness dependent fixation-time distribution, in stark contrast to the universality of the normal distribution observed on the 1D lattice.

A. Neutral Fitness

Again we begin with neutral fitness r = 1. Now b_m = d_m = (Nm − m²)/(N² − N). The eigenvalues of this transition matrix also have a nice analytical form:

The asymptotic form of the sums , can be found by taking the partial fraction decomposition of (λ_m)⁻ⁿ and evaluating each term individually. The resulting cumulants are

Our knowledge of the eigenvalues also allows us to obtain a series expression for the asymptotic distribution using Eq. (3). For N → ∞ the standardized distribution is, where to leading order in N the mean and standard deviation are μ = N² and σ = c_σN², with . This distribution was previously found using a different approach by Kimura, who also computed the first few fixation-time moments [38]. We have extended these results, obtaining the cumulants to all orders. Figure 3 shows that the predicted asymptotic distribution agrees well with numerical experiments.

FIG. 3.

Fixation-time distributions on the complete graph with N = 100 nodes and neutral fitness (r = 1) obtained from 10⁶ simulation runs. The distribution is standardized to zero mean and unit variance. The solid curve is the theoretical distribution obtained by numerically evaluating the infinite series in Eq. (24) for each value of t.

The numerical value of the fixation-time skew for the Birth-death process on the complete graph is , slightly less than that for the 1D lattice. This decrease is the result of two competing effects contributing to the skew. First, since the birth and death transition probabilities are the same, the process is a random walk, which has a highly skewed fixation-time distribution, as shown above. The average time spent in each state, however, varies with m. For instance, when m = 1 or N − 1, b_m → 0 for large N. But if m = αN for some constant 0 < α < 1 independent of N, then b_m approaches a constant.

Intuitively, the beginning and end of the mutation-spreading process are very slow because the transition probabilities are exceedingly small. To start, the single mutant must be selected by chance to give birth from the N available nodes, a selection problem like finding a needle in a haystack. Similarly, near fixation the reproducing mutant must find and replace one of the few remaining non-mutants, again choosing it by chance from an enormous population.

The characteristic slowing down at certain states is reminiscent of “coupon collection”, as discussed earlier. Erdős and Rényi proved that for large N, the normalized time to complete the coupon collection follows a Gumbel distribution [30], which we denote by Gumbel(α, β) with density

For the Moran process, each slow region is produced by long waits for the random selection of rare types of individuals: either mutants near the beginning of the process or non-mutants near the end. In the next section we show that the two coupon collection regions of the Bd process on a complete graph lead to fixation-time distributions that are convolutions of two Gumbel distributions. In the case of neutral fitness, these Gumbel distributions combine with the random walk to produce a new highly skewed distribution with cumulants given by Eq. (23).

B. Non-neutral fitness

We saw in Section III B that when the average time spent in each state is constant or slowly varying the fixation-time distribution is asymptotically normal. Birth-death dynamics on the complete graph, however, exhibit coupon collection regions at the beginning and end of the process, where the transition probabilities vanish. We begin this section with a heuristic argument that correctly gives the asymptotic fixation-time distribution in terms of independent iterations of coupon collection.

Differentiating b_m with respect to m, we find the slope near m = 0 is (r + 1)/N, while the slope near m = N has magnitude (r + 1)/(rN) for N ≫ 1. The transition rates approach zero at each of these points, so we expect behavior similar to coupon collection giving rise to two Gumbel distributions. Since the slope is greater for m near 0 than for m near N, the Moran process completes its coupon collection faster near the beginning of the process than near fixation.

This heuristic suggests that the asymptotic fixation time should be equal in distribution to the sum of two Gumbel random variables, one weighted by r, which is the ratio of the slopes in the coupon collection regions. Specifically, if T is the fixation time with mean μ and variance σ², we expect where means convergence in distribution for large N. Here G and G′ denote independent and identically distributed Gumbel random variables with zero mean and unit variance. It is easy to check that the correct distribution is Gumbel, where γ ≈ 0.5772 is the Euler-Mascheroni constant.

Let us make this argument more rigorous. Previous theoretical analysis showed that in the infinite fitness limit, the fixation time has an asymptotically Gumbel distribution [26]. This result can be recovered within our framework, since when r = ∞ it follows that d_m = 0, so the eigenvalues of the transition matrix are just λ_m = b_m = (N − m)/(N − 1) and the cumulants can be directly calculated using Eq. (4).

For large (but not infinite) fitness, the number of mutants is monotonically increasing, to good approximation, since the probability that the next change in state increases the mutant population is r/(1 + r) ≈ 1. The time spent waiting in each state, however, changes dramatically, especially near m = 1. Here, b_m → 0 for large N, in stark contrast to the infinite fitness system where b₁ → 1. The time spent at each state, t_m is an exponential random variable, ε(b_m + d_m). In this approximation each state is visited exactly once, so the total fixation time is a sum of these waiting times:

But this sum of exponential random variables has density given by Eq. (3), with the substitution λ_m → b_m + d_m. Thus, the cumulants of (T − μ)/σ are which are exactly the cumulants corresponding to the sum of Gumbel random variables given in Eq. (26). In the limit r → ∞, the first term in Eq. (28) becomes 1, and the cumulants are those for a single Gumbel distribution, in agreement with previous results [26].

Remarkably, these cumulants are exact for any r > 1, not just in the large-r limit. We can see this directly for the skew κ₃ using the visit statistics approach, computing the asymptotic form of Eq. (8) with the complete graph transition probabilities, Eqs. (20) and (21). Details of the asymptotic analysis are provided in Supplemental Material, Section S4 [33]. Numerical simulations of the Moran process corroborate our theoretical results. As shown in Fig. 4, for r = 1.1 and r = 5 the agreement between simulated fixation times and the predicted convolution of Gumbel distributions is excellent, at least when N is sufficiently large. Again, our calculations show a discontinuity in the fixation-time distribution at r = 1. In particular, the r → 1 limit of the cumulants for non-neutral fitness in Eq. (28) is not the same as the cumulants for neutral fitness found in the preceding section [Eq. (23)].

FIG. 4.

Fixation-time distributions on the complete graph with N = 5000 nodes and non-neutral fitness (r > 1) obtained from 10⁶ simulation runs. All distributions are standardized to zero mean and unit variance. Solid curves are the theoretical predictions obtained by numerical convolution of two Gumbel distributions, one weighted by r. Distributions are shown for (a) r = 1.1 and (b) r = 5.0. For larger r, the distribution has larger skew and a slightly sharper peak.

For smaller networks, it is fascinating to see how the results converge to the asymptotic predictions as N grows. Figure 5 shows how the skew of the fixation-time distribution depends on r and N for the complete graph. As discussed in Section II D, the fixation-time distributions for these systems are invariant under r → 1/r. Therefore we show the skew for all r > 0, to emphasize the intriguing behavior near neutral fitness, where r = 1. We find that non-uniform convergence of the fixation-time skew leads to the discontinuity predicted at r = 1. For finite N, the skew is a non-monotonic function of r and has a minimum value at some fitness r_min(N). Furthermore, at fixed r, the convergence to the N = ∞ limit is itself non-monotone. Though beyond the scope of the current study, further investigation of this finite-N behavior would be worth pursuing.

FIG. 5.

Fitness dependence of fixation-time skew for the Moran Birth-death process on the complete graph. The skew is shown for r ≥ 0 and is invariant under r → 1/r. For finite N, the skew does not have a discontinuity, but does show non-monotonic dependence on fitness r. In particular, for a given N, there is a certain fitness level with minimum skew. As N → ∞, we see non-uniform convergence to the predicted skew given by κ₃ in Eq. (28), leading to the discontinuity at r = 1. Moreover, for fixed r, the convergence to the N = ∞

V. PARTIAL FIXATION TIMES

In many applications, we may be interested in the time to partial fixation of the network. For instance, considering cancer progression [39–41] or the incubation of infectious diseases [26], symptoms can appear in a patient even when a relatively small proportion of cells are malignant or infected. We therefore consider T_α, the total time to first reach αN mutants on the network, where 0 < α < 1. The methods developed in Section II apply to these processes as well. For the eigendecomposition approach we instead use the sub-matrix of Ω_tr containing the first αN rows and columns. In calculations involving the numerical recurrence relations or visit statistics, we simply cut the sums off at αN instead of N and for the latter, replace with .

A. One-dimensional lattice

Truncating the Moran Bd process on the 1D lattice by a factor α has no effect on the asymptotic shape of the fixation-time distributions. In both the neutral fitness system and the random walk approximation to the non-neutral fitness system, the transition matrix has no explicit dependence on the state or system size [aside from proportionality factors that cancel in Eq. (4)]. Thus, the eigenvalues are identical to those calculated previously, but correspond to a smaller effective system size αN. Taking the limit N → ∞ therefore yields the same asymptotic distributions found in Section III.

B. Complete graph: truncating coupon collection

The complete graph exhibits more interesting dependence on truncation. Since the transition probabilities have state dependence, the eigenvalues change with truncation (they don’t correspond to the same system with smaller effective N). Our intuition from coupon collection, however, lets us predict the resulting distribution.

First consider non-neutral fitness. Then there are two coupon collection stages, one near the beginning and another near the end of the process, and together they generate a fixation-time distribution that is a weighted convolution of two Gumbel distributions. The effect of truncating the process near its end should now become clear: it simply removes the second coupon collection. The truncated process stops before the mutants have to laboriously find and replace the last remaining non-mutants. Therefore, we intuitively expect the fixation time for non-neutral fitness to be distributed according to a single Gumbel distribution, regardless of fitness level.

The only exception occurs if r = ∞; then no coupon collection occurs at the beginning of the process either, as the lone mutant is guaranteed to be selected to give birth in the first time step, thanks to its infinite fitness advantage. Thus, when fitness is infinite and the process is truncated at the end, both coupon collection phases are removed and the fixation times are normally distributed.

Similar reasoning applies to the Birth-death process with neutral fitness. It also has two coupon collection regions, one of which is removed by truncation. In this case, however, the random walk mechanism contributes to the skew of the overall fixation-time distribution, combining non-trivially with the coupon collection- like process. We find that the skew of the fixation time depends on the truncation factor α, varying between when α = 1, and when α = 0. A derivation of this α → 0 limit is given in Supplemental Material, Section S4 [33].

C. Summary of main results

The main results from Sections III–V are summarized in Fig. 6, which shows the asymptotic fitness dependence of fixation-time skew for each network considered in this paper. We again show the skew for all r > 0 (not just r > 1) to emphasize the discontinuities at zero, neutral, and infinite fitness. On the 1D lattice, independent of the truncation factor α, the Bd process has normally distributed fixation times, except at neutral fitness where the distribution is highly skewed. The complete graph fixation-time distributions are the weighted convolution of two Gumbel distributions for r ≠ 1, again with a highly skewed distribution at r = 1. With truncation by a factor α < 1, the distribution for the complete graph is Gumbel for 1 < r < ∞, and normal for r = ∞. With neutral fitness the fixation distribution is again highly skewed, with skew dependent on the truncation factor α.

FIG. 6.

Variation of fixation-time skew κ₃ with fitness level r and truncation factor α for different network structures. (a) The skew of the fixation-time distribution is plotted versus fitness for the 1D lattice (black solid line), complete graph (red dashed line), and complete graph with truncation (green dotted line). The skew is shown for all r ≥ 0 and is invariant under r → 1/r. When r ≠ 1 and r < ∞, the fixation-time distribution is normal for the 1D lattice, and hence has zero skew (κ₃ = 0). The distribution becomes a fitness-weighted convolution of Gumbel distributions for the complete graph, and a single Gumbel distribution for the complete graph with truncation (for any α < 1). Each curve jumps discontinuously at r = 1, where the distributions become highly skewed with κ₃ > 1.5. The inset shows a blow-up of the neutral fitness results, specifying the skew for each case. On the complete graph with truncation, the skew is continuously variable at r = 1, taking on an interval of values between when α = 1, and when α = 0. This range is indicated by the green vertical line. The truncated fixation time on the complete graph has a second discontinuity at r = ∞ (shown here at r = 0, by exploiting the r → 1/r invariance). At this discontinuity the functional form of the distribution jumps from Gumbel to normal. (b) The fixation-time skew for the complete graph with neutral fitness, plotted versus the truncation factor α. These points correspond to the green vertical line in panel (a) at r = 1.

VI. EXTENSIONS

It is natural to ask whether our results are generic; do the same fixation-time distributions appear in other models of evolutionary dynamics? Here we explore the robustness of our results to various changes in the model update dynamics and the network topology. The main finding is that our results are insensitive to these changes, at least qualitatively. The distributions typically remain right-skewed and even follow the same functional forms derived above.

A. Other update dynamics

1. Two-fitness Moran process

The Moran Bd processes considered above require a single fitness level, designating the relative reproduction rates between mutants and non-mutants. Another common model is the Moran Birth-Death (BD) process [29], which has a second fitness level measuring the resilience of mutants versus non-mutants during the replacement step [9]. Taking this into account, when a mutant or non-mutant is trying to replace its neighbors, mutants are replaced with probability proportional to . Taking returns to the model used throughout the preceding sections. The two-fitness model may better capture the complexity of real-world evolutionary systems but does not generally give rise to qualitatively different fixation-time distributions. For brevity, we simply discuss the resulting fixation-time distributions for the BD model. Details supporting the results quoted below are provided in the Supplemental Material, Section S5 [33].

Writing down the transition probabilities for the Moran BD process, we find that as N → ∞. This motivates the definition of an effective fitness level, . When r_eff ≠ 1 our results from above translate to this model. On the 1D lattice the fixation times are normally distributed, while on the complete graph the fixation time distribution is a weighted convolution of Gumbel distributions , with relative weighting (instead of r). When r_eff = 1, the process is asymptotically unbiased and we expect a highly skewed fixation-time distribution. This is indeed the case, although numerical calculations indicate there is an entire family of distributions, dependent on .

It is interesting to contrast the above observations with a result in evolutionary dynamics known as the isothermal theorem. The theorem states that for , the Moran process on a large class of networks, known as isothermal graphs, has fixation probability identical to the complete graph [5]. Recent work has shown that this breaks down if ; the fixation probability develops new network dependence [9]. In contrast, even isothermal graphs (including the complete graph and 1D lattice) have fixation-time distributions that depend on network structure. The two-fitness BD model breaks the universality in fixation probabilities predicted by the isothermal theorem, but leads to the same family of fixation distributions that arise due to network structure.

2. The Death-Birth Moran process

A two-fitness Death-Birth (DB) Moran process [29] is also frequently used to study evolutionary dynamics. In this model, the birth and death events are reversed in order. At each time step a node is chosen at random, with probability proportional to , and one of its neighbors is chosen with probability proportional to r. The first individual dies and is replaced by an offspring of the same type as the neighbor. The process continues until the mutation either reaches fixation or goes extinct.

The BD and DB processes obey a duality property [9]. Starting from the BD transition probabilities, if we swap the two fitness levels and substitute m → N − m (which swaps mutants and non mutants), we obtain the DB transition probabilities. Therefore, the transition matrix for the DB model is identical to that for the corresponding dual BD process, but has the main-, super-, and sub-diagonal entries reversed in order. This leaves the matrix eigenvalues unchanged, so that the DB process has identical fixation-time distributions to those given in the preceding section for the dual BD process.

In principle, the correspondence between DB and BD fixation times could break down for the truncated process considered in Section V. In practice, however, the results are again generally identical. For the truncated DB process, the fixation times on the 1D lattice remain normally distributed. On the complete graph, one of two coupon collection regions is removed by truncation leading to fixation-times following a single Gumbel distribution.

One exception, where the dual models yield different results under truncation, is at infinite fitness. As in Section V, at infinite fitness (r → ∞) the BD model performs a single coupon collection near fixation, which is cut off by truncation, leading to a normal fixation-time distribution. In contrast, in the dual infinite-fitness DB model the coupon collection occurs at the beginning of the process and even under truncation the Gumbel fixation-time distribution is preserved. This effect was previously observed by Ottino-Löffler et al. [26].

B. Other networks: Approximate results via mean-field transition probabilities

While the 1D lattice and complete graph provide illustrative exactly solvable models of the fitness dependence of fixation-time distributions, other networks may be more realistic. On more complicated networks the analytical tools developed here fail because the transition probabilities (the probability of adding or subtracting a mutant given the current state) depend on the full configuration of mutants, not just the number of mutants. Such systems can still be modeled as a Markov process, but the state space becomes prohibitively large. Fortunately, for certain networks the effect of different configurations can be averaged over, giving a mean-field approximation to the transition probabilities. This approach has been used on a variety of networks to calculate fixation times [19, 23, 25, 26]. In this section we discuss how such mean-field approaches can be used to calculate fixation-time distributions for evolution on several different networks.

1. Erdős-Rényi random graph

We start with the Erdős-Rényi random graph, for which the mean-field transition probabilities were recently estimated [19]. The result is identical to the complete graph probabilities [Eqs. (20)–(21)] up to a constant factor 1 − 2/Np, which depends on the edge probability p for the network. This correction is important for computing the mean fixation time, but does not affect the shape of the fixation-time distribution, since proportionality factors cancel in Eq. (4). Therefore we expect the asymptotic fixation-time distribution will be a weighted sum of two Gumbel distributions. This prediction holds for infinite fitness, where the fixation time on an Erdős-Rényi network has a Gumbel distribution [26].

Preliminary simulations show that the Erdős-Réenyi network has the expected fixation-time distributions for p = 1/4 and r = 2 (see Figure 7). Further investigation is required to determine the range of fitness and edge probabilities for which this result holds asymptotically (as N → ∞). For constant p, the average degree is proportional to the system size ⟨k⟩ = pN, similar to the complete graph. It may be, however, that for some p and r the mean-field approximation is not sufficient to capture the higher-order moments determining the shape of the distribution. It is also traditional to consider N-dependent edge probabilities with p(N) chosen, for example, to fix ⟨k⟩. It is unclear whether such graphs will behave like the ring (due to their sparsity), like the complete graph (due to their short average path length), or somewhere in between these extreme cases. In the same vein, which other networks admit accurate mean-field approximations to the transition probabilities? Do many complex networks have fixation-time distributions identical to the complete graph?

FIG. 7.

Fixation-time distribution on an Erdős-Rényi random graph with N = 100 nodes, edge probability p = 1/4, and fitness r = 2, obtained from 106 simulation runs (the same graph is used for each run). The distribution is standardized to zero mean and unit variance. The solid curve is the theoretical prediction for the complete graph, obtained by numerical convolution of two Gumbel distributions, one weighted by r. For these parameters, the random graph fixation time is captured by the mean field approximation.

2. Stars and superstars: evolutionary amplifiers

Another nice approximation maps the Moran process on a star graph, a simple amplifier of selection, onto a birth-death Markov chain [15]. The resulting transition probabilities exhibit coupon collection regions, similar to the complete graph. The ratio of slopes near these regions (few mutants or non-mutants), however, is r². Our heuristic predicts the fixation-time distribution on the star is G+r²G′. In addition to amplifying fixation probability, the star increases fixation-time skew. This raises a broader question: do evolutionary amplifiers also amplify fixation-time skew? Computing fixation times for evolutionary dynamics on superstars (which more strongly amplify selection [5]) remains an open problem.

3. Growth of cancerous tumors: evolutionary dynamics on d-dimensional lattices

Mean-field arguments have also been applied to d-dimensional lattices in the infinite-fitness limit [25, 26]. In this limit the mutant population grows in an approximately spherical shape near the beginning of the process and the population of non-mutants is approximately spherical near fixation. The surface area to volume ratio of the d-dimensional sphere gives the probability of adding a mutant. With finite fitness, non-mutants can now replace their counterparts and the surface of the sphere of growing mutants roughens [39]. For near-neutral fitness, the configuration of mutants resembles the shape of real cancerous tumors. Perhaps mean-field approaches can draw connections between the fitness-dependent roughness of growing mutant populations and fixation-time distributions for evolution on lattices.

VII. SUMMARY

In this paper we have obtained the first closed-form solutions for the fitness dependence of fixation-time distributions of the Moran Birth-death process on the 1D lattice and complete graph. Previous analyses were restricted to the limit of infinite fitness, with some partial results for neutral fitness. To reiterate our new results: There is a dichotomy between neutral and non-neutral fitness. When fitness is neutral, the distribution always exhibits a discontinuity; whether the graph is complete or a 1D lattice, the skew jumps up discontinuously in either case. On the other hand, when fitness is non-neutral but otherwise arbitrary, the results depend strongly on network topology. Specifically, on the complete graph the fixation-time distribution is a fitness-weighted convolution of Gumbel distributions and hence is always skewed, whereas on the 1D lattice the distribution is normal and hence is never skewed.

Together with the mean and variance, the distributions derived here give a complete statistical description of the asymptotic fixation time (see Table I). Our analysis re-vealed that these results are robust in the sense that similar distributions arise under truncation, in some other models, and in some other network structures, including the Erdős-Rényi random graph.

View this table:

TABLE I.

Asymptotic fixation-time statistics for the Moran Birth-death and Death-birth processes on the complete graph and the 1D lattice. Together with the mean and variance, the standardized distributions give a complete statistical description of the fixation time. The mean and variance given are to leading order in N for each case.

VIII. FUTURE DIRECTIONS

Though the model we have focused on here (the Moran Birth-death model) is deliberately simplified, we expect our results will be useful in applications. For instance, the theory should allow a more refined analysis of the rate of evolution, by extending the seminal work by Kimura, whose neutral theory of evolution predicted a molecular clock [42]. In his model, neutral mutations become fixed at a constant rate, independent of population size. This result, with some refinements, is now used widely in estimating evolutionary time scales [43]. The fixation-time distributions discussed here should allow one to go beyond Kimura’s classic analysis to capture the full range of evolutionary outcomes, by providing information about the expected deviations from the constant-rate molecular clock, as well as how this prediction is affected by population structure. More generally, it would be interesting to study the implications of these distributions for rates of evolution at various fitness levels.

Furthermore, our results provide concrete predictions that are testable via bacterial evolution experiments. Does the same fitness and network structure dependence of fixation-time distributions arise in real systems?

Future theoretical studies could analyze random networks and lattices more deeply, as well as stars and superstars, the prototypical evolutionary amplifiers [5]. More sophisticated models involving evolutionary games are also of interest. These have skewed fixation-time distributions [22] whose asymptotic form remains unknown. Finally, we hope that methods developed here will prove useful in other areas, such as epidemiology [44], ecology [45], and protein folding [46], where stochastic dynamics may similarly give rise to skewed first-passage times.

ACKNOWLEDGMENTS

We thank Bertrand Ottino-Löffler and Jacob Scott for helpful discussions. Research supported by the Cornell Presidential Life Science Fellowship and NSF Graduate Research fellowship grant DGE-1650441 to D. H. and by NSF grants DMS-1513179 and CCF-1522054 to S.H.S.

Appendix Visit statistics

In this Appendix we formulate the visit statistics approach. We first provide further details in the derivation of the series expression for the fixation-time cumulants given in Eq. (8), and then explicitly compute the weighting factors that appear in this expression to third order. This result requires constant selection, b_m/d_m = r, as is the case for the Moran process. Under constant selection the transient transition matrix can be written as Ω_tr = Ω_RWD, where D is diagonal with elements D_mm = b_m + d_m and Ω_RW is the transition matrix for a random walk,

Since we are interested in the fixation-time distribution, we condition on fixation occurring. As discussed in Section II B (see also Supplemental Material, Section I [33]), the conditioned transition matrix , where S is diagonal with S_mm = 1 − 1/r^m. Combining these results, we have that where we have used the fact that both D and S are diagonal matrices, and therefore commute.

We found in Section II B that the moments of the fixation time T can be expressed as, where 1 is a row vector of ones and p_tr(0) is the initial state of the system, with for m₀ initial mutants. To compute these moments, we need the inverse . Since Ω_RW is a tridiagonal Toeplitz matrix, its inverse has a well-known form [47]:

Hence the matrix has elements

The matrix V, sometimes called the fundamental matrix, encodes the visit statistics of the conditioned random walk: V_ij is the mean number of visits to state i from state j before hitting the absorbing state N [48]. The Moran process has the same visit statistics, but on average spends a different amount of time, designated by (b_i + d_i)⁻¹, waiting in each state.

While one could now compute the moments μ_n in Eq. (A.3) directly, we find that the cumulants yield nicer expressions. Furthermore, the normal and Gumbel fixation-time distributions, predicted by our simulations and approximate calculations, are more simply described in terms of their cumulants. The non-standardized cumulants are linear combinations involving products of moments whose orders sum to n. Thus each term in the cumulants has n powers of D producing n factors of (b_i+d_i)⁻¹ with a weight designated by the visit statistics. With this observation, it is clear the standardized cumulants have the form given in Eq. (8), where are the weighting factors coming entirely from the visit statistics of a biased random walk (starting from m₀ initial mutants). As in the main text, we take the initial state to be a single mutant m₀ = 1, and will suppress the dependence of the weighting factors on initial condition, writing instead. Generalizations to other cases are straightforward and are discussed briefly below.

We emphasize that even without explicit knowledge of the factors , this formulation can be extremely useful. For instance when b_i + d_i is constant, these are just the cumulants for the (possibly biased) random walk, which were computed in Section III to approximate the Moran process on the 1D lattice. In particular, the sums over weighting factors obtained from setting b_i + d_i = 1 in Eq. (A.6) have leading asymptotic form given by Eq. (14). This fact can be used to bound the cumulants even when b_i + d_i ≠ 1, which in some cases is sufficient to determine the leading asymptotic behavior. When this is not possible, the weighting factors must be computed explicitly. We now turn our focus to derving and .

We can compute the weighting factors by writing out the matrix multiplication of . First note that

Then the first three moments of the fixation time are,

The corresponding non-standardized cumulants are given by the usual formulas, and . In terms of the visit numbers the non-standardized cumulants become

From here we can read off the weighting factors accordingly. For convenience, we can choose and to be symmetric by averaging the numerators in Eq. (A.9) over the permutations of the indices. Then, where Π₂ is the set of permutations of {i, j} and Π₃ are the permutations of {i, j, k}. We note that these expressions also hold for general initial condition by replacing the subscript 1 with m₀. Plugging Eq. (A.5) into this expression for we obtain, after some algebra, for i ≥ j. Since we have constructed to be symmetric, when j > i the formula is identical with i and j exchanged. Similarly, using Eq. (A.5) together with the expression for in Eq. (A.10) leads to for i ≥ j ≥ k. Again, the formula for different orderings of the indices i, j, k is the same with the indices permuted appropriately, so that is perfectly symmetric.

This completes the derivation of the visit statistics expression for the fixation-time cumulants. Together, Eqs. (A.6), (A.11) and (A.12) give a closed form expression for the fixation-time skew which is manageable for the purpose of asymptotic approximations. The diagonal terms in the higher-order weighting factors are also particularly simple, . While we will not explicitly compute them, the off diagonal weights can be found by a straightforward generalization of the above procedure. Example applications of this approach are given in Supplemental Material, Sections III and IV [33], where we show that all cumulants of the fixation time vanish for the Moran process on the 1D lattice and compute the asymptotic skew for the Moran process on the complete graph.

References

[1].↵
M. A. Nowak, Evolutionary Dynamics (Harvard University Press, 2006).
[2].↵
T. Maruyama, Genetics 76, 367 (1974).
OpenUrl Abstract/FREE Full Text
[3].
T. Maruyama, Theoretical Population Biology 5, 148 (1974).
OpenUrl CrossRef PubMed Web of Science
[4].↵
M. Slatkin, Evolution 35, 477 (1981).
OpenUrl CrossRef Web of Science
[5].↵
E. Lieberman, C. Hauert, and M. A. Nowak, Nature 433, 312 (2005).
OpenUrl CrossRef PubMed Web of Science
[6].↵
T. Antal and I. Scheuring, Bulletin of Mathematical Biology 68, 1923 (2006).
OpenUrl CrossRef PubMed Web of Science
[7].
B. Houchmandzadeh and M. Vallade, New Journal of Physics 13, 073020 (2011).
OpenUrl
[8].
J. Díaz, L. A. Goldberg, G. B. Mertzios, D. Richerby, M. Serna, and P. G. Spirakis, Algorithmica 69, 78 (2014).
OpenUrl
[9].↵
K. Kaveh, N. L. Komarova, and M. Kohandel, Royal Society Open Science 2, 140465 (2015).
OpenUrl CrossRef PubMed
[10].
A. Jamieson-Lane and C. Hauert, Journal of Theoretical Biology 382, 44 (2015).
OpenUrl
[11].↵
P. M. Altrock, A. Traulsen, and M. A. Nowak, Phys. Rev. E 95, 022407 (2017).
OpenUrl
[12].↵
J. Tkadlec, A. Pavlogiannis, K. Chatterjee, and M. A. Nowak, Communications Biology 2, 138 (2019).
OpenUrl
[13].
M. Kimura, Proceedings of the National Academy of Sciences 77, 522 (1980).
OpenUrl Abstract/FREE Full Text
[14].
P. M. Altrock and A. Traulsen, New Journal of Physics 11, 013012 (2009).
OpenUrl
[15].↵
M. Frean, P. B. Rainey, and A. Traulsen, Proceedings of the Royal Society B: Biological Sciences 280, 20130211 (2013).
OpenUrl CrossRef PubMed
[16].↵
M. Askari and K. A. Samani, Phys. Rev. E 92, 042707 (2015).
OpenUrl
[17].↵
M. Askari, Z. M. Miraghaei, and K. A. Samani, Journal of Statistical Mechanics: Theory and Experiment 2017, 073501 (2017).
OpenUrl
[18].
S. Farhang-Sardroodi, A. H. Darooneh, M. Nikbakht, L. Komarova, and M. Kohandel, PLOS Computational Biology 13, 1 (2017).
OpenUrl
[19].↵
M. Hajihashemi and K. Aghababaei Samani, Phys. Rev. E 99, 042304 (2019).
OpenUrl
[20].↵
D. Dingli, A. Traulsen, and J. M. Pacheco, Cell Cycle 6, 461 (2007).
OpenUrl CrossRef PubMed Web of Science
[21].↵
P. M. Altrock, A. Traulsen, and F. A. Reed, PLOS Computational Biology 7, 1 (2011).
OpenUrl
[22].↵
P. Ashcroft, A. Traulsen, and T. Galla, Phys. Rev. E 92, 042154 (2015).
OpenUrl
[23].↵
L.-M. Ying, J. Zhou, M. Tang, S.-G. Guan, and Y. Zou, Frontiers of Physics 13, 130201 (2018).
OpenUrl
[24].↵
D. Aldous, Bernoulli 19, 1122 (2013).
OpenUrl
[25].↵
B. Ottino-Löffler, J. G. Scott, and S. H. Strogatz, Phys. Rev. E 96, 012313 (2017)
OpenUrl
[26].↵
B. Ottino-Löffler, J. G. Scott, and S. H. Strogatz, eLife 6, e30212 (2017).
OpenUrl
[27].↵
P. A. P. Moran, Mathematical Proceedings of the Cambridge Philosophical Society 54, 60 (1958).
OpenUrl CrossRef
[28].↵
P. A. P. Moran, Mathematical Proceedings of the Cambridge Philosophical Society 54, 463 (1958).
OpenUrl
[29].↵
We use the convention that capital letters designate a fitness dependent step in the Moran process (e.g. for the Bd process nodes give birth at a rate proportional to their fitness, but die with uniform probability). See Ref. [26, Box 2] for a detailed explanation of this nomenclature.
[30].↵
P. Erdős and A. Rényi, Publ. Math. Inst. Hung. Acad. Sci. 6, 215 (1961).
OpenUrl
[31].↵
S. Asmussen, Applied Probability and Queues (Springer–Verlag, 2003).
[32].↵
J. Keilson, Markov Chain Models – Rarity and Exponentiality (Springer–Verlag, 1979).
[33].↵
See Supplemental Material at [URL will be inserted by publisher] for details of mathematical derivations.
[34].↵
Y. Bakhtin, Bulletin of Mathematical Biology (2018).
[35].↵
N. S. Goel and N. Richter-Dyn, Stochastic models in biology (Academic Press, 1974).
[36].↵
J. Keilson, Journal of Applied Probability 2, 405 (1965).
OpenUrl CrossRef
[37].↵
R. Durrett, Probability: Theory and Examples (Cambridge University Press, 2010).
[38].↵
M. Kimura, Genetical Research 15, 131 (1970).
OpenUrl CrossRef PubMed
[39].↵
T. Williams and R. Bjerknes, Nature 236, 19 (1972).
OpenUrl CrossRef GeoRef
[40].
A. Sottoriva, I. Spiteri, S. G. M. Piccirillo, A. Touloumis, V. P. Collins, J. C. Marioni, C. Curtis, C. Watts, and S. Tavaré, Proceedings of the National Academy of Sciences 110, 4009 (2013).
OpenUrl Abstract/FREE Full Text
[41].↵
I. Bozic, J. G. Reiter, B. Allen, T. Antal, K. Chatterjee, P. Shah, Y. S. Moon, A. Yaqubie, N. Kelly, D. T. Le, E. J. Lipson, P. B. Chapman, J. Diaz, Luis A., B. Vogelstein, and M. A. Nowak, eLife 2, e00747 (2013).
OpenUrl CrossRef PubMed
[42].↵
M. Kimura, Nature 217, 624 (1968).
OpenUrl CrossRef PubMed Web of Science
[43].↵
S. Kumar, Nature Reviews Genetics 6, 654 (2005), perspective.
OpenUrl CrossRef PubMed Web of Science
[44].↵
C. R. Doering, K. V. Sargsyan, L. M. Sander, and E. Vanden-Eijnden, Journal of Physics: Condensed Matter 19, 065145 (2007).
OpenUrl
[45].↵
C. R. Doering, K. V. Sargsyan, and L. M. Sander, Multiscale Modeling & Simulation 3, 283 (2005).
OpenUrl
[46].↵
R. Zwanzig, A. Szabo, and B. Bagchi, Proceedings of the National Academy of Sciences 89, 20 (1992).
OpenUrl Abstract/FREE Full Text
[47].↵
C. da Fonseca and J. Petronilho, Linear Algebra and its Applications 325, 7 (2001).
OpenUrl
[48].↵
J. G. Kemeny and J. L. Snell, Finite Markov Chains (Springer, 1983).

References

[1].↵
K. Kaveh, N. L. Komarova, and M. Kohandel, Royal Society Open Science 2, 140465 (2015).
OpenUrl CrossRef PubMed
[2].↵
J. Keilson, Journal of Applied Probability 2, 405 (1965).
OpenUrl CrossRef

View the discussion thread.

Posted June 01, 2019.

Download PDF

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5217)
Biochemistry (11755)
Bioengineering (8757)
Bioinformatics (29209)
Biophysics (14980)
Cancer Biology (12103)
Cell Biology (17416)
Clinical Trials (138)
Developmental Biology (9425)
Ecology (14187)
Epidemiology (2067)
Evolutionary Biology (18314)
Genetics (12246)
Genomics (16807)
Immunology (11870)
Microbiology (28101)
Molecular Biology (11599)
Neuroscience (60995)
Paleontology (452)
Pathology (1872)
Pharmacology and Toxicology (3238)
Physiology (4961)
Plant Biology (10429)
Scientific Communication and Education (1683)
Synthetic Biology (2887)
Systems Biology (7341)
Zoology (1651)

[1] [1].↵
M. A. Nowak, Evolutionary Dynamics (Harvard University Press, 2006).

[2] [2].↵
T. Maruyama, Genetics 76, 367 (1974).
OpenUrl Abstract/FREE Full Text

[3] [3].
T. Maruyama, Theoretical Population Biology 5, 148 (1974).
OpenUrl CrossRef PubMed Web of Science

[4] [4].↵
M. Slatkin, Evolution 35, 477 (1981).
OpenUrl CrossRef Web of Science

[5] [5].↵
E. Lieberman, C. Hauert, and M. A. Nowak, Nature 433, 312 (2005).
OpenUrl CrossRef PubMed Web of Science

[6] [6].↵
T. Antal and I. Scheuring, Bulletin of Mathematical Biology 68, 1923 (2006).
OpenUrl CrossRef PubMed Web of Science

[7] [7].
B. Houchmandzadeh and M. Vallade, New Journal of Physics 13, 073020 (2011).
OpenUrl

[8] [8].
J. Díaz, L. A. Goldberg, G. B. Mertzios, D. Richerby, M. Serna, and P. G. Spirakis, Algorithmica 69, 78 (2014).
OpenUrl

[9] [9].↵
K. Kaveh, N. L. Komarova, and M. Kohandel, Royal Society Open Science 2, 140465 (2015).
OpenUrl CrossRef PubMed

[10] [10].
A. Jamieson-Lane and C. Hauert, Journal of Theoretical Biology 382, 44 (2015).
OpenUrl

[11] [11].↵
P. M. Altrock, A. Traulsen, and M. A. Nowak, Phys. Rev. E 95, 022407 (2017).
OpenUrl

[12] [12].↵
J. Tkadlec, A. Pavlogiannis, K. Chatterjee, and M. A. Nowak, Communications Biology 2, 138 (2019).
OpenUrl

[13] [13].
M. Kimura, Proceedings of the National Academy of Sciences 77, 522 (1980).
OpenUrl Abstract/FREE Full Text

[14] [14].
P. M. Altrock and A. Traulsen, New Journal of Physics 11, 013012 (2009).
OpenUrl

[15] [15].↵
M. Frean, P. B. Rainey, and A. Traulsen, Proceedings of the Royal Society B: Biological Sciences 280, 20130211 (2013).
OpenUrl CrossRef PubMed

[16] [16].↵
M. Askari and K. A. Samani, Phys. Rev. E 92, 042707 (2015).
OpenUrl

[17] [17].↵
M. Askari, Z. M. Miraghaei, and K. A. Samani, Journal of Statistical Mechanics: Theory and Experiment 2017, 073501 (2017).
OpenUrl

[18] [18].
S. Farhang-Sardroodi, A. H. Darooneh, M. Nikbakht, L. Komarova, and M. Kohandel, PLOS Computational Biology 13, 1 (2017).
OpenUrl

[19] [19].↵
M. Hajihashemi and K. Aghababaei Samani, Phys. Rev. E 99, 042304 (2019).
OpenUrl

[20] [20].↵
D. Dingli, A. Traulsen, and J. M. Pacheco, Cell Cycle 6, 461 (2007).
OpenUrl CrossRef PubMed Web of Science

[21] [21].↵
P. M. Altrock, A. Traulsen, and F. A. Reed, PLOS Computational Biology 7, 1 (2011).
OpenUrl

[22] [22].↵
P. Ashcroft, A. Traulsen, and T. Galla, Phys. Rev. E 92, 042154 (2015).
OpenUrl

[23] [23].↵
L.-M. Ying, J. Zhou, M. Tang, S.-G. Guan, and Y. Zou, Frontiers of Physics 13, 130201 (2018).
OpenUrl

[24] [24].↵
D. Aldous, Bernoulli 19, 1122 (2013).
OpenUrl

[25] [25].↵
B. Ottino-Löffler, J. G. Scott, and S. H. Strogatz, Phys. Rev. E 96, 012313 (2017)
OpenUrl

[26] [26].↵
B. Ottino-Löffler, J. G. Scott, and S. H. Strogatz, eLife 6, e30212 (2017).
OpenUrl

[27] [27].↵
P. A. P. Moran, Mathematical Proceedings of the Cambridge Philosophical Society 54, 60 (1958).
OpenUrl CrossRef

[28] [28].↵
P. A. P. Moran, Mathematical Proceedings of the Cambridge Philosophical Society 54, 463 (1958).
OpenUrl

[29] [29].↵
We use the convention that capital letters designate a fitness dependent step in the Moran process (e.g. for the Bd process nodes give birth at a rate proportional to their fitness, but die with uniform probability). See Ref. [26, Box 2] for a detailed explanation of this nomenclature.

[30] [30].↵
P. Erdős and A. Rényi, Publ. Math. Inst. Hung. Acad. Sci. 6, 215 (1961).
OpenUrl

[31] [31].↵
S. Asmussen, Applied Probability and Queues (Springer–Verlag, 2003).

[32] [32].↵
J. Keilson, Markov Chain Models – Rarity and Exponentiality (Springer–Verlag, 1979).

[33] [33].↵
See Supplemental Material at [URL will be inserted by publisher] for details of mathematical derivations.

[34] [34].↵
Y. Bakhtin, Bulletin of Mathematical Biology (2018).

[35] [35].↵
N. S. Goel and N. Richter-Dyn, Stochastic models in biology (Academic Press, 1974).

[36] [36].↵
J. Keilson, Journal of Applied Probability 2, 405 (1965).
OpenUrl CrossRef

[37] [37].↵
R. Durrett, Probability: Theory and Examples (Cambridge University Press, 2010).

[38] [38].↵
M. Kimura, Genetical Research 15, 131 (1970).
OpenUrl CrossRef PubMed

[39] [39].↵
T. Williams and R. Bjerknes, Nature 236, 19 (1972).
OpenUrl CrossRef GeoRef

[40] [40].
A. Sottoriva, I. Spiteri, S. G. M. Piccirillo, A. Touloumis, V. P. Collins, J. C. Marioni, C. Curtis, C. Watts, and S. Tavaré, Proceedings of the National Academy of Sciences 110, 4009 (2013).
OpenUrl Abstract/FREE Full Text

[41] [41].↵
I. Bozic, J. G. Reiter, B. Allen, T. Antal, K. Chatterjee, P. Shah, Y. S. Moon, A. Yaqubie, N. Kelly, D. T. Le, E. J. Lipson, P. B. Chapman, J. Diaz, Luis A., B. Vogelstein, and M. A. Nowak, eLife 2, e00747 (2013).
OpenUrl CrossRef PubMed

[42] [42].↵
M. Kimura, Nature 217, 624 (1968).
OpenUrl CrossRef PubMed Web of Science

[43] [43].↵
S. Kumar, Nature Reviews Genetics 6, 654 (2005), perspective.
OpenUrl CrossRef PubMed Web of Science

[44] [44].↵
C. R. Doering, K. V. Sargsyan, L. M. Sander, and E. Vanden-Eijnden, Journal of Physics: Condensed Matter 19, 065145 (2007).
OpenUrl

[45] [45].↵
C. R. Doering, K. V. Sargsyan, and L. M. Sander, Multiscale Modeling & Simulation 3, 283 (2005).
OpenUrl

[46] [46].↵
R. Zwanzig, A. Szabo, and B. Bagchi, Proceedings of the National Academy of Sciences 89, 20 (1992).
OpenUrl Abstract/FREE Full Text

[47] [47].↵
C. da Fonseca and J. Petronilho, Linear Algebra and its Applications 325, 7 (2001).
OpenUrl

[48] [48].↵
J. G. Kemeny and J. L. Snell, Finite Markov Chains (Springer, 1983).

Fitness dependence of the fixation-time distribution for evolutionary dynamics on graphs

Abstract

I. INTRODUCTION

II. GENERAL THEORY FOR BIRTH-DEATH MARKOV PROCESSES

A. Eigendecomposition of the birth-death process

B. Analytical cumulant calculation: Visit statistics

C. Recurrence relation for fixation-time moments

D. Equivalence between advantageous and disadvantageous mutations

III. ONE-DIMENSIONAL LATTICE

A. Neutral fitness

B. Non-neutral fitness

IV. COMPLETE GRAPH

A. Neutral Fitness

B. Non-neutral fitness

V. PARTIAL FIXATION TIMES

A. One-dimensional lattice

B. Complete graph: truncating coupon collection

C. Summary of main results

VI. EXTENSIONS

A. Other update dynamics

1. Two-fitness Moran process

2. The Death-Birth Moran process

B. Other networks: Approximate results via mean-field transition probabilities

1. Erdős-Rényi random graph

2. Stars and superstars: evolutionary amplifiers

3. Growth of cancerous tumors: evolutionary dynamics on d-dimensional lattices

VII. SUMMARY

VIII. FUTURE DIRECTIONS

ACKNOWLEDGMENTS

Appendix Visit statistics

References

References

Citation Manager Formats

Subject Area