Abstract
Investigating the emergence of a particular cell type is a recurring theme in models of growing cellular populations. The evolution of resistance to therapy is a classic example. Common questions are: when does the cell type first occur, and via which sequence of steps is it most likely to emerge? For growing populations, these questions can be formulated in a general framework of branching processes spreading through a graph from a root to a target vertex. Cells have a particular fitness value on each vertex and can transition along edges at specific rates. Vertices represents cell states, say phenotypes or physical locations, while possible transitions are acquiring a mutation or cell migration. We focus on the setting where cells at the root vertex have the highest fitness and transition rates are small. Simple formulas are derived for the time to reach the target vertex and for the probability that it is reached along a given path in the graph. We demonstrate our results on several applications which include: the orderings of resistance-conferring mutations in bacteria and the impact of imperfect drug penetration in cancer.
1 Introduction
The timing and manner in which a particular phenotype arises in a population is a central question of theoretical biology [4, 3, 54, 55, 17, 22, 58, 23, 12, 46, 29, 43, 41]. A typical scenario is to consider an initially monomorphic, wild type population, composed of cells that can acquire mutations, for example single site substitutions on the genome. The phenotype of interest comes to exist after a cell has accrued a specific set of mutations. The interpretations of this event are application dependent, but examples are the genesis of cancer instigated by mutations in a pair of tumour suppressor genes, or the emergence of multidrug resistance via alterations to the genes coding for the target proteins. Regardless of context, the questions of when, and how, the phenotype emerges are of significant interest.
Classically, the population under consideration has been assumed to be of a fixed size. However, with the aim of characterising disease progression, an increasing body of research has been developed to examine the evolutionary dynamics of a growing population. These studies have provided insights on a range of applications, including; cancer genetics [57, 6, 8, 21], metastasis formation [26, 25, 42, 14], drug resistance [7, 38, 34, 35, 19], phylogenetics [56], and the impact of poor drug penetration [24, 28, 20, 40].
Here we continue in the same vein by considering a stochastically growing cellular population, where cells can transition in such a fashion as to alter their, and their offsprings, reproductive capabilities. Such a transition might be the due to the acquisition of a (epi)genetic alteration or migration into a new environment. As before, suppose we have a cellular state of interest, for example; a given genotype, a spatial location, or a combination of both. Will this state ever be reached? If it is, when is it reached? And by which sequence of intermediate states?
To make the discussion clear, let us consider an example application: the emergence of multidrug resistant bacteria. Suppose an infection begins with a single pathogenic bacterium which is sensitive to two antibiotic therapies, drug A and drug B. In the absence of either drug, the initial bacterium initiates a growing colony. Within the colony, when a cell replicates one of the daughter cells can acquire a mutation yielding resistance to either of the drugs. Here our questions are: how long does it take for multidrug resistance to emerge? En route, is resistance to drug A or drug B more likely to arise first? The scenario is illustrated in Fig. 1a. There each vertex represents a cellular type, in this case its resistance profile. The edges represent cell transitions via mutation upon replication.
In this article, motivated by resistance costs and migrated cells exposed to toxins, we focus on the setting where the intermediate states have reduced reproductive ability (fitness) relative to the initial cells. As in many biological applications the relevant transition rates are small (recent measurements for the point mutation rate per cell division are 10-9 for cancer [50] and 10-10 for bacteria [37], while the dissemination rate of pancreatic cancer cells from the primary tumour was estimated as 10-7 per cell division in [25]), we concentrate our efforts on the regime of low transition rates. Our main contribution is to provide simple, intuitive formulas that answer the questions posed in the small transition rate limit. These formulas show explicitly the contribution of the model parameters, e.g. transition rates and fitness reductions, which allow them to be probed for biological insight. This provides relationships which would be difficult to deduce from simulation alone.
We now move to detail the general framework we study, which will be seen to encompass models introduced in previous works as special cases. Our main results concerning when and how a particular cell type emerges are then presented, and we follow this by demonstrating our method on several applications.
2 Results
2.1 General framework
Our interest will always be in the emergence of a particular cell type in a growing population which is modelled as a specific form of a multitype branching process [2]. It is convenient to picture the population on a finite, simple, directed graph G = (V, E), see Fig. 1b, where each vertex of the graph represents a cell type or state. The number of types in the system is denoted N and so we let V = {1, 2, …, N}. Thus E is a subset the set of ordered pairs {(i, j): i, j ∈V, i ≠ j}. For any given cell type there is an associated vertex in V. Henceforth we will refer to cells of type x as cells residing at vertex x. We will be concerned with the timing, and fashion, that vertex N, which we refer to as the target, is populated, when we initiate with cells at vertex 1, which we denote the root. In the example of multidrug resistance, illustrated in Fig. 1a, the cells at the root are sensitive bacteria while the target population is resistant to both drugs.
Let (x) represent a cell at vertex x and ∅ symbolise a dead cell. Then, with all cells behaving independently, our cell level dynamics can be graphically represented as:
That is: cells give birth at rate α(x), die at rate β(x), and transition (at replication) to a cell at vertex y at rate ν(x, y) if the edge (x, y) exists. We will denote the growth rate of cells at vertex x by λ(x) = α(x) -β(x). The parameters associated with the vertex 1 population feature prominently and so for convenience we let α = α(1), β = β(1) and λ = λ(1). As mentioned above, in this article we focus on the setting where the vertex 1 population is the most fit and has positive growth rate. That is, we assume that λ > 0 and for 2 ≤ x ≤ N − 1, λ(x) < λ. We do not specify the fitness of the target population (cells at vertex N). As the model is currently formulated, transitioned cells have deterministically fixed new birth and death rates. However including a finite distribution of fitness effects, for example the scenario that when a cell at vertex x transitions it has new birth or death rates with given probabilities, is actually covered by our model as stated. We discuss this point further in Section 2.4.
At a population level, the number of cells at vertex x at time t will be denoted Zx(t). We shall always assume that the system is initiated with only vertex 1 cells, at a quantity z, i.e. Zx(0) = zδx,1 with the Kronecker delta function δx,y.
We have two primary questions. The first is, having initiated the system with z vertex 1 cells, how long does it take for the population at the target vertex to arise? That is we concern ourself with the distribution of the target hitting time, defined as
Now assuming that the target vertex is populated by a founding cell, we ask from which path through the graph G did this founding cell come? This gives rise to a distribution over the set of paths (or walks) from the root vertex to the target vertex, which we aim to characterise. This second question is more precisely formulated in Section 2.3.
Our results are most clearly understood when G is acyclic, which amounts to excluding reverse transitions. Therefore in the following presentation this will be assumed. The setting when this assumption is relaxed will be discussed in Section 2.4.
Consider any path from the root to the target vertex. Intuitively, if the transition rates encountered along this path are larger, the target population will be seeded from this path quicker. Conversely, the smaller the growth rates along this path are, the slower the target will be reached. It will transpire that both the time to the target, and which path is taken, depends on these competing factors through the path weights which we now introduce.
Let the set of paths between the root and the target be be denoted 𝒫1,N. Any p ∈ 𝒫1,N, will be a sequence p = (p1, p2, …, pl, pl+1), where each 1≤ pi ≤ N, l is the path length (the number of edges in p), all pi are distinct, and p1 = 1, pl+1 = N. Along the path p, let us call the difference between the growth rate at the root and the vertices encountered the fitness costs. Then we define the weight of the path w(p) as that is the product of the transition rates along the path divided by the fitness costs along the path. Throughout the empty product is set to 1. Further, we let the total weight of the target be
We will take the case of a path graph as a running example. A path graph, as illustrated in Fig. 2, is the scenario in which the only edges are (i, i + 1), for 1 ≤i ≤N −1. There is only one path between the root and the target, say p*, and so in this case, .
2.2 Time until target vertex is populated
How long do we wait until the target vertex is populated? We answer this question by characterising the distribution of T, defined in (1), when the target seeding transition rates (the transition rates associated with edges into the target vertex) are small. The key tool required is the long-time population numbers at the initial and intermediate vertices, which is discussed further in Section 4. Using this we are able to prove the following theorem regarding the target hitting time.
As the target seeding transition rates tend to 0, we have where
Heuristically, the time until the target is populated is approximately µ plus some noise, where the distribution of the noise is given by the right hand side of (3). Note that this distribution depends only on parameters associated with vertex 1 cells, while all other parameters of the system are bundled into µ. For practical purposes the theorem yields the following approximation which is valid if the target seeding transition rates are small. We note that Theorem 1 has similarities with the results given in [16]. There the authors examined an approximation of the model studied in this paper, specifically in the path graph setting, but with each transition resulting in an increase in fitness to the transitioned cells.
Notice that it is possible that the target is never populated (T = ∞). In the supplementary materials (Proposition 1) we show that if the vertex 1 population survives forever then the target hitting time is finite with probability one. Furthermore in many relevant cases, namely a large initial population, low death rate, or small transition rates leaving vertex 1, that the target is eventually populated is essentially equivalent to the vertex 1 population surviving. Using Theorem 3, presented in Section 4, we have an exact expression for the probability that the vertex 1 population survives (by examining when W = 0 from Theorem 3). Therefore, this expression gives an approximation for the probability that the target population will ever arise, that is
This also explains the β/α term in (4), which arises due to lineages stemming from the initial vertex 1 cells going extinct. The corresponding approximate distribution for the target hitting time, when the vertex 1 population survives is given in the supplementary materials (Eq. (33)).
We now return to consider µ as given in Theorem 1. In the case of β = 0, z = 1, it can be easily seen (if we recall λ = α -β) that the median of our approximate distribution (4), is exactly µ. More generally, if we let t1/2 be the median time to hit the target, then conditional that the vertex 1 population survives, as the target seeding transition rates tend to 0, we have
The shift h(z), whose precise form can be found in the supplementary materials (Corollary (7.2)), has the behaviour
The shift exists as initiating the system with a larger number of cells at vertex 1, leads to the target being reached faster. In terms of notation f ∼ g means that f/g → 1, when the limit under consideration is taken.
Let us return to the running example of the path graph setting which was introduced at the end of Section 2.1. With z = 1, (6) yields that the median time for the target population to appear is for small ν(N − 1, N). The first term on the right hand side comes from waiting for the first transition from the vertex 1 population, while the second is due transitions between the remaining vertices. The first term is distinct as the vertex 1 growth is the main cause of stochasticity, as discussed in Section 4. The population growth and target hitting time is illustrated for the path graph case in Fig. 3.
2.3 Path distribution
We now move to our second question: which path leads to the target population arising? Our naive expectation is that paths with larger weights will be more likely and this is confirmed. So that we can formulate our question mathematically, we introduce some notation.
To any existing cell, say at vertex x, we may define the cell’s vertex lineage. This tracks the vertices of cells in the ancestral lineage of the cell under consideration and is a sequence of the form (1, …, x). For example a cell with vertex lineage (1, 3, 4) is at vertex 4, and descended from a vertex 3 cell, who in turn descended from a vertex 1 cell. Any vertex lineage is a path in G and we denote the number of cells with vertex lineage q at time t as X(q, t).
Now for any root to target path p ∈ 𝒫 1,N, that is a path of the form p = (1, p2, p3, …, N), the first time that p is traversed can be defined as
Observe that path p populated the target first if T (p) = T. The question of which path initiated the target only makes sense if the target is eventually populated. To ensure that this occurs we condition on a surviving vertex 1 population, and hence let
We can now state the answer to our second question, which is a special case of Theorem 2 presented below. The probability that the target is populated via path p is simply the path weight of p, suitably normalised, that is for small target seeding transition rates. One may ask not only if a particular path populated the target, but whether the target was initiated from a given set of paths, for an example see Section 3.2. In this case the probability that the target is populated via a particular set of paths is given by summing the normalised path weights over each path in the set.
If the vertex 1 population avoids extinction then T (p) will be finite for each root to target path p. Therefore instead of only asking which path populated the target first, one might be interested in whether a particular path is traversed significantly faster. For example, does multidrug resistance obtained via one of the paths in Fig. 1a occur t days before resistance from the other path. In order to also consider this case, for a root to target path p, we define that is the first time the target is reached along any path except p.
We can now quantify the probability of reaching the target via path p more than t time units before any other path. To avoid discussing a technical assumption, which excludes pathological scenarios, the theorem is stated in approximate form. The more precise version can be found in the supplementary material (Theorem 8).
For small target seeding transition rates, and t ∈ ℝ,
Note that letting t = 0 gives (10). In the running example of a path graph we cannot ask which path seeded the target vertex. However we briefly illustrate the usefulness of the above theorem by considering the question of whether the target vertex is populated via a fitness valley.
Let us consider the case depicted in Fig. 4a where two paths exist to the target, a direct path and an indirect path. Here N = 3, with paths p(1) = (1, 2, 3), p(2) = (1, 3) and ν = ν(1, 2) = ν(2, 3), ν(1, 3) = ν2. Naively it may be expected that as the product of transition rates along both these paths are equal, and the indirect path (p(1)) contains a vertex with a fitness cost, the direct path (p(2)) is more likely. However, for small ν, Theorem 2 informs us that the target is populated via the indirect path t time units before the direct path, with probability where T (i) = T (p(i)). Thus the indirect path is more probable, that is ℙ 1(T (1) < T (2)) > ℙ 1(T (2) < T (1)), if λ − 1 < λ(2).
Further applications of the results presented thus far will be given in Section 3. Before this, we discuss the case when G is cyclic, and some extensions to the initial model for which our results still hold.
2.4 Cyclic graphs, distribution of fitness effects and extensions
In the above exposition we have assumed that G is acyclic, that is no path exists which contains cycles. We now briefly summarise how the results are altered if G is cyclic, with details given in the supplementary materials. We will however maintain that the root is a source vertex (has no incoming edges) and the target is a sink (has no outgoing edges).
First we introduce the N − 1 × N − 1 matrix A with entries and we denote the largest real eigenvalue of A as λ*.
In order that a modified version of Theorem 1 holds it is required that λ* = λ and λ* is a simple eigenvalue. This condition is guaranteed for acyclic graphs, and will also be satisfied if the transition rates throughout the graph are small enough (see Lemma 5 in the supplementary materials). Assuming this eigenvalue condition, Theorem 1 holds but with ϕN replaced by summing particular elements of the λ-associated eigenvector of A (see Theorem 7). Theorem 3, which concerns the population growth at the initial and intermediate vertices and is presented below in Section 4, is similarly modified under the same condition (see Theorem 6).
Turning to Theorem 2, we can now consider walks that contain cycles as opposed to only paths in the graph G. If we consider a particular set of root to target walks of a finite length, analogously to Theorem 2, we can ask whether the target is reached via one particular walk more than t units before any other walk in the specified set. A near identical result to Theorem 2 is obtained (see Proposition 2 in the supplementary materials). However the path weights should now be replaced by the walk weights (defined analogously to the path weight; the product of the transition rates along the walk divided by the fitness costs along the walk). In particular for two given walks of finite length we can still approximate the probability the target population arises via one of the walks versus the other.
For example we can use this to explore when a walk, containing a cycle, will seed the target before a path. Consider the graph and edge parameters displayed in Fig. 5a. Let p = (1, 2, 4) and ω = (1, 2, 3, 2, 4).
Despite the extra length, by using the modified Theorem 2, for small ν1 we see
Hence the target can be seeded via the walk, if ν2 is large enough. On the other hand, by using the modified Theorem 2 it can be seen that when all the transition rates are small in the system, the target population will arise from a path and not a walk containing cycles (see Proposition 3) This further justifies our focus on the acyclic case.
Let us now expand on the comments in Section 2.1 where we claimed that having a finite distribution of fitness effects is covered by our model. Suppose we wish to consider cells transitioning from vertex x to vertex y but require that the new growth rate of cells to be λi(y) with probability πy(i), where πy(i) is a finite distribution and every λi(y) < λ. Then we can simply replace the vertex y with new vertices associated to each of the λi(y) and have cells transition from vertex x to the new vertices at rate ν(x, y)πy(i). We may freely apply the results of Section 2 to the graph containing the new vertices and edges. See figure 5b for an example.
We have focused on the setting where cells can birth, die or transition, however Theorem 2 would also hold for a slightly more general branching process. Keeping the transition process as it is, we can have a vertex x cell giving birth to j vertex x cells at a rate α(x)σx,j where for each x between 1 and N − 1, (σx,j)j≥0 is a probability distribution with finite second moment and mean . We now have no need for β(x) as cell death is contained in σx,0. In this setting if we redefine then the path distribution will be unchanged. This is due to the fact that the key result of [33] used in the proof of Theorem 3 holds then also. This may be relevant to the case of viral dynamics, where an infected cell can release multiple virions upon bursting [44].
Throughout we have assumed we wait for the first cell of vertex N to arise. However it may be that the population founded by the initial cell at the target vertex goes extinct. If instead one wishes to wait for the first ‘successful’ cell, that is the first cell of at vertex N to exist whose progeny survives, then all the results above hold so long as the mapping ν(x, N) ↦ ν(x, N) λ(N)/α(N) for all target adjacent vertices x, is applied. We have also only considered the case where only a single target vertex exists. It may be of interest to consider several target populations, and then ask which of these populations arises first. Generally these populations can be grouped together and the above results will still apply. However if we wait for the target vertices to be founded by ‘successful’ individuals to arise a minor modification occurs. For example if we have two target vertices labelled N1, N2 we should map ν(x, N1) ↦ ν(x, N1)λ(N1)/α(N1) for x a neighbour of N1 and ν(y, N2) ↦ ν(y, N2)λ(N2)/α(N2) for y a neighbour of N2. Other numbers of target vertices are treated analogously.
3 Discussion
We now demonstrate how to apply the results of Section 2 on some applications.
Firstly we consider the impact of imperfect drug penetration on the emergence of resistance. Two recent publications have explored how resistance spreads in this setting, and have shown that poor penetration can accelerate resistance [20, 40]. We are able to recover and extend their findings in a rigorous fashion. Next the ordering by which resistance causing mutations accrue is explored, firstly in the setting of cancer and then in bacterial infections. In the case of bacterial infections we explore how the risk of multidrug resistance depends on mutation rates, recovering simulation results in [19]. We will be investigating these applications with the approximate formulas implied by the previous section, and we now make a remark on these.
The results of Section 2 hold when the final transition rates in each path tends to 0. Therefore in this section all transition rates, in particular the mutation and migration rates discussed below, will be taken to be small and statements are to be interpreted as approximations that hold true in this limiting regime. Furthermore, note that the results of Section 2 are given for the case when all transition rates between vertices are distinct (the limits are taken only on the target seeding transition rates and not on all rates simultaneously). However below, and in several of the approximate formulas of Section 2, we will take the intermediate rates equal to the final transition rate. The results of this section could be presented with all transition rates distinct, and approximations given for when the final transition rates are small but non-zero. Then the numerical value for intermediate transition rates could be chosen to be equal to the final transition rates, where for the sake of interpretation they should be the same. We have chosen to avoid such a cumbersome presentation.
3.1 Imperfect drug penetration: monotherapy
Our first two applications concern imperfect drug penetration and so the language introduced here will hold in the next section also.
We consider a pathogenic population, with cancer cells or bacteria as example populations, being treated with one or more drugs with imperfect penetration profiles. The imperfect penetration results in low drug concentration in spatial locations, such that cells in these locations may still proliferate. We term the cumulation of these spatial regions the sanctuary. Sanctuaries have been observed in both bacterial infections [13, 53] and tumours [45]. We consider resistance to have arisen when cells with sufficiently many mutations such that all therapies are impotent come to exist in areas where the drugs have penetrated. The possibility of resistance occurring prior to treatment is excluded and we consider only the case when resistance arises from the sanctuary. Further, we do not consider cells acquiring mutations via gene transfer.
Throughout this section and the next we shall be comparing paths comprised of mutation and migration events. Growing cells in the sanctuary will be the vertex 1 cells from Section 2.1 and thus have birth and death rates α, β, with growth rate λ. The lack of exposure to drug motivates λ > 0. We fix the initial number of cells in the sanctuary to be z. We shall assume all cells have birth rate α. Across all cell types, i.e. regardless of location or drug presence, the per cell mutation and migration rates will be denoted ν and m respectively. We suppose each mutation carries a fitness cost which will be denoted s. Resistance costs may not always exist but are well documented and often assumed in viruses, bacteria and cancer [1, 30, 18]. Migration events bring cells into contact with drugs and each drug invokes an additive fitness cost d, unless resistance to the drug has already been acquired.
There are two paths from which resistance might arise. Firstly, upon replication a cell in the sanctuary may produce a mutant able to grow in the presence of the drug. Then a mutated cell in the sanctuary can migrate to the drug compartment. We term this the mutation-migration path. Analogously we have the migration-mutation path which involves a susceptible cell migrating from the sanctuary, from whose progeny a resistant mutant emerges. Due to the fitness cost of resistance the death rate of mutated cells in the sanctuary is increased to β + s, while the death rate of migrated susceptible cells in the drug compartment is β + d, with s, d > 0.
The first quantity to determine is the probability of escaping resistance, and its dependence on the initial number of cells in the sanctuary. The approximation of (5) shows the probability resistance never occurs decreases exponentially with the initial number of cells in the sanctuary as P(resistance never occurs) ≈ (β/α)z. For the rest of the discussion we will assume resistance does occur. Further we suppose z ≫ 1, so that we may use the approximate form for h(z) given in (7). The case when z is small could be treated similarly using the exact expression for h(z).
We now ask when, and via which path, will resistant cells come to exist in the drug compartment? From Section 2 we know the key quantities required to answer these questions are the path weights. For this system these are given by
If we now let t1/2 be the median time resistance occurs, then using the path weights with (6), we immediately have
Of note is that the sanctuary size, mutation and migration rate all have equivalent impact. This simple result can now be explored under different biological scenarios.
For example, suppose we are currently using drug A with fitness cost d(A) and penetration profile leaving z(A) initial cells in the sanctuary. We consider instead using a second drug which results in sanctuary size z(B) and has fitness cost d(B). Let the median resistance times under the first and second drug be denoted and respectively. Then, using (13) we see , and hence we should switch to drug B, if
This inequality is illustrated in Fig. 6b.
Note as d(B) → ∞, implying drug B is far more effective, the condition tends to δz > (1 + s/d(A))−1. This is due to the fact that when d(B) is large resistance will always arise via the mutation-migration path (which we show below), so this provides an upper bound to the gain that can be obtained from increasing the drug efficacy. The above is under the assumption resistance will occur but note the probability of no resistance would decrease exponentially with larger sanctuary size.
Having discussed the time resistance will occur we now ask by which of the two paths it will occur. Let Tνm be the first time resistance arises via the mutation-migration path and Tmν be defined analogously for the migration-mutation path. Then Theorem 2 gives us
Hence resistance arising via the mutation-migration path is more probable, that is the above probability is greater than 1/2, if d > s. As most drugs are designed to at least halt cell growth, while many resistant cell lines can still proliferate, we expect this condition to be satisfied for most drugs.
A similar question was asked in [20]. There they considered an approximate stochastic process and sought the parameter regime such that Tmν stochastically dominated Tνm under the approximate process. For the full process considered here, we can ask the same question but on the limit of the centred, in the sense of Theorem 7, resistance times. The resulting condition for stochastic dominance is again d > s. The difference in the condition given here (d > s) and that stated in equation (28) in [20] exists as there ν is defined as the probability of mutation per birth event. Accounting for this leads to the same condition.
3.2 Imperfect penetration: combination therapy
There are at least two reasons for extending our analysis to include two drugs, say drugs A and B, being used in tandem. Firstly, combination therapies are widely implemented in HIV and bacterial infections, and there is a growing appreciation for their use in cancer [9, 47, 7]. Secondly, interesting new phenomena emerges, especially when a region still exists with only one drug present.
We will only consider the case where the penetration profile for drug B is a subset of drug A, and thus a single drug region exists only for drug A. This allows us to clearly demonstrate the impact of unequal penetration profiles. In this setting there are 12 possible states that a cell may inhabit, defined by which of the drugs the cell is exposed to and the resistance profile it has obtained. These states are illustrated in Fig. 7a. We further assume that the mutation rate bringing resistance to either drug is ν. This is for simplicity only, and dealing with differing mutation rates is straightforward. Drug interactions and epistasis are also neglected but can be included.
To investigate the effect of the single drug region, we introduce further notation. Let nTot be the number of cells that may reside in our system of interest (e.g. the entire patient or tumour) in the absence of drug. Further, let the regions where both drugs act, the single drug (drug A) acts and the sanctuary have their respective cell capacities denoted by nDD, nD, and nS. We assume that all these capacities are large, and hence we may still investigate the system with our branching process model. These capacities enter the dynamics of the process in the following manner. When a cell leaves the region it is in, at rate m, it now may migrate to one of two regions. Of these two regions, we specify that it migrates to a particular region at a frequency proportional to the regions capacity. For example a cell in the sanctuary migrates to the double drug region at rate mnDD/(nD + nDD). Resistance mutations and the presence of the drug has the same additive effect as in Section 3.1.
Observe that there are 18 possible paths by which sensitive cells in the sanctuary can eventually produce fully drug resistant cells in the double drug compartment. To see this it is convenient to separate the paths into those that pass through the single drug compartment, which following [40] we term stepwise evolution, and those that involve migration from the sanctuary directly to the double drug compartment, which we collectively call direct evolution. For the direct evolution paths there are three transitions needed, mutations yielding resistance to drug A and B and then the migration to the double drug region. From the number of permutations of these events, this yields 6 direct evolution paths. In the stepwise evolution case four transitions are needed, both mutations and then the two requisite migration events. Notice that the migration from the sanctuary to the single drug region must precede the migration from the single drug region to the double drug region. This leads to 12 stepwise evolution paths. An exemplar stepwise evolution path is a mutation to drug A, followed by migration to the single drug region, then a mutation to drug B, and finally a migration to the double drug region. The edges of this path are red and dashed in Fig. 7a.
If the penetration of the drugs increase, then this will result in a reduced initial number of proliferating sanctuary cells, leading to a delay in resistance. However the effect of unequal penetration profiles opens up multiple new paths to resistance which we expect to accelerate the time until fully resistant cells exist in the double drug compartment. Hence a trade-off may exist. To study this we fix nTot = nS+nD+nDD and specify that the initial number of cells in the sanctuary is related to its size via z = γnS with 0 < γ ≪ 1. Further let t1/2(nD/nS) be the median time to resistance with fixed nD/nS. Using the approximation provided by (6) we can explore the behaviour of t1/2(nD/nS). As can be seen in Fig. 7b it is possible that the existence of the single drug compartment speeds up resistance. We now seek the conditions under which this is the case.
As mentioned previously, it is reasonable to expect the efficacy of either drug is greater than the cost to resistance. Also we may hope that both drugs penetrate the majority of the target system. This motivates taking s ≪ d and (nS + nD) ≪ nTot. Under this limit, by examining the sign of , we find a simple condition for when increasing the single drug compartment speeds up resistance, namely
However, when (15) holds, as nD/nS increases eventually a minimal resistance time is achieved. This represents a worse case scenario yielding fastest possible resistance. For s ≪ d, this minimal time occurs when
We now examine whether resistance arises via stepwise evolution or direct evolution. Let T SE be the first time resistance occurs via stepwise evolution, and T DE defined analogously for direct evolution. Then using Theorem 2, again with s ≪ d, we find
Therefore the larger the proportional size of the single drug region, the more likely resistance will arise via stepwise evolution. A qualitatively similar result was derived in [40] (e-page 2878) by comparing the most likely stepwise evolution path with the most likely directed evolution path.
Furthermore, in [40] a minimal resistance time, in the sense of (16), was observed via simulations for the mutationA-migration-mutationB-migration path (the red and dashed path in Fig. 7a). Using Theorem 2 we can show that this is the most probable (in the sense of the mode of the path distribution, see (10)) stepwise evolution path whenever d > s. Further, by differentiating the median time along this path, we see a minimal time occurs at nD ≈ nS, which recovers the simulation result in [40] (see their Fig. 3). That this path is the most probable stepwise evolution path also gives insight into the recurring s/m term in (15) and (17). Relative to direct evolution this path has an extra migration step, so higher migration should be beneficial. However fitness costs along this paths are incurred only via the cost of resistance, and a smaller s diminishes these fitness costs.
3.3 Path to resistance in chronic myeloid leukemia
Let us discuss the example application of [35], concerning the mechanism by which resistance to the targeted therapy imatinib occurs in chronic myeloid leukemia (CML). The two main resistance mechanisms are via gene amplification of the BCR-ABL fusion gene, which drives the CML, or a point mutation resulting in a modification to the target protein of imatinib. Approximately 100 point mutations have been identified conferring resistance, and a rough estimate for the probability of each of these occurring during a cell division is 10−7 [36]. Thus the probability of a point mutation conferring resistance is approximately 10−5 per division. The analogous quantity for resistance-causing gene amplifications, in a similar system, was found to be approximately 10−4 [49]. With the birth rate of leukemic stem cells as α = 0.008 day−1 [39] we have the rate of point mutations as νpm = 8 × 10−8 day−1, while the rate of resistance due to gene amplifications is νga = 8 × 10−7 day−1. Therefore, using this information alone, the probability of resistance arising from a point mutation first is estimated to be 1/11. However, in the majority of cases, the primary mechanism of resistance is found to be point mutations [5].
Within our framework there are two possible explanations we can explore, both related to the reproductive capabilities of resistant cells. The first is simply that more lineages of amplified cells go extinct. The final paragraph in Section 2.4 indicates how to treat this case. Indeed, if ρpm is the average survival probability for a population initiated by a single cell with a point mutation and ρga the analogous quantity for gene amplification, then we find resistance arising first from a point mutation is more probable if ρpm > 10ρga. This is possible as point mutations may be deleterious or advantageous [36] whereas gene amplifications appear deleterious [48].
The second explanation, that suggested in [35], is that several gene amplifications are required to attain resistance. Suppose a gene amplification increases the death rate of cells by s per day. Then if two gene amplifications are required, ignoring the survival probability from before, the probability that resistance arises via point mutation is now
Resistance via points mutations is now more likely if s > 10−6. Note that Fig. 4a serves as a schematic for this scenario, reinterpreting the fitness valley path as resistance arising via gene amplifications and the direct path relating to resistance via point mutations. Further explanations for the primacy of point mutation mediated resistance exist and the assumptions above may not hold. But when they do hold, our results offer a clear framework to investigate such questions.
3.4 Antibacterial multidrug resistance
A timely problem is that posed by the emergence of multidrug resistant bacteria. We now explore some specific examples of our general framework that pertain to this issue. As before we assume resistance occurs a fitness cost and thus the (unmutated) wild-type has the highest growth rate. Using Theorem 2 and measured fitness costs and mutation rates we can predict the most likely path to multidrug resistance.
We consider the acquisition of resistance to both rifampicin and streptomycin in P. aeruginosa. Recently the fitness costs associated with resistance to rifampicin [27] and streptomycin [52] have been reported. Fitness costs were determined by the maximum growth rate of the bacteria undergoing exponential growth on media-rich plates. From [52] the ratio of streptomycin resistant bacteria to the wild-type’s growth rate was 0.71 (here the wild-type’s growth rate is estimated to be 5.1 mOD min−1 from Fig. 2.a in [52]). The same value for resistance to rifampicin, in [27], was 0.88. Let TRS be the time when multidrug resistance emerges assuming rifampicin resistance is acquired first, and TSR defined analogously for streptomycin resistance first. Then from Theorem 2, we have that
Hence, under this model, we would predict that rifampicin resistance emerges first en route to multidrug resistance. We note that above we have taken the average fitness costs associated with the differing antibiotics. Fluctuations have not been taken into account.
Instead of considering the the acquisition of resistance to different drugs within a bacterial strain, we can also examine the impact of differing mutation rates across strains. In [19] the heightened occurrence of multidrug resistance in lineage 2 vs lineage 4 from M. tuberculosis was investigated. Considering in particular resistance to rifampicin and isoniazid, via simulations they deduced that the heightened occurrence of resistance was due to elevated mutation rates in lineage 2 strains. The model simulated can be considered a discrete time counterpart to ours in the case of two paths of length 2 (see Fig. 1a). One of the main quantities of interest was the relative probability of resistance, when the only difference between the strains was the mutation rates. For a particular strain, initiated with one wild-type cell with wild-type birth and growth rates, α, λ, suppose we have mutation rate νR, νI to rifampicin and isoniazid, each incurring the same fitness cost s. Then from Theorem 1, we have
For lineage-i (i = 2, 4) let and be the mutation rates to rifampicin and isoniazid. As in [19], keeping all other parameters equal between the lineages we have the relative probaility of resistance
For times such that , we see the relative probability is simply . Taking the mutation rates measured in [19], this gives the relative probability of 22.06. This agrees with the simulation results in [19] who reported an approximately 22 fold increase. If in (18), we replace the λ2/α with λ and eλt with M, our expression is the same, to leading order in the mutation rates, as that given in [10] (equation 2) for the probability of resistance in a population of size M. This difference between fixed population size and fixed time results is the same as given in [8, 16].
3.5 Concluding remarks
In this article we have considered a continuous time birth-death process with transitions. The time until a population of a particular target state arises was investigated. This target state is accessible via one or more transitions from the initial population. Which sequence of transitions leads to the target population arising was explored. This gave rise to a probability distribution over the paths composed of transitions between differing vertices. Motivated by applications, the setting when the initial population was most fit was focused on.
The important factor in determining whether the target was reached via a specific path was seen to be the weight of the path. This is comprised of the transition rates and fitness costs associated with the intermediate vertices along the considered path. Both the target hitting time and the distribution of paths to the target have simple explicit formulas for small transition rates. The utility of these formulas was demonstrated on a variety of applications. The biological relations revealed would have been difficult to deduce by a simulation based approach.
Many previous studies have considered the probability of particular cell type emerging in a growing population, typically by a fixed time after the process starts or when the total population reaches a given size. However the majority deal with the target population being one or two transitions away from the initial population (the root vertex in this paper) [38, 32, 7, 10, 26, 8]. Two notable exceptions are [35] and [16]. In [35], the same model as that presented here is numerically explored when all vertices have the same fitness, and the implications on multidrug therapy failure is emphasised. With the motivation of driver mutations in cancer, [16] examines an approximation of the model studied in this paper, in the case where each transition leads to a fitness increase.
Justified criticisms have been made on the use of birth-death processes as modelling framework. In particular the potential for unlimited growth and the lack of interactions between the cells is unrealistic. However in systems with large carrying capacities and abundant resources, they provide a convenient approximation and are widely used. Whenever this framework is deemed appropriate, this article provides quick and accessible results which may be used to guide and develop biological insight.
4 Materials and Methods
4.1 Population growth
The main ingredient in proving Theorem 1, and consequently Theorem 2, is the long-time behaviour of the population of the initial and intermediate vertices. In this section we discuss this asymptotic behaviour.
Firstly we extend the definition of the total path weight given in Section 2.1. Let the set of paths between the root and vertex x be denoted 𝒫1,x. Then using the definition of path weight (2), we let the total weight for vertex x be for 2 ≤x ≤N. We further set ϕ1 = 1 which will allow us to conveniently describe the growth at vertex 1 also.
The target population can only be founded by transitions from cells residing at the neighbouring vertices (vertices connected to the target by an edge). Therefore, understanding the population growth of cells at these vertices is needed to discuss the timing and manner in which cells at the target arise. At large times, this understanding is provided by the following theorem, concerning the population at the initial and intermediate vertices. It follows from a more general result due to Janson [33], which we have tailored to our particular setting.
With probability one
Here W is distributed as the sum of K independent exponential random variables with parameter λ/α, where K is binomial with z trials and success probability λ/α.
The distribution of W may be called a binomial-Erlang mixture, being an Erlang distribution with a binomially distributed shape parameter. For the running example of a path graph, we see that asymptotically Zi+1(t) is related to Zi(t) by a multiplicative factor of . This is demonstrated in Fig. 3.
Using Theorem 3 and that the immigration rate into the target vertex from any target neighbouring vertex x is Zx(t)ν(x, N) we are able to prove the results of Section 2. Full proofs are provided in the supplementary materials.
Acknowledgments
We thank David Cheek, Stefano Avanzini and the Edinburgh physics population dynamics group for helpful discussions and feedback. MDN acknowledges support from EPSRC via a studentship.
References
- [1].↵
- [2].↵
- [3].↵
- [4].↵
- [5].↵
- [6].↵
- [7].↵
- [8].↵
- [9].↵
- [10].↵
- [11].↵
- [12].↵
- [13].↵
- [14].↵
- [15].↵
- [16].↵
- [17].↵
- [18].↵
- [19].↵
- [20].↵
- [21].↵
- [22].↵
- [23].↵
- [24].↵
- [25].↵
- [26].↵
- [27].↵
- [28].↵
- [29].↵
- [30].↵
- [31].↵
- [32].↵
- [33].↵
- [34].↵
- [35].↵
- [36].↵
- [37].↵
- [38].↵
- [39].↵
- [40].↵
- [41].↵
- [42].↵
- [43].↵
- [44].↵
- [45].↵
- [46].↵
- [47].↵
- [48].↵
- [49].↵
- [50].↵
- [51].↵
- [52].↵
- [53].↵
- [54].↵
- [55].↵
- [56].↵
- [57].↵
- [58].↵