Isolation-By-Distance-and-Time in a Stepping-Stone model

Nicolas Duforet-Frebourg; Montgomery Slatkin

doi:10.1101/024133

Abstract

With the great advances in ancient DNA extraction, population genetics data are now made of geographically separated individuals from both present and ancient times. However, population genetics theory about the joint effect of space and time has not been thoroughly studied. Based on the classical stepping–stone model, we develop the theory of Isolation by Distance and Time. We derive the correlation of allele frequencies between demes in the case where ancient samples are present in the data, and investigate the impact of edge effects with forward–in–time simulations. We also derive results about coalescent times in circular/toroidal models. As one of the most common way to investigate population structure is to apply principal component analysis, we evaluate the impact of this theory on plots of principal components. Our results demonstrate that time between samples is a non-negligible factor that requires new attention in population genetics.

1 Introduction

Geography plays a central role in the pattern of genetic differentiation within a species. Seminal work on describing the evolution of continuous populations was done by Wright and Malécot. They studied genetic differentiation and inbreeding in continuously distributed population (Wright, 1943; Malécot 1948). The resulting idea is that, under the assumption of local dispersion, genetic differentiation accumulates with distance. This pattern of genetic structure is called Isolation–By–Distance (IBD), which is detected by computing measures of differentiation such as F_ST (Wright, 1943; Nei, 1973; Weir and Cockerham, 1984), or correlation coefficients (Malécot 1955; Kimura and Weiss, 1964). Understanding the effect of geographic distance on population structure is an important task for population geneticist, as it is a source of neutral genetic variation (Slatkin, 1985; Rousset, 1997). Furthermore, IBD has been observed in humans and many other species (Sharbel et al., 2000; Castric and Bernatchez, 2003; Ramachandran et al., 2005; Hellberg, 2009; Karakachoff et al., 2015).

The role of geography in neutral genetic variation has been widely studied partly because of the existence of many population genetic studies of individuals living at the present time and sampled in different locations. Because of the development of methods for sequencing DNA from fossils, genomes of individuals alive at previous times are now available to bring new information about the evolutionary processes that affected a species in the past. Since the first studies of ancient DNA (aDNA) three decades ago (Higuchi et al., 1984; Pääbo, 1985), techniques to retrieve DNA molecules from ancient bones have tremendously developed (pääbo, et al., 2004).

In modern evolutionary biology, the similarity of differentiation in space and time is acknowledged (Depaulis et al., 2009; Andrello et al., 2011; Teacher et al., 2011). Theoretical developments predict the effect of time on F_ST and related quantities (Skoglund et al., 2014). Epperson (2000) studied patterns of isolation by distance and time in ecology by using stochastic spatial time series and Identity by descent probabilities However such theoretical studies remain scarce.

The effect of separation in time can be studied using classical statistical methods in population genetics, such as principal component analysis (PCA) (Patterson et al., 2006). PCA is widely used to determine relatedness between individuals, and is a convenient way to represent geographic patterns (Novembre et al., 2008). But PCA can also capture the differentiation between ancient and modern samples: the percentage of variance explained by time can be expressed on the same scale as the percentage of variance explained by geography (Skoglund et al., 2014). Unfortunately, PCA does not give a complete picture of how differentiation quantities such as F_st and correlations of allele frequencies evolve in time and space.

In this article we generalize the theory of IBD to allow for difference in the times at which different individuals are sampled. We call this the theory of isolation by distance and time (IBDT). We base our work on the stepping–stone model of Kimura (1953) and add to the theoretical results known for this model (Kimura and Weiss, 1964; Weiss and Kimura, 1965; Maruyama, 1971a; Nagylaki, 1983; Cox et al., 2002; De and Durrett, 2007). We start by briefly reviewing the original results for the infinite stepping– stone model at equilibrium and the decay of correlation of allele frequencies with distance. Then, we extend the original work to derive the correlation between individuals separated by distance and time. We perform simulations that show the validity of the analytic results, even in the case of a finite number of populations where some demes are subject to edge effect. We also derive the expected coalescence times between samples separated by time and space in circular and toroidal models (Slatkin, 1991, 1993). Finally we consider the consequences of IBDT on PCA in the common case of a dataset made of a large proportion genomes from present–day individuals and few ancient genomes.

2 The stepping–stone model

The stepping–stone model describes the distribution of allele frequencies in an infinite set of demes in different locations of the space represented by Cartesian coordinates. We start by describing the 1-Dimensional case. Let p(k) be the frequency of one allele at a bi-allelic locus in population k and be the overall allele frequency. In each generation, p(k) is updated with the following three steps (Crow et al., 1970):

Exchange a proportion m_i of migrants with demes at a distance i.
Exchange a proportion m_∞ of migrants with a deme that has fixed allele frequency . The meaning of this step is discussed later.
Sample gametes of the next generation in the population.

In the case considered by Kimura and Weiss (1964), migrants are exchanged only between neighboring locations in the first step, so that m_i = 0, i > 1. The second step consists of exchanges of migrants with an external population at rate m_∞. This event is equivalent to reversible mutations occurring. The formulation of the model states that every locus is bi-allelic, and the number of loci is fixed. As a consequence, the mutation model is a reversible mutation model with probability m_∞, and m_i > > m_∞. Random sampling of step 3 is represented by a random change in the allele frequency ϵ(k), with E[ϵ(k)] = 0, and E[ϵ(k)²] = p(k)(1 - p(k))/2N_e, where N_e is the effective population size of a deme (Wright, 1940; Kimura and Crow, 1963).

Our interest is in the changes in allele frequency in one generation. We consider , the deviation from the average frequency. Given these three steps,

To simplify the notation, we define the operators S and L, so that,

The quantity of interest in this model is the correlation of allele frequencies between two demes at locations k₁ and k₂. Let r(k) be the correlation coefficient of allele frequencies between populations that are k steps apart. Assuming equilibrium, we have where ρ(k) is the covariance in frequencies in demes k steps apart. The mathematical treatment of equation (5) by Weiss and Kimura (1965) using the spectral representation of a correlation (Doob, 1953) gives the general formula where C is the normalizing constant.

Equation (6) can be approximated by an exponential function of k:

This simple formula conveys the important idea that in one dimension, the correlation of allele frequencies between populations decays exponentially with distance. In the 2–Dimensional and 3–Dimensional cases, the correlation function is more difficult to approximate. Using modified Bessel function, it is shown that correlation at a given distance is lower in these cases than in the 1–Dimensional case (Weiss and Kimura, 1965).

3 Isolation–by–Distance–and–Time

3.1 1–Dimensional case

We are here interested in the case where genetic samples are collected from demes that are in different locations and at different times (measured in generations). Let ρ(k, t) be the covariance between allele frequencies of two demes separated by k steps and t generations. We denote the coordinates of these demes by (k₁, t₁) and (k₂, t₂), and the deviations in allele frequencies and Since we assume the distribution of allele frequencies is stationary in both time (equilibrium distribution) and space (all migration rates are equal), we can consider these coordinates to be (0, 0) and (k, t) with no loss of generality. Following previous notation

To characterize the evolution of the covariance between allele frequencies with respect to time t, we iteratively apply the operator L defined in equation (3). This operation describes the potential trajectories of an allele, and results in a quantity similar to a propagator. This process leads to with ρ(k) = ρ(k, 0) (see Appendix A).

Let r(k, t) be the correlation between allele frequencies of two demes separated by k steps and t generations, equations (5) and (9), combined with the general formula of equation (6) gives and the constant C is set such that r(0, 0) = 1 (Appendix B).

This equation reduces to in the standard stepping–stone model, where demes only exchange migrants with their closest neighbors at rate m₁/2. An exact formula for this integral can be calculated and is notable for its size and lack of utility (Appendix C).

One noteworthy feature of equation (10) is that the decay of the correlation with time is not affected by the effective population size N_e. This result is different from what is expected for an isolated population: the level of differentiation as a function of the number of generations separating two samples is larger when the effective population size is small, reflecting the increased magnitude of genetic drift. However, in the particular case of an equilibrium stepping–stone model, the covariance of allele frequencies between the demes is not a function of the effective population size, a result already known in the spatial context (see equation (7)) (Kimura and Weiss, 1964). This result becomes clear when considered in terms of coalescence times. Between the time the first and second samples are taken, the trajectory of the first sample depends only on the migration process. There is no possibility of coalescence.

3.2 Two dimensions and more

So far, we have focused on the 1-Dimensional case for the sake of simplicity. However, it is important to investigate the decay in higher dimensions as it is common in practice to have samples taken from a 2-Dimensional or even 3-Dimensional habitat. The general formula for the correlation in higher dimensions can be obtained with no more theoretical development. In their work on the stepping–stone model, Kimura and Weiss derived a general formula for the correlation that can be extended to any number of dimensions. In their work they only gave approximations for 1, 2 or 3 dimensions as these are the practical cases. Using general formula (3.11) of Weiss and Kimura (1965), we can write the correlation 10 in 2 dimensions

The generalization to obtain the correlation in n dimensions is straight– forward (Appendix D).

We perform a numerical integration of equation (12) to investigate the decay of correlation with distance and time in one dimension and higher. Correlation decreases as a function of distance and time in 1, 2 and 3 dimensional models (Figure 1). In addition, for equal values of the migration and mutation rates the correlation decrease is much larger with respect to time and geography in higher dimension models, consistently with previous results (Maruyama, 1970a, 1971a). Numerical integration is done using the R package cubature.

Figure 1:

Correlation as a function of distance between demes k steps appart in 1, 2 and 3-Dimensional models. The correlation is evaluated for different number of generations t between the demes. The migration and mutation rates are used for all models, and m₁ = .01 and m_∞ = 4.10⁻⁴.

3.3 Simulations in one dimension and two dimensions

When considering realistic examples, a finite number of demes is present in the data. As a consequence, correlation patterns are affected by the proximity of the edge of the sampling scheme (Maruyama, 1970b). Another effect of the finite number of demes is that the overall allele frequency can drift away from the theoretical allele frequency. An alternative is to consider a finite, non-circular model, and to deal with edge issues independently (Felsenstein, 2015). To investigate to which extent the analytic theory developed in the previous section is valid in a finite stepping–stone model with temporal sampling, we perform simulations.

Backward in time simulation software such as ms (Hudson, 2002), or fastsimcoal (Excoffier and Foll, 2011), are usually used to investigate IBD in a stepping–stone models (Novembre et al., 2008). Temporal sampling can be investigated in such model by simulating gene trees where lineages from isolated demes are joined to the stepping–stone demes at a chosen time in the past (Skoglund et al., 2014). Mutations are then randomly placed on the gene tree. Such a simulation is needed to understand the influence of time and distance on genetic differentiation, but does not precisely reproduce the process of the above model which assumes reversible mutation rather than the infinite site model. The infinite site model does not have a true equilibrium for any one site, only a pseudo–equilibrium.

We wrote a C program that performs forward in time simulations. The simulation program precisely follows the model presented in the previous section. At the initial time, the allele frequencies in all the demes are equal to the allele frequencies in the outside infinite–sized population. Then the program runs for a large number of generations until the stationary distribution of the allele frequencies is reached.

In the 1-Dimensional case, we simulate 100 demes. For the 2-Dimensional case, we simulate a total of 2500 demes on a 50 × 50 grid. We assume all the demes have the same effective population size. We sample the allele frequencies at several times in the past. Correlation between demes fit very closely the theory of equations (11) and (12) provided that demes are taken sufficiently far away from the edge of the grid (Figure 2). The edge effect directly increases the correlation between demes, and is present when com paring present and ancient samples. In both 1 and 2 dimensions, the edge effect disappears in the simulations (Figure 3). As predicted by Maruyama, the edge effect is less strong with lower migration rates.

Figure 2:

Comparison between theoretical results and simulations in the 2 dimensional case with m₁ = .02 and m_∞ = 10⁻⁵. The solid lines represent the theory prediction. The dots represent the simulation results evaluated for demes at a distance 4, 10 or 16 from the edges. Since in the simulations several pairwise comparisons between demes have the same distance in space and time, we keep the median of these pairwise correlations.

Figure 3:

Mean squarred error between simulations and theory in 1 and 2 Dimensions as a function of the distance to the edge. The error is evaluated for m_∞ = 10⁻⁵ and m₁ = .01, .005, .001, .0005.

4 Coalescence times

4.1 Coalescence times in one dimension

Coalescence times in a stepping–stone model can be derived under some assumptions. In particular, we consider a case with migration only between neighboring demes and low mutation rate. Expected coalescence times between genes that are in different demes is a function of the locations of these demes. These coalescence times are of interest because they closely related to F_ST and coefficients of identity–by–descent (Slatkin, 1991). Under the assumption of a circular 1-Dimensional stepping-stone model with n_d demes, two genes A₁ and A₂ have an expected coalescence time where N_e is the effective population size per deme, m the migration rate between neighboring demes (previously m₁), and k is the distance between the two demes (Slatkin, 1991). Considering a circular arrangement of the demes makes the analysis simpler, as only the distance between the demes matters, and there are no edge effects. In addition it has been shown that linear/planar and circular/toroidal stepping stone models are very similar when considering population away from the edges (Maruyama, 1971a,b). To study a case similar to the infinite stepping–stone model, we assume n_d is large.

We extend the previous theoretical result in the case where two genes are sampled at different times. Let us assume that the sampled genes are in population k₁ and k₂. The number of generations between the two sampling times is t = t₁ - t₂, and we assume, with no loss of generality, that t₁ = 0 and t₂ = t generations in the past. The coalescence process between these two genes can be divided in three phases. The first phase corresponds to the genealogy that traces back to the ancestor of the present gene, called , at generation t. This ancestor is in population . The two other parts correspond to the time until the coalescence event between and A₂. They are respectively the time until the gene and A₂ are in the same deme, then the time to the common ancestor of these two genes. This part has already been described, and the expectation is given in equation (13) (Slatkin, 1991). The expected coalescence time between A₁ and A₂ is then written

The variable is the coalescence time between a random gene in the unknown population and a random gene in population k₂. To represent the uncertainty about the population , we derive the probability distribution of the position at time t, given position k₁ at time 0. Using this probability distribution we rewrite the expectation (14) as

To describe the probability distribution of position at time t given that a gene is in population k₁ at time 0, we consider a random walk with transition matrix

Using standard results about Markov chains (Ross et al., 1996), we know that the vector of probabilities for the position at time t, is expressed such as with P_k₁ is the initial probability distribution of gene A₁’s position. The initial probability distribution is trivial and P_k₁ is a vector of 0 with a 1 in the entry. Exact formula for this matrix power can be obtained using tridiagonal matrix properties (Al-Hassan, 2012). However we can also express an approximation for the probability distribution of this process at time t. This random process is symmetrical, centered in k₁, and using classical results about Brownian motion, has a variance proportional to t. We can approximate the probability distribution by a Normal distribution, and

The accuracy of this approximation can be verified with simulations using equation (17). The approximation is relevant for sufficiently large values of t, depending on the migration rate. The expected coalescence time in a 1-Dimensional circle can then be written

Coalescence time between genes is an increasing function of distance and time between demes (Figure 4). Asymptotically, when t is large, the expected time for two genes to be in the same population can be approximated by a linear function of time between the samples (Figure 4). The right part of equation (19) is the integral of a product of a positive function that depends only on the distance between demes and a Gaussian kernel with variance mt. As the time gets large, relatively to m, the Gaussian kernel becomes flat, and the integral is almost constant (Figure 4). In practice, this implies that in a population at equilibrium, the geography does not matter when the sample is very old.

Figure 4:

Top row: Expected time for two genes to be in a same deme in a 1–Dimensional circular stepping–stone model with N_e = 100, m = .01, and n_d = 51 demes. Bottom row: Expected time for two genes to be in a same deme in a 2–Dimensional toroidal stepping–stone model with N_e = 100, m = .01, and n_d = 51×51 demes. Left column: Expected times as a function of the time between the samples. Colors indicate the geographic distance between samples. Right column. Expected times as a function of geographic distance between the samples. Colors indicate the time between samples. Sampling consists in 45 time points evenly separated by 50 generations.

4.2 Coalescence times in two dimensions

In the case of a 2-Dimensional habitat with n_d1 × n_d2 demes, the expected coalescence time between two genes A₁ and A₂ is where S(i, j) is a function of i and j, the number of demes between the two genes. We assume in this case that the migration in each direction is the same.

Using the same conditioning as in equation (14), we can derive the expectation for the coalescence time of genes A₁ in population k₁ and A₂ in population k₂, t generations in the past. We have

The probability distribution of the position of gene A₁ at time t, is known using the same random walk as in the 1-Dimensional case. The distribution can be approximated by a bivariate Normal distribution with mean k₁, and covariance matrix Σ, where Σ is diagonal with terms mt/2 in the diagonal. In the anisotropic case where migration rate would be different in the two dimensions, m₁ and m₂, Σ would have m₁t and m₂t as diagonal terms. The evaluation of this function for samples separated in distance and time shows a similar pattern to the 1-Dimensional case (Figure 4). However for a same migration rate, the expected times for two genes to be in the same deme in the 2–Dimensional toroidal model are smaller than in the 1– Dimensional circular model. Then, if there is the same number of demes, with same effective population sizes, e.g. n_dN_e = n_d1n_d2N_e, the expected coalescence times are smaller in the 2–Dimensional case. This result is already known when comparing samples taken at the same generation and remains true when t is positive (Slatkin, 1993).

5 Connection with PCA

Because there is a close connection between PCA and coalescence times (McVean, 2009), our results are relevant to using PCA to compare ancient and modern samples. PCA is a useful way to represent the main axes of variation in the data and has proven to be a powerful tool to infer genetic relationships when applied to ancient DNA data(Skoglund et al., 2012; Haak et al., 2015).

5.1 Ancient samples are shrunk towards 0

In population genetics, PCA is usually performed by computing the eigenvectors, and eigenvalues of the matrix of covariances in the genotypes of different individuals. Although there are other ways to compute principal components, this one is convenient in population genetics because the number of variables is usually larger by several orders of magnitude than the number of samples. The effect of differences in the sampling times can be evaluated using the dependence of the covariance matrix described by equation (10). To illustrate, consider a 2-Dimensional even repartition of 10 × 10 demes, and ancient samples taken in several randomly chosen demes at t = 1000 generations in the past (Figure 5A). By calculating the theoretical covariance matrix and its first two eigenvectors, we obtain the first two principal components that reproduce geography of the demes (Novembre et al., 2008; Engelhardt and Stephens, 2010). Figure (5B) shows that principal components mimic the geography of the present demes, but ancient demes are not superposed on the corresponding present-day sample from the same deme. Instead, ancient samples move towards the center of the first and second principal components.

Figure 5:

Panel A. Sampling scheme of a 10×10 grid of demes. Brown triangles represent demes where ancient individuals are sampled 1000 generations in the past. Panel B. First 2 eigenvectors of the covariance matrix between populations of Panel A. Parameters used are m₁ = .01 and m_∞ = 10⁻⁵. Color code is the same as in Panel A. Brown arrows start from the position of the present deme where an ancient sample is taken, and end where the ancient sample is projected on the principal components.

Using 100 demes from a 1-Dimensional simulation described above, we apply PCA to the allele frequencies at the 6000 simulated loci. To remove the edge effect, we simulate 200 demes, and consider only the 100 demes in the center. We also include allele frequencies from past generations for several demes. PC1 shows the 1-Dimensional pattern of isolation–by–distance as expected, and ancient samples are closer to 0 (Figure 6A). The distance between the scores of ancient individuals and the center of the principal component decreases as the sampling time increases. In practice, the true allele frequencies are not known, and the covariance matrix is estimated on individuals. When working with sampled individuals instead of allele frequency, the same pattern is still visible. A subsampling of 10 diploid individuals for each deme at the present time, and 1 diploid individual for each ancient deme shows the same shrinkage of PC scores for ancient individuals (Figure 6B).

Figure 6:

Panel A. First principal component for the 1-Dimensional simulation described above, with m₁ = .01 and m_∞ = 4.10⁻⁵. PCA is performed on allele frequency data from each of the 100 demes, and ancient allele frequencies are taken in 5 populations at 8 times in the past. Panel B. First principal component for the 1-Dimensional simulation described above. In each deme, 10 diploid individuals are sampled at the present time. One diploid individual is sampled in 5 demes at 8 times in the past. Panel C. Sampling scheme of a 10 × 10 grid of populations. Demes marked by a triangle are demes where ancient individuals were sampled. Panel D. plot of P C1 and P C2 for the 2-Dimensional simulation with m₁ = .001 and m_∞ = 10⁻⁵. Ancient samples are taken at different times in the past for 4 demes.

When applying PCA on allele frequencies from the 2-Dimensional simulations, the time effect is visible on the first two components. We study the case of a 10 × 10 grid, with no edge effects, and ancient samples taken from 4 demes at different times in the past (Figure 6C). The first and second principal components reproduce the geography of the samples, and the ancient samples are moved towards the center of the plot (Figure 6D).

This shrinkage effect of time can be understood considering the shape of the covariance function. The first and second principal components represent the 2–Dimensional Isolation–By–Distance pattern. This pattern causes the covariance matrix at time t = 0 to have a “block Toeplitz with Toeplitz blocks” form (Novembre and Stephens, 2008). However the pairwise covariance between present-day individuals (t = 0) and between ancient and present-day individuals (t > 0) does not have the same shape (Figure 1). Equation (10) implies that in a stepping–stone model the covariance as a function of distance flattens when comparing present and ancient individuals. As a consequence, the scores of ancient samples are moved towards the center of the principal components reproducing the local correlation pattern. Thus ancient samples can cluster with present samples at different locations, even in an equilibrium stepping–stone model.

5.2 One component for the time differentiation

Links between PCA and population genetics quantities, such as coalescence times and F_ST have been studied (McVean, 2009; Duforet-Frebourg et al., 2015; Baran and Halperin, 2015) and show that these values can be estimated from principal components. In the 2–population case, McVean (2009) showed that the distance between individuals on the appropriate principal component is approximately a linear function of the square root of the time, ∆, until the lineages of the two individuals are in the same deme. If there are ancient and present samples, they can be considered as two groups, and ∆ is the time corresponding to the first two parts of the coalescence process between the lineages, described in the previous section. The time separating the individuals is a source of variance important enough to be reflected in the principal components (Skoglund et al., 2014). In this case, one component separates the two groups and the distance between groups is approximately proportional to . In Appendix E, we compute the expectation of ∆ if there are several present-day and one ancient individuals sampled.

We analyze the case with 50 contiguous populations sampled from a circular 1-Dimensional stepping–stone model with n_d = 1000. We assume m₁ = 0.1, and one deme is sampled in the past. We apply PCA by computing the eigenvectors of the individuals correlation matrix. The first principal component represents the IBD pattern between the present demes (Figure 7A). The second principal component corresponds to the differentiation between the ancient deme, and the present demes. The average distance on PC2 between the two groups (present and ancient) is an increasing function that can be approximated by a linear function of the square root of ∆ (Figure 7B).

Figure 7:

Panel A. Principal components for a 1-Dimensional stepping–stone with 50 present demes, and 1 ancient deme. The PCA is performed several times, with an ancient deme sampled at different times. Panel B. Average distance between present demes and ancient deme on PC2 as a function of

6 Conclusions and discussion

We have generalized the Kimura–Weiss theory of a stepping–stone model to the case where samples are taken at different times, a theory we call Isolation-by distance-and-time (IBDT). The correlation between individuals decreases as a function of both geographic distance and time. This result is accentuated in higher dimensions. When considering IBDT patterns, the edge effect applies when considering a linear model with a finite number of demes, similarly to the standard stepping–stone model. However simulations shows that in both 1 and 2 dimensions, this effect vanishes at a rate depending of the migration rate. We have also derived the expected coalescence times under the assumption of a circular, or toroidal model and low mutation rate. As the time between samples increases, the coalescence time between samples can be approximated by a linear function of time.

The connection between IBDT theory and PCA is of interest as it gives insights about what to expect from the PC plots that compare ancient and present-day samples. When considering only principal components reproducing geography, scores of the ancient samples may not cluster with the population at the same location. Such a result can occur even in the case of a population at equilibrium in a stepping–stone model, with no complex demographic history. This behavior of PCA is important to note as it could result in the inference of a non-existent ancient demographic event. The genetic differentiation created by time can be observed on another principal component. An important question that remains is in which conditions the proportion of variance explained by time is larger than the proportion of variance explained by Geography. In this unlikely event, the first principal component would not reproduce geography of the samples but rather the time line of the samples.

The limitations of PCA to investigate population structure in a spatio– temporal context highlights the need for new theoretical developments to analyze population structure when present-day and ancient samples are combined. This is especially apparent when considering the complex demographic scenarios already inferred about the history of modern humans (Pickrell and Reich, 2014). Important theoretical work has already been done to test specific hypothesis (Durand et al., 2011; Loh et al., 2013). Another way to test different past demographic events is with intensive simulation procedures, such as Approximate Bayesian Computations (Beaumont et al., 2002; Csilléry et al., 2010). In this case, theoretical developments on mechanistic models such as the stepping–stone model are important to perform simulations efficiently (Baird and Santos, 2010).

We studied the classical stepping–stone model under the assumptions of a stationary distribution of the allele frequencies in both time and space. These assumptions are not valid in all cases. The time–stationary distribution is not reached when recent events such as range expansions occurred, causing asymmetry in the site frequency spectrum (Hallatschek et al., 2007; Peter and Slatkin, 2013). Spatial non–stationarity and anisotropy can occur when migration pattern is uneven between all populations, or migration is favored in one direction (Jay et al., 2013; Duforet-Frebourg and Blum, 2014; Petkova et al., 2014). The correlation of allele frequencies is then not only a function of space and time, but also of the locations of each deme in the habitat.

A stepping–stone model is not the only model to describe spatial population structure. As an alternative to discrete models, continuous models can also be considered to study evolutionary processes (Maruyama, 1972; Barton et al., 2002, 2010). Isolation–by-Distance–and–Time can be studied in continuous framework. In the same way, results about coalescence times in a stepping–stone model can be connected to previous theory on coalescence in a continuous population (Wilkins and Wakeley, 2002).

Acknowledgement

This work was supported by NIH grant R01-6M40282 to M. Slatkin.

Appendix A

Using the notations in Weiss and Kimura (1965), we calculate the covariance of the allele frequencies ρ(k) between two populations that are spatially separated by k units of distance. This quantity is defined by

In the case where the demes are also separated by t units of time, we define and in the particular case of t = 1,

By induction, we show that for any value of t > 0

Let’s assume that for a time t > 0 equation (24) is true,

Then to obtain the correlation of allele frequencies r(k, t) between two demes, we have ρ(0, 0) = ρ(0) and

Appendix B

We established in equation (11) that r(k, t) = L^tr(k), and using the general expression in equation (6) we have,

It is now demonstrated that where for the convenience of the notation we denote m₀ = (1 − m_∞ − . In the particular case of t = 1 we have

Now assuming that formula (26) holds for any value t > 0, we have

We can conclude by induction that formula (26) is true for any positive t. Then, using equation (26), a general formula for r(k, t) can be expressed

Constant C is set such that r(0, 0) = 1. We do not analytically investigate this constant, however details about the case t = 0 can be found in Weiss and Kimura (1965).

Appendix C

Let’s assume the particular stepping-stone model: . Now the correlation between 2 demes k steps apart and t generations is

The fraction can be decomposed in two parts r(k, t) = C/(2π)(A₁(k, t) + A₂(k, t)) using partial fraction expansion, where

. Let α = (1 - m₁ - m_∞)/m₁, we can expand A₁ and A₂,

To get rid of the integral, we can use the fact that where and as given in Weiss and Kimura (1965)

This leads us to the expressions for A₁ and A₂,

Appendix D

The 2-Dimensional case of the analysis can be detailed by changing the operators L and S. We note the cartesian coordinates of each deme with the couple (i₁, i₂), and we define the operators S₁ and S₂ such as

The operator L in two dimensions becomes where is the migration rate between demes separated by i₁ and i₂ steps. The correlation in 2 dimensions can be written using the spectral decomposition and for two demes we have for two populations that are separated by k₁ and k₂ steps at the same generation. Using the same trigonometric properties as in appendix B, we have and . As a consequence, the correlation of allele frequencies in 2 dimensions between two populations separated by k₁ and k₂ steps, and t generations is

To go further, and especially investigate the 3-Dimensional case that can be relevant in practice, it is possible to extend the calculations in n-dimensional models, where two populations are separated by t generations and a vector of steps (k₁,… k_n). Redefining the operators S and L, we can show that the correlation is

Appendix E

We detail the case where two groups are present in the data, the present demes and the ancient deme. The quantity ∆ is the time for two genes in different groups to be in the same group. In the case where there is one ancient deme k₂ and one present deme k₁, using equation (19) we have

In the practical case we consider several present time demes 1… n_p, and one ancient deme. The expectation of ∆ has to be conditioned by the probability that A₁ is in a given present population k₁.

Since we consider a stepping–stone model where all the populations have the same effective population size, we have p(k₁ = j) = 1/n_p, j = 1… n_p.

References

↵
Al-Hassan, Q. (2012). On powers of tridiagonal matrices with nonnegative entries. Journal of Applied Mathematical Sciences, 6(48):2357–2368.
OpenUrl
↵
Andrello, M., Bevacqua, D., Maes, G. E., and De Leo, G. A. (2011). An integrated genetic-demographic model to unravel the origin of genetic structure in european eel (anguilla anguilla l.). Evolutionary applications, 4(4):517–533.
OpenUrl CrossRef PubMed Web of Science
↵
Baird, S. J. and Santos, F. (2010). Monte carlo integration over stepping stone models for spatial genetic inference using approximate bayesian computation. Molecular ecology resources, 10(5):873–885.
OpenUrl
↵
Baran, Y. and Halperin, E. (2015). A note on the relations between spatiogenetic models. Journal of Computational Biology.
↵
Barton, N. H., Depaulis, F., and Etheridge, A. M. (2002). Neutral evolution in spatially continuous populations. Theoretical population biology, 61(1):31–48.
OpenUrl CrossRef PubMed Web of Science
↵
Barton, N. H., Etheridge, A. M., and Véber, A. (2010). A new model for evolution in a spatial continuum. Electron. J. Probab, 15(7).
↵
Beaumont, M. A., Zhang, W., and Balding, D. J. (2002). Approximate bayesian computation in population genetics. Genetics, 162(4):2025–2035.
OpenUrl Abstract/FREE Full Text
↵
Castric, V. and Bernatchez, L. (2003). The rise and fall of isolation by distance in the anadromous brook charr (salvelinus fontinalis mitchill). Genetics, 163(3):983–996.
OpenUrl Abstract/FREE Full Text
↵
Cox, J. T., Durrett, R., et al. (2002). The stepping stone model: New formulas expose old myths. The Annals of Applied Probability, 12(4):1348– 1377.
OpenUrl
↵
Crow, J. F., Kimura, M., et al. (1970). An introduction to population genetics theory. An introduction to population genetics theory.
↵
Csilléry, K., Blum, M. G., Gaggiotti, O. E., and François, O. (2010). Approximate bayesian computation (abc) in practice. Trends in ecology & evolution, 25(7):410–418.
OpenUrl CrossRef PubMed
↵
De, A. and Durrett, R. (2007). Stepping-stone spatial structure causes slow decay of linkage disequilibrium and shifts the site frequency spectrum. Genetics, 176(2):969–981.
OpenUrl Abstract/FREE Full Text
↵
Depaulis, F., Orlando, L., and Hänni, C. (2009). Using classical population genetics tools with heterochroneous data: time matters! PLoS One, 4(5):e5541.
OpenUrl CrossRef PubMed
↵
Doob, J. L. (1953). Stochastic processes, volume 101. New York Wiley.
↵
Duforet-Frebourg, N. and Blum, M. G. (2014). Nonstationary patterns of isolation–by–distance: inferring measures of local genetic differentiation with bayesian kriging. Evolution, 68(4):1110–1123.
OpenUrl
↵
Duforet-Frebourg, N., Laval, G., Bazin, E., and Blum, M. G. (2015). Detecting genomic signatures of natural selection with principal component analysis: application to the 1000 genomes data. arXiv preprint arXiv:1504.04543.
↵
Durand, E. Y., Patterson, N., Reich, D., and Slatkin, M. (2011). Testing for ancient admixture between closely related populations. Molecular biology and evolution, 28(8):2239–2252.
OpenUrl CrossRef PubMed Web of Science
↵
Engelhardt, B. E. and Stephens, M. (2010). Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis. PLoS Genet, 6(9):e1001117.
OpenUrl CrossRef PubMed
↵
Epperson, B. K. (2000). Spatial and space–time correlations in ecological models. Ecological modelling, 132(1):63–76.
OpenUrl
↵
Excoffier, L. and Foll, M. (2011). Fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios. Bioinformatics, 27(9):1332–1334.
OpenUrl CrossRef PubMed Web of Science
↵
Felsenstein, J. (2015). Covariation of gene frequencies in a stepping-stone lattice of populations. Theoretical population biology, 100:88–97.
OpenUrl
↵
Haak, W., Lazaridis, I., Patterson, N., Rohland, N., Mallick, S., Llamas, B., Brandt, G., Nordenfelt, S., Harney, E., Stewardson, K., et al. (2015). Massive migration from the steppe was a source for indo-european languages in europe. Nature.
↵
Hallatschek, O., Hersen, P., Ramanathan, S., and Nelson, D. R. (2007). Genetic drift at expanding frontiers promotes gene segregation. Proceedings of the National Academy of Sciences, 104(50):19926–19930.
OpenUrl Abstract/FREE Full Text
↵
Hellberg, M. E. (2009). Gene flow and isolation among populations of marine animals. Ecology, Evolution, and Systematic.
↵
Higuchi, R., Bowman, B., Freiberger, M., Ryder, O. A., and Wilson, A. C. (1984). Dna sequences from the quagga, an extinct member of the horse family. Nature.
↵
Hudson, R. R. (2002). Generating samples under a wright–fisher neutral model of genetic variation. Bioinformatics, 18(2):337–338.
OpenUrl CrossRef PubMed Web of Science
↵
Jay, F., Sjödin, P., Jakobsson, M., and Blum, M. G. (2013). Anisotropic isolation by distance: the main orientations of human genetic differentiation. Molecular biology and evolution, 30(3):513–525.
OpenUrl CrossRef PubMed
↵
Karakachoff, M., Duforet-Frebourg, N., Simonet, F., Le Scouarnec, S., Pellen, N., Lecointe, S., Charpentier, E., Gros, F., Cauchi, S., Froguel, P., et al. (2015). Fine-scale human genetic structure in western france. European Journal of Human Genetics, 23(6):831–836.
OpenUrl CrossRef PubMed
↵
Kimura, M. (1953). stepping stonem odel of population. Ann. Rept. Nat. Inst. Genetics Japan, pages 62–63.
↵
Kimura, M. and Crow, J. F. (1963). The measurement of effective population number. Evolution, pages 279–288.
↵
Kimura, M. and Weiss, G. H. (1964). The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics, 49(4):561.
OpenUrl FREE Full Text
↵
Loh, P.-R., Lipson, M., Patterson, N., Moorjani, P., Pickrell, J. K., Reich, D., and Berger, B. (2013). Inferring admixture histories of human populations using linkage disequilibrium. Genetics, 193(4):1233–1254.
OpenUrl Abstract/FREE Full Text
↵
Malécot, G. (1948). mathéematiques de l’héereéeditée. Paris: Masson etCie.
↵
Malécot, G. (1955). The decrease of relationship with distance. In Cold Spring Harbor Symp. Quant. Biol, volume 20, pages 52–53.
OpenUrl
↵
Maruyama, T. (1970a). Rate of decrease of genetic variability in a subdivided population. Biometrika, 57(2):299–311.
OpenUrl CrossRef
↵
Maruyama, T. (1970b). Stepping stone models of finite length. Advances in Applied Probability, pages 229–258.
↵
Maruyama, T. (1971a). Analysis of population structure: Ii. twodimensional stepping sone models of finite length and other geographically structured populations*. Annals of human genetics, 35(2):179–196.
OpenUrl PubMed Web of Science
↵
Maruyama, T. (1971b). The rate of decrease of heterozygosity in a population occupying a circular or a linear habitat. Genetics, 67(3):437.
OpenUrl FREE Full Text
↵
Maruyama, T. (1972). Rate of decrease of genetic variability in a twodimensional continuous population of finite size. Genetics, 70(4):639–651.
OpenUrl Abstract/FREE Full Text
↵
McVean, G. (2009). A genealogical interpretation of principal components analysis. PLoS Genet, 5(10):e1000686.
OpenUrl CrossRef PubMed
↵
Nagylaki, T. (1983). The robustness of neutral models of geographical variation. Theoretical Population Biology, 24(3):268–294.
OpenUrl CrossRef Web of Science
↵
Nei, M. (1973). Analysis of gene diversity in subdivided populations. Proceedings of the National Academy of Sciences, 70(12):3321–3323.
OpenUrl Abstract/FREE Full Text
↵
Novembre, J., Johnson, T., Bryc, K., Kutalik, Z., Boyko, A. R., Auton, A., Indap, A., King, K. S., Bergmann, S., Nelson, M. R., et al. (2008). Genes mirror geography within europe. Nature, 456(7218):98–101.
OpenUrl CrossRef PubMed Web of Science
↵
Novembre, J. and Stephens, M. (2008). Interpreting principal component analyses of spatial population genetic variation. Nature genetics, 40(5):646–649.
OpenUrl CrossRef PubMed Web of Science
↵
Pääbo, S. (1985). Molecular cloning of ancient egyptian mummy dna. Nature.
↵
Pääbo, S., Poinar, H., Serre, D., Jaenicke-Després, V., Hebler, J., Rohland, N., Kuch, M., Krause, J., Vigilant, L., and Hofreiter, M. (2004). Genetic analyses from ancient dna. Annu. Rev. Genet., 38:645–679.
OpenUrl CrossRef PubMed Web of Science
↵
Patterson, N., Price, A. L., and Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics.
↵
Peter, B. M. and Slatkin, M. (2013). Detecting range expansions from genetic data. Evolution, 67(11):3274–3289.
OpenUrl CrossRef PubMed Web of Science
↵
Petkova, D., Novembre, J., and Stephens, M. (2014). Visualizing spatial population structure with estimated effective migration surfaces. bioRxiv, page 011809.
↵
Pickrell, J. K. and Reich, D. (2014). Toward a new history and geography of human genes informed by ancient dna. Trends in Genetics, 30(9):377–389.
OpenUrl CrossRef PubMed
↵
Ramachandran, S., Deshpande, O., Roseman, C. C., Rosenberg, N. A., Feldman, M. W., and Cavalli-Sforza, L. L. (2005). Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in africa. Proceedings of the National Academy of Sciences of the United States of America, 102(44):15942–15947.
OpenUrl Abstract/FREE Full Text
↵
Ross, S. M. et al. (1996). Stochastic processes, volume 2. John Wiley & Sons New York.
↵
Rousset, F. (1997). Genetic differentiation and estimation of gene flow from f-statistics under isolation by distance. Genetics, 145(4):1219–1228.
OpenUrl Abstract/FREE Full Text
↵
Sharbel, T. F., Haubold, B., and Mitchell-Olds, T. (2000). Genetic isolation by distance in arabidopsis thaliana: biogeography and postglacial colonization of europe. Molecular Ecology, 9(12):2109–2118.
OpenUrl CrossRef PubMed Web of Science
↵
Skoglund, P., Malmström, H., Raghavan, M., Storå, J., Hall, P., Willerslev, E., Gilbert, M. T. P., Götherström, A., and Jakobsson, M. (2012). Origins and genetic legacy of neolithic farmers and hunter-gatherers in europe. Science, 336(6080):466–469.
OpenUrl Abstract/FREE Full Text
↵
Skoglund, P., Sjödin, P., Skoglund, T., Lascoux, M., and Jakobsson, M. (2014). Investigating population history using temporal genetic differentiation. Molecular biology and evolution, 31(9):2516–2527.
OpenUrl CrossRef PubMed
↵
Slatkin, M. (1985). Gene flow in natural populations. Annual review of ecology and systematics, pages 393–430.
↵
Slatkin, M. (1991). Inbreeding coefficients and coalescence times. Genetical research, 58(02):167–175.
OpenUrl CrossRef PubMed Web of Science
↵
Slatkin, M. (1993). Isolation by distance in equilibrium and non-equilibrium populations. Evolution, pages 264–279.
↵
Teacher, A. G., Thomas, J. A., and Barnes, I. (2011). Modern and ancient red fox (vulpes vulpes) in europe show an unusual lack of geographical and temporal structuring, and differing responses within the carnivores to historical climatic change. BMC evolutionary biology, 11(1):214.
OpenUrl
↵
Weir, B. S. and Cockerham, C. C. (1984). Estimating f-statistics for the analysis of population structure. evolution, pages 1358–1370.
↵
Weiss, G. H. and Kimura, M. (1965). A mathematical analysis of the stepping stone model of genetic correlation. Journal of Applied Probability, pages 129–149.
↵
Wilkins, J. F. and Wakeley, J. (2002). The coalescent in a continuous, finite, linear population. Genetics, 161(2):873–888.
OpenUrl Abstract/FREE Full Text
↵
Wright, S. (1940). Breeding structure of populations in relation to speciation. American Naturalist, pages 232–248.
↵
Wright, S. (1943). Isolation by distance. Genetics, 28(2):114.
OpenUrl FREE Full Text

View the discussion thread.

Posted August 07, 2015.

Download PDF

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11715)
Bioengineering (8723)
Bioinformatics (29128)
Biophysics (14935)
Cancer Biology (12049)
Cell Biology (17359)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14144)
Epidemiology (2067)
Evolutionary Biology (18268)
Genetics (12221)
Genomics (16767)
Immunology (11843)
Microbiology (28014)
Molecular Biology (11560)
Neuroscience (60810)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10384)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] ↵
Al-Hassan, Q. (2012). On powers of tridiagonal matrices with nonnegative entries. Journal of Applied Mathematical Sciences, 6(48):2357–2368.
OpenUrl

[2] ↵
Andrello, M., Bevacqua, D., Maes, G. E., and De Leo, G. A. (2011). An integrated genetic-demographic model to unravel the origin of genetic structure in european eel (anguilla anguilla l.). Evolutionary applications, 4(4):517–533.
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Baird, S. J. and Santos, F. (2010). Monte carlo integration over stepping stone models for spatial genetic inference using approximate bayesian computation. Molecular ecology resources, 10(5):873–885.
OpenUrl

[4] ↵
Baran, Y. and Halperin, E. (2015). A note on the relations between spatiogenetic models. Journal of Computational Biology.

[5] ↵
Barton, N. H., Depaulis, F., and Etheridge, A. M. (2002). Neutral evolution in spatially continuous populations. Theoretical population biology, 61(1):31–48.
OpenUrl CrossRef PubMed Web of Science

[6] ↵
Barton, N. H., Etheridge, A. M., and Véber, A. (2010). A new model for evolution in a spatial continuum. Electron. J. Probab, 15(7).

[7] ↵
Beaumont, M. A., Zhang, W., and Balding, D. J. (2002). Approximate bayesian computation in population genetics. Genetics, 162(4):2025–2035.
OpenUrl Abstract/FREE Full Text

[8] ↵
Castric, V. and Bernatchez, L. (2003). The rise and fall of isolation by distance in the anadromous brook charr (salvelinus fontinalis mitchill). Genetics, 163(3):983–996.
OpenUrl Abstract/FREE Full Text

[9] ↵
Cox, J. T., Durrett, R., et al. (2002). The stepping stone model: New formulas expose old myths. The Annals of Applied Probability, 12(4):1348– 1377.
OpenUrl

[10] ↵
Crow, J. F., Kimura, M., et al. (1970). An introduction to population genetics theory. An introduction to population genetics theory.

[11] ↵
Csilléry, K., Blum, M. G., Gaggiotti, O. E., and François, O. (2010). Approximate bayesian computation (abc) in practice. Trends in ecology & evolution, 25(7):410–418.
OpenUrl CrossRef PubMed

[12] ↵
De, A. and Durrett, R. (2007). Stepping-stone spatial structure causes slow decay of linkage disequilibrium and shifts the site frequency spectrum. Genetics, 176(2):969–981.
OpenUrl Abstract/FREE Full Text

[13] ↵
Depaulis, F., Orlando, L., and Hänni, C. (2009). Using classical population genetics tools with heterochroneous data: time matters! PLoS One, 4(5):e5541.
OpenUrl CrossRef PubMed

[14] ↵
Doob, J. L. (1953). Stochastic processes, volume 101. New York Wiley.

[15] ↵
Duforet-Frebourg, N. and Blum, M. G. (2014). Nonstationary patterns of isolation–by–distance: inferring measures of local genetic differentiation with bayesian kriging. Evolution, 68(4):1110–1123.
OpenUrl

[16] ↵
Duforet-Frebourg, N., Laval, G., Bazin, E., and Blum, M. G. (2015). Detecting genomic signatures of natural selection with principal component analysis: application to the 1000 genomes data. arXiv preprint arXiv:1504.04543.

[17] ↵
Durand, E. Y., Patterson, N., Reich, D., and Slatkin, M. (2011). Testing for ancient admixture between closely related populations. Molecular biology and evolution, 28(8):2239–2252.
OpenUrl CrossRef PubMed Web of Science

[18] ↵
Engelhardt, B. E. and Stephens, M. (2010). Analysis of population structure: a unifying framework and novel methods based on sparse factor analysis. PLoS Genet, 6(9):e1001117.
OpenUrl CrossRef PubMed

[19] ↵
Epperson, B. K. (2000). Spatial and space–time correlations in ecological models. Ecological modelling, 132(1):63–76.
OpenUrl

[20] ↵
Excoffier, L. and Foll, M. (2011). Fastsimcoal: a continuous-time coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios. Bioinformatics, 27(9):1332–1334.
OpenUrl CrossRef PubMed Web of Science

[21] ↵
Felsenstein, J. (2015). Covariation of gene frequencies in a stepping-stone lattice of populations. Theoretical population biology, 100:88–97.
OpenUrl

[22] ↵
Haak, W., Lazaridis, I., Patterson, N., Rohland, N., Mallick, S., Llamas, B., Brandt, G., Nordenfelt, S., Harney, E., Stewardson, K., et al. (2015). Massive migration from the steppe was a source for indo-european languages in europe. Nature.

[23] ↵
Hallatschek, O., Hersen, P., Ramanathan, S., and Nelson, D. R. (2007). Genetic drift at expanding frontiers promotes gene segregation. Proceedings of the National Academy of Sciences, 104(50):19926–19930.
OpenUrl Abstract/FREE Full Text

[24] ↵
Hellberg, M. E. (2009). Gene flow and isolation among populations of marine animals. Ecology, Evolution, and Systematic.

[25] ↵
Higuchi, R., Bowman, B., Freiberger, M., Ryder, O. A., and Wilson, A. C. (1984). Dna sequences from the quagga, an extinct member of the horse family. Nature.

[26] ↵
Hudson, R. R. (2002). Generating samples under a wright–fisher neutral model of genetic variation. Bioinformatics, 18(2):337–338.
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Jay, F., Sjödin, P., Jakobsson, M., and Blum, M. G. (2013). Anisotropic isolation by distance: the main orientations of human genetic differentiation. Molecular biology and evolution, 30(3):513–525.
OpenUrl CrossRef PubMed

[28] ↵
Karakachoff, M., Duforet-Frebourg, N., Simonet, F., Le Scouarnec, S., Pellen, N., Lecointe, S., Charpentier, E., Gros, F., Cauchi, S., Froguel, P., et al. (2015). Fine-scale human genetic structure in western france. European Journal of Human Genetics, 23(6):831–836.
OpenUrl CrossRef PubMed

[29] ↵
Kimura, M. (1953). stepping stonem odel of population. Ann. Rept. Nat. Inst. Genetics Japan, pages 62–63.

[30] ↵
Kimura, M. and Crow, J. F. (1963). The measurement of effective population number. Evolution, pages 279–288.

[31] ↵
Kimura, M. and Weiss, G. H. (1964). The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics, 49(4):561.
OpenUrl FREE Full Text

[32] ↵
Loh, P.-R., Lipson, M., Patterson, N., Moorjani, P., Pickrell, J. K., Reich, D., and Berger, B. (2013). Inferring admixture histories of human populations using linkage disequilibrium. Genetics, 193(4):1233–1254.
OpenUrl Abstract/FREE Full Text

[33] ↵
Malécot, G. (1948). mathéematiques de l’héereéeditée. Paris: Masson etCie.

[34] ↵
Malécot, G. (1955). The decrease of relationship with distance. In Cold Spring Harbor Symp. Quant. Biol, volume 20, pages 52–53.
OpenUrl

[35] ↵
Maruyama, T. (1970a). Rate of decrease of genetic variability in a subdivided population. Biometrika, 57(2):299–311.
OpenUrl CrossRef

[36] ↵
Maruyama, T. (1970b). Stepping stone models of finite length. Advances in Applied Probability, pages 229–258.

[37] ↵
Maruyama, T. (1971a). Analysis of population structure: Ii. twodimensional stepping sone models of finite length and other geographically structured populations*. Annals of human genetics, 35(2):179–196.
OpenUrl PubMed Web of Science

[38] ↵
Maruyama, T. (1971b). The rate of decrease of heterozygosity in a population occupying a circular or a linear habitat. Genetics, 67(3):437.
OpenUrl FREE Full Text

[39] ↵
Maruyama, T. (1972). Rate of decrease of genetic variability in a twodimensional continuous population of finite size. Genetics, 70(4):639–651.
OpenUrl Abstract/FREE Full Text

[40] ↵
McVean, G. (2009). A genealogical interpretation of principal components analysis. PLoS Genet, 5(10):e1000686.
OpenUrl CrossRef PubMed

[41] ↵
Nagylaki, T. (1983). The robustness of neutral models of geographical variation. Theoretical Population Biology, 24(3):268–294.
OpenUrl CrossRef Web of Science

[42] ↵
Nei, M. (1973). Analysis of gene diversity in subdivided populations. Proceedings of the National Academy of Sciences, 70(12):3321–3323.
OpenUrl Abstract/FREE Full Text

[43] ↵
Novembre, J., Johnson, T., Bryc, K., Kutalik, Z., Boyko, A. R., Auton, A., Indap, A., King, K. S., Bergmann, S., Nelson, M. R., et al. (2008). Genes mirror geography within europe. Nature, 456(7218):98–101.
OpenUrl CrossRef PubMed Web of Science

[44] ↵
Novembre, J. and Stephens, M. (2008). Interpreting principal component analyses of spatial population genetic variation. Nature genetics, 40(5):646–649.
OpenUrl CrossRef PubMed Web of Science

[45] ↵
Pääbo, S. (1985). Molecular cloning of ancient egyptian mummy dna. Nature.

[46] ↵
Pääbo, S., Poinar, H., Serre, D., Jaenicke-Després, V., Hebler, J., Rohland, N., Kuch, M., Krause, J., Vigilant, L., and Hofreiter, M. (2004). Genetic analyses from ancient dna. Annu. Rev. Genet., 38:645–679.
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Patterson, N., Price, A. L., and Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics.

[48] ↵
Peter, B. M. and Slatkin, M. (2013). Detecting range expansions from genetic data. Evolution, 67(11):3274–3289.
OpenUrl CrossRef PubMed Web of Science

[49] ↵
Petkova, D., Novembre, J., and Stephens, M. (2014). Visualizing spatial population structure with estimated effective migration surfaces. bioRxiv, page 011809.

[50] ↵
Pickrell, J. K. and Reich, D. (2014). Toward a new history and geography of human genes informed by ancient dna. Trends in Genetics, 30(9):377–389.
OpenUrl CrossRef PubMed

[51] ↵
Ramachandran, S., Deshpande, O., Roseman, C. C., Rosenberg, N. A., Feldman, M. W., and Cavalli-Sforza, L. L. (2005). Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in africa. Proceedings of the National Academy of Sciences of the United States of America, 102(44):15942–15947.
OpenUrl Abstract/FREE Full Text

[52] ↵
Ross, S. M. et al. (1996). Stochastic processes, volume 2. John Wiley & Sons New York.

[53] ↵
Rousset, F. (1997). Genetic differentiation and estimation of gene flow from f-statistics under isolation by distance. Genetics, 145(4):1219–1228.
OpenUrl Abstract/FREE Full Text

[54] ↵
Sharbel, T. F., Haubold, B., and Mitchell-Olds, T. (2000). Genetic isolation by distance in arabidopsis thaliana: biogeography and postglacial colonization of europe. Molecular Ecology, 9(12):2109–2118.
OpenUrl CrossRef PubMed Web of Science

[55] ↵
Skoglund, P., Malmström, H., Raghavan, M., Storå, J., Hall, P., Willerslev, E., Gilbert, M. T. P., Götherström, A., and Jakobsson, M. (2012). Origins and genetic legacy of neolithic farmers and hunter-gatherers in europe. Science, 336(6080):466–469.
OpenUrl Abstract/FREE Full Text

[56] ↵
Skoglund, P., Sjödin, P., Skoglund, T., Lascoux, M., and Jakobsson, M. (2014). Investigating population history using temporal genetic differentiation. Molecular biology and evolution, 31(9):2516–2527.
OpenUrl CrossRef PubMed

[57] ↵
Slatkin, M. (1985). Gene flow in natural populations. Annual review of ecology and systematics, pages 393–430.

[58] ↵
Slatkin, M. (1991). Inbreeding coefficients and coalescence times. Genetical research, 58(02):167–175.
OpenUrl CrossRef PubMed Web of Science

[59] ↵
Slatkin, M. (1993). Isolation by distance in equilibrium and non-equilibrium populations. Evolution, pages 264–279.

[60] ↵
Teacher, A. G., Thomas, J. A., and Barnes, I. (2011). Modern and ancient red fox (vulpes vulpes) in europe show an unusual lack of geographical and temporal structuring, and differing responses within the carnivores to historical climatic change. BMC evolutionary biology, 11(1):214.
OpenUrl

[61] ↵
Weir, B. S. and Cockerham, C. C. (1984). Estimating f-statistics for the analysis of population structure. evolution, pages 1358–1370.

[62] ↵
Weiss, G. H. and Kimura, M. (1965). A mathematical analysis of the stepping stone model of genetic correlation. Journal of Applied Probability, pages 129–149.

[63] ↵
Wilkins, J. F. and Wakeley, J. (2002). The coalescent in a continuous, finite, linear population. Genetics, 161(2):873–888.
OpenUrl Abstract/FREE Full Text

[64] ↵
Wright, S. (1940). Breeding structure of populations in relation to speciation. American Naturalist, pages 232–248.

[65] ↵
Wright, S. (1943). Isolation by distance. Genetics, 28(2):114.
OpenUrl FREE Full Text