Detecting spatial dynamics of range expansions with geo-referenced genomewide SNP data and the geographic spectrum of shared alleles

Diego F. Alvarado-Serrano; Michael J. Hickerson

doi:10.1101/457556

Abstract

Uncovering the spatial dynamics of range expansions is a major goal in studies of historical demographic inference, with applications ranging from understanding the evolutionary origins of domesticated crops, epidemiology, invasive species, and understanding specieslevel responses to climate change. Following the surge in advances that make explicit use of the spatial distribution of genetic data from georeferenced SNP variants, we present a novel summary statistic vector, the geographic spectrum of shared alleles (GSSA). Using simulations of twodimensional serial expansion, we find that the information from the GSSA, summarized with Harpending’s Raggedness Index (RI), can accurately detect the spatial origins of a range expansion under serial founder models, even with sparse sampling of only ten individuals. When applying to SNP data from two species of the holarctic butterfly genus Lycaeides, the suggested origins of expansion are consistent with hindcasts obtained from ecological niche models (ENMs). These results demonstrate the GSSA to be a useful exploratory tool for generating hypotheses of range expansion with genomewide SNP data. Our simulation experiments suggest high performance even with sampling found in studies of nonmodel organisms (one sampled individual per location, no outgroup information, and only 5,000 SNP loci).

Introduction

Geographic range shifts are a prevalent feature of virtually all species’ histories. They often follow or accompany the origin of species (Gaston 2003), and frequently happen in response to changes in climate, landscape, or biotic opportunities (Konečný et al. 2013 Roberts & Hamann 2015). Despite range expansions being a pervasive feature across many species histories, the net outcomes of the interactions between speciesspecific traits and changes in suitable habitats are poorly understood with regards to changes in the geographic distribution of genetic diversity, adaptability, and community structure (Pharo & Zartman 2007 Thuiller et al. 2008 Phillips et al. 2010). For example, although serialrange expansions are likely a common feature of species histories in the context of past global changes (Excoffier et al. 2009), there is still uncertainty about how they affect differentiation and diversification, or how they influence the risk of extinction under future environmental shifts (Charles & Dukes 2008 Fordham et al. 2014). Tools that uncover the spatiotemporal dynamics of species’ ranges are therefore critical for understanding how geographic range dynamics affect the spatial distribution of genetic diversity through drift and selection (Slatkin 1987 Kirkpatrick & Barton 1997 Peischl et al. 2013). They are also central to forecast species’ responses to ongoing and future environmental changes (Brown et al. 2016).

Among recent attempts to couple genetic and spatial information to infer historical range dynamics, Ramachadran et al. (2005) proposed to identify the geographic origin of an expansion using heterozygosity clines that result from the reduction of heterozygosity associated with the sequential founder events that are characteristic of range expansions (Ramachandran et al. 2005 Li et al. 2008 DeGiorgio et al. 2009 Henn et al. 2012). Based on the premise that heterozygosity should continuously decrease with distance from the expansion source, this method fits a linear regression between expected heterozygosity at multiple sampled locations and their geographic distance from all possible candidate sources. While this is simple to estimate, this approach has been found to perform well under a relatively narrow range of circumstances, with its accuracy decreasing as the speed of the expansion decreases, or as the time since the end of the expansion increases (Peter & Slatkin 2015). Previously, researchers have used principal component analysis (PCA) of georeferenced samples to identify the direction of expansion under the assumption that the first principal component—which accounts for the greatest amount of variation—aligns with the expansion sourcefront axis (Menozzi et al. 1978). However, this assumption has been challenged by recent evidence indicating that the geographic alignment of the first component may be driven by the geographic distribution of samples, allele surfing, or mathematical artifacts (Novembre & Stephens 2008 François et al. 2010 Pemberton et al. 2013).

More recently, Peter and Slatkin (2013) proposed a new statistic, the directionality index (ψ), to detect the origins of range expansions using joint georeferenced and genomewide information. This statistic takes advantage of allele frequency changes driven by the sequential founder events that often characterize expansions. Specifically, ψ detects asymmetries in 2D site frequency spectra of shared neutral nonancestral alleles between pairs of populations, under the theoretical expectation that populations further away from the expansion have experienced more genetic drift than populations near the expansion source (Slatkin & Excoffier 2012 Peter & Slatkin 2013). This metric has been also incorporated into an ABC framework, whereby He et al. (2017) used it to estimate the source of range expansion in the Collared pika in Alaska.

While analyses using the directionality index (Peter & Slatkin 2015) have been highly promising, the relatively high number of sampled individuals, and the requirement of polarized SNPs, make its implementation difficult in non-model organisms, which are typically characterized by limited numbers of georeferenced genotypes, or for which a reliable outgroup for polarizing SNPs has not been identified. Here we develop a summary statistic vector, which we call the geographic spectrum of shared alleles (GSSA, hereafter), to harvest information from the spatial distribution of shared allelic variants to help identify the geographical origin of range expansions. Our approach is targeted for nonmodel organisms as it does not require information about ancestral states, requires a single individual per sampled location, and can accommodate a reasonable number of SNPs.

We formally describe the GSSA summary statistic vector and use two-dimensional simulations of different expansion histories to statistically explore its behavior. We also demonstrate its applicability with genomewide SNP data sampled from a set of taxa of the holartic butterfly genus Lycaeides distributed in Western North America (Gompert et al. 2010). Our analyses demonstrate that the GSSA can be a powerful exploratory tool to identify the relative position of samples along an expansion axis. In Lycaeides, the GSSA places the origin of expansions in geographical areas that are consistent with species distribution models for hypothesized ranges shifts during the last glacial maximum.

Materials and Methods

The geographic spectrum of shared alleles (GSSA)

The Geographic Spectrum of Shared Alleles (GSSA) is a property of—and calculated for—each sampled locality. It captures the geographic distribution of shared coancestry with other localities by summarizing the geographic distances between copies of the minor allele observed at each SNP site (that is, the allele with the smallest frequency in the global sample for each site Nielsen et al. 2012). As such, the GSSA of a given location is a histogram that depicts the cumulative frequency distribution of the geographic distances separating each minor allele shared between the focal location and all other sampled locations across all SNPs (Fig. 1). On an intuitive level, this new summary statistic vector can be understood as a spatial structure function that captures the relative change in allelic similarity from a focal location as a function of geographic distance. Importantly, the GSSA is a location-specific statistic which differs from other structure functions such as spatial correlograms that are globally calculated. These latter statistics quantify the spatial dependence of an observed variable across the entire set of observations using a covariance matrix partitioned into distance classes (Smouse & Peakall 1999 Legendre & Legendre 2012). Instead, each location-specific GSSA is directly based on distances between minor alleles shared with a focal locality.

Fig. 1. Schematic cartoon describing the construction of the geographic spectrum of shared alleles (GSSA) across five diploid genotypes that are each sampled from a unique location. Location-specific genotype matrices (G_i), which capture the presence/absence of minor alleles at each location, locus and DNA strand (panel a), and cartoon representation of location-specific vectors (

), which capture the geographic distances between each focal location and all of the other locations in the sample (panel b). Based on the G_i matrices and

distance vectors, corresponding vectors (

) are constructed from the aggregated relative spatial distribution of minor alleles for each location (panel c). To facilitate interpretation, the color of each

element follows the location colors in panel a, and indicates their derivation with respect to each one of the sampled locations. The

vectors are then converted into corresponding geo-genetic histograms (h_{gen_i}), by applying a common binning scheme based on Sturge’s (1926) equation (panel d). For each location, a corresponding geographic histogram (h_{geo_i}) is constructed, using the same binning scheme as before, from the geographic-distances separating the location from all other sampled locations (panel e). The number of observations for corresponding distance classes in both histograms,h_{gen_i} and, are then regressed against each other for each location (panel f). From these regressions, the GSSA vector for each individual is built by taking the absolute value of the regression residuals for each histogram’s distance class (panel g).

The information captured in the GSSA helps identify the relative position of each sampling location with respect to the expansion sourcefront axis, because the similarity in genetic constitution between locations depends on the extent of shared history under an expansion history (Peter & Slatkin 2015 Bradburd et al. 2016). Because the amount of genetic drift increases with distance to the source of an expansion (Slatkin & Excoffier 2012 Peter & Slatkin 2013), and because colonization through different geographic paths should lead to allelic segregation (Hallatschek et al. 2007 Knowles & Alvarado-Serrano 2010 François et al. 2010), the amount of shared genetic variants can potentially further inform about the direction of the expansion. As such, if this information is captured by our summary statistic vector, it could be useful in a simulationbased statistical method—such as approximate Bayesian computation or supervised machine learning—for testing alternative expansion histories (Pudlo et al. 2015 Joseph et al. 2016 Schrider & Kern 2018).

GSSA overview

The calculation of the set of GSSAs, one per unique sampled georeferenced location, requires two sets of data: i) the geographic distances between all sampled locations, and ii) a record of the counts of minor allele copies of each SNP locus at each sampled location (Fig. 1a). Here, we restrict our discussion to geographic distance between samples, but note that users could use an alternative distance metric, such as riverdistance, when appropriate. Below we verbally summarized the steps involved in the calculation of the GSSA for one sampled location (a formal mathematical description is presented in the next section).

First, using the entire set of geographic distances between all sampled locations, the Sturges (1926) equation is deployed to identify the optimal number of geographic distance classes and breakpoints for the construction of all of the location-specific GSSAs. This Sturges binning scheme is then applied to the set of geographic distances associated with the ith focal sampled location, such that there will be a specific histogram (h_{geo_i}) for each of the locations (Fig. 1e). Second, a matrix of minor allele’s presence/absence (G_i) is constructed for the ith focal sampled location (Fig. 1a). Third, a vector () is constructed for the ith focal sampled location from the geographic distribution of minor alleles. Specifically, each location-specific vector lists the geographic distances between each minor SNP allele present at the focal sampling location and all other copies of the same minor allele in the entire sample (Fig. 1c). Fourth, this location-specific vector is then converted into a corresponding location-specific histogram (h_{gen_i}) using the Sturges’ (1927) binning scheme previously defined (Fig. 1d). Finally, to make our statistic independent of the specific sampling scheme (i.e., sampling locations and their influence on the binning scheme), the location-specific histogram constructed from the vector of distances between minor alleles (h_{gen_i}) is regressed against the values from the corresponding “geography-only” histogram (h_{geo_i} i.e., the “null” distribution of geographic distances). The residuals from this regression are finally used to build the location-specific histogram for the ith focal location (GSSA_i).

GSSA calculation

Formally, the two sets of data extracted from the series of georeferenced sampled genotypes can be defined as two matrices. With one individual per sampled location, I locations, M sets of chromosomes (i.e., ploidy), and L loci (i.e., SNPs), the arrangement of minor alleles in the entire sample relative to each location can be characterized by a M x L genotype matrix, G_i, where each element G_{i_m,l} takes the value of 0 or 1, depending on whether a copy of the minor allele is present (1) or not (0) at each individual’s SNP locus and DNA strand (note that phasing is not necessary because the method treats each locus independently Fig. 1a). In turn, the set of Euclidean or effective (Shirk & Cushman 2014 Davis et al. 2018) geographic distances between all locations can be characterized by an I x I square distance matrix, D, where D_i,i = 0 and D_i,j = D_j,i. From this latter matrix, a set of location-specific geographic-distances vectors, , can be obtained by subsetting the D matrix so that (Fig. 1b). Altogether, these data are used to construct a GSSA summary statistic vector for each sampled location following five steps:

Step 1. First Sturges’ (Sturges 1926) equation is applied to the set of pairwise geographic distances between all localities contained in matrix D to identify an optimal series of consecutive distance classes that are later used for histogram binning. We chose Sturges’ (1926) equation for its simplicity and common use in biological research (RamírezGarcıa et al. 1998 Fagua & Gonzalez 2007 Pires et al. 2016 Cardozo et al. 2018). Although alternative binning schemes would in theory be possible, an exploration of a coarser binning scheme (Scott 1979) and the more granular Rice rule (Jones et al. 2001--) showed that although Sturge’s scheme resulted in relatively better performance with regards to the dynamics of the GSSA and identification of expansion colonization history, accuracy was good across all three binning schemes (Supp. Fig. 1).

Step 2. Each genotype matrix G_i is used to construct a location-specific vector, , which summarizes the aggregated relative spatial distribution of minor alleles at location i (Fig. 1c). The elements of this vector are defined as: where Ø = null, G_{i_m,l} = element of individual i’s genotype matrix that denotes the presence or absence of a minor allele at locus l, strand m, and D_i,j = geographic distance between localities i and j. Elements in the vector with a value of zero correspond to instances in which the same minor allele at a SNP locus is homozygous within an individual or present in more than one individual sampled from the same locality because the geographic distance between these allele copies would be zero (D_i,i = 0). On the other hand, nonzero elements correspond to the geographic distances between individuals from different localities sharing the same minor allele at a SNP locus (Fig. 1c). Effectively, can also be derived from the frequency of the minor allele at each locus and location by aggregating over DNA strands so that the set of elements of this vector are defined as: where f_i is the frequency of copies of the minor allele at location i, locus l and D_i,j as in eq. [1a].

Step 3. After removing all null elements (Ø) from vector (Fig. 1c), which number depends on the number of loci at which each locality does not contain a minor allele copy (Fig. 1a), the size of each varies among locations. Therefore, it is necessary to compress each into a vector with an equal number of elements for all individual sampled locations. To do this, each vector (Fig.1c) is converted into a histogram by calculating the frequency of elements that fall within each of the distance classes previously determined in Step 1. The resulting location-specific geo-genetic histogram (h_{gen_i}) summarizes the relative spatial distribution of minor alleles copies across loci present at each (i^th) sampling location (Fig. 1d).

Step 4. To correct for the “null” geographic expectation introduced by the specific geographic position of each sample, each location-specific vector of geographicdistances, , is first converted into a histogram by again estimating the frequency of observations that fall within each of the same set of distance classes previously used to create the geo-genetic histograms. This generates a unique geographic histogram (h_{geo_i}) for each (i^th) location that is analogous to the corresponding geo-genetic histograms (h_{gen_i}) (Fig. 1e), yet that only containing geographic information.

Step 5. Finally, the elements of each geo-genetic histogram (h_{gen_i}) are regressed against the corresponding elements of the geographicdistance histogram (h_{geo_i}) (Fig. 1f), as defined here: where h_{gen_i} = geo-genetic histogram for location i, h_{geo_i} = geographic histogram for location i, [b_k] = histogram’s distance class, b, ranging from k = 1 to k = K (i.e., the maximum number of bins as determined by Sturges’ equation), β = simple regression coefficient, and ε = error term.

The vector of absolute residuals resulting from each regression (eq. [2]) constitutes the location-specific spatial summary statistic vector we named the GSSA and it is defined for each location i as the absolute difference between the location-specific regressionpredicted values (eq. [2]) and the location-specific geo-genetic histogram values: with ĥ_{gen_i}, h_{gen_i}, and [b_k] defined as in eq. [2].

GSSA Implementation

All five steps described above can be implemented from a list of genotypes and a list of individuals’ sampling coordinates, or optionally a set of distances between genotypes in case users choose to use effective distances instead (Shirk & Cushman 2014 Davis et al. 2018). The aggregated relative spatial distribution of minor alleles vector for each locality,, can be obtained from the multisite frequency spectrum (multiSFS), which summarizes the frequency of shared allele variants across populations, and the set of distances among all sampled locations. This multiSFS can be calculated from the list of genotypes in available programs such as δaδi, whereby the multiSFS should be polarized based on the minor allele where each sampled georeferenced location is considered a “population” in the multiSFS (Gutenkunst et al. 2009). Scripts to estimate the multiSFS along the rest of necessary steps to calculate the GSSAs for each location are available at https://bitbucket.org/diegofalvarados/gssa-v0.0/src. If multiple individuals per location are available, the method can be adapted by aligning all genotypes per location into a single “polyploid individual” so that individuals’ genotypes are treated as different DNA strands. Similarly, polyploid individuals are likewise accommodated as the method treats each SNP locus independently and thus, no phasing is necessary.

Raggedness index

A useful property of the GSSA is that its shape can potentially carry information about a location’s relative age of colonization and its historical connectivity with other populations given an expansion history. To explore and summarize the overall behavior of the GSSA with respect to expansion sources, we use Harpending’s raggedness index (Harpending 1994). Harpending’s raggedness index (RI) was originally introduced to quantify the shape of the histogram obtained from the average pairwise genetic differences amongst individuals in the context of inferring the history of population size change due to the predictive relationships between the shape and the timing and magnitude of size change. Here we repurpose it to condense the GSSA elements into a single variable that is correlated with the shape of the GSSA and thus, summarizes the behavior of the GSSA and its spatial and temporal relationship with the expansion source. In our calculation of each localityspecific RI, we disregard all distance classes within each GSSA_ithat are equivalent to the distance classes in the corresponding h_{geo_i} that have a value of zero (which arise for distance classes not involving the focal locality). We then correct the RI for the number of comparisons used for its calculation: where b_k indicates a distance class and K′ the total number of distance classes used in the calculation.

Locations further away from the serialexpansion source are expected to share more genetic variants with nearby locations that were colonized through the same expansion path (Knowles & Alvarado-Serrano 2010), and hence should tend to present a leftskewed and more ragged GSSA (Supp. Fig. 2a). This is because, in this case, short distance bins should be overly represented with a marked drop at intermediate bins, thereby inflating the RI from the “neighborhood effect” resulting from allele surfing (Hallatschek & Nelson 2008 François et al. 2010). This process can cause genetic variants at the expansion front to increase in frequency within constrained areas of derived populations (Excoffier et al. 2008). In contrast, locations closer to the source should tend to have a more uniform and nonskewed GSSA histogram because these locations are expected to retain the shared genetic variants present at different frequencies in other locations in the sample. In this latter scenario (Supp. Fig. 2b), most variants come from standing genetic variation in the source population due to the comparatively smaller amount of drift experienced by these closetosource populations (Excoffier et al. 2008 Peter & Slatkin 2013).

Simulation experiments

To evaluate the behavior of the GSSA, we simulated 5,000 unlinked SNPs sampled from 10 spatially separated individuals under different serial range expansion histories, where 99 demes are colonized by a single source deme. Spatially implicit simulations were conducted using fastsimcoal v2.5.2 (Excoffier & Foll 2011 Excoffier et al. 2013), with each simulation starting with a source deme at either of four different starting locations (Fig. 2). At each sequential expansion step going forward in time, we set a proportion of individuals within each deme (f) to move out from the demes they occupied into an immediate neighboring deme, excluding diagonal movements (i.e., 4-neighbors). We set newly colonized demes to then grow to a size N_K (based on the carrying capacity parameter), in τ_r generations. After τ_c generations, we set the next set of demes to be colonized from previously colonized populations, and cycles of serial colonization proceed until the last of 99 demes is colonized. We set the simulations to then run for τ additional generations after the last colonized deme has reached N_K. Through the entire simulation, we allowed colonized demes to exchange individuals with neighboring colonized demes (4-neighbors) according to a migration parameter (m) that reflects the per individual per generation probability of migration. After all available demes are colonized, we let the simulations continue under a steppingstone model (Kimura & Weiss 1964) for a number of generations determined by parameter (τ), which signifies the time after initial expansion has ended. Table 1 lists all parameters controlling the simulations.

Fig. 2. Schematic of the sequential range expansion models used to quantify the spatial association of the GSSA and expansion source location. Each panel correspond to simulations started from a different source (marked in red and denoted by letter). Demes are shaded according to their colonization time, with darker colors corresponding to earlier colonization times. Sampled demes are surrounded by a square.

View this table:

Table 1.

Parameters used in simulations. Parameter values in bold were used for all simulation experiments except those in which the parameter is being tested.

The particular effect of each parameter on the ability of the GSSA vector (summarized with the RI) to locate the expansion source was quantified independently. For that, we kept all parameters—except the one of interest of interest—fixed across simulations (Table 1). For each scenario/parameter combination, we ran 1,000 simulations for a total of 48,000 simulations (1 scenario x 4 possible sources x 4 parameters x 3 parameter values x 1,000 simulations). After each simulation was completed (as defined by τ), we gathered a dataset of 5,000 homologous biallelic SNPs from one individual from each of the 10 locations that were randomly selected before the simulations, including the source location. We then used our custom python pipeline to calculate a GSSA (eqs. [2, 3]) and associated RI (eq. [4]) for each of the 10 sampled locations. The source of the expansion was identified based on the distribution of magnitudes of the RI across sampling locations, with the smallest RI value assumed to correspond to the expansion source. Accuracy was measured as the proportion of simulations that correctly identified the source. To capture the empirical reality that the actual expansion source may not have been sampled, we repeated our inferences while excluding the source location from our sample (Fig. S2). In this latter case, accuracy was measured as the proportion of simulations that correctly identified the most proximate location to the source location among those sampled (Supp. Fig. 3). In the case of the simulations that include the source, the root mean squared error (RMSE) was also calculated. Additionally, we visualized the magnitude of change in the RI of each sampled location as a function of when each location was colonized.

As a point of comparison, we also used Peter and Slatkin’s (2013) approach to identify the expansion source on the same simulated datasets. Because Peter and Slatkin’s approach make use of interpolation, which reduces the chances of estimating the precise coordinates of the simulated source, we scored a prediction as accurate whenever Peter and Slatkin’s directionality index was able to locate the source within the perimeter defined by the 8neighbor demes around the actual source.

Empirical Application

To illustrate and demonstrate the utility of our approach, we applied it to two taxa of the holarctic butterfly genus Lycaeides that are distributed in western North America (Gompert et al. 2014b). These butterfly taxa are hostspecialists and relatively poor dispersers, with distributions that are tightly linked to their host plants (Forister et al. 2011 Gompert et al. 2014a). Their respective geographical ranges were likely impacted by Pleistocene climatic events (Thompson et al. 1993 Thompson & Anderson 2000 Pierce et al. 2004), which presumably involved range expansions from refugia since the Last Glacial Maximum (LGM). In particular, we focused on two highelevation undescribed taxa (Alpine and Jackson Lycaeides Gompert et al. 2014). These data are wellsuited for demonstrating our approach because they have a relatively wellcharacterized evolutionary history and spatial distribution (Gompert et al. 2006 Nice et al. 2013), as well as a thorough, spatially widespread genetic sampling (Gompert et al. 2014) in the western mountains of North America (Alpine Lycaeides: 10 localities and 8097 independent homologous SNPs Jackson Lycaeides: 11 localities and 9074 independent homologous SNPs).

To compare our geographic predictions of source locality with the plausible geographic origin of each taxon’s putative range expansion after the LGM, we used ecological niche models (ENMs), projected onto past climates, as independent hypotheses of colonization history under the assumption of Grinnellian niche conservatism. Briefly, we generated an ENM for each taxon in the R package dismo (Hijmans 2012) using a maximum entropy approach (Phillips et al. 2006) and 19 interpolated climatic surfaces at 30 arc-seconds resolution that summarize global patterns of temperature, precipitation, and seasonality (Hijmans 2012). We only included localities for which genetic data are available (Table 1, Gompert et al. 2014) given the pending taxonomic status of these taxa, which can lead to misidentification issues in available locality databases. To reduce possible sampling bias, for each taxon we filtered the localities based on a minimum distance required between localities. The exact distance used for each species was determined by a variogram approach that establishes the distance over which spatial autocorrelation in environmental conditions is minimal (Brown et al. 2016). In addition, to maximize the fit of the model and to avoid unnecessary complexity, we tuned our models using the R package ENMeval (Muscarella et al. 2014). The tuning procedure consisted of assessing optimal model parameters based on jackknife crossvalidation of the samples (Radosavljevic & Anderson 2014 Muscarella et al. 2014). We chose a jackknife approach given the small sample size of the filtered collection points (Galante et al. 2018). Model fit was evaluated under different combinations of model features and regularization multipliers (Table S1) using three sequential criteria: omission rate, AUC, and model feature class complexity. Tuning results are summarized in Supp. Table 1. We then hindcasted this ENM to the LGM using climatic CCSM (Community Climate System Model) estimations derived from the Paleoclimate Modelling Intercomparison Project Phase II (Braconnot et al. 2007). Finally, following Knowles and Alvarado-Serrano (2010), we identified the area(s) of contiguous high predicted suitability (i.e., refugial distribution) as potential expansion source(s), using as an arbitrary cutoff the top 10 ^th percentile of the predicted suitability scores. We then contrasted these hypothesized sources with those inferred by our approach.

Results

Simulation experiments

Our simulation study showed that the magnitude of the RI calculated from the elements of our GSSA vector was highly correlated with i) the distance from the location of the expansion source, and ii) how long, after the expansion started, a location was colonized (Fig. 3). The metric was able to accurately identify the geographical source of expansion given that one of the sampling localities was the source of expansion, or the first location colonized among those sampled (Fig. S3). Still, the degree of accuracy somewhat depended on the location of the source deme such that lower accuracy occurred when the source was located in a peripheral area (Fig. 4). Still, incorrect inferences tended to identify sampling localities proximate to the actual expansion locality, regardless of the location of the expansion source (Fig. 5).

Fig. 3. Association between time at which each sampled locality is colonized and the raggedness index calculated from the GSSA at each of these localities under all four simulated scenarios. Colonization time step denotes the time at which a locality was colonized (with the origin always being zero). The impact of each parameter on the ability to recover the simulated colonization history is evidenced by changes in the slope of the association. Different colors are used for each parameter value for increased visibility.

Fig. 4. Probability across 1000 simulation replicates of the the raggedness index calculated from the GSSA vector for identifying the source deme given the four serial range expansion scenarios and model parameter values. Higher probabilities of correct source identifications are a function of darker shadings. See Table 1 for model parameter definitions.

Fig. 5. Geographic position of the suggested sources of expansion according to the raggedness index calculated from the GSSA vector (left column) and directionality index (right column). A red circle denotes the true source used for each simulation scenario, whereas black circles indicate the locations sampled (shown only for the GSI-RI approach). Results presented are aggregated across 1000 simulations, with the relative frequency that each position was identified as the source of expansion indicated by the color hue (darker colors corresponding to greater frequencies). Note that in the vast majority of instances, the GSS-ARI approach either correctly infers the source or identifies a close neighbor location.

By summarizing the GSSA with Harpending’s (1994) RI, we were able to correctly identify the source population up to 70% of the time, with RMSE values that approached zero (Fig. 6). However, this accuracy declined with larger numbers of colonists (f) and time since all demes have been colonized (τ). Likewise, under an IBD model that arose when enough time had accrued for the signature of the expansion to have significantly diminished (Fig. 6), our metric was unable to accurately identify the source population. Our approach did strongly outperform Peter and Slatkin’s directionality index (ψ) by a margin of up to 60%, except in those aforementioned cases such as a high number of individuals colonizing demes (Fig. 6).

Fig. 6. Comparative accuracies of the raggedness index calculated from the GSSA and directionality indexin identifying the source deme for serial range expansions. The mean accuracy (a) per model parameter tested and corresponding root mean square error (RMSE) (b) obtained under both methods (GSSA-RI in blue, directionality index in red) is shown under different model parameter values.

Empirical application

The application of our coupled GSSARI approach to the two Lycaeides datasets suggests an expansion spatial dynamics that is well in line with species ranges at the LGM according to our ENM hindcasts (Fig. 7). For the Alpine Lycaeides distributed in northern Rocky mountain areas, the predicted source coincided with the southernmost samples, which are located in the largest LGM refugium predicted by the ENM hindcasts (Figure 7A). Although the hindcasted range at the LGM climate for this species covers a wide area that encompasses all of the sampled locations, the sampling location predicted by the GSSA coincides with an area of highest suitability. The RI calculated on the GSSA for Jackson Lycaeides samples suggested a source also within the ENM hindcast (which was more heterogeneous in comparison with Alpine Lycaeides hindcast), and close to one potential refugium (i.e., continuous area of high suitability) (Fig. 7B).

Fig. 7. Comparison of LGM hindcasts of range distributions obtained with Ecological Niche Models (ENMs). The grey circles indicate the sampling localities on which the inferences are based. The suggested sources of expansion according to the GSSA-RI approach are indicated by a green rhombus with a “G” on it. Warmer colors indicate greater potential suitability as estimated by the ENM. a) Model and inferences for Alpine Lycaeides b) model and inference for Jackson Lycaeides. Refugia, identified as the top 10th percentile most suitable areas, are denoted by black polygons.

Discussion

We introduce a new summary statistic vector, the geographic spectrum of shared alleles (GSSA), which makes joint use of geographic and genetic information. We demonstrate that it can be used to help infer the geographic dynamics of a range expansion. We use Harpending’s (1994) RI calculated on the GSSA elements of each sampled genotype to summarize spatial gradients in the distribution of minor alleles that are indicative of the relative position of each genotype along an expansion axis. However, we envision the elements of the GSSA vector may be directly incorporated within a supervised machine learning or ABC inferential framework to obtain parameter estimates associated with source locality likelihoods, and to test competing expansion hypotheses (He et al. 2017b Fraimout et al. 2017 Schrider & Kern 2018).

However, there are some important assumptions to first examine before deployment of the GSSA. First, as with many other population genomic approaches, it should be used after population structure is explored (Patterson et al. 2006 Frichot & François 2015 Petkova et al. 2016). Specifically, the GSSA is applied to groups of samples suggested to derive from single sources by various other exploratory methods applied cautiously (House & Hahn 2017 Elleouet & Aitken 2018). For example, as late Pleistocene range expansions and recent species invasions can both have multiple origins (Miraldo et al. 2011 RuizCooley et al. 2013 Cristescu 2015), care needs to be taken to detect the possibility of admixture from multiple origins a priori. In this case, if exploratory methods suggest a species is heavily structured, with a history of expansions from more than one refugium, researchers could apply the GSSA to each subsample independently. This may, however, prove a challenging task in certain circumstances because smaller sample sizes (resulting from splitting up the range) may lead to reduced accuracy. Difficulties may also arise if more than two sources are close together geographically, multiple expansions occurred, or if gene flow directions are biased given complex heterogeneous geography (Branco et al. 2018 Lundgren & Ralph 2018). However, one strength of the approach is that the assumption of panmixia within each sub-sample can be relaxed, as our simulated conditions contain stepping stone relationships among demes such that some isolationbydistance relationships within genetic clusters is accommodated, as long as there is a single source.

A second assumption underlying our metric is that a single spatial expansion had actually occurred within an identifiable timeframe. As demonstrated by our simulations with the reduced accuracy as the time since all demes have been colonized (τ) increases, eventually reaching the stationarity conditions of isolationbydistance, the historical signal of expansion will be erased over long intervals. It is therefore recommended that spatial expansion be tested before attempting to infer the geographic dynamics of an expansion (Excoffier 2004 Wegmann et al. 2006 Elleouet & Aitken 2018).

Key advantages of the GSSA are that no outgroup is required, and that it can be deployed on sparse datasets that are often collected from a variety of nonmodel species. Notably, correlation to the source of expansion was in general robust to different acrosspopulation migration rates (m) and deme growth rates—as conditioned by the time allowed for demes to reach their carrying capacity (τ_r). However, accuracy did somewhat decrease with greater migration levels, as one would expect. Also expectedly, the accuracy in identifying the source location was greatly diminished by larger founder sizes (f), and the amount of time passed after colonization ended (τ). In the former case, larger f will deflate the founder effect, whereas in the latter case the signal of the founder effect will become erased over time (Nei et al. 1975).

Regardless of parameter setting, the GSSA approach consistently outperformed the directionality index of Peter and Slatkin’s (2013) in all simulations. While such a finding is not surprising given that we violated the latter method’s assumption of allelic polarity and used smaller sample sizes than recommended, it highlights the gap filled by our new statistic, which accommodates data typical of nonmodel organisms. As demonstrated by Peter and Slatkin’s (2013) empirical example, in which they found support for the outofAfrica hypothesis (Li et al. 2008 DeGiorgio et al. 2009) using a comprehensive empirical dataset consisting of over half a million SNPs in more than 1,500 individuals distributed in 55 human populations (Fumagalli et al. 2011), their statistic is a powerful tool for model organisms. Ours on the other hand, may be best suited for smaller datasets. Furthermore, it is important to note that the two metrics are not precisely inferring the same entity even if the overall objective is to uncover the spatial dynamics of an expansion. Specifically, the smallest RI of the GSSAs is assumed to be associated with the sampling location closest to the actual origin (i.e., first colonized location among those sampled). In contrast, Peter and Slatkin’s ψ aims to infer the actual geographical point of expansion, which can be located beyond the sampling localities. This methodological difference reflects our goal of limiting the amount of statistical uncertainty at the expense of decreasing potential inferential capabilities (i.e., limiting the noise introduced by spatial interpolation). Nevertheless, part of our simulation study used sampling locations that included the source, thereby making this comparison between both approaches informative.

The utility of our new statistic for nonmodel organisms is also apparent in the empirical analysis of Lycaeides (Fig. 7). These taxa had 8097 and 9074 independent SNPs (i.e., 1 SNP per tag for Alpine Lycaeides and Jackson Lycaeides respectively), from which we randomly selected a set of 5000 SNPs for consistency with our simulations. Our approach was able to point, as sources, geographical regions of relatively high predicted suitability according to the ENMs. For Alpine Lycaeides, this included a location at the southern end of the range. In contrast, for Jackson Lycaeides, the lowest RI suggested an area of marginal LGM suitability (albeit close to regions predicted to be suitable Fig. 7). That being said, because the inferred entity refers to the sampling location that is closest to the true origin (i.e., first colonized location among those sampled), a perfect match between the ENMs’ prediction and the GSSA inference is not expected. Together with the simulation data, these analyses demonstrate that under circumstances of more constrained information and limited sampling relative to those common to model organisms, our approach may be a useful complement to existing methods to infer the geographic history underlying range expansions (Ramachandran et al. 2005 François et al. 2010 Peter & Slatkin 2013). Our results also highlight how independent lines of evidence can be combined for more robust demographic inference. In contrast to previous attempts to reciprocally validate ENMs and genetic data without spatially explicit inference (Alvarado-Serrano & Knowles 2014), our method allows one to make more explicit comparisons between the hypothesized locations of refugia derived from ENMs and the inferred range expansion dynamics based on genetics. This capability is of particular interest when considering the uncertainty associated with hindcasting using ENMs, as these tools are capable only of generating hypotheses about potential distribution based on abiotic correlates under the assumption of niche conservatism and analogous climate—which is likely to overpredict the real former distribution of species (Soberón 2007 Peterson et al. 2010 Alvarado-Serrano & Knowles 2014). By combining these approaches, the confidence in these estimates increases, giving scientists the ability to refine a set of plausible historical scenarios under consideration.

Potential Pitfalls

To help improve modelbased inferences, several aspects of our approach should be taken into consideration. Besides accurate georeferencing of all samples, the number and spatial distribution of samples should be carefully considered. As it is true for previous methods (Ramachandran et al. 2005 Peter & Slatkin 2013), inferences based on the GSSA will be most useful with sampling that maximizes geographical space at the cost of multiple samples per location. Samples clustered in a particular area, and representing only a small portion of the distribution of the taxon of interest, would probably carry limited information and lead to biased estimates. Similarly, our results indicate that precaution should be taken with peripheral localities as they may carry a smaller signal due to noise introduced by boundary effects. In this regard, the advantage of allowing only a single individual per locality should help researchers to more efficiently design their sampling scheme without excessive increases in overall costs, assuming that accessing most of the range of a species is not prohibitive. Furthermore, larger sampling of geographic space could potentially better enable the implementation of a spatial interpolation to identify major colonization routes in more detail (Li & Heap 2011).

Like any method using genomewide SNP data, another relevant consideration is the potential for confounded inference if the sampled SNPs are impacted from parts of the genome under strong natural selection (Sokal et al. 1989 Lotterhos & Whitlock 2014), even if they are not linked (Allman & Weissman 2018). For this reason, it would be advisable to test for selective neutrality a priori, and to remove loci potentially under selection. An additional consideration is that care should be taken to ensure that inference is not confounded by multiple population histories that involve different sources of expansion. Although use of the GSSA does not explicitly require clustering individuals into populations a pirori (Frichot & François 2015; Petkova et al. 2016), the conduction of exploratory analyses to obtain some information about population structure may help identify cases of admixture and more than one source for expansion.

As a stand-alone summary statistic, the GSSA is itself an exploratory tool to be used with other spatial genomic methods such as EEMS (Petkova et al. 2016), MAPS (Al-Asadi et al. 2018), or SpaceMix (Bradburd et al. 2016): it can investigate one’s data and help formulate model-based hypotheses about population history (House & Hahn 2017). Ideally, the GSSA should be incorporated into an inferential model that allows for testing historical hypotheses and for estimating relevant demographic parameters (He et al. 2017a).

Future Prospects

Further evaluations are needed to assess whether our approach can also detect multiple sources of expansion (a bimodal distribution of raggedness indices could potentially suggest two colonization sources), and if it may allow us to estimate the relative timing of expansions (for instance, by using the slope of the spatial clines in the raggedness of the GSSAs; Fig. 3). Additionally, the GSSA might be used more generally to test for spatial expansion in the first place by using null simulations to determine the significance of spatial clines in the raggedness of GSSAs (Fig. 3). Specifically, the GSSA shape quantified by Harpending’s (1994) RI could be used to test for expansion in the context of a null distribution expected under an ahistorical equilibrium between distance and genetic differences (i.e. isolation by distance; IBD). Under the latter scenario, the allelic similarity is expected to be inversely correlated with distance (Wright 1943), and hence the spatial cline of raggedness indexes should not be significant. On the contrary, under a range expansion this slope is expected to be significantly different from zero. Finally, the GSSA could also be a useful way to explore a broader set of models such as a source/sink population scenarios (MartinezSolano & Gonzalez 2008), cyclical histories of admixture (Frantz et al. 2013 Alvarado-Serrano & Hickerson 2015), or heterogeneous population densities (Excoffier et al. 2008). However, the potential for these future directions remains speculative at this point and further work is needed to assess these possibilities.

Our approach may also be expanded if there is interest in more precisely identifying the geographic origin of an expansion when the source area is suspected to be missing from the sample. That would require interpolating the expected raggedness index at un-sampled localities to identify the region with the smallest indexes. Implementing an interpolation would also allow us to potentially identify spatially isolated areas of contiguous high raggedness indices, which could serve to uncover instances of expansion from more than one source using an Euclidean allocation algorithm. However, as interpolation is strongly impacted by the number and distribution of observed samples (Stein 2012), its implementation may be not possible when a limited number of locations are available since interpolation accuracy drastically decreases for small sample sizes. Alternatively, a time-difference of arrival approach (TDOA; (Gustafsson et al. 1994)) could be implemented, as done by Peter and Slatkin’s (2013) ψ. Commonly used for localization, the TDOA is a triangulation approach that takes advantage of the change in the strength of a signal as distance from the source increases (Gustafsson et al. 1994 Drake & Dogancay 2004). Specifically in our case, the pairwise difference in raggedness index between location pairs may be used as the signal change for triangulation to identify the geographic coordinates of the hypothesized source (see eq. 5 in Peter and Slatkin 2013).

For those interested in community ecology and species interactions, the GSSA may be used in aggregated geo-referenced population genomic datasets of co-distributed taxa to test for concerted spatio-temporal dynamics between two interacting species (Perkins & Swayne 2001 Wicker et al. 2012), the sources of multiple invading species (Sax et al. 2007 Johnson et al. 2009), and the geographic origins of multiple historic domestications (Kanginakudru et al. 2008 He et al. 2011). Likewise, it may be used to help identify shared regions of secondary contact and hybridization between longisolated co-distributed pairs of taxa (Remington 1968 Moritz et al. 2009), or to understand the assembly of whole communities across geographic barriers or trajectories of expansion after global shifts in climate (Avise et al. 1987 Ibrahim et al. 1996 Avise 2000 Hewitt 2000 Burbrink et al. 2016). Additionally, if one were interested in incorporating a resistance surface to correct for effective distances that emerge from landscape features (Spear et al. 2010), the GSSA may easily accommodate alternative distance matrices to allow the integration of realistic landscape scenarios.

Conclusion

The GSSA statistic we present here builds on emerging efforts to make phylogeography and population genetics more spatially explicit (Alvarado-Serrano & Hickerson 2015 House & Hahn 2017 Ashander et al. 2018 Bradburd et al. 2018). By doing so, it improves our ability to estimate the geographical source region (or closest areas) as well as the general direction of range expansions. The use of single samples per location, and un-polarized SNPs, makes this metric well-suited for a wide range of non-model organisms. The GSSA offers not only the capability to serve as an additional spatially explicit summary of genetic patterns for modelbased inference when likelihoodbased methods are intractable (Beaumont 2010 Pudlo et al. 2015), but, most importantly, presents a tool for guiding the development of spatially-explicit demographic hypotheses by better accounting for the spatial component of species histories. Further developments of this statistic, including the refinement of an interpolation procedure for geographically sparse samples, should lead to easier incorporation of this tool into simulationbased inferential approaches (Currat et al. 2004 Leblois et al. 2009 He et al. 2017a). We expect that the incorporation of expansion surfaces will enable more complex scenarios that include environmental heterogeneity and explicit geographic barriers to be modeled into these simulation approaches (Ray & Excoffier 2010 Joseph et al. 2016). Likewise, we expect clines in GSSA estimates to be able to help identify relevant spatial features, such as shared contact zones between populations that have been colonized from different expansion clusters (Swenson 2010), or consistent geographic barriers that have maintained isolation of populations during expansions and hence have promoted differentiation (Carstens et al. 2005 Potter et al. 2017). The GSSA offers a useful and flexible tool that complements existing methods for improved understanding of the processes governing population differentiation and spatial patterns of genomic diversity.

Software

A program to calculate the GSSA and Harpending’s (1944) raggedness index from georeferenced genotypes and associated user-defined geographic distances among sampled locations is available in https://bitbucket.org/diegofalvarados/GSSA-v0.0. This software also contains the pipeline needed to recreate the simulations we used to test the accuracy of the GSSA and Peter and Slatkin’s ψ. A full step-by-step implementation of our method, including comments and visualization of the intermediate outputs, is available at the following jupyter notebook: https://mybinder.org/v2/gh/ftempo/GSSA/master.

Data Accessibility

The whole pipeline and associated files needed to recreate the simulations used is available at the following bitbucket repository: https://bitbucket.org/diegofalvarado-s/GSSA-v0.0

Author Contributions

DFA and MJH conceived the GSSA statistic and DFA led the development of it. DFA coded the simulations, analyses and produced the figures. DFA and MJH wrote the manuscript.

Acknowledgements

We would like to thank Benjamin Peter for assistance in the implementation of the directionality method and Zach Gompert for generously providing the genomewide georeferenced SNP data, as well as Gideon Bradburd, Ana Carnaval, Marcelo Gehara, Isaac Overcast, and John Robinson for insightful comments on previous versions of this manuscript. All authors reviewed the manuscript. Funding was provided by grants from FAPESP (BIOTA, 2013/50297-0 to MJH and AC Carnaval), NASA through the Dimensions of Biodiversity Program (DOB 1343578 to MJH), and the National Science Foundation (DEB-1253710 to MJH). This work would not have been possible without help from the City University of New York High Performance Computing Center, with support from the National Science Foundation (CNS-0855217 and CNS-0958379).

References

↵
AlAsadi, H., Petkova, D., Stephens, M., & Novembre, J. (2018). Estimating recent migration and population size surfaces. bioRxiv, doi: 10.1101/365536.
OpenUrl Abstract/FREE Full Text
↵
Allman, B.E., & Weissman, D.B. (2018). Hitchhiking in space: Ancestry in adapting, spatially extended populations. Evolution, 72, 722–734.
OpenUrl
↵
Alvarado-Serrano, D.F., & Hickerson, M.J. (2015). Spatially explicit summary statistics for historical population genetic inference. Methods in Ecology and Evolution, 7, 418–427.
OpenUrl
↵
Alvarado-Serrano, D.F., & Knowles., L.L. (2014). Ecological niche models in phylogeographic studies: Applications, advances and precautions. Molecular Ecology Resources, 14, 233–248.
OpenUrl
↵
Ashander, J., Ralph, P., McCartneyMelstad, E., & Shaffer, H.B. (2018), Demographic inference 611 in a spatiallyexplicit ecological model from genomic data: A proof of concept for the 612 Mojave Desert Tortoise. bioRxiv. doi: 10.1101/354530.
OpenUrl Abstract/FREE Full Text
↵
Avise, J.C. (2000). Phylogeography: The history and formation of species. Harvard University Press. Cambridge.
↵
Avise, J.C., Arnold, J., Ball, R.M., Bermingham, E., Lamb, T., Neigel, J.A., …, Saunders, N.C. (1987). Intraspecific phylogeography: The mitochondrial DNA bridge between population genetics and systematics. Annual Review of Ecology and Systematics, 18, 489–522.
OpenUrl CrossRef Web of Science
↵
Beaumont, M. (2010). Approximate Bayesian computation in evolution and ecology. Annual Review of Ecology, Evolution, and Systematics, 41, 379–406.
OpenUrl CrossRef Web of Science
↵
Braconnot, P., OttoBliesner, B., Harrison, S., Joussaume, S., Peterchmitt, J.Y., AbeOuchi, A., …, Zhao, Y. (2007). Results of PMIP2 coupled simulations of the MidHolocene and Last Glacial MaximumPart 1: experiments and largescale features. Climate of the Past, 3, 261–277.
OpenUrl CrossRef Web of Science
↵
Bradburd, G.S., Coop, G.M., & Ralph, P.L. (2018). Inferring continuous and discrete population genetic structure across space. Genetics, 210, 33–52.
OpenUrl Abstract/FREE Full Text
↵
Bradburd, G.S., Ralph, P.L., & Coop, G.M. (2016). A spatial framework for understanding population structure and admixture. PLoS genetics, 12, e1005703.
OpenUrl
↵
Branco, C., Velasco, M., Benguigui, M., Currat, M., Ray, N., & Arenas, M. (2018). Consequences of diverse evolutionary processes on american genetic gradients of modern humans. Heredity, doi: 10.1038/s414370180122x.
OpenUrl CrossRef
↵
Brown, J.L., Weber, J.J., AlvaradoSerrano, D.F., Hickerson, M.J., Franks, S.J., & Carnaval, A.C. (2016). Predicting the genetic consequences of future climate change: The power of coupling spatial demography, the coalescent, and historical landscape changes. American Journal of Botany, 103, 153–163.
OpenUrl Abstract/FREE Full Text
↵
Burbrink, F.T., Chan, Y.L., Myers, E.A., Ruane, S., Smith, B.T., & Hickerson, M.J. (2016). Asynchronous demographic responses to Pleistocene climate change in Eastern Nearctic vertebrates. Ecology Letters, 19, 1457–1467.
OpenUrl
↵
Cardozo, A.L.P., Farias, E.G.G., RodriguesFilho, J.L., Moteiro, I.B., Scandolo, T.M. & Dantas, T.V. (2018). Feeding ecology and ingestion of plastic fragments by Priacanthus arenatus : What’s the fisheries contribution to the problem? Marine Pollution Bulletin, 130, 19–27.
OpenUrl
↵
Carstens, B.C., Brunsfeld, S.J., Demboski, J.R., Good, J.D., & Sullivan, J. (2005). Investigating the evolutionary history of the Pacific Northwest mesic forest ecosystem: Hypothesis testing within a comparative phylogeographic framework. Evolution, 59, 1639–1652.
OpenUrl PubMed Web of Science
↵
Charles H., & Dukes, J.S. (2008). Impacts of invasive species on ecosystem services. In: Biological Invasions Ecological Studies, pp. 217–237. Springer, Heidelberg.
↵
Cristescu, M.E. (2015). Genetic reconstructions of invasion history. Molecular Ecology, 24, 2212–2225.
OpenUrl CrossRef
↵
Currat, M., Ray, N., & Excoffier, L. (2004). Splatche: a program to simulate genetic diversity taking into account environmental heterogeneity. Molecular Ecology Notes, 4, 139–142.
OpenUrl CrossRef
↵
Davis, C.D., Epps, C.W., Flitcroft, R.L., & Bank,s, M.A. (2018). Refining and defining riverscape genetics: How rivers influence population genetic structure. Wiley Interdisciplinary Reviews: Water, 5, e1269.
OpenUrl
↵
DeGiorgio, M., Jakobsson, M., & Rosenberg, N.A. (2009). Explaining worldwide patterns of human genetic variation using a coalescentbased serial founder model of migration outward from Africa. Proceedings of the National Academy of Sciences, 106, 16057–16062.
OpenUrl Abstract/FREE Full Text
↵
Drake, S.R., & Dogancay, K. (2004). Geolocation by time difference of arrival using hyperbolic asymptotes. In: 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 361–364.
↵
Elleouet, J.S., & Aitken, S.N. (2018). Exploring Approximate Bayesian Computation for inferring recent demographic history with genomic markers in nonmodel species. Molecular Ecology Resources, doi: 10.1111/17550998.12758.
OpenUrl CrossRef
↵
Excoffier, L. (2004). Patterns of DNA sequence diversity and genetic structure after a range expansion: Lessons from the infiniteisland model. Molecular Ecology, 13, 853–864.
OpenUrl CrossRef PubMed Web of Science
↵
Excoffier, L., Dupanloup, I., HuertaSánchez, E., Sousa, V.C., & Foll, M. (2013). Robust demographic inference from genomic and SNP data. PLoS genetics, 9, e1003905.
OpenUrl
↵
Excoffier, L., & Foll, M. (2011). Fastsimcoal: A continuoustime coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios. Bioinformatics, 27, 1332–1334.
OpenUrl CrossRef PubMed Web of Science
↵
Excoffier, L., Foll, M., & Petit, R.J. (2008). Genetic consequences of range expansions. Annual Review of Ecology, Evolution, and Systematics, 40, 481–501.
OpenUrl
↵
Excoffier, L., Hofer, T., & Foll, M. (2009). Detecting loci under selection in a hierarchically structured population. Heredity, 103, 285–298.
OpenUrl CrossRef PubMed Web of Science
↵
Fagua, J.C., & Gonzalez, V.H. (2007). Growth rates, reproductive phenology, and pollination ecology of Espeletia grandiflora (Asteraceae), a giant Andean caulescent rosette. Plant Biology, 9, 127–135.
OpenUrl CrossRef PubMed
↵
Fordham, D.A., Brook, B.W., Moritz, C., & NoguésBravo, D. (2014). Better forecasts of range dynamics using genetic data. Trends in Ecology & Evolution, 29, 436–443.
OpenUrl
↵
Forister, M.L., Gompert, Z., Fordyce, J.A., & Nice, C.C. (2011). After 60 years, an answer to the question: What is the Karner blue butterfly? Biology Letters, 7, 399–402.
OpenUrl
↵
Fraimout, A., Debat, V., Fellous, S., Hufbauer, R.A., Foucaud, J., Pudlo, P., …, Estoup, A. (2017). Deciphering the routes of invasion of Drosophila suzukii by means of ABC Random Forest. Molecular Biology and Evolution, 34, 980–996.
OpenUrl
↵
François., O., Currat, M, Ray, N., Han, E., Excoffier, L. & Novembre, J. (2010). Principal component analysis under population genetic models of range expansion and admixture. Molecular Biology and Evolution, 27, 1257–1268.
OpenUrl CrossRef PubMed Web of Science
↵
Frantz, L.A.F., Schraiber, J.G., Madsen, O., Megens, H.J., Bosse, M., Paudel, Y., …, Groenen, M.A. (2013). Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus. sus Genome Biology, 14, R107.
OpenUrl
↵
Frichot, E., & François, O. (2015). LEA: An R package for landscape and ecological association studies. Methods in Ecology and Evolution, 6, 925–929.
OpenUrl
↵
Fumagalli, M., Sironi, M., Pozzoli, U., FerrerAdmetlla, A., Pattini, L. & Nielsen, R. (2011). Signatures of environmental genetic adaptation pinpoint pathogens as the main selective pressure through human evolution. PLoS genetics, 7, e1002355.
OpenUrl
↵
Galante, P.J., Alade, B., Muscarella, R., Jansa, S.A., Goodman, S.M., & Anderson, R.P. (2018). The challenge of modeling niches and distributions for datapoor species: a comprehensive approach to model complexity. Ecography, 41, 726–736.
OpenUrl
↵
Gaston KJ (2003) The Structure and Dynamics of Geographic Ranges. Oxford University Press. Oxford.
↵
Gompert, Z., Comeault, A.A., Farkas, T.E., Feder, J.L., Parchman, T.L., Buerkle, C.A., & Nosil, P. (2014a). Experimental evidence for ecological selection on genome variation in the wild. Ecology Letters, 17, 369–379.
OpenUrl CrossRef PubMed
↵
Gompert, Z., Fordyce, J.A., Forister, M.L., Shapiro, A.M., & Nice, C.C. (2006). Homoploid hybrid speciation in an extreme habitat. Science, 314, 1923–1925.
OpenUrl Abstract/FREE Full Text
↵
Gompert, Z., Forister, M.I., Fordyce, J.A., Nice, C.C., Williamson, R.J., & Buerkle, C.A. (2010). Bayesian analysis of molecular variance in pyrosequences quantifies population genetic structure across the genome of Lycaeides butterflies. Molecular Ecology, 19, 2455–2473.
OpenUrl PubMed Web of Science
↵
Gompert, Z., Lucas, L.K., Buerkle, C.A., Forister, M.L., Fordyce, J.A., & Nice, C.C. (2014b). Admixture and the organization of genetic diversity in a butterfly species complex revealed through common and rare genetic variants. Molecular Ecology, 23, 4555–4573.
OpenUrl
↵
Gustafsson, F., Gunnarsson, S., & Ljung, L. (1994). On timefrequency resolution of signal properties using parametric techniques. In: 1994 Proceedings of 33rd IEEE Conference on Decision and Control, pp. 2259-2264.
↵
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H., Bustamante, C.D. (2009). Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS genetics, 5, e1000695.
OpenUrl
↵
Hallatschek, O., Hersen, P., Ramanathan, S., & Nelson, D.R. (2007) Genetic drift at expanding frontiers promotes gene segregation. Proceedings of the National Academy of Sciences of the United States of America, 104, 19926–19930.
OpenUrl Abstract/FREE Full Text
↵
Hallatschek, O., & Nelson DR (2008) Gene surfing in expanding populations. Theoretical Population Biology, 73, 158–170.
OpenUrl CrossRef PubMed Web of Science
↵
Harpending, H.C. (1994). Signature of ancient population growth in a lowresolution mitochondrial DNA mismatch distribution. Human Biology, 66, 591–600.
OpenUrl PubMed Web of Science
↵
Henn, B.M., CavalliSforza, L.L., & Feldman, M.W. (2012). The great human expansion. Proceedings of the National Academy of Sciences of the United States of America, 109, 17758–17764.
OpenUrl Abstract/FREE Full Text
↵
He, Q., Prado, J.R., & Knowles, L.L. (2017). Inferring the geographic origin of a range expansion: Latitudinal and longitudinal coordinates inferred from genomic data in an ABC framework with the program xorigin. Molecular Ecology, 26, 6908–6920.
OpenUrl
↵
Hewitt, G. (2000). The genetic legacy of the Quaternary ice ages. Nature, 405, 907–913.
OpenUrl CrossRef GeoRef PubMed Web of Science
↵
He, Z., Zhai, W., Wen, H., Tang, T., Wang, Y., Lu, X., …, Shi, S. (2011). Two evolutionary histories in the genome of rice: The roles of domestication genes. PLoS genetics, 7, e1002100.
OpenUrl
↵
Hijmans, R.J. (2012). Crossvalidation of species distribution models: Removing spatial sorting bias and calibration with a null model. Ecology, 93, 679–688.
OpenUrl CrossRef PubMed Web of Science
House, G.L., & Hahn, M.W. (2018). Evaluating methods to visualize patterns of genetic differentiation on a landscape. Molecular Ecology Resources, 18, 448–460.
OpenUrl
↵
Ibrahim, K.M., Nichols, R.A., & Hewitt, G.M. (1996). Spatial patterns of genetic variation generated by different forms of dispersal during range expansion. Heredity, 77, 282–291.
OpenUrl CrossRef Web of Science
↵
Johnson, P.T.J., Olden, J.D., Solomon, C.T., Vander Zanden, M.J. (2009). Interactions among invaders: Community and ecosystem effects of multiple invasive species in an experimental aquatic system. Oecologia, 159, 161–170.
OpenUrl CrossRef PubMed Web of Science
↵
Jones, E., Oliphant, T., & Peterson, P. (2001). SciPy: Open source scientific tools for Python, doi: 10.1234/12345678.
OpenUrl CrossRef
↵
Joseph, T.A., Hickerson, M.J., & AlvaradoSerrano, D.F. (2016). Demographic inference under a spatially continuous coalescent model. Heredity, 117, 94–99.
OpenUrl
↵
Kanginakudru, S., Metta, M., Jakati, R.D., & Nagaraju, J. (2008). Genetic evidence from Indian red jungle fowl corroborates multiple domestication of modern day chicken. BMC Evolutionary Biology, 8, 174.
OpenUrl
↵
Kimura, M., & Weiss, G.H. (1964). The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics, 49, 561–576.
OpenUrl FREE Full Text
↵
Kirkpatrick M., & Barton, N.H. (1997). Evolution of a species’ range. The American naturalist, 150, 1–23.
OpenUrl CrossRef PubMed Web of Science
↵
Knowles, L.L., & AlvaradoSerrano, D.F. (2010). Exploring the population genetic consequences of the colonization process with spatiotemporally explicit models: Insights from coupled ecological, demographic and genetic models in montane grasshoppers. Molecular Ecology, 19, 3727–3745.
OpenUrl CrossRef PubMed Web of Science
↵
Konečný, A., Estoup, A., Duplantier, J.M., Bryja, J., Ba, K., Galan, M., …, Cosson, J.F. (2013). Invasion genetics of the introduced black rat (Rattus rattus) in Senegal, West Africa. Molecular Ecology, 22, 286–300.
OpenUrl CrossRef Web of Science
↵
Leblois, R., Estoup, A., Rousset, F. (2009). IBDSim: A computer program to simulate genotypic data under isolation by distance. Molecular Ecology Resources, 9, 107–109.
OpenUrl
↵
Legendre P., & Legendre, L.F.J. (2012). Numerical Ecology. Elsevier. Philadelphia.
↵
Li, J.Z., Absher, D.M., Tang, H., Southwick, A.M., Casto, A.M., Ramachandran, S., … Myers, R.M. (2008). Worldwide human relationships inferred from genomewide patterns of variation. Science, 319, 1100–1104.
OpenUrl Abstract/FREE Full Text
↵
Li J., & Heap, A.D. (2011). A review of comparative studies of spatial interpolation methods in environmental sciences: Performance and impact factors. Ecological Informatics, 6, 228–241.
OpenUrl CrossRef Web of Science
↵
Lotterhos, K.E., & Whitlock, M.C. (2014). Evaluation of demographic history and neutral parameterization on the performance of F_ST outlier tests. Molecular Ecology, 23, 2178–2192.
OpenUrl CrossRef PubMed
↵
Lundgren, E., & Ralph, P.L. (2018). Are populations like a circuit? The relationship between isolation by distance and isolation by resistance. bioRxiv, doi: 10.1101/451328.
OpenUrl Abstract/FREE Full Text
↵
MartinezSolano, I., Gonzalez, E.G. (2008). Patterns of gene flow and sourcesink dynamics in high altitude populations of the common toad Bufo bufo (Anura: Bufonidae). Biological Journal of the Linnean Society, 95, 824–839.
OpenUrl CrossRef Web of Science
↵
Menozzi, P., Piazza, A., & CavalliSforza, L. (1978). Synthetic maps of human gene frequencies in Europeans. Science, 201, 786–792.
OpenUrl Abstract/FREE Full Text
↵
Miraldo, A., Hewitt, G.M., Paulo, O.S., & Emerson, B.C. (2011). Phylogeography and demographic history of Lacerta lepida in the Iberian Peninsula: Multiple refugia, range expansions and secondary contact zones. BMC Evolutionary Biology, 11, 170.
OpenUrl
↵
Moritz, C., Hoskin, C.J., MacKenzie, J.B., Phillips, B.L., Tonione, M., Silva, N., …, Graham, C.H. (2009). Identification and dynamics of a cryptic suture zone in tropical rainforest. Proceedings of the Royal Society B: Biological Sciences, 276, 1235–1244.
OpenUrl PubMed
↵
Muscarella, R., Galante, P.J., SoleyGuardia, M., Boria, R.A., Kass, J.M., Uriarte, M., & Anderson, R.P. (2014). ENMeval: An R package for conducting spatially independent evaluations and estimating optimal model complexity for Maxent ecological niche models. Methods in Ecology and Evolution, 5, 1198–1205.
OpenUrl
↵
Nei, M., Maruyama, T., & Chakraborty, R. (1975). The bottleneck effect and genetic variability in populations. Evolution, 29, 1–10.
OpenUrl CrossRef Web of Science
↵
Nice, C.C., Gompert, Z., Fordyce, J.A., Forister, M.L., Lucas, L.K., & Buerkle, C.A. (2013). Hybrid speciation and independent evolution in lineages of alpine butterflies. Evolution, 67, 1055–1068.
OpenUrl CrossRef PubMed
↵
Novembre, J., & Stephens, M. (2008). Interpreting principal component analyses of spatial population genetic variation. Nature Genetics, 40, 646–649.
OpenUrl CrossRef PubMed Web of Science
↵
Patterson, N., Price, A.L., & Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics, 2, e190.
OpenUrl CrossRef
↵
Peischl, S., Dupanloup, I., Kirkpatrick, M., & Excoffier, L. (2013). On the accumulation of deleterious mutations during range expansions. Molecular Ecology, 22, 5972–5982.
OpenUrl CrossRef
↵
Pemberton, T.J., DeGiorgio, M., & Rosenberg, N.A. (2013). Population structure in a comprehensive genomic data set on human microsatellite variation. G3: Genes, Genomes, Genetics, 3, 891–907.
OpenUrl
↵
Perkins, L.E., & Swayne, D.E. (2001). Pathobiology of A/chicken/Hong Kong/220/97 (H5N1) avian influenza virus in seven gallinaceous species. Veterinary Pathology, 38, 149–164.
OpenUrl CrossRef PubMed Web of Science
↵
Peter, B.M., & Slatkin, M. (2013). Detecting range expansions from genetic data. Evolution, 67, 3274–3289.
OpenUrl CrossRef PubMed Web of Science
↵
Peter, B.M., & Slatkin, M. (2015). The effective founder effect in a spatially expanding population. Evolution, 69, 721–734.
OpenUrl CrossRef PubMed
↵
Peterson, K.R., Pfister, D.H., & Bell, C.D. (2010). Cophylogeny and biogeography of the fungal parasite Cyttaria and its host Nothofagus, southern beech. Mycologia, 102, 1417–1425.
OpenUrl CrossRef PubMed
↵
Petkova, D., Novembre, J., & Stephens, M. (2016). Visualizing spatial population structure with estimated effective migration surfaces. Nature Genetics, 48, 94–100.
OpenUrl CrossRef PubMed
↵
Pharo, E.J., & Zartman, C.E. (2007). Bryophytes in a changing landscape: The hierarchical effects of habitat fragmentation on ecological and evolutionary processes. Biological Conservation, 135, 315–325.
OpenUrl CrossRef Web of Science
↵
Phillips, S.J., Anderson, R.P., & Schapire, R.E., (2006). Maximum entropy modeling of species geographic distributions. Ecological Modelling, 190, 231–259.
OpenUrl CrossRef Web of Science
↵
Phillips, B.L., Kelehear, C., Pizzatto, L., Brown, G.P., Barton, D., & Shine, R. (2010). Parasites and pathogens lag behind their host during periods of host range advance. Ecology, 91, 872–881.
OpenUrl CrossRef PubMed Web of Science
↵
Pierce, J.L., Meyer, G.A., & Jull, A.J.T. (2004). Fireinduced erosion and millennialscale climate change in northern ponderosa pine forests. Nature, 432, 87–90.
OpenUrl CrossRef GeoRef PubMed Web of Science
↵
Pires, T.H.S., Farago, T.B., Campos, D.F., Cardoso, G.M., & Zuanon, J. (2016). Traits of a lineage with extraordinary geographical range: Ecology, behavior and lifehistory of the sailfin tetra Crenuchus spilurus. Environmental Biology of Fishes, 99, 925–937.
OpenUrl
↵
Potter, S., Xue, A.T., Bragg, J.G., Rosauer, D.F., Roycroft, E.J., & Moritz, C. (2017). Pleistocene climatic changes drive diversification across a tropical savanna. Molecular Ecology, 27, 520–532.
OpenUrl
↵
Pudlo, P., Marin, J.M., Estoup, A., Cornuet, J.M., Gautier, M., Robert, C.P. (2015). Reliable ABC model choice via random forests. Bioinformatics 32, 859–866.
OpenUrl
↵
Radosavljevic, A., & Anderson, R.P. (2014) Making better Maxent models of species distributions: complexity, overfitting and evaluation. Journal of Biogeography, 41, 629–643.
OpenUrl CrossRef
↵
Ramachandran, S., Deshpande, O., Roseman, C.C., Rosenberg, N.A., Feldman, M.W., CavalliSforza, L. (2005). Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proceedings of the National Academy of Sciences of the United States of America, 102, 15942–15947.
OpenUrl Abstract/FREE Full Text
↵
Ramırez-Garcıa, P., LópezBlanco J., & Ocaña, D. (1998). Mangrove vegetation assessment in the Santiago River Mouth, Mexico, by means of supervised classification using LandsatTM imagery. Forest Ecology and Management, 105, 217–229.
OpenUrl
↵
Ray, N., & Excoffier, L. (2010). A first step towards inferring levels of longdistance dispersal during past expansions. Molecular Ecology Resources, 10, 902–914.
OpenUrl
↵
Remington, C.L. (1968). Suturezones of hybrid interaction between recently joined biotas. In: Evolutionary Biology, pp. 321–428. Springer, Boston.
↵
Roberts, D.R., & Hamann, A. (2015). Glacial refugia and modern genetic diversity of 22 westernNorth American tree species. Proceedings of the Royal Society B: Biological Sciences, 282, 20142903.
OpenUrl CrossRef PubMed
↵
Ruiz-Cooley, R.I., Ballance, L.T., & McCarthy, M.D. (2013). Range expansion of the jumbosquid in the NE Pacific: δ15N decrypts multiple origins, migration and habitat use. PloSone, 8, e59651.
OpenUrl
↵
Sax, D.F., Stachowicz, J.J., Brown, J.H., Bruno, J.F., Dawson, M.N, Gaines, S.D., …, Rice, W.R. (2007). Ecological and evolutionary insights from species invasions. Trends in Ecology & Evolution, 22, 465–471.
OpenUrl
↵
Schrider, D.R., & Kern, A.D. (2018). Supervised machine learning for population genetics: Anew paradigm. Trends in Genetics, 34, 301312.
OpenUrl
↵
Scott, D.W. (1979). On optimal and databased histograms. Biometrika, 66, 605.
OpenUrl CrossRef Web of Science
↵
Shirk, A.J., & Cushman, S.A. (2014). Spatiallyexplicit estimation of Wright’s neighborhood size in continuous populations. Frontiers in Ecology and Evolution, 2, 177.
OpenUrl
↵
Slatkin, M. (1987) Gene flow and the geographic structure of natural populations. Science, 236, 787–792.
OpenUrl Abstract/FREE Full Text
↵
Slatkin, M., & Excoffier, L. (2012). Serial founder effects during range expansion: A spatialanalog of genetic drift. Genetics, 191, 171–181.
OpenUrl Abstract/FREE Full Text
↵
Smouse, P.E., & Peakall, R. (1999). Spatial autocorrelation analysis of individual multiallele andmultilocus genetic structure. Heredity, 82, 561–573.
OpenUrl CrossRef PubMed Web of Science
↵
Soberón, J. (2007) Grinnellian and Eltonian niches and geographic distributions of species. Ecology Letters, 10, 1115–1123.
OpenUrl CrossRef PubMed Web of Science
↵
Sokal, R.R., Harding, R.M., & Oden, N.L. (1989). Spatial patterns of human gene frequencies in Europe. American Journal of Physical Anthropology, 80, 267–294.
OpenUrl CrossRef PubMed Web of Science
↵
Spear, S.F., Balkenhol, N., Fortin, M.J., McRae, B.H., & Scribner, K. (2010). Use of resistance surfaces for landscape genetic studies: considerations for parameterization and analysis. Molecular Ecology, 19, 3576–3591.
OpenUrl CrossRef PubMed Web of Science
↵
Stein, M.L. (2012). Interpolation of Spatial Data: Some Theory for Kriging. Springer. New York.
↵
Sturges, H.A. (1926). The choice of a class interval. Journal of the American Statistical Association, 21, 65–66.
OpenUrl CrossRef Web of Science
↵
Swenson, N.G. (2010). Mapping the suturing of a continental biota. Molecular Ecology, 19, 5324–5327.
OpenUrl CrossRef PubMed
↵
Thompson, R.S., & Anderson, K.H. (2000). Biomes of western North America at 18,000, 6000 and 0 14C yr bp reconstructed from pollen and packrat midden data. Journal of Biogeography, 27, 555–584.
OpenUrl CrossRef Web of Science
↵
Thompson, L.G., MosleyThompson, E., Davis, M. Lin, P.N., Yao, T., Dyurgerov, M., & Dai, J. (1993). “Recent warming”: ice core evidence from tropical ice cores with emphasis on Central Asia. Global and Planetary Change, 7, 145–156.
OpenUrl CrossRef GeoRef Web of Science
↵
Thuiller, W., Albert, C., Araújo, M.B., Berry, P.M., Cabeza, M., Guisan, A., …, Zimmermann, E. (2008). Predicting global change impacts on plant species’ distributions: future challenges. Perspectives in Plant Ecology, Evolution and Systematics, 9, 137–152.
OpenUrl CrossRef
↵
Wegmann, D., Currat, M., & Excoffier, L. (2006). Molecular diversity after a range expansion in heterogeneous environments. Genetics, 174, 2009–2020.
OpenUrl Abstract/FREE Full Text
↵
Wicker, E., Lefeuvre, P., de Cambiaire, J.C., Lemaire, C., Poussier, S., & Prior, P. (2012). Contrasting recombination patterns and demographic histories of the plant pathogenRalstonia solanacearum inferred from MLSA. The ISME Journal, 6, 961–974.
OpenUrl
↵
Wright, S. (1943). Isolation by distance. Genetics, 28, 114–138.
OpenUrl FREE Full Text

View the discussion thread.

Posted October 30, 2018.

Download PDF

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] ↵
AlAsadi, H., Petkova, D., Stephens, M., & Novembre, J. (2018). Estimating recent migration and population size surfaces. bioRxiv, doi: 10.1101/365536.
OpenUrl Abstract/FREE Full Text

[2] ↵
Allman, B.E., & Weissman, D.B. (2018). Hitchhiking in space: Ancestry in adapting, spatially extended populations. Evolution, 72, 722–734.
OpenUrl

[3] ↵
Alvarado-Serrano, D.F., & Hickerson, M.J. (2015). Spatially explicit summary statistics for historical population genetic inference. Methods in Ecology and Evolution, 7, 418–427.
OpenUrl

[4] ↵
Alvarado-Serrano, D.F., & Knowles., L.L. (2014). Ecological niche models in phylogeographic studies: Applications, advances and precautions. Molecular Ecology Resources, 14, 233–248.
OpenUrl

[5] ↵
Ashander, J., Ralph, P., McCartneyMelstad, E., & Shaffer, H.B. (2018), Demographic inference 611 in a spatiallyexplicit ecological model from genomic data: A proof of concept for the 612 Mojave Desert Tortoise. bioRxiv. doi: 10.1101/354530.
OpenUrl Abstract/FREE Full Text

[6] ↵
Avise, J.C. (2000). Phylogeography: The history and formation of species. Harvard University Press. Cambridge.

[7] ↵
Avise, J.C., Arnold, J., Ball, R.M., Bermingham, E., Lamb, T., Neigel, J.A., …, Saunders, N.C. (1987). Intraspecific phylogeography: The mitochondrial DNA bridge between population genetics and systematics. Annual Review of Ecology and Systematics, 18, 489–522.
OpenUrl CrossRef Web of Science

[8] ↵
Beaumont, M. (2010). Approximate Bayesian computation in evolution and ecology. Annual Review of Ecology, Evolution, and Systematics, 41, 379–406.
OpenUrl CrossRef Web of Science

[9] ↵
Braconnot, P., OttoBliesner, B., Harrison, S., Joussaume, S., Peterchmitt, J.Y., AbeOuchi, A., …, Zhao, Y. (2007). Results of PMIP2 coupled simulations of the MidHolocene and Last Glacial MaximumPart 1: experiments and largescale features. Climate of the Past, 3, 261–277.
OpenUrl CrossRef Web of Science

[10] ↵
Bradburd, G.S., Coop, G.M., & Ralph, P.L. (2018). Inferring continuous and discrete population genetic structure across space. Genetics, 210, 33–52.
OpenUrl Abstract/FREE Full Text

[11] ↵
Bradburd, G.S., Ralph, P.L., & Coop, G.M. (2016). A spatial framework for understanding population structure and admixture. PLoS genetics, 12, e1005703.
OpenUrl

[12] ↵
Branco, C., Velasco, M., Benguigui, M., Currat, M., Ray, N., & Arenas, M. (2018). Consequences of diverse evolutionary processes on american genetic gradients of modern humans. Heredity, doi: 10.1038/s414370180122x.
OpenUrl CrossRef

[13] ↵
Brown, J.L., Weber, J.J., AlvaradoSerrano, D.F., Hickerson, M.J., Franks, S.J., & Carnaval, A.C. (2016). Predicting the genetic consequences of future climate change: The power of coupling spatial demography, the coalescent, and historical landscape changes. American Journal of Botany, 103, 153–163.
OpenUrl Abstract/FREE Full Text

[14] ↵
Burbrink, F.T., Chan, Y.L., Myers, E.A., Ruane, S., Smith, B.T., & Hickerson, M.J. (2016). Asynchronous demographic responses to Pleistocene climate change in Eastern Nearctic vertebrates. Ecology Letters, 19, 1457–1467.
OpenUrl

[15] ↵
Cardozo, A.L.P., Farias, E.G.G., RodriguesFilho, J.L., Moteiro, I.B., Scandolo, T.M. & Dantas, T.V. (2018). Feeding ecology and ingestion of plastic fragments by Priacanthus arenatus : What’s the fisheries contribution to the problem? Marine Pollution Bulletin, 130, 19–27.
OpenUrl

[16] ↵
Carstens, B.C., Brunsfeld, S.J., Demboski, J.R., Good, J.D., & Sullivan, J. (2005). Investigating the evolutionary history of the Pacific Northwest mesic forest ecosystem: Hypothesis testing within a comparative phylogeographic framework. Evolution, 59, 1639–1652.
OpenUrl PubMed Web of Science

[17] ↵
Charles H., & Dukes, J.S. (2008). Impacts of invasive species on ecosystem services. In: Biological Invasions Ecological Studies, pp. 217–237. Springer, Heidelberg.

[18] ↵
Cristescu, M.E. (2015). Genetic reconstructions of invasion history. Molecular Ecology, 24, 2212–2225.
OpenUrl CrossRef

[19] ↵
Currat, M., Ray, N., & Excoffier, L. (2004). Splatche: a program to simulate genetic diversity taking into account environmental heterogeneity. Molecular Ecology Notes, 4, 139–142.
OpenUrl CrossRef

[20] ↵
Davis, C.D., Epps, C.W., Flitcroft, R.L., & Bank,s, M.A. (2018). Refining and defining riverscape genetics: How rivers influence population genetic structure. Wiley Interdisciplinary Reviews: Water, 5, e1269.
OpenUrl

[21] ↵
DeGiorgio, M., Jakobsson, M., & Rosenberg, N.A. (2009). Explaining worldwide patterns of human genetic variation using a coalescentbased serial founder model of migration outward from Africa. Proceedings of the National Academy of Sciences, 106, 16057–16062.
OpenUrl Abstract/FREE Full Text

[22] ↵
Drake, S.R., & Dogancay, K. (2004). Geolocation by time difference of arrival using hyperbolic asymptotes. In: 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 361–364.

[23] ↵
Elleouet, J.S., & Aitken, S.N. (2018). Exploring Approximate Bayesian Computation for inferring recent demographic history with genomic markers in nonmodel species. Molecular Ecology Resources, doi: 10.1111/17550998.12758.
OpenUrl CrossRef

[24] ↵
Excoffier, L. (2004). Patterns of DNA sequence diversity and genetic structure after a range expansion: Lessons from the infiniteisland model. Molecular Ecology, 13, 853–864.
OpenUrl CrossRef PubMed Web of Science

[25] ↵
Excoffier, L., Dupanloup, I., HuertaSánchez, E., Sousa, V.C., & Foll, M. (2013). Robust demographic inference from genomic and SNP data. PLoS genetics, 9, e1003905.
OpenUrl

[26] ↵
Excoffier, L., & Foll, M. (2011). Fastsimcoal: A continuoustime coalescent simulator of genomic diversity under arbitrarily complex evolutionary scenarios. Bioinformatics, 27, 1332–1334.
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Excoffier, L., Foll, M., & Petit, R.J. (2008). Genetic consequences of range expansions. Annual Review of Ecology, Evolution, and Systematics, 40, 481–501.
OpenUrl

[28] ↵
Excoffier, L., Hofer, T., & Foll, M. (2009). Detecting loci under selection in a hierarchically structured population. Heredity, 103, 285–298.
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Fagua, J.C., & Gonzalez, V.H. (2007). Growth rates, reproductive phenology, and pollination ecology of Espeletia grandiflora (Asteraceae), a giant Andean caulescent rosette. Plant Biology, 9, 127–135.
OpenUrl CrossRef PubMed

[30] ↵
Fordham, D.A., Brook, B.W., Moritz, C., & NoguésBravo, D. (2014). Better forecasts of range dynamics using genetic data. Trends in Ecology & Evolution, 29, 436–443.
OpenUrl

[31] ↵
Forister, M.L., Gompert, Z., Fordyce, J.A., & Nice, C.C. (2011). After 60 years, an answer to the question: What is the Karner blue butterfly? Biology Letters, 7, 399–402.
OpenUrl

[32] ↵
Fraimout, A., Debat, V., Fellous, S., Hufbauer, R.A., Foucaud, J., Pudlo, P., …, Estoup, A. (2017). Deciphering the routes of invasion of Drosophila suzukii by means of ABC Random Forest. Molecular Biology and Evolution, 34, 980–996.
OpenUrl

[33] ↵
François., O., Currat, M, Ray, N., Han, E., Excoffier, L. & Novembre, J. (2010). Principal component analysis under population genetic models of range expansion and admixture. Molecular Biology and Evolution, 27, 1257–1268.
OpenUrl CrossRef PubMed Web of Science

[34] ↵
Frantz, L.A.F., Schraiber, J.G., Madsen, O., Megens, H.J., Bosse, M., Paudel, Y., …, Groenen, M.A. (2013). Genome sequencing reveals fine scale diversification and reticulation history during speciation in Sus. sus Genome Biology, 14, R107.
OpenUrl

[35] ↵
Frichot, E., & François, O. (2015). LEA: An R package for landscape and ecological association studies. Methods in Ecology and Evolution, 6, 925–929.
OpenUrl

[36] ↵
Fumagalli, M., Sironi, M., Pozzoli, U., FerrerAdmetlla, A., Pattini, L. & Nielsen, R. (2011). Signatures of environmental genetic adaptation pinpoint pathogens as the main selective pressure through human evolution. PLoS genetics, 7, e1002355.
OpenUrl

[37] ↵
Galante, P.J., Alade, B., Muscarella, R., Jansa, S.A., Goodman, S.M., & Anderson, R.P. (2018). The challenge of modeling niches and distributions for datapoor species: a comprehensive approach to model complexity. Ecography, 41, 726–736.
OpenUrl

[38] ↵
Gaston KJ (2003) The Structure and Dynamics of Geographic Ranges. Oxford University Press. Oxford.

[39] ↵
Gompert, Z., Comeault, A.A., Farkas, T.E., Feder, J.L., Parchman, T.L., Buerkle, C.A., & Nosil, P. (2014a). Experimental evidence for ecological selection on genome variation in the wild. Ecology Letters, 17, 369–379.
OpenUrl CrossRef PubMed

[40] ↵
Gompert, Z., Fordyce, J.A., Forister, M.L., Shapiro, A.M., & Nice, C.C. (2006). Homoploid hybrid speciation in an extreme habitat. Science, 314, 1923–1925.
OpenUrl Abstract/FREE Full Text

[41] ↵
Gompert, Z., Forister, M.I., Fordyce, J.A., Nice, C.C., Williamson, R.J., & Buerkle, C.A. (2010). Bayesian analysis of molecular variance in pyrosequences quantifies population genetic structure across the genome of Lycaeides butterflies. Molecular Ecology, 19, 2455–2473.
OpenUrl PubMed Web of Science

[42] ↵
Gompert, Z., Lucas, L.K., Buerkle, C.A., Forister, M.L., Fordyce, J.A., & Nice, C.C. (2014b). Admixture and the organization of genetic diversity in a butterfly species complex revealed through common and rare genetic variants. Molecular Ecology, 23, 4555–4573.
OpenUrl

[43] ↵
Gustafsson, F., Gunnarsson, S., & Ljung, L. (1994). On timefrequency resolution of signal properties using parametric techniques. In: 1994 Proceedings of 33rd IEEE Conference on Decision and Control, pp. 2259-2264.

[44] ↵
Gutenkunst, R.N., Hernandez, R.D., Williamson, S.H., Bustamante, C.D. (2009). Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS genetics, 5, e1000695.
OpenUrl

[45] ↵
Hallatschek, O., Hersen, P., Ramanathan, S., & Nelson, D.R. (2007) Genetic drift at expanding frontiers promotes gene segregation. Proceedings of the National Academy of Sciences of the United States of America, 104, 19926–19930.
OpenUrl Abstract/FREE Full Text

[46] ↵
Hallatschek, O., & Nelson DR (2008) Gene surfing in expanding populations. Theoretical Population Biology, 73, 158–170.
OpenUrl CrossRef PubMed Web of Science

[47] ↵
Harpending, H.C. (1994). Signature of ancient population growth in a lowresolution mitochondrial DNA mismatch distribution. Human Biology, 66, 591–600.
OpenUrl PubMed Web of Science

[48] ↵
Henn, B.M., CavalliSforza, L.L., & Feldman, M.W. (2012). The great human expansion. Proceedings of the National Academy of Sciences of the United States of America, 109, 17758–17764.
OpenUrl Abstract/FREE Full Text

[49] ↵
He, Q., Prado, J.R., & Knowles, L.L. (2017). Inferring the geographic origin of a range expansion: Latitudinal and longitudinal coordinates inferred from genomic data in an ABC framework with the program xorigin. Molecular Ecology, 26, 6908–6920.
OpenUrl

[50] ↵
Hewitt, G. (2000). The genetic legacy of the Quaternary ice ages. Nature, 405, 907–913.
OpenUrl CrossRef GeoRef PubMed Web of Science

[51] ↵
He, Z., Zhai, W., Wen, H., Tang, T., Wang, Y., Lu, X., …, Shi, S. (2011). Two evolutionary histories in the genome of rice: The roles of domestication genes. PLoS genetics, 7, e1002100.
OpenUrl

[52] ↵
Hijmans, R.J. (2012). Crossvalidation of species distribution models: Removing spatial sorting bias and calibration with a null model. Ecology, 93, 679–688.
OpenUrl CrossRef PubMed Web of Science

[53] House, G.L., & Hahn, M.W. (2018). Evaluating methods to visualize patterns of genetic differentiation on a landscape. Molecular Ecology Resources, 18, 448–460.
OpenUrl

[54] ↵
Ibrahim, K.M., Nichols, R.A., & Hewitt, G.M. (1996). Spatial patterns of genetic variation generated by different forms of dispersal during range expansion. Heredity, 77, 282–291.
OpenUrl CrossRef Web of Science

[55] ↵
Johnson, P.T.J., Olden, J.D., Solomon, C.T., Vander Zanden, M.J. (2009). Interactions among invaders: Community and ecosystem effects of multiple invasive species in an experimental aquatic system. Oecologia, 159, 161–170.
OpenUrl CrossRef PubMed Web of Science

[56] ↵
Jones, E., Oliphant, T., & Peterson, P. (2001). SciPy: Open source scientific tools for Python, doi: 10.1234/12345678.
OpenUrl CrossRef

[57] ↵
Joseph, T.A., Hickerson, M.J., & AlvaradoSerrano, D.F. (2016). Demographic inference under a spatially continuous coalescent model. Heredity, 117, 94–99.
OpenUrl

[58] ↵
Kanginakudru, S., Metta, M., Jakati, R.D., & Nagaraju, J. (2008). Genetic evidence from Indian red jungle fowl corroborates multiple domestication of modern day chicken. BMC Evolutionary Biology, 8, 174.
OpenUrl

[59] ↵
Kimura, M., & Weiss, G.H. (1964). The stepping stone model of population structure and the decrease of genetic correlation with distance. Genetics, 49, 561–576.
OpenUrl FREE Full Text

[60] ↵
Kirkpatrick M., & Barton, N.H. (1997). Evolution of a species’ range. The American naturalist, 150, 1–23.
OpenUrl CrossRef PubMed Web of Science

[61] ↵
Knowles, L.L., & AlvaradoSerrano, D.F. (2010). Exploring the population genetic consequences of the colonization process with spatiotemporally explicit models: Insights from coupled ecological, demographic and genetic models in montane grasshoppers. Molecular Ecology, 19, 3727–3745.
OpenUrl CrossRef PubMed Web of Science

[62] ↵
Konečný, A., Estoup, A., Duplantier, J.M., Bryja, J., Ba, K., Galan, M., …, Cosson, J.F. (2013). Invasion genetics of the introduced black rat (Rattus rattus) in Senegal, West Africa. Molecular Ecology, 22, 286–300.
OpenUrl CrossRef Web of Science

[63] ↵
Leblois, R., Estoup, A., Rousset, F. (2009). IBDSim: A computer program to simulate genotypic data under isolation by distance. Molecular Ecology Resources, 9, 107–109.
OpenUrl

[64] ↵
Legendre P., & Legendre, L.F.J. (2012). Numerical Ecology. Elsevier. Philadelphia.

[65] ↵
Li, J.Z., Absher, D.M., Tang, H., Southwick, A.M., Casto, A.M., Ramachandran, S., … Myers, R.M. (2008). Worldwide human relationships inferred from genomewide patterns of variation. Science, 319, 1100–1104.
OpenUrl Abstract/FREE Full Text

[66] ↵
Li J., & Heap, A.D. (2011). A review of comparative studies of spatial interpolation methods in environmental sciences: Performance and impact factors. Ecological Informatics, 6, 228–241.
OpenUrl CrossRef Web of Science

[67] ↵
Lotterhos, K.E., & Whitlock, M.C. (2014). Evaluation of demographic history and neutral parameterization on the performance of F_ST outlier tests. Molecular Ecology, 23, 2178–2192.
OpenUrl CrossRef PubMed

[68] ↵
Lundgren, E., & Ralph, P.L. (2018). Are populations like a circuit? The relationship between isolation by distance and isolation by resistance. bioRxiv, doi: 10.1101/451328.
OpenUrl Abstract/FREE Full Text

[69] ↵
MartinezSolano, I., Gonzalez, E.G. (2008). Patterns of gene flow and sourcesink dynamics in high altitude populations of the common toad Bufo bufo (Anura: Bufonidae). Biological Journal of the Linnean Society, 95, 824–839.
OpenUrl CrossRef Web of Science

[70] ↵
Menozzi, P., Piazza, A., & CavalliSforza, L. (1978). Synthetic maps of human gene frequencies in Europeans. Science, 201, 786–792.
OpenUrl Abstract/FREE Full Text

[71] ↵
Miraldo, A., Hewitt, G.M., Paulo, O.S., & Emerson, B.C. (2011). Phylogeography and demographic history of Lacerta lepida in the Iberian Peninsula: Multiple refugia, range expansions and secondary contact zones. BMC Evolutionary Biology, 11, 170.
OpenUrl

[72] ↵
Moritz, C., Hoskin, C.J., MacKenzie, J.B., Phillips, B.L., Tonione, M., Silva, N., …, Graham, C.H. (2009). Identification and dynamics of a cryptic suture zone in tropical rainforest. Proceedings of the Royal Society B: Biological Sciences, 276, 1235–1244.
OpenUrl PubMed

[73] ↵
Muscarella, R., Galante, P.J., SoleyGuardia, M., Boria, R.A., Kass, J.M., Uriarte, M., & Anderson, R.P. (2014). ENMeval: An R package for conducting spatially independent evaluations and estimating optimal model complexity for Maxent ecological niche models. Methods in Ecology and Evolution, 5, 1198–1205.
OpenUrl

[74] ↵
Nei, M., Maruyama, T., & Chakraborty, R. (1975). The bottleneck effect and genetic variability in populations. Evolution, 29, 1–10.
OpenUrl CrossRef Web of Science

[75] ↵
Nice, C.C., Gompert, Z., Fordyce, J.A., Forister, M.L., Lucas, L.K., & Buerkle, C.A. (2013). Hybrid speciation and independent evolution in lineages of alpine butterflies. Evolution, 67, 1055–1068.
OpenUrl CrossRef PubMed

[76] ↵
Novembre, J., & Stephens, M. (2008). Interpreting principal component analyses of spatial population genetic variation. Nature Genetics, 40, 646–649.
OpenUrl CrossRef PubMed Web of Science

[77] ↵
Patterson, N., Price, A.L., & Reich, D. (2006). Population structure and eigenanalysis. PLoS Genetics, 2, e190.
OpenUrl CrossRef

[78] ↵
Peischl, S., Dupanloup, I., Kirkpatrick, M., & Excoffier, L. (2013). On the accumulation of deleterious mutations during range expansions. Molecular Ecology, 22, 5972–5982.
OpenUrl CrossRef

[79] ↵
Pemberton, T.J., DeGiorgio, M., & Rosenberg, N.A. (2013). Population structure in a comprehensive genomic data set on human microsatellite variation. G3: Genes, Genomes, Genetics, 3, 891–907.
OpenUrl

[80] ↵
Perkins, L.E., & Swayne, D.E. (2001). Pathobiology of A/chicken/Hong Kong/220/97 (H5N1) avian influenza virus in seven gallinaceous species. Veterinary Pathology, 38, 149–164.
OpenUrl CrossRef PubMed Web of Science

[81] ↵
Peter, B.M., & Slatkin, M. (2013). Detecting range expansions from genetic data. Evolution, 67, 3274–3289.
OpenUrl CrossRef PubMed Web of Science

[82] ↵
Peter, B.M., & Slatkin, M. (2015). The effective founder effect in a spatially expanding population. Evolution, 69, 721–734.
OpenUrl CrossRef PubMed

[83] ↵
Peterson, K.R., Pfister, D.H., & Bell, C.D. (2010). Cophylogeny and biogeography of the fungal parasite Cyttaria and its host Nothofagus, southern beech. Mycologia, 102, 1417–1425.
OpenUrl CrossRef PubMed

[84] ↵
Petkova, D., Novembre, J., & Stephens, M. (2016). Visualizing spatial population structure with estimated effective migration surfaces. Nature Genetics, 48, 94–100.
OpenUrl CrossRef PubMed

[85] ↵
Pharo, E.J., & Zartman, C.E. (2007). Bryophytes in a changing landscape: The hierarchical effects of habitat fragmentation on ecological and evolutionary processes. Biological Conservation, 135, 315–325.
OpenUrl CrossRef Web of Science

[86] ↵
Phillips, S.J., Anderson, R.P., & Schapire, R.E., (2006). Maximum entropy modeling of species geographic distributions. Ecological Modelling, 190, 231–259.
OpenUrl CrossRef Web of Science

[87] ↵
Phillips, B.L., Kelehear, C., Pizzatto, L., Brown, G.P., Barton, D., & Shine, R. (2010). Parasites and pathogens lag behind their host during periods of host range advance. Ecology, 91, 872–881.
OpenUrl CrossRef PubMed Web of Science

[88] ↵
Pierce, J.L., Meyer, G.A., & Jull, A.J.T. (2004). Fireinduced erosion and millennialscale climate change in northern ponderosa pine forests. Nature, 432, 87–90.
OpenUrl CrossRef GeoRef PubMed Web of Science

[89] ↵
Pires, T.H.S., Farago, T.B., Campos, D.F., Cardoso, G.M., & Zuanon, J. (2016). Traits of a lineage with extraordinary geographical range: Ecology, behavior and lifehistory of the sailfin tetra Crenuchus spilurus. Environmental Biology of Fishes, 99, 925–937.
OpenUrl

[90] ↵
Potter, S., Xue, A.T., Bragg, J.G., Rosauer, D.F., Roycroft, E.J., & Moritz, C. (2017). Pleistocene climatic changes drive diversification across a tropical savanna. Molecular Ecology, 27, 520–532.
OpenUrl

[91] ↵
Pudlo, P., Marin, J.M., Estoup, A., Cornuet, J.M., Gautier, M., Robert, C.P. (2015). Reliable ABC model choice via random forests. Bioinformatics 32, 859–866.
OpenUrl

[92] ↵
Radosavljevic, A., & Anderson, R.P. (2014) Making better Maxent models of species distributions: complexity, overfitting and evaluation. Journal of Biogeography, 41, 629–643.
OpenUrl CrossRef

[93] ↵
Ramachandran, S., Deshpande, O., Roseman, C.C., Rosenberg, N.A., Feldman, M.W., CavalliSforza, L. (2005). Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proceedings of the National Academy of Sciences of the United States of America, 102, 15942–15947.
OpenUrl Abstract/FREE Full Text

[94] ↵
Ramırez-Garcıa, P., LópezBlanco J., & Ocaña, D. (1998). Mangrove vegetation assessment in the Santiago River Mouth, Mexico, by means of supervised classification using LandsatTM imagery. Forest Ecology and Management, 105, 217–229.
OpenUrl

[95] ↵
Ray, N., & Excoffier, L. (2010). A first step towards inferring levels of longdistance dispersal during past expansions. Molecular Ecology Resources, 10, 902–914.
OpenUrl

[96] ↵
Remington, C.L. (1968). Suturezones of hybrid interaction between recently joined biotas. In: Evolutionary Biology, pp. 321–428. Springer, Boston.

[97] ↵
Roberts, D.R., & Hamann, A. (2015). Glacial refugia and modern genetic diversity of 22 westernNorth American tree species. Proceedings of the Royal Society B: Biological Sciences, 282, 20142903.
OpenUrl CrossRef PubMed

[98] ↵
Ruiz-Cooley, R.I., Ballance, L.T., & McCarthy, M.D. (2013). Range expansion of the jumbosquid in the NE Pacific: δ15N decrypts multiple origins, migration and habitat use. PloSone, 8, e59651.
OpenUrl

[99] ↵
Sax, D.F., Stachowicz, J.J., Brown, J.H., Bruno, J.F., Dawson, M.N, Gaines, S.D., …, Rice, W.R. (2007). Ecological and evolutionary insights from species invasions. Trends in Ecology & Evolution, 22, 465–471.
OpenUrl

[100] ↵
Schrider, D.R., & Kern, A.D. (2018). Supervised machine learning for population genetics: Anew paradigm. Trends in Genetics, 34, 301312.
OpenUrl

[101] ↵
Scott, D.W. (1979). On optimal and databased histograms. Biometrika, 66, 605.
OpenUrl CrossRef Web of Science

[102] ↵
Shirk, A.J., & Cushman, S.A. (2014). Spatiallyexplicit estimation of Wright’s neighborhood size in continuous populations. Frontiers in Ecology and Evolution, 2, 177.
OpenUrl

[103] ↵
Slatkin, M. (1987) Gene flow and the geographic structure of natural populations. Science, 236, 787–792.
OpenUrl Abstract/FREE Full Text

[104] ↵
Slatkin, M., & Excoffier, L. (2012). Serial founder effects during range expansion: A spatialanalog of genetic drift. Genetics, 191, 171–181.
OpenUrl Abstract/FREE Full Text

[105] ↵
Smouse, P.E., & Peakall, R. (1999). Spatial autocorrelation analysis of individual multiallele andmultilocus genetic structure. Heredity, 82, 561–573.
OpenUrl CrossRef PubMed Web of Science

[106] ↵
Soberón, J. (2007) Grinnellian and Eltonian niches and geographic distributions of species. Ecology Letters, 10, 1115–1123.
OpenUrl CrossRef PubMed Web of Science

[107] ↵
Sokal, R.R., Harding, R.M., & Oden, N.L. (1989). Spatial patterns of human gene frequencies in Europe. American Journal of Physical Anthropology, 80, 267–294.
OpenUrl CrossRef PubMed Web of Science

[108] ↵
Spear, S.F., Balkenhol, N., Fortin, M.J., McRae, B.H., & Scribner, K. (2010). Use of resistance surfaces for landscape genetic studies: considerations for parameterization and analysis. Molecular Ecology, 19, 3576–3591.
OpenUrl CrossRef PubMed Web of Science

[109] ↵
Stein, M.L. (2012). Interpolation of Spatial Data: Some Theory for Kriging. Springer. New York.

[110] ↵
Sturges, H.A. (1926). The choice of a class interval. Journal of the American Statistical Association, 21, 65–66.
OpenUrl CrossRef Web of Science

[111] ↵
Swenson, N.G. (2010). Mapping the suturing of a continental biota. Molecular Ecology, 19, 5324–5327.
OpenUrl CrossRef PubMed

[112] ↵
Thompson, R.S., & Anderson, K.H. (2000). Biomes of western North America at 18,000, 6000 and 0 14C yr bp reconstructed from pollen and packrat midden data. Journal of Biogeography, 27, 555–584.
OpenUrl CrossRef Web of Science

[113] ↵
Thompson, L.G., MosleyThompson, E., Davis, M. Lin, P.N., Yao, T., Dyurgerov, M., & Dai, J. (1993). “Recent warming”: ice core evidence from tropical ice cores with emphasis on Central Asia. Global and Planetary Change, 7, 145–156.
OpenUrl CrossRef GeoRef Web of Science

[114] ↵
Thuiller, W., Albert, C., Araújo, M.B., Berry, P.M., Cabeza, M., Guisan, A., …, Zimmermann, E. (2008). Predicting global change impacts on plant species’ distributions: future challenges. Perspectives in Plant Ecology, Evolution and Systematics, 9, 137–152.
OpenUrl CrossRef

[115] ↵
Wegmann, D., Currat, M., & Excoffier, L. (2006). Molecular diversity after a range expansion in heterogeneous environments. Genetics, 174, 2009–2020.
OpenUrl Abstract/FREE Full Text

[116] ↵
Wicker, E., Lefeuvre, P., de Cambiaire, J.C., Lemaire, C., Poussier, S., & Prior, P. (2012). Contrasting recombination patterns and demographic histories of the plant pathogenRalstonia solanacearum inferred from MLSA. The ISME Journal, 6, 961–974.
OpenUrl

[117] ↵
Wright, S. (1943). Isolation by distance. Genetics, 28, 114–138.
OpenUrl FREE Full Text

Detecting spatial dynamics of range expansions with geo-referenced genomewide SNP data and the geographic spectrum of shared alleles

Abstract

Introduction

Materials and Methods

The geographic spectrum of shared alleles (GSSA)

GSSA overview

GSSA calculation

GSSA Implementation

Raggedness index

Simulation experiments

Empirical Application

Results

Simulation experiments

Empirical application

Discussion

Potential Pitfalls

Future Prospects

Conclusion

Software

Data Accessibility

Author Contributions

Acknowledgements

References

Citation Manager Formats

Subject Area