Polygenic Adaptation has Impacted Multiple Anthropometric Traits

Jeremy J. Berg; Xinjun Zhang; Graham Coop

doi:10.1101/167551

Abstract

Our understanding of the genetic basis of human adaptation is biased toward loci of large pheno-typic effect. Genome wide association studies (GWAS) now enable the study of genetic adaptation in polygenic phenotypes. We test for polygenic adaptation among 187 world-wide human populations using polygenic scores constructed from GWAS of 34 complex traits. We identify signals of polygenic adaptation for anthropometric traits including height, infant head circumference (IHC), hip circumference and waist-to-hip ratio (WHR). Analysis of ancient DNA samples indicates that a north-south cline of height within Europe and and a west-east cline across Eurasia can be traced to selection for increased height in two late Pleistocene hunter gatherer populations living in western and west-central Eurasia. Our observation that IHC and WHR follow a latitudinal cline in Western Eurasia support the role of natural selection driving Bergmann’s Rule in humans, consistent with thermoregulatory adaptation in response to latitudinal temperature variation.

Author’s Note on Failure to Replicate After this preprint was posted, the UK Biobank dataset was released, providing a new and open GWAS resource. When attempting to replicate the height selection results from this preprint using GWAS data from the UK Biobank, we discovered that we could not. In subsequent analyses, we determined that both the GIANT consortium height GWAS data, as well as another dataset that was used for replication, were impacted by stratification issues that created or at a minimum substantially inflated the height selection signals reported here. The results of this second investigation, written together with additional coauthors, have now been published (https://elifesciences.org/articles/39725 along with another paper by a separate group of authors, showing similar issues https://elifesciences.org/articles/39702). A preliminary investigation shows that the other non-height based results may suffer from similar issues. We stand by the theory and statistical methods reported in this paper, and the paper can be cited for these results. However, we have shown that the data on which the major empirical results were based are not sound, and so should be treated with caution until replicated.

Main Text

Decades of research in anthropology have identified anthropometric traits that show evidence of biological adaptation to climatic conditions as humans spread around the world over the past hundred thousand years.^{1, 2, 3} However, it can be challenging to rule out non-heritable environmental factors,^{4, 5} as opposed to genetic variation, as the primary cause of these phenotypic differences.⁶ Even for phenotypes where there is some confidence that some of the phenotypic differences among populations are due in part to genetic differences, it is often hard to rule out genetic drift as an alterative explanation to selection.^{7, 8, 9} The development of population-genetic methods and genomic data resources during the last few decades has enabled the interrogation of adaptive hypotheses and has produced an expanding list of examples of plausible human adaptations.^{10, 11} However, such approaches are often inherently limited to detecting adaptation in genetically simple traits via large allele frequency changes at a small number of loci, whereas many adaptations likely involve highly polygenic traits and so are undetectable by most approaches.^{12, 13} Genome-wide association studies (GWAS) have now identified thousands of loci underlying the genetic basis of many complex traits.^{14, 15, 16} These studies oer an unprecedented opportunity to identify adaptation in recent human evolution by detecting subtle shifts in allele frequencies compounded over many GWAS loci.^{17, 18, 19, 20, 21, 22, 23}

We conducted a broad screen for evidence of directional selection on variants that contribute to 34 polygenic traits by studying the distribution of their allele frequencies in a dataset of 187 human populations (2158 individuals across 161 populations from the Human Origins Panel²⁴ and 2504 individuals across 26 populations of the 1000 Genomes phase 3 panel²⁵), making use of prior large-scale GWAS for these traits (see Table S1). We divided the genome into 1700 non-overlapping and approximately independent linkage blocks²⁶ and choose the SNP with the highest posterior probability of association within the block.^{27, 28} For each trait, we calculate a polygenic score for each population as a weighted sum of allele frequencies at each of these 1700 SNPs, with the GWAS effect sizes taken as the weights. Figure 1 shows the distribution of these scores for height across our population samples.

Figure 1: Polygenic Height Scores for 187 population samples (combined Human origin panel and 1000 genomes datasets), plotted on geographic coordinates.

Blue corresponds to populations with the “tallest” polygenic height scores, and yellow the “shortest”.

These polygenic scores should not be viewed as phenotypic predictions across populations. For example, the Maasai and Biaka pygmy populations have similar polygenic scores despite having dramatic differences in height.²⁹ Discrepancies between polygenic scores and actual phenotypes may be expected to occur either because of purely environmental influences on phenotype, or due to gene-by-gene and gene-by-environment interactions. We also expect that the accuracy of these scores when viewed as predictions should decay with genetic distance from Europe (where the GWAS were carried out), due to changes in the structure of linkage disequilibrium (LD) between causal variants and tag SNPs picked up in GWAS, and because GWAS are biased toward discovering intermediate frequency variants, which will explain more variance in the region they are mapped in than outside of it. These caveats notwithstanding, the distribution of polygenic scores across populations is informative about the history of natural selection on a given phenotype,¹⁸ and a number of striking patterns are visible in their distribution. For example, there is a strong gradient in polygenic height scores running from east to west across Eurasia (Figure 1)

To explore whether patterns observed in the polygenic scores were caused by natural selection, we tested whether the observed distribution of polygenic scores across populations could plausibly have been generated under a neutral model of genetic drift. To understand this null model, consider that a neutrally evolving allele has the same expected frequency across a set of independently evolving sub-populations. However, due to genetic drift, individual sub-populations will deviate from this expected frequency, with the variance of the sub-population frequencies given by F_ST p (1 − p), where p is the ancestral allele frequency, and F_ST is Wright’s “fixation index,”30 which can be measured from genome-wide data.^{17, 31} Our polygenic scores sum the contributions of a large number of effectively unlinked loci, which under our null model will experience genetic drift independently. It follows that under a model of genetic drift, the polygenic score of each of a set of independent sub-populations will be normally distributed, with variance of V AFST, where VA is the additive genetic variance of polygenic scores the ancestral population. Our test is based on a generalization of this simple relation in which we account for both variance and covariance among multiple populations that are non-independent due to common descent, migration, and admixture over the history of human evolution. Specifically, we model the joint distribution of polygenic scores as multivariate normal and use a generalized variance statistic (Q_X) to measure the over-dispersion of polygenic scores relative to the neutral prediction, which is taken as evidence in favor of natural selection driving dierence among populations in polygenic scores (see Methods and our previous study¹⁸ for details). Our approach is similar to classic tests of adaptation on phenotypes measured in common gardens, which rely on comparisons of the within and among-population additive genetic variance for phenotypes and neutral markers, i.e. Q_ST/F_ST comparisons.32, 33, 34 Importantly, the neutral distribution we derive holds independent of whether the loci truly influence the trait in an additive manner (with respect to each other or the environment), and whether the GWAS loci are truly causal or merely imperfect tags. However, population structure in the original GWAS panels can confound signals of polygenic adaptation.^{18, 20} Modern methods are generally considered to be effective at controlling for the effects of population structure,³⁵ and we proceed assuming that it has been adequately accounted for in the original GWAS panels.

We applied our test to each of the 34 traits across all populations, as well as within nine restricted regional groupings (Figure 2 and Table S3). Using our test across all populations as a general test for the impact of selection anywhere in the dataset, we find 5 signals of selection after controlling for multiple testing (p < 0.05/34). In each case of significant over-dispersion, the signal represents a small but systematic shift in allele frequency of a few percent across many loci, which would be undetectable by standard population-genetic tests for selection (see Table S6), such that the majority of the variance in polygenic scores is within populations as opposed to among populations (see Table S4). The traits involved include height, infant head circumference (IHC), hip circumference, waist-hip ratio (WHR), and type 2 diabetes (T2D). Although the sixth-strongest signal, waist circumference, failed to meet the multiple-testing correction, we include it in subsequent analyses due to its obvious relationship to WHR. We also found signals of selection on polygenic scores constructed for waist and hip circumference and waist-hip ratio when adjusted for BMI (Table S3), but we focus on the unadjusted versions for ease of interpretation. We do not replicate a previously reported signal of selection on BMI within Europe, but also note that the previous study used many more SNPs than we have in constructing polygenic scores, which likely explains the difference.²⁰

The predominantly European ascertainment of GWAS loci can lead to apparent deviations from neutrality. Therefore all p values in Figure 2 and throughout the paper are derived from comparing test statistics against frequency-matched empirical controls, unless otherwise stated (see Text S1.3). This empirical matching is an important control. For example, the distribution of polygenic scores for Schizophrenia show a signal of over-dispersion under the naive null hypothesis, but not after controlling for the effects of ascertainment. More generally, the ascertainment and selection against disease phenotypes pose difficulties for the interpretation of tests of dierentiation. Thus, although we see a signal of selection for decreased T2D polygenic scores in Europe, the interpretation of this signal likely requires the development of more explicit models of selection on disease traits (section S1.4).

The Geography of Selection on Height

In addition to the known gradient of increased polygenic height scores in northern Europeans relative to southern Europeans (latitude correlation within Europe p = 6.3 × 10−⁶, see S2 and Methods for statistical details),^{17, 18, 19, 20, 36} we also find evidence that that natural selection has impacted polygenic height scores well outside of modern Europe. Polygenic scores decline sharply from west to east across Eurasia in a way that cannot be predicted by a neutral model (longitude correlation across Eurasia, p = 4.46 × 10−¹⁵; Figure 1), and they are overdispersed within each of our four population clusters (north, south/central, east, and west) across Asia, as well among Native Americans (Figure 2). Does this broadly Eurasian signal represents multiple independent episodes of selection on the genetic basis of height, or can it be explained by ancient selection on one or just a few populations, with modern signals reflecting variation in the extent to which modern populations derive ancestry from these ancient populations? For example, the signal of selection on height in East Asia is driven entirely by the Tu population sample, who have the highest polygenic height score among East Asian samples (p = 0.4329 for height in East Asia after the Tu are removed). Does this unusually high polygenic score reflect recent selection, or the fact that the Tu derive a proportion of their ancestry from an ∼800-year-old admixture event involving a population resembling modern Europeans³⁷?

To test whether the height signal within Asia is due to a selective event shared with Europeans, we predicted the polygenic height scores across Asia given the deviation of European populations from the Asian mean, and each of the Asian sample’s genome-wide relationship to the European samples (see Figure 3, and Methods for details). We find that this prediction conditioned on Europeans are suffcient to explain most the divergence between the Tu and the other East Asian populations in our dataset (see sky blue dots in Figure 3), and eliminate the signal of selection among East Asian populations (p = 0.099 after conditioning). In fact, all signals of dierential selection on height across Asia can be eliminated using these conditional predictions (p = 0.2019 after conditioning). This suggests that most of the selected divergence in our polygenic height scores across Eurasia can be attributed either to events which are predominantly ancestral to modern Europeans (but which have impacted other regions via admixture), or which lie along an early lineage which has contributed ancestry broadly across Eurasia.

Figure 2: A heatmap showing the log10 p-values for the Q_X test statistic for over-dispersion of the polygenic scores for a trait among population samples.

The ‘All’ column gives the p-value in the combined Human Origin and 1000 Genomes dataset. See S2 and S1 for the definitions of the regional groupings. Each subsequent column gives the score in each geographic subregion. MCV: Mean red blood cell volume; MCHC: Mean cell hemoglobin concentration; LSBMD: Lumbar spine bone mineral density; FNBMD: Femoral neck bone mineral density; PCV: Packed red blood cell volume; MPV: Mean platelet volume.

Figure 3: Polygenic height scores in Asia are well-predicted by a model conditioned on European height scores, consistent with selection occurring in a shared ancestral population.

An individual population sample’s position along the x axis gives the genetic height score predicted on the basis of scores observed in Europe and their relatedness to the European samples, whereas their position along the y axis gives the true polygenic height score (see Methods for statistical details). The dashed line gives the one-to-one line along which all populations would fall if the predictions were perfectly accurate, whereas the vertical gray lines give population-specific 95% confidence intervals under genetic drift.

To gain further clarity about the history of selection on height, we examined polygenic height scores in a set of ancient DNA samples from Western Eurasia.^{19, 38, 39} In Figure 4A we plot estimates of the polygenic score through time for ancient and modern samples, and in Figure 4B a heatmap of signed p-values from our test of selection applied to pairs of populations (for more detail see Text S1.5). The earliest unambiguous signal of selection for increased height is found approximately 15,000 years ago in the Villabruna cluster of hunter-gatherers, who have significantly increased polygenic scores relative to earlier pleistocene hunter-gatherers (e.g. Villabruna vs Ust’-Ishim p = 0.0015, Villabruna vs Kostenki14 p = 0.0244, Villabruna vs Vestonice p = 0.003). The Mal’ta sample also appears to have an elevated polygenic score, on par with modern Europeans, but it is not significantly different from the earlier pleistocene hunter-gatherers in pairwise tests. Moving into the Holocene, the western, Scandanavian, and Caucasus hunter-gatherers (WHG, SHG, and CHG respectively) all have signficiantly increased polygenic height scores when compared to any of the early pleistocene hunter-gatherers. While WHG and SHG share a significant amount of ancestry with the Villabruna cluster, CHG do not, having separated approximately 46kya (along with Mal’ta and the Eastern hunter-gatherers: EHG) from the lineage leading to Villabruna/WHG.^{40, 38} Many ancient samples have ancestry nested within this split between Villabruna/WHG and CHG, but seemingly do not inherit a signal of selection for increased height (including pleistocene hunter-gatherers Kostenki14 and Vestonice^{41, 38}). It is therefore unlikely that the signals we observe can be traced to a single selective event common to Villabruna/WHG/SHG and to CHG. Instead, our results are potentially consistent with at least two independent episodes of selection for increased height among pleistocene and/or holocene hunter-gatherers: at least one in the west, affecting Villabruna, WHG, and SHG, and one in the east, affecting CHG (and potentially Mal’ta).

Figure 4: A) Polygenic height scores for ancient and the modern 1000 genomes population samples.

Each dot show the mean polygenic score for the labeled sample, and the error bars give the 95% confidence interval. The x coordinate of each sample is positioned at the mean of the calBP dates for the samples, plotted using a square root transfor to help visualize the spread of ancient populations. AEN, Anatolian Neolithic; WHG, Western hunter-gatherer; CEM, central European Early and Middle Neolithic; INC, Iberian Neolithic and Chalcolithic; CLB, central European Late Neolithic and Bronze Age; STP, steppe. B) A heat map of log10 p-values for pairwise Q_X tests, the p-values are signed by the difference in polygenic score (shades of red denotes the row sample having higher polygenic score than the column sample, and blue the converse)

The Yamnaya-related steppe samples (STP) also show a signal of selection for increased polygenic height scores (e.g. STP–Ust’-Ishim p = 0.001, STP–Vestonice p = 0.004).^{19, 42} This signal is likely due to the fact that they draw ∼45% of their ancestry from a population related to the CHG,¹⁹ who they are not significantly different from (STP–CHG p = 0.62). In turn, the central European Late Neolithic and Bronze Age samples (CLB, including the Corded Ware and Bell Beaker culture) share the high polygenic height signal, and draw much of their ancestry from the expansion of the Yamnaya Steppe people.^{43, 44} In contrast, many of the European and Near East early Neolithic samples show little dierence in scores relative to the early pleistocene hunter-gatherers and have significantly lower polygenic height scores than Villabruna/WHG/SHG and CHG samples and the populations with Yamnaya ancestry (e.g. Levant–SHG p = 0.001, Levant–CHG p = 0.01, Levant– STP p = 0.014). We do not find support for Mathieson and colleagues’¹⁹ suggestion of selection for reduced height in Iberian Neolithic samples relative to Anatolian Neolithic (p = 0.90, see also⁴²).

Taken together, our results suggest that much of the variation we observe among modern Eurasian populations for polygenic height scores can be traced to variation in the amount of the WHG and Yamnaya/CHG ancestry they have inherited. For example, modern Europeans can be described approximately as a mixture between WHG, Yamnaya, and early Neolithic farmers from Anatolia,⁴³ and the variation in the relative proportion of ancestry derived from these three sources explains a substantial amount of the variation in polygenic height scores (see Figure S10).^{19, 42} Similarly, Yamanaya/CHG ancestry decays from west to east across both northern and southern Asia,^{40, 44} consistent with the cline of decreasing polygenic height scores moving from west to east across the continent.

Finally, we note that we can reject neutrality in pairwise comparisons between modern East Asian populations and certain ancient samples that do not appear to be involved in the signal of selection for increased height in the west (e.g. CHB–EHG p = 0.004, CHB–Levant p = 0.014, Mal’ta–CHB p = 0.006). As these ancient populations are distantly related to one another, and show no other signals of selection on height, this may indicate that selection drove polygenic height scores down somewhere in the history of East Asians. However, the intepretation of this signal is complicated by the fact that we cannot completely exclude that polygenic height scores were selected up in these ancient populations. Clarifying this signal will likely require investigation via more explict models of human demographic history²³ as well as the incorporation of height GWAS from East Asia.

Selection on Body Shape Polygenic Scores

As four out of the next five strongest signals beyond height also represent anthropometric traits, we focus the remainder of our efforts on these phenotypes. Due to genetic correlations between traits, it is possible that signals of selection on two (or more) distinct phenotypes actually represent only a single episode of selection, where one trait responds indirectly to selection on the correlated trait. Because the genetic correlation with height varies among these phenotypes (hip circumference: r = 0.39, IHC: r = 0.268, waist circumference: r = 0.22, and WHR: r = −0.08),^{45, 46} we expect a priori that signals for more tightly correlated phenotypes are more likely due to a correlated response to selection on height, whereas for example the WHR signal is more likely to be independent.

To test whether the new signals we observe represent selective events distinguishable from the height signal, we developed a multi-trait extension to our null model based on the quantitativegenetic multivariate-selection model of Lande and Arnold⁴⁷ (see Methods and Supplementary Text Section S1.6). We condition on the observed polygenic height scores, and test whether the signal of selection on a second trait is still significant after accounting for a genetic correlation with height (a non-significant p-value is consistent with a correlated response to selection on height). Applying this test to our entire panel of populations, we find that conditioning on height ablates much of the signal for hip circumference (p = 0.0186 compared to p = 1.12 × 10−⁴ when not conditioning on height), whereas signals in IHC (p = 1.11 × 10−⁵ vs p = 5.37 × 10−⁸) and WHR (p = 3.57 × 10−⁸ vs p = 3.38 × 10−⁷) are less aected. Restricting to European populations only, height is better able to explain hip circumference (p = 0.1152 vs p = 3.4 × 10−3), waist circumference (p = 0.0104 vs p = 2.63 × 10−³), and IHC (p = 5.1 × 10−³ vs p = 1.41 × 10−⁸) signals, while the signal of selection on WHR again remains strong even after conditioning on height (p = 1.92 × 10−⁸ vs p = 6.03 × 10−¹⁰). WHR is genetically correlated within populations with hip (r = 0.316) and waist circumference (r = 0.729), but not with IHC (r = 0.01).^{45, 46} Conditioning on WHR is suffcient to explain waist circumference (global p = 0.1523 vs p = 3 × 10−³, Europe p = 0.5178 vs p = 2.6 × 10−³), but signals in HIP, IHC, and height are all independent of WHR (see Table S4). Together, these results suggest that we can distinguish the action of natural selection along a minimum of two phenotypic dimensions (i.e. height and WHR, or unmeasured phenotypes closely correlated to them). The signal of selection observed for hip circumference is likely due at least in part to selection on height, and the waist circumference signal is probably due to selection on a combination of height and WHR (or closely correlated phenotypes; we provide additional evidence for this claim in supplement section S1.6.2). Whereas IHC shows some evidence of being influenced by selection on height, a correlated response to height seems not to fully explain this signal.

Signals of divergence for both IHC and WHR polygenic scores are confined mostly to Europe and West Asia. For both traits the null model gives a significantly improved fit to the data when conditioned on Europe to explain West Asia and similar when conditioning on West Asia to explain Europe (Table S5). This suggests that, as is the case for Eurasian height scores, a substantial fraction of the divergence in IHC and WHR polygenic scores among modern populations across western Eurasia reflects divergence among ancient populations and subsequent mixture rather than recent selection.

Bergmann’s Rule and Thermoregulatory Adaptation

For both IHC and WHR, the selective signal in Western Eurasia can be captured in large part by strong, positive latitudinal clines (p = 3.16 × 10−¹⁵ for IHC and p = 3.16 × 10−⁷ for WHR; Figure 6). These clines in polygenic scores support independent phenotypic evidence for larger and wider bodies and rounder skulls at high latitudes,^{48, 1, 49, 2, 50, 51, 3} consistent with Bergmann’s Rule,^{52, 53} and add genetic support for a thermoregulatory hypothesis for morphological adaptation, whereby individuals in colder environments are thought to have adapted to improve heat conservation by decreasing their surface area to volume ratio.

A broad range of selective mechanisms have been proposed to act on height variation.⁵⁴ Because we do not detect any signal of selection on age at menarche, we think it unlikely that the height signal represents a correlated response due to life-history mediated selection on age at reproductive maturity.⁵⁵ It has also been suggested that selection on height may be explained as a thermoregulatory adaptation.⁵⁴ However, because the surface area to volume ratio is approximately independent of height,^{56, 2} the effect of height SNPs on this ratio is mediated almost entirely through their effect on circumference (hip and/or waist; see section S1.8). Because the signal of selection on height cannot be explained by conditioning on hip and waist circumference, it seems that the thermoregulation hypothesis cannot fully explain the signal of selection on height.

A second eco-geographic rule relevant to height is Allen’s rule,⁵⁷ which predicts relatively shorter limbs in colder environments, again consistent with adaptation on the basis of thermoregulation. In support of this, human populations in colder environments are observed to have proportionally shorter legs, compared to those in warmer environments.^{49, 58} However, we detect no signal of selection on polygenic scores for the ratio of sitting to standing height (SHR); a measure of leg length relative to total body height.⁵⁹ Indeed, by combining our height SNPs with their effect on SHR, we find a strong signal that both increases in leg length and torso length underlie the selective signal on height from North to South within Europe, and from East to West across Eurasia (see S1.9). This again suggests that thermoregulatory concerns are unlikely to fully explain signals of selection on height.

Discussion

The study of polygenic adaptation provides new avenues for the study of human evolution, and promises a new synthesis of physical anthropology and human genetics. Here, we undertake a broad scan for evidence of polygenic adaptive divergence among modern human populations, with body size and shape phenotypes providing most of our strongest signals. We show for the first time that it is possible to reject a neutral model of evolution at height associated loci in comparissons between populations outside of Europe. Using ancient DNA, we show that patterns seen across modern populations are consistent with two independent episodes of selection for increased height in pleistocene hunter-gatherer populations that lived in western and west-central Eurasia during or shortly after the last glacial maximum, and then distributed ancestry widely across the continent. We also provide evidence for adaptive divergence of IHC and WHR in western Eurasia, independent of selection on height, and show that signals of selection on hip and waist circumference can likely be explained as correlated responses to selection on height and WHR (or some other closely correlated phenotypes).

It is conspicuous that the signals of adaptive divergence that we detect are mostly localized to western Eurasia, even in cases where it seems implausible that observed phenotypic differences could have been generated under neutrality (e.g. Maasai vs Biaka pygmy). However, the fact that we do not detect departures from neutrality in such cases should not necessarily be taken as evidence against selection. We should expect to be better-powered to detect selective events in populations more closely related to Europeans for two reasons. First, changes in the structure of linkage disequilibrium (LD) across populations should lead GWAS variants to tag causal variation best in populations genetically close to the European-ancestry GWAS panels.⁶⁰ Second, gene-by-environment and gene-by-gene interactions can lead to changes in the additive effects of individual loci among populations,⁶¹ and therefore in the way that they respond to selection on the phenotype. We expect that these difficulties can be overcome or mitigated in the future through a combination of well-powered GWAS in multiple populations of non-European ancestry, access to a wider array of ancient DNA samples, and improved frameworks for the interpretation of signals of polygenic adaptation.²³

The existence of latitudinal trends in the polygenic scores for WHR and IHC support the notion that some of the clinal phenotypic variation in body shape typically thought to represent thermoregulatory adaptation can be attributed to genetic variation driven by selection, while the ability of simple models to unify signals across broad geographic regions again suggests that these patterns could have been generated by a limited number of selective events. Evidence for adaptation on the basis of specific environmental pressures is most convincing when multiple populations independently converge on the same phenotype in the face of the same environmental pressure, a pattern for which we currently lack evidence. Therefore, while our evidence is consistent with adaptation to temperature environments, alternative explanations (e.g. adaptation to diet) are plausible.

1 Methods

1.1 Population Genetics Datasets

We downloaded the 1000 genomes phase 3 release data from the 1000 genomes ftp portal.²⁵ We also used data from the Human Origins fully public panel²⁴ which was imputed from the 1000 Genomes phase 3 as reference, using the Michigan imputation server,⁶² and restricting to SNPs with an imputation quality score (in terms of predicted r²) of 0.8 or greater (pers. comm. Joe Pickrell). The original genotype data can be downloaded from the Reich lab website (https://reich.hms.harvard.edu/datasets).

This combined dataset represent samples from 2504 people from 26 populations in the 1000 Genomes dataset and 2158 people across 161 populations from the Human Origins dataset, for a total of 4662 samples from 187 populations (S2). For global analyses we include all 187 populations. In regional analyses we exclude populations with a significant recent (i.e. < 500 years) African/non-African admixture to avoid confounding admixture with signals of recent selection within regions (see S2 and S1 for the regions).

1.2 Selection of GWAS SNPs

We took public GWAS results for a set of traits²⁸ and combined them with additional anthropometric traits from the GIANT consortium and a subset of Early Growth phenotypes contributed by EGG Consortium. Table S1 gives a full list of the traits included in this study and the relevant references. For each trait we selected a set of SNPs with which to construct our polygenic scores as follows. For each SNP, we calculated an approximate Bayes factor summarizing the evidence for association at that SNP via the method of Wakefield,⁶³ following Pickrell et al (2016)²⁸ (see their supplementary note section 1.2.1). We then used a published set of 1700 non-overlapping linkage disequilibrium blocks²⁶ to divide the genome, after which we selected the single SNP with the strongest approximate Bayes factor in favor of association within each block to carry forward for analyses.

1.3 Polygenic Scores and Null Model

Given a set of L SNPs associated with a trait (L ≈ 1700), we construct the vector of polygenic scores across all M = 187 populations by taking the sum of allele frequencies across the L sites (the vector at site ℓ), weighting each allele’s frequency by its effect on the trait (αℓ) to give

For each trait, we construct a null model for the joint distribution of polygenic scores across populations, assuming where . Here population samples (weighting all population samples equally), and F is the M × M population-level genetic covariance matrix.¹⁸ All polygenic scores are plotted in centered standardized form . We use the Mahalanobis distance of from its distribution under the null as a natural test statistic to assess the ability of the null model to explain the data (see Berg and Coop (2014)¹⁸ for an extended discussion). This test statistic should be X² with M − 1 degrees of freedom under neutrality. However, in practice we are concerned that the ascertainment of GWAS loci may invalidate our null model, so we compare the test statistic to an empirical null (see Section S1.3)

1.4 Latitudinal and Longitudinal Correlations

We also test for selection-driven correlations between geographic variables (e.g. latitude) and a subset of our polygenic scores (see Berg and Coop (2014)¹⁸ and Section S1.1 for more details of the test). We take the standardized geographic variable and polygenic scores, and then rotate these vectors by the inverse Cholesky decomposition of the relatedness matrix F. These rotated vectors are in a reference frame where the populations represent independent contrasts under the neutral model. We take as our test statistic the covariance of these rotated vectors. We calculate the significance of the statistic by comparing to a null distribution generated by calculating null sets of polygenic scores assembled from resampled SNPs with derived frequency matched to the CEU population sample so as to mimic the effects of the GWAS ascertainment.

1.5 Analysis of Ancient DNA

We included a combined dataset of 63 Ancient Eurasian human population samples with date estimates from 45kya-2.5kya,^{19, 38, 39} combining these samples into pre-specified analysis clusters we took a set of 19 populations that had < 10% of height SNPs missing (see Table S7 for a list of ancient populations included). We compare these to the modern population samples from 1000 genome consortium data. We then took the subset of 724 of our 1700 height associated GWAS SNPs with low levels of missing data in these 19 ancient populations (6.2% averaged over populations).

Polygenic height scores were calculated as in Eq. (1), for loci with no counts in an ancient population we set to the frequency in the combined rest of sample. We construct the 95% credible intervals show in Figure 4A, by assuming that the the posterior of the underlying population frequency is independent across loci and populations and follows a beta distribution, with a uniform prior distribution, which is updated by our binomial sample of ancient counts. Using the variance of the posterior distribution at each locus, we then calculated the variance of the polygenic score (V_Z), which follows from Eq. (1). The 95% credible-interval error bars in Figure 4A were then calculated as for each population.

For calculating Q_X(eqn (3)) for pairs of population samples, we restricted the SNP set to the loci that had counts in both samples. Our p-values are calculated assuming that the pairwise Q_X statistic has a χ² distribution, with one degree of freedom. We also constructed a null by flipping the signs of the GWAS effect of the loci at random, and found the χ² p-values to be well callibrated.

1.6 Two-Trait Conditional Tests

Because some of the traits we examine are genetically correlated with one another, we were concerned that signals of selection observed for one trait might reflect a response to selection on another correlated trait. To determine whether genetic correlations might be responsible for some of our signals, we developed a multitrait extension to our neutral model that accounts for genetic covariance among traits. The extension is on the framework of Lande and Arnold.⁴⁷

If are vectors of polygenic scores for two different traits constructed according to equation (1), and the matrix contains these vectors as columns, then under neutrality the distribution of Z is approximately matrix normal where the matrix µ contains the trait-specific means, F gives the population covariance structure among rows as in the single trait model, and G is the among trait additive genetic covariance matrix, the “G matrix” of multivariate quantitative genetics,⁴⁷ estimated for a population ancestral to all populations in the sample. The diagonal elements of the 2 × 2 G matrix are given by the V_A parameters from above in the single trait model and the o-diagonal element (C_A,12) corresponds to the additive genetic covariance between the two traits. Given this null model for the joint distribution of the two traits, we can construct a conditional model for the distribution of polygenic scores for trait 1, given the polygenic score observed for trait 2, as

Given a value of C_A,12 we can then use these conditional means and variances in equation (3) to form a conditional Q_X statistic and compare it to its null distribution. We take the failure to reject neutrality on the basis of the conditional Q_X statistic as consistent with the hypothesis that any response to selection observed for trait 1 is a result of selection on trait 2. Some of the traits we study have non-linear allometric relationships with each other, but because our polygenic scores are linear by construction our tests are robust to this non-linearity (see S1.7).

We experimented with estimating C_A,12 on the basis of SNPs that overlap between the two traits in each genomic block. However, we were concerned about this approach to estimating genetic correlations not being a suffcient joint model for cases in which different SNPs within a block affected the two traits but were in linkage disequilibrium with one another, and therefore do not drift independently. To deal with this issue, we represent the genetic covariance among populations as where ρ represents the genetic correlation between the two sets of polygenic scores. We pursued a conservative strategy, testing a range of values for ρ along a dense grid from −1 to 1 to ask whether any assumed genetic correlation between polygenic scores could plausibly allow one trait to be explained as a correlated response to another. As a further conservative measure, we allowed the genetic correlation used to calculate the conditional variance (Eq (7)) to be equal to zero, while allowing the ρ used to compute the conditional mean (Eq (6)) was not. This is a conservative approach, as it fits our conditional prediction to the mean, but allows the variance of the null model to remain as large as the unconditional model. The conditional two-trait p-values we present in the text, and the CI shown in two-trait Figure 5 and in the supplement, use this conservative approach. In practice our values of ρ are consistent with estimates of genetic correlations obtained from the LDscore approach,^{45, 46} given that our polygenic scores capture only a fraction of the total genetic variance for each trait.

Figure 5: The overdispersion of genetic HIP scores among populations can be explained as a correlated response to selection on height, but such an effect cannot explain the signal of selection on the WHR polygenic scores.

A) The observed polygenic HIP score (y axis) plotted against the height polygenic scores (x axis). We show only Western Eurasian population samples (blue dots: Europe; green dots: West Asia), as it is these samples which drive the majority of the signal. The line gives the best prediction for each sample’s polygenic HIP score according to the model of a correlated response to selection on height. Vertical lines give the 95% confidence interval of this prediction for each sample under this model. Most populations’ polygenic HIP scores lie within their confidence intervals, consistent with our failure to reject this conditional null model (main text). B) The same as A but now giving polygenic WHR scores rather than HIP. Note that for many populations the WHR scores lie outside of their 95% CI predictions based on genetic drift and correlated selection on height alone, consistent with the inability of this model to fully capture variation in polygenic WHR scores (main text)

Figure 6:

Genetic IHC, WC, and WHR score plotted against Latitude for the Western Eurasian population samples. The points are colored East to West (blue to yellow).

1.7 Single Trait Conditional Null Model

We also developed an extension of the null model for a single trait to test whether two (or more) signals of selection detected in different geographic regions might reflect a single ancestral event that occurred in an ancient population that has contributed ancestry broadly to modern populations.

Assume for example that we have detected a signal of selection among the population samples from region A (e.g. Europe) and among the population samples from (e.g. Asia), and we would like to test whether the signal detected in region B is due to a selective event that is also responsible for generating a signal of selection in region A. We first reorganize our samples into two blocks for the two regions

Where µ_B is the mean polygenic score in the set of populations being tested, the F•,•s refer to the sub-matrices of the relatedness matrix F, and F itself has been recentered at the mean of the test set (i.e. region B). Then the conditional distribution of polygenic scores in region B given the polygenic scores observed in region A is

The conditional mean, reflects the best predictions of population means in region B given the values observed in region A, whereas the conditional covariance matrix F_B|A reflects the scale and form of the variance around this expectation that arises from drift that is independent of drift in the ancestry of populations in region A.

We can then test for over-dispersion of polygenic score in region B given the observed polygenic scores in region A by using in (3) to construct a conditional Q_X score. We judge the statistical significance of this conditional Q_X score by comparing it to a frequency matched dataset, as with the standard test. We interpret a non-significant conditional Q_X score for region B as evidence that any selective signal of overdispersion in B is well explained by genome-wide allele-sharing with A. We view this as evidence that the selection signal in B overlaps that in A, due to selection in shared ancestral populations and admixture.

In Figure 3 we plot the observed polygenic scores for Asia against the predicted polygenic scores for Asia (B), conditional on the Europe population sample polygenic scores (A). The error bars are 95% CIs for each population sample, obtained from the variances on the diagonal of V_AF_B|A.

Acknowledgements

We thank the Coop Lab and Doc Edge, Iain Mathieson, Emily Josephs, Joe Pickrell, Molly Przeworski, David Reich, Je Ross-Ibarra, Guy Sella, and Tim Weaver for helpful discussions and feedback on earlier drafts. The work was supported in part by an NSF GRFP (to JJB), the UC Davis Anthropology department (XZ), and NIGMS-NIH RO1 grants GM108779 to GC. JJB was also supported in part by R01 grants GM115889 to Guy Sella and GM121372 to Molly Przeworski.

References

↵
Roberts, D. F. Body weight, race and climate. American Journal of Physical Anthropology 11, 533–558 (1953).
OpenUrl CrossRef PubMed
↵
Ruff, C. B. Morphological adaptation to climate in modern and fossil hominids. Am. J. Phys. Anthropol. (1994).
↵
Savell, K. R. R., Auerbach, B. M. & Roseman, C. C. Constraint, natural selection, and the evolution of human body form. Proc. Natl. Acad. Sci. U.S.A. 113, 9492–9497 (2016).
OpenUrl Abstract/FREE Full Text
↵
Bogin, B., Smith, P., Orden, A. B., Varela Silva, M. I. & Loucky, J. Rapid change in height and body proportions of Maya American children. Am. J. Hum. Biol. 14, 753–761 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Serrat, M. A., King, D. & Lovejoy, C. O. Temperature regulates limb length in homeotherms by directly modulating cartilage growth. Proc. Natl. Acad. Sci. U.S.A. 105, 19348–19353 (2008).
OpenUrl Abstract/FREE Full Text
↵
Pujol, B., Wilson, A., Ross, R. & Pannell, J. Are QST –FST comparisons for natural populations meaningful? Molecular Ecology 17, 4782–4785 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Rogers, A. R. & Harpending, H. C. Population structure and quantitative characters. Genetics 105, 985–1002 (1983).
OpenUrl Abstract/FREE Full Text
↵
Relethford, J. H. Craniometric variation among modern human populations. American Journal of Physical Anthropology 95, 53–62 (1994).
OpenUrl CrossRef PubMed Web of Science
↵
Relethford, J. H. Apportionment of global human genetic diversity based on craniometrics and skin color. American Journal of Physical Anthropology 118, 393–398 (2002).
OpenUrl CrossRef PubMed Web of Science
↵
Tishkoff, S. Strength in small numbers. Science (2015).
↵
Fan, S., Hansen, M. E. B., Lo, Y. & Tishkoff, S. A. Going global by adapting local: A review of recent human adaptation. Science 354, 54–59 (2016).
OpenUrl Abstract/FREE Full Text
↵
Pritchard, J. K. & Di Rienzo, A. Adaptation–not by sweeps alone. Nat Rev Genet (2010).
↵
Pritchard, J. K., Pickrell, J. K. & Coop, G. The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Current biology (2010).
↵
Visscher, P. M., Brown, M. A. & McCarthy, M. I. Five years of GWAS discovery. Am. J. Hum. Genet. (2012).
↵
Price, A. L., Spencer, C. C. A. & Donnelly, P. Progress and promise in understanding the genetic basis of common diseases. Proc. R. Soc. B 282, 20151684–10 (2015).
OpenUrl CrossRef PubMed
↵
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell (2017).
↵
Turchin, M. C., Chiang, C. & Palmer, C. D. Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nature (2012).
↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS Genet (2014).
↵
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nat. Gen. 528, 499–503 (2015).
OpenUrl
↵
Robinson, M. R., Hemani, G. & Medina-Gomez, C. Population genetic differentiation of height and body mass index across Europe. Nature (2015).
↵
Hansen, M. E. B. et al. Shorter telomere length in Europeans than in Africans due to polygenetic adaptation. Hum. Mol. Genet. 25, 2324–2330 (2016).
OpenUrl CrossRef PubMed
↵
Field, Y. et al. Detection of human adaptation during the past 2000) years. Science 354, 760–764 (2016).
OpenUrl Abstract/FREE Full Text
↵
Racimo, F., Berg, J. J. & Pickrell, J. K. Detecting polygenic adaptation in admixture graphs. bioRxiv 146043 (2017).
↵
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nat. Gen. 513, 409–413 (2014).
OpenUrl
↵
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68 (2015).
OpenUrl CrossRef PubMed
↵
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics (2016).
↵
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. (2014).
↵
Pickrell, J. K., Berisa, T., Liu, J. Z., Ségurel, L. & Tung, J. Y. Detection and interpretation of shared genetic influences on 42 human traits. Nature (2016).
↵
Martin, A. R. et al. Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am. J. Hum. Genet. 100, 635–649 (2017).
OpenUrl CrossRef
Wright, S. The genetical structure of populations. Ann Eugen 15, 323–354 (1951).
OpenUrl PubMed Web of Science
↵
Nicholson, G. et al. Assessing population differentiation and isolation from single-nucleotide polymorphism data. J. R. Stat. Soc. 64, 695–715 (2002).
OpenUrl
Prout, T. & Barker, J. S. F statistics in Drosophila buzzatii: selection, population size and inbreeding. Genetics 134, 369–375 (1993).
OpenUrl Abstract/FREE Full Text
Spitze, K. Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135, 367–374 (1993).
OpenUrl Abstract/FREE Full Text
Ovaskainen, O., Karhunen, M., Zheng, C., Arias, J. M. C. & Merila, J. A New Method to Uncover Signatures of Divergent and Stabilizing Selection in Quantitative Traits. Genetics 189, 621–632 (2011).
OpenUrl Abstract/FREE Full Text
↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Gen. 47, 291–295 (2015).
OpenUrl CrossRef PubMed
↵
Zoledziewska, M., Sidore, C., Chiang, C. & Sanna, S. Height-reducing variants and selection for short stature in Sardinia. Nature (2015).
↵
Hellenthal, G. et al. A genetic atlas of human admixture history. Science 343, 747–751 (2014).
OpenUrl Abstract/FREE Full Text
↵
Fu, Q. et al. The genetic history of Ice Age Europe. Nature (2016).
↵
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient near east. Nature 536, 419–424 (2016).
OpenUrl CrossRef PubMed
↵
Jones, E. R. et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians. Nature Communications 6, 1–8 (2015).
OpenUrl
↵
Seguin-Orlando, A. et al. Paleogenomics. Genomic structure in Europeans dating back at least 36,200 years. Science 346, 1113–1118 (2014).
OpenUrl Abstract/FREE Full Text
↵
Martiniano, R. et al. The population genomics of archaeological transition in west iberia. bioRxiv (2017).
↵
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211 (2015).
OpenUrl CrossRef PubMed
↵
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172 (2015).
OpenUrl CrossRef PubMed
↵
Bulik-Sullivan, B., Finucane, H. K., Anttila, V. & Gusev, A. An atlas of genetic correlations across human diseases and traits. Nature (2015).
↵
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272–279 (2017).
OpenUrl CrossRef PubMed
↵
Lande, R. & Arnold, S. J. The measurement of selection on correlated characters. Evolution 37, 1210–1226 (1983).
OpenUrl CrossRef PubMed Web of Science
↵
Schreider, E. Geographical distribution of the body-weight/body-surface ratio. Nature 165, 286 (1950).
OpenUrl PubMed
↵
Roberts, D. Climate and human variability (Addison-Wesley, 1973).
↵
Ruff, C. Variation in Human Body Size and Shape. Annu. Rev. Anthropol. 31, 211–232 (2002).
OpenUrl CrossRef Web of Science
↵
Katz, D. C., Grote, M. N. & Weaver, T. D. A mixed model for the relationship between climate and human cranial form. Am. J. Phys. Anthropol. 160, 593–603 (2015).
OpenUrl
↵
Bergmann, C. Über die Verhältnisse der Wärmeökonomie der Thiere zu ihrer Grösse. Göttinger Studien 3, 595–708 (1847).
OpenUrl
↵
Mayr, E. Geographical character gradients and climatic adaptation. Evolution 10, 105–108 (1956). URL http://www.jstor.org/stable/2406103.
OpenUrl CrossRef Web of Science
↵
Stulp, G. & Barrett, L. Evolutionary perspectives on human height variation. Biol Rev 91, 206–234 (2014).
OpenUrl
↵
Stearns, S. C., Govindaraju, D. R., Ewbank, D. & Byars, S. G. Constraints on the coevolution of contemporary human males and females. Proceedings of the Royal Society of London B: Biological Sciences 279, 4836–4844 (2012). URL http://rspb.royalsocietypublishing.org/content/279/1748/4836. http://rspb.royalsocietypublishing.org/content/279/1748/4836.full.pdf.
OpenUrl CrossRef PubMed
↵
Ruff, C. B. Climate and body shape in hominid evolution. Journal of Human Evolution 21, 81–105 (1991).
OpenUrl CrossRef Web of Science
↵
Allen, J. A. The Influence of Physical Conditions in the Genesis of Species. Radical Review 1, 108–140 (1877).
OpenUrl
↵
Katzmarzyk, P. T. & Leonard, W. R. Climatic influences on human body size and proportions: ecological adaptations and secular trends. Am. J. Phys. Anthropol. 106, 483–503 (1998).
OpenUrl CrossRef PubMed Web of Science
↵
Chan, Y. et al. Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development. Am. J. Hum. Genet. 96, 695–708 (2015).
OpenUrl
↵
Palmer, C. & Pe’er, I. Statistical correction of the winner’s curse explains replication variability in quantitative trait genome-wide association studies. bioRxiv (2017).
↵
Brown, B. C. et al. Transethnic genetic-correlation estimates from summary statistics. The American Journal of Human Genetics 99, 76–88 (2016).
OpenUrl CrossRef PubMed
↵
Das, S. et al. Next-generation genotype imputation service and methods. Nature genetics 48, 1284–1287 (2016).
OpenUrl CrossRef PubMed
↵
Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79–86 (2009).
OpenUrl CrossRef PubMed Web of Science
Perry, J. R. B. et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature 514, 92–97 (2014).
OpenUrl CrossRef PubMed Web of Science
Lambert, J. C., Ibrahim-Verbaas, C. A., Harold, D. & Naj, A. C. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature (2013).
van der Valk, R. J. P. et al. A novel common variant in DCST2 is associated with length in early life and height in adulthood. Hum. Mol. Genet. 24, 1155–1168 (2014).
OpenUrl PubMed
Horikoshi, M., Yaghootkar, H. & Mook-Kanamori, D. O. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nature (2013).
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
OpenUrl CrossRef PubMed
Schunkert, H., König, I. R., Kathiresan, S. & Reilly, M. P. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature (2011).
Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
OpenUrl CrossRef PubMed Web of Science
Manning, A. K., Hivert, M. F., Scott, R. A. & Grimsby, J. L. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nature (2012).
Estrada, K., Styrkarsdottir, U., Evangelou, E. & Hsu, Y. H. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nature (2012).
van der Harst, P. et al. Seventy-five genetic loci influencing the human red blood cell. Nature 492, 369–375 (2012).
OpenUrl CrossRef PubMed Web of Science
Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).
OpenUrl CrossRef PubMed Web of Science
Wood, A. R., Esko, T., Yang, J., Vedantam, S. & Pers, T. H. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature (2014).
Cousminer, D. L. et al. Genome-wide association and longitudinal analyses reveal genetic loci linking pubertal height growth, pubertal timing and childhood adiposity. Hum. Mol. Genet. 22, 2735–2747 (2013).
OpenUrl CrossRef PubMed Web of Science
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
OpenUrl CrossRef PubMed Web of Science
Taal, H. R. et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat Genet 44, 532–538 (2012).
OpenUrl CrossRef PubMed
Gieger, C. et al. New gene functions in megakaryopoiesis and platelet formation. Nature 480, 201–208 (2011).
OpenUrl CrossRef PubMed Web of Science
Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
OpenUrl CrossRef PubMed Web of Science
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
OpenUrl CrossRef PubMed Web of Science
Chan, Y., Salem, R. M., Hsu, Y. & McMahon, G. Genome-wide analysis of body proportion classifies height-associated variants by mechanism of action and implicates genes important for skeletal Am. J. Hum. Genet. (2015).
Morris, A. P., Voight, B. F., Teslovich, T. M. & Ferreira, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature (2012).
↵
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet (2012).
↵
Zhao, L., Lascoux, M., Overall, A. & Waxman, D. The characteristic trajectory of a fixing allele: A consequence of fictitious selection that arises from conditioning. Genetics (2013).
↵
Kremer, A. & Le Corre, V. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity (Edinb) 108, 375–385 (2012).
OpenUrl
↵
Le Corre, V. & Kremer, A. The genetic differentiation at quantitative trait loci under local adaptation. Mol. Ecol. (2012).
↵
Chan, Y., Lim, E. T., Sandholm, N. & Wang, S. R. An excess of risk-increasing low-frequency variants can be a signal of polygenic inheritance in complex diseases. Am. J. Hum. Genet. (2014).
↵
Mathieson, I. Selection on height in europe. http://mathii.github.io/review/2015/10/21/selection-on-height-in-europe (2015).
↵
Huxley, J. Problems of Relative Growth (Methuen, London, 1932).
↵
Huxley, J. S. & Teissier, G. Terminology of relative growth. Nature 137, 780–781 (1936).
OpenUrl CrossRef
↵
Cheverud, J. M. Relationships among ontogenetic, static, and evolutionary allometry. American Journal of Physical Anthropology 59, 139–149 (1982). URL http://dx.doi.org/10.1002/ajpa.1330590204.
OpenUrl CrossRef PubMed Web of Science
↵
Lande, R. Quantitative genetic analysis of multivariate evolution, applied to brain: body size allometry. Evolution 402–416 (1979).
↵
Rice, S. H. The evolution of canalization and the breaking of von baer’s laws: Modeling the evolution of development with epistasis. Evolution 52, 647–656 (1998). URL http://www.jstor.org/stable/2411260.
OpenUrl CrossRef Web of Science
↵
Nieuwboer, H. A., Pool, R., Dolan, C. V., Boomsma, D. I. & Nivard, M. G. GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes. Am. J. Hum. Genet. 99, 917–927 (2016). URL http://www.sciencedirect.com/science/article/pii/S0002929716303214.
OpenUrl

View the discussion thread.

Posted April 18, 2019.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Evolutionary Biology

Subject Areas

All Articles

Animal Behavior and Cognition (5201)
Biochemistry (11718)
Bioengineering (8724)
Bioinformatics (29132)
Biophysics (14936)
Cancer Biology (12051)
Cell Biology (17360)
Clinical Trials (138)
Developmental Biology (9406)
Ecology (14146)
Epidemiology (2067)
Evolutionary Biology (18269)
Genetics (12223)
Genomics (16768)
Immunology (11844)
Microbiology (28016)
Molecular Biology (11560)
Neuroscience (60822)
Paleontology (450)
Pathology (1864)
Pharmacology and Toxicology (3231)
Physiology (4940)
Plant Biology (10401)
Scientific Communication and Education (1680)
Synthetic Biology (2878)
Systems Biology (7333)
Zoology (1642)

[1] ↵
Roberts, D. F. Body weight, race and climate. American Journal of Physical Anthropology 11, 533–558 (1953).
OpenUrl CrossRef PubMed

[2] ↵
Ruff, C. B. Morphological adaptation to climate in modern and fossil hominids. Am. J. Phys. Anthropol. (1994).

[3] ↵
Savell, K. R. R., Auerbach, B. M. & Roseman, C. C. Constraint, natural selection, and the evolution of human body form. Proc. Natl. Acad. Sci. U.S.A. 113, 9492–9497 (2016).
OpenUrl Abstract/FREE Full Text

[4] ↵
Bogin, B., Smith, P., Orden, A. B., Varela Silva, M. I. & Loucky, J. Rapid change in height and body proportions of Maya American children. Am. J. Hum. Biol. 14, 753–761 (2002).
OpenUrl CrossRef PubMed Web of Science

[5] ↵
Serrat, M. A., King, D. & Lovejoy, C. O. Temperature regulates limb length in homeotherms by directly modulating cartilage growth. Proc. Natl. Acad. Sci. U.S.A. 105, 19348–19353 (2008).
OpenUrl Abstract/FREE Full Text

[6] ↵
Pujol, B., Wilson, A., Ross, R. & Pannell, J. Are QST –FST comparisons for natural populations meaningful? Molecular Ecology 17, 4782–4785 (2008).
OpenUrl CrossRef PubMed Web of Science

[7] ↵
Rogers, A. R. & Harpending, H. C. Population structure and quantitative characters. Genetics 105, 985–1002 (1983).
OpenUrl Abstract/FREE Full Text

[8] ↵
Relethford, J. H. Craniometric variation among modern human populations. American Journal of Physical Anthropology 95, 53–62 (1994).
OpenUrl CrossRef PubMed Web of Science

[9] ↵
Relethford, J. H. Apportionment of global human genetic diversity based on craniometrics and skin color. American Journal of Physical Anthropology 118, 393–398 (2002).
OpenUrl CrossRef PubMed Web of Science

[10] ↵
Tishkoff, S. Strength in small numbers. Science (2015).

[11] ↵
Fan, S., Hansen, M. E. B., Lo, Y. & Tishkoff, S. A. Going global by adapting local: A review of recent human adaptation. Science 354, 54–59 (2016).
OpenUrl Abstract/FREE Full Text

[12] ↵
Pritchard, J. K. & Di Rienzo, A. Adaptation–not by sweeps alone. Nat Rev Genet (2010).

[13] ↵
Pritchard, J. K., Pickrell, J. K. & Coop, G. The genetics of human adaptation: hard sweeps, soft sweeps, and polygenic adaptation. Current biology (2010).

[14] ↵
Visscher, P. M., Brown, M. A. & McCarthy, M. I. Five years of GWAS discovery. Am. J. Hum. Genet. (2012).

[15] ↵
Price, A. L., Spencer, C. C. A. & Donnelly, P. Progress and promise in understanding the genetic basis of common diseases. Proc. R. Soc. B 282, 20151684–10 (2015).
OpenUrl CrossRef PubMed

[16] ↵
Boyle, E. A., Li, Y. I. & Pritchard, J. K. An expanded view of complex traits: from polygenic to omnigenic. Cell (2017).

[17] ↵
Turchin, M. C., Chiang, C. & Palmer, C. D. Evidence of widespread selection on standing variation in Europe at height-associated SNPs. Nature (2012).

[18] ↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS Genet (2014).

[19] ↵
Mathieson, I. et al. Genome-wide patterns of selection in 230 ancient Eurasians. Nat. Gen. 528, 499–503 (2015).
OpenUrl

[20] ↵
Robinson, M. R., Hemani, G. & Medina-Gomez, C. Population genetic differentiation of height and body mass index across Europe. Nature (2015).

[21] ↵
Hansen, M. E. B. et al. Shorter telomere length in Europeans than in Africans due to polygenetic adaptation. Hum. Mol. Genet. 25, 2324–2330 (2016).
OpenUrl CrossRef PubMed

[22] ↵
Field, Y. et al. Detection of human adaptation during the past 2000) years. Science 354, 760–764 (2016).
OpenUrl Abstract/FREE Full Text

[23] ↵
Racimo, F., Berg, J. J. & Pickrell, J. K. Detecting polygenic adaptation in admixture graphs. bioRxiv 146043 (2017).

[24] ↵
Lazaridis, I. et al. Ancient human genomes suggest three ancestral populations for present-day Europeans. Nat. Gen. 513, 409–413 (2014).
OpenUrl

[25] ↵
1000 Genomes Project Consortium. A global reference for human genetic variation. Nature 526, 68 (2015).
OpenUrl CrossRef PubMed

[26] ↵
Berisa, T. & Pickrell, J. K. Approximately independent linkage disequilibrium blocks in human populations. Bioinformatics (2016).

[27] ↵
Pickrell, J. K. Joint analysis of functional genomic data and genome-wide association studies of 18 human traits. Am. J. Hum. Genet. (2014).

[28] ↵
Pickrell, J. K., Berisa, T., Liu, J. Z., Ségurel, L. & Tung, J. Y. Detection and interpretation of shared genetic influences on 42 human traits. Nature (2016).

[29] ↵
Martin, A. R. et al. Human Demographic History Impacts Genetic Risk Prediction across Diverse Populations. Am. J. Hum. Genet. 100, 635–649 (2017).
OpenUrl CrossRef

[30] Wright, S. The genetical structure of populations. Ann Eugen 15, 323–354 (1951).
OpenUrl PubMed Web of Science

[31] ↵
Nicholson, G. et al. Assessing population differentiation and isolation from single-nucleotide polymorphism data. J. R. Stat. Soc. 64, 695–715 (2002).
OpenUrl

[32] Prout, T. & Barker, J. S. F statistics in Drosophila buzzatii: selection, population size and inbreeding. Genetics 134, 369–375 (1993).
OpenUrl Abstract/FREE Full Text

[33] Spitze, K. Population structure in Daphnia obtusa: quantitative genetic and allozymic variation. Genetics 135, 367–374 (1993).
OpenUrl Abstract/FREE Full Text

[34] Ovaskainen, O., Karhunen, M., Zheng, C., Arias, J. M. C. & Merila, J. A New Method to Uncover Signatures of Divergent and Stabilizing Selection in Quantitative Traits. Genetics 189, 621–632 (2011).
OpenUrl Abstract/FREE Full Text

[35] ↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nat. Gen. 47, 291–295 (2015).
OpenUrl CrossRef PubMed

[36] ↵
Zoledziewska, M., Sidore, C., Chiang, C. & Sanna, S. Height-reducing variants and selection for short stature in Sardinia. Nature (2015).

[37] ↵
Hellenthal, G. et al. A genetic atlas of human admixture history. Science 343, 747–751 (2014).
OpenUrl Abstract/FREE Full Text

[38] ↵
Fu, Q. et al. The genetic history of Ice Age Europe. Nature (2016).

[39] ↵
Lazaridis, I. et al. Genomic insights into the origin of farming in the ancient near east. Nature 536, 419–424 (2016).
OpenUrl CrossRef PubMed

[40] ↵
Jones, E. R. et al. Upper Palaeolithic genomes reveal deep roots of modern Eurasians. Nature Communications 6, 1–8 (2015).
OpenUrl

[41] ↵
Seguin-Orlando, A. et al. Paleogenomics. Genomic structure in Europeans dating back at least 36,200 years. Science 346, 1113–1118 (2014).
OpenUrl Abstract/FREE Full Text

[42] ↵
Martiniano, R. et al. The population genomics of archaeological transition in west iberia. bioRxiv (2017).

[43] ↵
Haak, W. et al. Massive migration from the steppe was a source for Indo-European languages in Europe. Nature 522, 207–211 (2015).
OpenUrl CrossRef PubMed

[44] ↵
Allentoft, M. E. et al. Population genomics of Bronze Age Eurasia. Nature 522, 167–172 (2015).
OpenUrl CrossRef PubMed

[45] ↵
Bulik-Sullivan, B., Finucane, H. K., Anttila, V. & Gusev, A. An atlas of genetic correlations across human diseases and traits. Nature (2015).

[46] ↵
Zheng, J. et al. LD Hub: a centralized database and web interface to perform LD score regression that maximizes the potential of summary level GWAS data for SNP heritability and genetic correlation analysis. Bioinformatics 33, 272–279 (2017).
OpenUrl CrossRef PubMed

[47] ↵
Lande, R. & Arnold, S. J. The measurement of selection on correlated characters. Evolution 37, 1210–1226 (1983).
OpenUrl CrossRef PubMed Web of Science

[48] ↵
Schreider, E. Geographical distribution of the body-weight/body-surface ratio. Nature 165, 286 (1950).
OpenUrl PubMed

[49] ↵
Roberts, D. Climate and human variability (Addison-Wesley, 1973).

[50] ↵
Ruff, C. Variation in Human Body Size and Shape. Annu. Rev. Anthropol. 31, 211–232 (2002).
OpenUrl CrossRef Web of Science

[51] ↵
Katz, D. C., Grote, M. N. & Weaver, T. D. A mixed model for the relationship between climate and human cranial form. Am. J. Phys. Anthropol. 160, 593–603 (2015).
OpenUrl

[52] ↵
Bergmann, C. Über die Verhältnisse der Wärmeökonomie der Thiere zu ihrer Grösse. Göttinger Studien 3, 595–708 (1847).
OpenUrl

[53] ↵
Mayr, E. Geographical character gradients and climatic adaptation. Evolution 10, 105–108 (1956). URL http://www.jstor.org/stable/2406103.
OpenUrl CrossRef Web of Science

[54] ↵
Stulp, G. & Barrett, L. Evolutionary perspectives on human height variation. Biol Rev 91, 206–234 (2014).
OpenUrl

[55] ↵
Stearns, S. C., Govindaraju, D. R., Ewbank, D. & Byars, S. G. Constraints on the coevolution of contemporary human males and females. Proceedings of the Royal Society of London B: Biological Sciences 279, 4836–4844 (2012). URL http://rspb.royalsocietypublishing.org/content/279/1748/4836. http://rspb.royalsocietypublishing.org/content/279/1748/4836.full.pdf.
OpenUrl CrossRef PubMed

[56] ↵
Ruff, C. B. Climate and body shape in hominid evolution. Journal of Human Evolution 21, 81–105 (1991).
OpenUrl CrossRef Web of Science

[57] ↵
Allen, J. A. The Influence of Physical Conditions in the Genesis of Species. Radical Review 1, 108–140 (1877).
OpenUrl

[58] ↵
Katzmarzyk, P. T. & Leonard, W. R. Climatic influences on human body size and proportions: ecological adaptations and secular trends. Am. J. Phys. Anthropol. 106, 483–503 (1998).
OpenUrl CrossRef PubMed Web of Science

[59] ↵
Chan, Y. et al. Genome-wide Analysis of Body Proportion Classifies Height-Associated Variants by Mechanism of Action and Implicates Genes Important for Skeletal Development. Am. J. Hum. Genet. 96, 695–708 (2015).
OpenUrl

[60] ↵
Palmer, C. & Pe’er, I. Statistical correction of the winner’s curse explains replication variability in quantitative trait genome-wide association studies. bioRxiv (2017).

[61] ↵
Brown, B. C. et al. Transethnic genetic-correlation estimates from summary statistics. The American Journal of Human Genetics 99, 76–88 (2016).
OpenUrl CrossRef PubMed

[62] ↵
Das, S. et al. Next-generation genotype imputation service and methods. Nature genetics 48, 1284–1287 (2016).
OpenUrl CrossRef PubMed

[63] ↵
Wakefield, J. Bayes factors for genome-wide association studies: comparison with P-values. Genet. Epidemiol. 33, 79–86 (2009).
OpenUrl CrossRef PubMed Web of Science

[64] Perry, J. R. B. et al. Parent-of-origin-specific allelic associations among 106 genomic loci for age at menarche. Nature 514, 92–97 (2014).
OpenUrl CrossRef PubMed Web of Science

[65] Lambert, J. C., Ibrahim-Verbaas, C. A., Harold, D. & Naj, A. C. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature (2013).

[66] van der Valk, R. J. P. et al. A novel common variant in DCST2 is associated with length in early life and height in adulthood. Hum. Mol. Genet. 24, 1155–1168 (2014).
OpenUrl PubMed

[67] Horikoshi, M., Yaghootkar, H. & Mook-Kanamori, D. O. New loci associated with birth weight identify genetic links between intrauterine growth and adult height and metabolism. Nature (2013).

[68] Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197–206 (2015).
OpenUrl CrossRef PubMed

[69] Schunkert, H., König, I. R., Kathiresan, S. & Reilly, M. P. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nature (2011).

[70] Jostins, L. et al. Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease. Nature 491, 119–124 (2012).
OpenUrl CrossRef PubMed Web of Science

[71] Manning, A. K., Hivert, M. F., Scott, R. A. & Grimsby, J. L. A genome-wide approach accounting for body mass index identifies genetic variants influencing fasting glycemic traits and insulin resistance. Nature (2012).

[72] Estrada, K., Styrkarsdottir, U., Evangelou, E. & Hsu, Y. H. Genome-wide meta-analysis identifies 56 bone mineral density loci and reveals 14 loci associated with risk of fracture. Nature (2012).

[73] van der Harst, P. et al. Seventy-five genetic loci influencing the human red blood cell. Nature 492, 369–375 (2012).
OpenUrl CrossRef PubMed Web of Science

[74] Teslovich, T. M. et al. Biological, clinical and population relevance of 95 loci for blood lipids. Nature 466, 707–713 (2010).
OpenUrl CrossRef PubMed Web of Science

[75] Wood, A. R., Esko, T., Yang, J., Vedantam, S. & Pers, T. H. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature (2014).

[76] Cousminer, D. L. et al. Genome-wide association and longitudinal analyses reveal genetic loci linking pubertal height growth, pubertal timing and childhood adiposity. Hum. Mol. Genet. 22, 2735–2747 (2013).
OpenUrl CrossRef PubMed Web of Science

[77] Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187–196 (2015).
OpenUrl CrossRef PubMed Web of Science

[78] Taal, H. R. et al. Common variants at 12q15 and 12q24 are associated with infant head circumference. Nat Genet 44, 532–538 (2012).
OpenUrl CrossRef PubMed

[79] Gieger, C. et al. New gene functions in megakaryopoiesis and platelet formation. Nature 480, 201–208 (2011).
OpenUrl CrossRef PubMed Web of Science

[80] Okada, Y. et al. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 506, 376–381 (2014).
OpenUrl CrossRef PubMed Web of Science

[81] Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421–427 (2014).
OpenUrl CrossRef PubMed Web of Science

[82] Chan, Y., Salem, R. M., Hsu, Y. & McMahon, G. Genome-wide analysis of body proportion classifies height-associated variants by mechanism of action and implicates genes important for skeletal Am. J. Hum. Genet. (2015).

[83] Morris, A. P., Voight, B. F., Teslovich, T. M. & Ferreira, T. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature (2012).

[84] ↵
Pickrell, J. K. & Pritchard, J. K. Inference of population splits and mixtures from genome-wide allele frequency data. PLoS Genet (2012).

[85] ↵
Zhao, L., Lascoux, M., Overall, A. & Waxman, D. The characteristic trajectory of a fixing allele: A consequence of fictitious selection that arises from conditioning. Genetics (2013).

[86] ↵
Kremer, A. & Le Corre, V. Decoupling of differentiation between traits and their underlying genes in response to divergent selection. Heredity (Edinb) 108, 375–385 (2012).
OpenUrl

[87] ↵
Le Corre, V. & Kremer, A. The genetic differentiation at quantitative trait loci under local adaptation. Mol. Ecol. (2012).

[88] ↵
Chan, Y., Lim, E. T., Sandholm, N. & Wang, S. R. An excess of risk-increasing low-frequency variants can be a signal of polygenic inheritance in complex diseases. Am. J. Hum. Genet. (2014).

[89] ↵
Mathieson, I. Selection on height in europe. http://mathii.github.io/review/2015/10/21/selection-on-height-in-europe (2015).

[90] ↵
Huxley, J. Problems of Relative Growth (Methuen, London, 1932).

[91] ↵
Huxley, J. S. & Teissier, G. Terminology of relative growth. Nature 137, 780–781 (1936).
OpenUrl CrossRef

[92] ↵
Cheverud, J. M. Relationships among ontogenetic, static, and evolutionary allometry. American Journal of Physical Anthropology 59, 139–149 (1982). URL http://dx.doi.org/10.1002/ajpa.1330590204.
OpenUrl CrossRef PubMed Web of Science

[93] ↵
Lande, R. Quantitative genetic analysis of multivariate evolution, applied to brain: body size allometry. Evolution 402–416 (1979).

[94] ↵
Rice, S. H. The evolution of canalization and the breaking of von baer’s laws: Modeling the evolution of development with epistasis. Evolution 52, 647–656 (1998). URL http://www.jstor.org/stable/2411260.
OpenUrl CrossRef Web of Science

[95] ↵
Nieuwboer, H. A., Pool, R., Dolan, C. V., Boomsma, D. I. & Nivard, M. G. GWIS: Genome-Wide Inferred Statistics for Functions of Multiple Phenotypes. Am. J. Hum. Genet. 99, 917–927 (2016). URL http://www.sciencedirect.com/science/article/pii/S0002929716303214.
OpenUrl