Genetic Consequences of Social Stratification in Great Britain

Abdel Abdellaoui; David Hugh-Jones; Kathryn E. Kemper; Yan Holtz; Michel G. Nivard; Laura Veul; Loic Yengo; Brendan P. Zietsch; Timothy M. Frayling; Naomi Wray; Jian Yang; Karin J.H. Verweij; Peter M. Visscher

doi:10.1101/457515

Abstract

Human DNA varies across geographic regions, with most variation observed so far reflecting distant ancestry differences. Here, we investigate the geographic clustering of genetic variants that influence complex traits and disease risk in a sample of ~450,000 individuals from Great Britain. Out of 30 traits analyzed, 16 show significant geographic clustering at the genetic level after controlling for ancestry, likely reflecting recent migration driven by socio-economic status (SES). Alleles associated with educational attainment (EA) show most clustering, with EA-decreasing alleles clustering in lower SES areas such as coal mining areas. Individuals that leave coal mining areas carry more EA-increasing alleles on average than the rest of Great Britain. In addition, we leveraged the geographic clustering of complex trait variation to further disentangle regional differences in socio-economic and cultural outcomes through genome-wide association studies on publicly available regional measures, namely coal mining, religiousness, 1970/2015 general election outcomes, and Brexit referendum results.

Introduction

The first law of geography states that “everything is related to everything else, but near things are more related than distant things”.¹ Humans living near each other tend to share more ancestry with each other than with humans that live further away, which is reflected in genome-wide patterns of genetic variation on a global scale² and on finer scales.^3–5 Regional differences in allele frequencies are driven by genetic drift (i.e., the random fluctuations of allele frequency each generation), natural selection pressures, migrations, or admixture (i.e., two previously isolated populations interbreeding). Out of these four mechanisms, genetic drift is the only mechanism not expected to disproportionately affect genetic variants that are associated with heritable human traits. Natural selection targets heritable traits over extended periods of time, thereby affecting allele frequencies of the genetic variants that are associated with the traits under selection. Earlier studies have identified natural selection pressures on many trait-associated variants by looking for extreme allele frequency differences between different ancestries.^3,6,7 Migration is behavior, and since most behavioral traits have heritable components,⁸ migration is likely to be associated with genetic variants that influence behavior. Long-distance migratory events may in turn result in admixture. Internal migrations (i.e., migrations within countries) may lead to geographic clustering of trait-associated genetic variants beyond the clustering of ancestry and may occur for a variety of reasons. They may be driven by the search for specific neighborhood, housing, and inhabitant characteristics, and/or socio-economic factors (e.g., education or job-related considerations),⁹ such as the mass migrations from rural to industrial areas during the industrialization.¹⁰ These geographic movements may coincide with regional clustering of heritable social outcomes such as socio-economic status and major group ideologies (e.g., religion¹¹ and political preference¹²).

Understanding what drives the geographic distribution of genome-wide complex trait variation is important for a variety of reasons. Studying regional differences of genetic variants associated with complex traits that reflect education, wealth, growth, health, and disease, may help understand why those traits are unevenly distributed across Great Britain. Besides the known regional differences in income and SES, significant regional differences have been reported for mental¹³ and physical¹⁴ health problems. Regional differences in wealth and health are likely linked to each other,^15–17 and have been shown to be partly driven by migration.^14,18 If genome-wide complex trait variation is geographically clustered, this should also be taken into account in certain genetically-informative study designs. Mendelian randomization for example uses genetic variants as instrumental variables to identify causality, under the assumption that the genetic instrument is not associated with confounders that influence the two traits under investigation.¹⁹ Geographic clustering of genetic complex trait variation could introduce gene-environment correlations that violate this assumption.²⁰ Such gene-environment correlations could also introduce bias in heritability estimates in twin and family studies,²¹ and could affect signals from genome-wide association studies (GWASs). Furthermore, studying the genetics of migration and geographically clustered cultural phenomena that are related to how society is organized, such as SES, political preference, and religiosity, may help us to further understand regional differences beyond what can be learned from standard observational data. For example, as we will show in this study using a novel regional GWAS approach, we can compute genetic correlations between these clustered social phenomena and a wide range of other traits for which GWASs have been conducted through their GWAS summary statistics.²² This can teach us about how these regional differences are related to traits that have not been measured in the same dataset.

In this study, we first investigate whether genome-wide complex trait variation is geographically clustered after accounting for ancestry differences; if so, this may reflect the genetic consequences of more recent (internal) migration events. In addition, we investigate whether genome-wide complex trait variation is sufficiently clustered to capture the heritability of regional cultural outcomes such as coal mining, religiousness, and political preference by conducting GWASs on publicly available regional measures. We will then utilize the genetic signals from these GWASs to estimate genetic correlations between the regional measures and a wide range of complex traits.

Data and Analysis

We investigated the geographic clustering of ancestry and complex trait variation using genome-wide single-nucleotide polymorphism (SNP) data from ~450,000 British individuals of European ancestry from the UK Biobank project.²³ Ancestry within Great Britain was captured by conducting a principal component analysis (PCA)²⁴ on genome-wide SNPs, a method that has been shown to successfully capture ancestry differences within relatively homogeneous populations.³ Genome-wide complex trait variation was captured by polygenic scores, which are created by weighting an individual’s alleles by the estimated allelic effects on the trait of interest and then summing the weights, resulting in predictive scores for each individual. We built polygenic scores for 456,426 individuals from 1,312,100 autosomal SNPs using effect estimates from 30 published GWASs on traits related to psychiatric disease, substance use, personality, body composition, cardiovascular disease, diabetes, reproduction, and educational attainment (see Supplementary Table 1). Importantly, the 30 GWASs that produced the effect estimates did not include UK Biobank participants.²⁵ Geographic clustering of genetic variation was then investigated using 320,940 unrelated individuals and their birthplace by testing whether the spatial autocorrelation (Moran’s I) is significantly greater than zero for ancestry-informative principal components (PCs), polygenic scores, and the residuals of polygenic scores after regressing out the first 100 PCs. The spatial autocorrelation (Moran’s I) is the correlation in a measure among nearby locations in space, and its values range between −1 (dispersed) to 0 (spatially random) to 1 (spatially clustered).²⁶ Supplementary Figure 1 shows geographic locations of UK Biobank participants. Furthermore, we test whether polygenic scores that showed significant geographic clustering were associated with an index of economic deprivation of the neighborhood (the Townsend index) and migration into or out of the most economically deprived regions (coal mining areas), while accounting for ancestry differences (100 PCs).

We subsequently investigate whether geographic clustering of genome-wide complex trait variation is associated with regional cultural outcomes by running genome-wide association analyses on coal mining, regional estimates of the proportion of religious vs non-religious inhabitants, election outcomes of the Brexit referendum and of the 1970 & 2015 general elections. We estimate the degree to which these regional differences share genetic influences with a range of traits related to cognitive ability, socio-economic status (SES), personality, behavior, substance use, mental and physical health, well-being, reproduction, and body composition.

For more detailed descriptions of the data and analyses, see Online Methods.

Geographic Clustering of Genome-Wide Ancestry and Complex Trait Variation

In line with earlier studies,⁵ British ancestry showed significant geographic clustering: the first 100 genetic PCs all show Moran’s I statistics that are greater than 0, with 72 PCs showing an empirical p-value < .0005, the Bonferroni corrected threshold, and 95 PCs showing an empirical p-value < .05. Many PCs roughly capture the differentiation between Scotland, England, and Wales (see Figure 1 for the first 5 PCs; see https://holtzyan.shinyapps.io/UKB_geo/ for maps of all 100 PCs). The geographic distributions of the ancestry differences captured by the PCs are likely to reflect consequences of historical demographic events.⁵ These include old population movements and settlements, followed by generations of relatively isolated (sub)populations that went through genome-wide allele frequency differentiation through genetic drift and, perhaps, differential natural selection pressures.

Figure 1:

The geographic distributions (birthplace) of the first five PCs, Moran’s I and empirical p-values for Moran’s I. P-values denoted in green are significant after Bonferroni correction.

Without controlling for ancestry, 27 out of the 30 polygenic scores tested showed a Moran’s I significantly greater than 0, indicating significant geographic clustering (Figures 2 & 3, see https://holtzyan.shinyapps.io/UKB_geo/ for maps of all polygenic scores). Only age at menarche, agreeableness, and caffeine consumption were not significantly geographically clustered. Many clustered polygenic scores showed geographic distributions that were similar to the ancestry differences captured by the PCs. After regressing out the 100 ancestry-informative PCs, 16 polygenic scores remained significantly geographically clustered with FDR correction, with educational attainment (EA) showing the highest Moran’s I (before PC correction: Moran’s I = .57, empirical p < 10⁻⁴; after PC correction: Moran’s I = .51, empirical p < 10⁻⁴; see Figures 3 & 4).

Figure 2:

Geographic distribution (birthplace) and Moran’s I values for polygenic scores of four major psychiatric disorders (based on GWASs from the Psychiatric Genomics Consortium (PGC): schizophrenia²⁸, bipolar²⁹, MDD³⁰, and ADHD³¹) and alcohol use³² before (top row) and after (bottom row) regressing out 100 ancestry-informative PCs. Green p-values are significant after FDR correction.

Figure 3:

Moran’s I of 30 SBLUP polygenic scores computed using the average polygenic score per region in 378 local authority regions. A shows the Moran’s I of the polygenic scores unadjusted for PCs (red) and adjusted for 100 PCs (green), where orange means a significant FDR corrected p-value < .05 (corrected for 30 tests). B shows the distribution of significant Moran’s I statistics from 10,000 permutations that were conducted to obtain an empirical p-value for Moran’s I. The vertical line to the right of the permutation distribution shows the observed Moran’s I of the actual data.

Figure 4:

Geographic distribution (birthplace) of the educational attainment (EA) polygenic scores before and after regressing out 100 PCs, and the geographic distribution of Townsend indices from 1971 and 2011. The black lines indicate coal mining areas.

It has been argued that geographic clustering of complex trait genetic variation in UK Biobank is due to (subtle) ancestry differences or ascertainment bias.²⁷ We discuss in more detail in the Supplementary Material why these are unlikely to be the sole explanations of our observations (paragraph: Population Stratification and Ascertainment Bias). In the next paragraph, we explore the more likely explanation, namely recent internal SES-related migrations.

Consequences of SES-Related Migration

The geographic clustering of genome-wide trait-associated alleles after correcting for 100 PCs possibly reflects migration events that occurred more recently than the pre-modern demographic events that drove the regional ancestry differences captured by the PCs. Given the exceptionally strong geographic clustering of the EA score, we investigate here whether it reflects relatively recent internal migrations due to SES-related factors, which are known to especially motivate longer distance moves.³³ Two types of migration flows may have affected the geographic clustering of SES-related alleles: 1) laborers and farmers leaving the country-side during the Industrial Revolution to work in the geographically clustered industrial jobs,¹⁰ and 2) more recent migration of higher-educated people, or people seeking a higher education, out of the more economically deprived industrial regions.

Much of the energy necessary for the mass-production that characterized the birth of the Industrial Revolution came from coal mines. The presence of coal and iron ore attracted large numbers of manual laborers. The Industrial Revolution and the later deindustrialization had a great impact on the economy of the coal mining areas.³⁴ The decline of the British coal industry began in the 1920s, and nearly the whole industry has closed since the early 1980s, resulting in major job losses that still remain visible in unemployment rates decades later.³⁵ Economic deprivation is widespread in coal mining areas: 43% of neighborhoods from coal mining areas fall into the 30% most economically deprived.³⁴ In our analysis, coal mining areas show more economic deprivation than the rest of Great Britain from 1971 to 2011 as measured with the Townsend index³⁶ (higher Townsend = more economic deprivation; all FDR corrected p-values < 10⁻³²; see Figure 4 & Supplementary Figure 6). All regions have become less economically deprived over time, but the difference between coal mining areas and the rest remains highly significant.

After correcting for ancestry differences (100 PCs), the Townsend index is significantly associated with the 16 geographically clustered polygenic scores, with the strongest associations for EA (higher EA polygenic score = lower Townsend index; see Supplementary Figures 7 & 8). These 16 polygenic scores also all show significant differences between coal mining areas and the rest of the regions, both based on birth place and current address (Supplementary Figure 9), with EA showing the strongest differences (FDR corrected p-value < 10⁻²⁰⁰). We further compared ancestry-corrected polygenic scores between four groups of unrelated individuals: 1) people born in coal mining areas who moved out of coal mining areas (N=35,024), 2) people born outside of coal mining areas and still live outside of coal mining areas (N=129,298), 3) people born outside of coal mining areas who moved into coal mining areas (N=47,505), and 4) people born in coal mining areas who still live in coal mining areas (N=111,838). ANOVAs for all 16 geographically clustered polygenic scores show significant differences between the four groups (Figure 5), with EA showing the largest and most significant differences (F_3,323661 = 687.3, FDR corrected p-value < 10⁻²⁰⁰). The largest differences were between people born in coal mining areas who moved away versus those who remained in the coal mining areas. The people that moved away have significantly higher EA polygenic scores than all other groups combined (t₄₃₉₂₃ = 19.8, p = 9 × 10⁻⁸⁷), while those that remained have significantly lower EA polygenic scores than all other groups combined (t₂₃₀₂₂₀ = 44.6, p < 10⁻²⁰⁰). The degree of geographic clustering of polygenic scores, as measured by Moran’s I, is significantly correlated with the strength of their associations with Townsend, coal mining areas, and migration groups; the strongest correlations were between Moran’s I and the F statistic of the migration group differences: r = .95, p = 2 × 10⁻⁸ including EA and r = .73, p = .002 excluding EA (Supplementary Figure 10).

Figure 5:

The average and standard errors of the 16 geographically clustered polygenic scores (ordered by Moran’s I) for four migration groups: born in coal field area and moved out, born in coal mining area and stayed, born outside of coal mining area and moved to coal mining area, born outside of coal mining area and stayed out. All polygenic scores shown are standardized residuals after regressing out 100 ancestry-informative PCs. ANOVAs were conducted for every polygenic score to test the presence of group differences, which were all significant with the least significant FDR corrected p-value of 1 × 10⁻⁴ for conscientiousness.

To get a better sense of the scale of regional differences in polygenic scores, and of how these change due to migration, we computed how much of the individual differences are explained by regional differences for both birthplace and current address (Supplementary Figure 15). The regional differences are greatest for the EA polygenic score, with about ~0.6-2.6% of individual differences being explained by regional differences, depending on how fine the regional scale is (the finer the scale, the more individual differences explained) and by whether the calculations are based on the birthplace or the current address. Regional differences are ~38-54% greater for the current address than for birthplace. The increase in variation explained by regional differences for the EA score (i.e., difference between birthplace and current address in % variance explained by region) is greater than the total variance explained by region for any other polygenic score. As would be expected from recent migration events, ancestry shows the opposite effect: comparing birthplace to current address, the variance explained by region has on average decreased by 37-73% for the first 30 PCs (Supplementary Figure 16).

Genome-Wide Association Studies on Regional Outcomes

The geographic clustering of socio-economic resources and associated genetic variants may coincide with a range of regional collective views and attitudes. We examined this by leveraging the geographic clustering of genome-wide complex trait variation with GWASs on regional socio-economic and cultural outcomes, whereby all participants from the same region were assigned the same regional value as a phenotype (from here-on referred to as regional GWASs). The regional GWASs were run on the >400,000 UK Biobank participants, corrected for relatedness, age, sex, and ancestry (100 PCs). We first verified whether the approach works by running a regional GWAS on a regional measure of educational attainment (EA), obtained from census data, which showed genetic signals almost identical to an individual-level EA GWAS that excluded UK Biobank³⁷ (see Supplementary Materials). We then ran regional GWASs on the presence of coal in an area and regional measures of major ideological factors known to cluster geographically, namely religiousness¹¹ and political preference¹². The regional socio-economic and cultural outcomes were defined as follows: whether the individual was born/lives in a coal mining area, the proportion of religious vs non-religious inhabitants in their region (based on current address), the proportion of “Leave” votes and non-voters in the 2016 Brexit referendum (current address), the proportion of non-voters and votes in the individuals’ constituency for three major UK parties in the UK 1970 general elections (based on birthplace) and the five major UK parties in the UK 2015 general elections (current address). We used the genome-wide summary statistics of the regional GWASs to estimate genetic correlations with a wide range of complex traits using LD score regression, a method that computes genetic correlations based on GWAS summary statistics without bias from sample overlap or ancestry differences.²² We summarize the main results on their (genetic) relationship with other complex traits below, and additional results in the Supplementary Materials.

The regional outcomes showed striking and often highly significant genetic correlations with a wide range of other traits (Figure 6). EA, IQ, and age at first birth showed significant genetic correlations with every regional outcome. Overall, the strongest genetic correlations were observed for cognition & SES-related traits (IQ, EA, income, and Townsend). These suggest that the election outcomes can be divided roughly into higher SES and lower SES regions, with Green Party, Liberal Democrats, and Conservative regions containing more alleles associated with higher SES trait values, and the Labour Party, UKIP, “Leave” votes for Brexit, and non-voters reflecting regions with more alleles associated with lower SES trait values. The election outcomes that are genetically associated with higher cognition and SES outcomes generally also show negative genetic correlations with disease risk outcomes (ADHD, MDD, smoking, alcohol dependence, heart disease, type-2 diabetes, BMI, longevity, and self-rated health), except for alcohol consumption, cannabis use, autism, and psychiatric disorders that are characterized by delusions (schizophrenia, bipolar, anorexia), for which the genetic association is the other way around (higher SES = higher risk). The genetic correlations were largely similar between election outcomes and the coal mining regions, likely due to the same systematic regional SES differences. A different pattern was observed for the proportion of religious inhabitants, which showed weaker genetic correlations with the SES related traits (cognition and health) and stronger associations with the two personality dimensions that showed significant geographic clustering in our previous analyses, openness and conscientiousness (more religious people = lower openness and higher conscientiousness). Risk taking, schizophrenia, and autism also show the highest genetic correlations with being religious (more religious people = lower genetic risk).

Figure 6:

Genetic correlations based on LD score regression. Colored is significant after FDR correction. The green numbers in the left part of the Figure below the diagonal of 1’s are the phenotypic correlations between the regional outcomes of coal mining, religiousness, and regional political preference. The blue stars next to the trait names indicate that UK Biobank was part of the GWAS of the trait. See Supplementary Figure 23 for the standard errors and Supplementary Table 4 for the list of GWASs that the summary statistics of the complex traits were derived from.

The signals were largely consistent within parties over time (1970 & 2015) with respect to SES, but UKIP and Green Party did not yet exist in 1970. The genetic correlations between regional religiousness and the 1970 & 2015 election outcomes suggest that UKIP regions include former Labour Party regions with a more religious genetic profile (lower openness, higher conscientiousness), while the Green Party regions include former Conservative regions with a more non-religious genetic profile (higher openness, lower conscientiousness).

The genetic correlations between the regional outcomes were much stronger than the phenotypic correlations, and in some instances in opposite directions: Labour Party 2015 & Green Party (negative genetic, positive phenotypic correlation), Conservative Party 2015 & Green Party (positive genetic, negative phenotypic correlation), Conservative Party 2015 & Brexit (negative genetic, positive phenotypic correlation). A possible explanation is that the part of the regional variation that is explained by genetic differences is mostly related to regional socio-economic status (lower SES associated alleles in Labour and Brexit areas, higher SES in Conservative and Green areas), while environmental factors, which are responsible for most of the regional variation, are more characterized by ideology (Labour and Green areas being more left-wing, Conservative and Brexit more right-wing).

Discussion

Understanding the consequences of DNA variation in human populations is of major importance for medical, biological, forensic, behavioral, and anthropological research. Since we have been able to measure DNA at a sequence level, studies have shown that the geographic distributions of alleles are not random and have mapped striking geographic patterns of ancestry.^2,3,5,38 Here, we investigated geographic patterns of genome-wide complex trait variation and show that there are additional levels of genetic geographic clustering beyond the geographic patterns that reflect older ancestry differences. We show that the geographic clustering of genome-wide trait-associated alleles is related to recent geographic movement of people and that the resulting regional genetic patterns are associated with regional socio-economic and cultural outcomes.

Without controlling for ancestry, almost all traits we examined showed significant geographic clustering, often resembling the geographic patterns of ancestry differences within Great Britain. This indicates that either 1) the allele frequencies were differentiated between the different ancestries due to genetic drift or natural selection, and/or 2) the GWASs that produced the SNP effect estimates did not sufficiently control for ancestry differences, resulting in SNP effect estimates that are biased towards certain ancestral backgrounds. When we control for ancestry, 16 polygenic scores remain significantly clustered by geography. The strongest clustering was observed for EA. Among the rest of the geographically clustered traits are body dimensions, personality dimensions, and physical and mental health traits, which may reflect independent influences of them on non-random migration, and/or clustering that is (partly) driven by a genetic overlap with EA. The geographic clustering of complex trait variation seems to have increased due to relatively recent migration which is disrupting the older geographic patterns of ancestry (Supplementary Figures 15 & 16).

The degree of geographic clustering of the polygenic scores is largely in line with the strength of their relationship with regional economic deprivation and migration out of economically deprived regions (Supplementary Figure 10). People are more likely to migrate to improve their skills or employment prospects than for other area characteristics.⁹ Many industrialized countries showed these types of migration flows during the late 19^th and early 20^th century, where poorer laborers and small farmers left the country-side to work in industrial jobs that were often highly clustered in geographic space (e.g., coal mining areas).¹² After the deindustrialization, the dense, durable, and affordable working-class houses and the public transportation networks from the industrial revolution remained in these neighborhoods and continued to attract poorer immigrants.¹² Our results show that people with a genetic predisposition for higher cognitive abilities are leaving these regions, likely attracted by better educational or occupational opportunities in other regions. In fact, the people who were born in coal mining areas and migrated to better neighborhoods have higher average EA polygenic scores than people born outside of these regions. The regional clustering of cognitive abilities that follows may further affect the economic development of neighborhoods. These demographic processes may influence GWAS signals as well, where alleles that increase the chances of living in the unhealthy circumstances of lower SES neighborhoods may become part of the signal of a GWAS for a trait like BMI or body fat. There are for example significantly more McDonald’s restaurants in lower SES neighborhoods in Great Britain.³⁹ This may be part of the explanation for why four out of the top five geographically clustered polygenic scores are related to body weight.

Selective migration has led to geographic clustering of social and economic needs, which can coincide with collective attitudes towards how communities should be organized and governed. We successfully captured heritability signals for regional religiousness and regional political attitudes, both of which have been shown to be partly heritable on an individual level^40–45 and to cluster geographically^11,12. From a regional genetic perspective, the election outcomes can be roughly divided into lower SES and higher SES electorates. Our findings suggest that the previously reported heritability estimates of these traits on an individual level may contain genetic effects on traits, such as EA, that influence which socio-economic strata and geographic regions people end up living in. Regional religiousness shows higher genetic correlations with personality (openness and conscientiousness) and less with the SES and health traits than the political parties do, which implies additional dimensions of geographic clustering beyond high versus low SES.

Our findings may largely reflect genetic consequences of social stratification, a key characteristic of human civilizations whereby society groups their people into strata based on SES. SES is generally based on occupation, income, and educational attainment, which are influenced by many environmental and genetic factors, and are associated with a wide range of physical and mental health outcomes. Socioeconomic status is not distributed randomly across geographic space, which leads to geographic clustering of alleles that are associated with SES-related traits such as educational attainment. Educational attainment is known for its high levels of assortative mating,^46,47 which may be further induced by geographic clustering. This may affect social inequalities across generations through expanding biological inequalities in cognitive abilities and susceptibility to disease. It is possible that the combination of recent increases in social mobility and an improved educational system accelerates this separation of higher and lower genetic predisposition for traits related to cognition, SES, and health. Even though the genetic effects we find explain only part of the observed regional differences, researchers and social policy makers should keep these effects in mind, as they seem to be growing due to migration and can lead to detectable regional differences in health and social and economic success. For example, the significant genetic correlations between educational attainment and traits related to disease risk or body composition may decrease in the presence of stronger social safety nets that are geared towards making inhabitants of lower SES regions live more economically prosperous and healthier lives. Social policies that increase the quality of life in lower SES regions may also help to decrease migration out of the currently more economically deprived regions by people with genetic predispositions for higher SES outcomes, and thereby possibly result in a less geographically stratified society.

Online Methods

Participants

The participants of this study come from UK Biobank (UKB),²³ which has received ethical approval from the National Health Service North West Centre for Research Ethics Committee (reference: 11/NW/0382). A total of 502,655 participants aged between 37 and 73 years old were recruited in the UK between 2006 and 2010. They underwent a wide range of cognitive, health, and lifestyle assessments, provided blood, urine, and saliva samples, and will have their health followed longitudinally.

Genotypes and QC

A total of 488,377 UKB participants had their genome-wide single nucleotide polymorphisms (SNPs) genotyped on either the UK BiLEVE array (N = 49,950) or the UK Biobank Axiom Array (N = 438,423). The genotypes were imputed using the Haplotype Reference Consortium (HRC) panel as a reference set (pre-imputation QC and imputation are described in more detail in Bycroft et al, 2018).⁴⁸ To create polygenic scores, we extracted a set of 1,312,100 autosomal HapMap 3 (HM3) SNPs with minor allele count (MAC) > 5, info score > 0.3, p_HWE < 10⁻⁶, and missingness < .05. For the genome-wide association study, we used 5.8 million SNPs that survived QC and have a MAF > .01.

Ancestry & Principal Component Analysis

To capture British ancestry, we first excluded individuals with non-European ancestry. Ancestry was determined using Principal Component Analysis (PCA) in GCTA⁴⁹. The UKB dataset was projected onto the first two principal components (PCs) from the 2,504 participants of the 1000 Genomes Project,⁵⁰ using HM3 SNP with minor allele frequency (MAF) > 0.01 in both datasets. Next, participants from UKB were assigned to one of five super-populations from the 1000 Genomes project: European, African, East-Asian, South-Asian, or Admixed. Assignments for European, African, East-Asian, and South-Asian ancestries were based on > 0.9 posterior-probability of belonging to the 1000 Genomes reference cluster, with the remaining participants classified as Admixed. Posterior-probabilities were calculated under a bivariate Gaussian distribution where this approach generalizes the k-means method to take account of the shape of the reference cluster. We used a uniform prior and calculated the vectors of means and 2×2 variance-covariance matrices for each super-population. A total of 456,426 subjects were identified to have a European ancestry.

A PCA was then conducted on individuals of European ancestry in order to capture ancestry differences within the British population. In order to capture ancestry differences in homogenous populations, genotypes should be pruned for LD and long-range LD regions removed.³ The LD pruned (r² < .1) UKB dataset without long-range LD regions consisted of 137,102 genotyped SNPs. The PCA to construct British ancestry-informative PCs was conducted on this SNP set for unrelated individuals using flashPCA v2.⁵¹ PC SNP loadings were used to project the complete set of European individuals onto the PCs.

Polygenic Scores

Polygenic scores, the genome-wide sum of alleles weighted by their estimated effect sizes, were computed for 30 traits. The effect size estimates were obtained from genome-wide association studies (GWASs) that were chosen to not have included the UKB dataset to avoid over-estimation of the genetic predisposition of a trait.²⁵ The polygenic scores were computed using the SBLUP approach,⁴⁶ which maximizes the predictive power by creating scores with best linear unbiased predictor (BLUP) properties that account for linkage disequilibrium (LD) between SNPs. As a reference sample for the LD, we used the a random sample of 10,000 unrelated individuals from UK Biobank that were imputed using the Haplotype Reference Consortium (HRC) reference panel.⁵² The traits included psychiatric disorders, substance use, anthropomorphic traits, personality dimensions, educational attainment, reproduction, cardiovascular disease, and type-2 diabetes. Supplementary Table 1 lists the 30 traits and the GWASs from which we obtained the genome-wide effect sizes.

To further investigate the robustness of our results, we also created polygenic scores using only independent SNPs that were associated with the trait with a p-value < .05. The SNPs were clumped using PLINK⁵³, using an r² threshold of 0.1 and a window of 1 Mb as the physical distance threshold for clumping.

In order to examine the geographic clustering of polygenic scores beyond the clustering of ancestry, we created additional sets of polygenic scores that had the first 100 British ancestry-informative PCs regressed out.

Spatial autocorrelations (Moran’s I)

The geographic clustering of ancestry and of genome-wide complex trait variation was investigated by testing whether the spatial autocorrelation (Moran’s I) is significantly greater than zero for ancestry-informative principal components (PCs), polygenic scores, and the residuals of polygenic scores after regressing out 100 ancestry-informative PCs. The spatial autocorrelation (Moran’s I) is the correlation in a measure among nearby locations in space, and its values range between −1 (dispersed) to 0 (spatially random) to 1 (spatially clustered).²⁶ Moran’s I’s were computed using the average PCs or polygenic scores per region based on the birthplace of the subjects (378 regions, see Figure 1), whereby the regions were defined according to the local authorities division as provided by the UK Data Service InFuse database.⁵⁴ The empirical p-values of Moran’s I statistics were derived with 10,000 permutations in which the average PCs or polygenic scores were permuted across regions (Figure 3B).

Regional genome-wide association studies (GWASs)

Genome-wide association studies (GWASs) were run on publicly available regional outcomes, whereby all subjects from the same regions had the same regional phenotypic value assigned. Supplementary Figure 16 & 21 show the distributions of all phenotypes analyzed, except for the coal mining phenotypes, which were binary traits (47% of the participants were born in a coal mining area, and 50% of the participants currently live in a coal mining area). The regional phenotypes were obtained from the following public resources:

The borders of a total of 208 coal mining regions were obtained from the Coal Authority: https://data.gov.uk/dataset/coal-mining-reporting-area
The regional educational attainment for 342 local districts and for 7,195 Middle Super Output Areas (MSOA) was measured as the 2011 estimates of the highest qualification of residents of England >16 years old (5 levels: level 1 qualifications, level 2 qualifications, apprenticeship, level 3 qualifications, level 4 qualifications) was obtained from the Nomis database of the Office of National Statistics: https://www.nomisweb.co.uk/
The proportion of religious vs non-religious inhabitants were obtained for 7,195 Middle Super Output Areas (MSOA) regions from the Nomis database of the Office of National Statistics: https://www.nomisweb.co.uk/
The 2016 Brexit referendum results were obtained for 405 Local Authority Districts from The Electoral Commision: https://www.electoralcommission.org.uk/find-information-by-subject/elections-and-referendums/past-elections-and-referendums/eu-referendum/electorate-and-count-information
The 1970 general election outcomes were obtained for 630 constituencies from Political Science Resources: http://www.politicsresources.net/area/uk/ge70/ge70index.htm
The 2015 general election outcomes were obtained for 633 constituencies from data.parliament.uk: http://www.data.parliament.uk/dataset/general-election-2015
All political parties were included that had a median proportion of votes > 0.

We ran linear mixed model (LMM) GWASs with BOLT-LMM⁵⁵ on participants with European ancestry, which controls for cryptic relatedness and population stratification by including a genetic relatedness matrix (GRM) in the model.⁵⁶ Sex and age were included as covariates, as were the first 100 PCs as an additional control for population stratification. The results revealed a considerable inflation of test statistics that was not due to polygenic effects (this was captured by the LD score intercepts⁵⁷ shown in Supplementary Table 2). This is likely due to the fact that participants that share regional environmental influences, because they come from the same region, are all assigned the same phenotypic value. We controlled for this inflation with an LD score intercept-based genomic control,⁵⁷ i.e., we adjusted the standard errors (SE) of the estimated effect sizes as follows: (see Supplementary Table 2).

LD Score Regression

We partitioned the polygenic contributions to the heritability across genomic regions associated with histone modifications specific to ten cell-type/tissue groups using stratified LD score regression⁵⁸ (Supplementary Figure 23). Genetic correlations were also computed using LD-score regression (Figure 6).⁵⁸ The genetic correlation between traits is based on the estimated slope from the regression of the product of z-scores from two GWASs on the LD score and represents the genetic covariation between two traits based on all polygenic effects captured by the included SNPs. The genome-wide LD information used by these methods were based on European populations from the HapMap 3 reference panel.^57,58 All LD score regression analyses included the 1,290,028 million genome-wide HapMap SNPs used in the original LD score regression studies.^57,58

Computing genetic correlations with LD score regression is robust to sample overlap, so we included summary statistics from GWAS studies that also included UK Biobank (denoted with a blue star in Figure 6). Where possible however, we decided to display results obtained from summary statistics without UK Biobank, even if the GWASs from the original studies included UK Biobank participants. This was the case for MDD³⁰ and educational attainment³⁷, for which we used the same summary statistics that we used for the polygenic scores, namely from the GWASs that were re-run excluding UK Biobank. The genetic correlations for MDD and educational attainment obtained with the summary statistics that did include UK Biobank however were almost identical.

Genome-Wide Association Studies on Regional Outcomes (Additional Results)

In order to empirically validate the approach, we first ran regional GWASs on regional measures of average EA outcomes as obtained from census data, namely the weighted average of 2011 estimates of the highest qualification of residents >16 years old, obtained from the Office for National Statistics (Supplementary Figure 16). This resulted in genetic signals that were very close to those of an individual-level EA GWAS that excluded UK Biobank.³⁷ Most significant SNPs in the regional GWAS were at least nominally significant in the individual level GWAS and their effect sizes correlate .93 (Supplementary Figure 18). The genetic correlation between the regional EA GWAS and the individual level EA GWAS was .90, and the genetic correlations with 64 other complex traits were almost identical between the regional and individual level EA GWASs (r = .99, Supplementary Figures 19 & 20).

For the regional GWASs conducted on the presence of coal in the area, religiousness, and political preference, there were a total of 12 independent SNPs with p < 5 × 10⁻⁸ and 5 independent SNPs with p < 1 × 10⁻⁸ (Supplementary Figure 22 & Supplementary Table 3). The variance that could be accounted for by all SNPs (i.e., SNP heritability) ranged from 0.3% to 2.4% (see Supplementary Table 2), with the highest (≥2%) observed for Brexit, Green Party, UKIP, and non-voters in 2015. The heritability signals were significantly enriched for genetic variants that are active in hormonal pathways for the Green Party, and in the central nervous system for the Green Party, UKIP, 2015 non-voters, and Brexit (Supplementary Figure 23).

Regions with more non-voters genetically show a lower SES profile (i.e., strong negative genetic correlations with cognition and SES-related traits) and the largest positive genetic correlations with regions with more Labour party votes, up to .96 between the 2015 non-voters and the 2015 Labour voters. The 2015 non-voters regional GWAS shows the highest SNP heritability of the non-voters GWASs (2.2%). The genetic correlations also imply that regions with more non-voters and Labour voters show more risk-increasing alleles for mood-related traits (i.e., more MDD, higher neuroticism, more loneliness, and lower wellbeing), and no significant genetic correlation with conscientiousness, as opposed the other lower SES regions with more votes for UKIP and “Leave” votes for Brexit, which show a significant positive genetic correlation with conscientiousness.

In order to further examine what differentiates the parties within the higher SES and lower SES clusters from each other, we repeated the regional GWASs for the proportion of votes among only the Green Party, Liberal Democrats, and Conservatives votes, and the proportion of votes among only Labour Party and UKIP votes. The correlations with religiousness were consistently higher and more often significant for the differences within the higher and lower SES voters than the differences between them (Supplementary Figure 24). The genetic signals that differentiate Green Party regions from the other higher SES votes show the highest genetic correlation with regional religiousness (more Green Party votes = less religious: r_g = -.82, SE = .06). What differentiates Liberal Democrats from the other higher SES parties still seems to be largely SES-related, given the high positive genetic correlation with EA and income (both .77). The lower SNP heritability estimates of the within SES differences (0.3% - 1.2%, compared to 1.1% - 2.4%, see Supplementary Table 2) suggest that the differences within the higher and lower SES voters are less influenced by regional genetics than the differences between them.

Population Stratification and Ascertainment Bias

Population Stratification

The largest patterns of genome-wide variation between and within human populations are due to differences in ancestry rather than trait variation. In genetic association studies, false positives due to population stratification occur when these systematic ancestry differences get mistaken for associations due to genetic variants that influence the trait that is being studied.⁵⁹ False positives due to population stratification can occur when trait differences are in line with ancestry differences, which could also occur due to non-genetic factors, such as regional differences in environmental exposures. Geographic location is known to strongly correlate with ancestry differences: the closer people live to each other, the more likely it is that they share more ancestors. The main focus of this study is the relationship between geographic location and genome-wide complex trait variation, which is why we had to be particularly rigorous in accounting for population stratification. We summarize below why it is unlikely that our observations are merely a result of ancestry differences or biased polygenic scores.

The most widely used approach to account for ancestry differences is to quantify ancestry differences with a principal component analysis (PCA) on genome-wide SNP data and then account for the resulting principal components (PCs).^24,59 Instead of using the standard 40 PCs provided by UK Biobank, which capture both non-European and European ancestry differences,⁴⁸ we re-computed PCs to more effectively capture population stratification within the more homogeneous group of British participants with European ancestry (see Online Methods). While genome-wide association studies (GWASs) usually control for 10 to 40 PCs, we controlled for the first 100 PCs in all our analyses.
We validated the effectiveness of the 100 PCs in accounting for geographic clustering due to population stratification using polygenic scores that reflect European ancestry differences as captured in an independent European-American dataset: the GERA cohort.⁶⁰ First, we conducted GWASs in GERA (N = 51,258) on the first 20 GERA PCs in order to get SNP effects that reflect genome-wide patterns of their European-American ancestry differences. We then used these SNP effects to build polygenic scores in UK Biobank. These all show significant geographic clustering as quantified with Moran’s I. After controlling for 100 PCs from UKB all Moran’s I’s dropped to being not significantly greater than 0 (see Online Methods and Supplementary Figures 3 and 4).
The polygenic scores we analyzed were constructed from 1,312,100 autosomal SNPs, regardless of how significantly associated they are with the trait. The ensemble of non-significant SNPs contain a substantial amount of signals due to true causal relationships, and thus meaningful effect sizes, which increase the predictive power of polygenic scores.⁶¹ Increasing the number of non-associated SNPs however may also increase the chances of including more stratified SNPs in the polygenic score. We therefore created a set of additional polygenic scores using only independent SNPs (i.e., clumped) that were at least nominally significantly associated with the trait at p < .05. This results in scores that are based on fewer SNPs that are more reliably associated with the trait, but also results in less predictive scores. With this approach, 8 out of 16 previously significant traits are significantly geographically clustered after FDR correction (see Supplementary Figure 2). These geographically clustered clumped scores also showed similar and significant associations with Townsend (Supplementary Figures 11 & 12), coal mining regions (Supplementary Figure 13) and migration out of coal fields (Supplementary Figures 14), with educational attainment showing the strongest effects.
We show that the geographically clustered polygenic scores are significantly associated with regional outcomes of economic deprivation and with migration out of the more economically deprived regions in the UK (coal mining regions). The strength of the geographic clustering is in line with the strength of the association with regional outcomes and, more importantly, migration (Supplementary Figure 10). In other words, 1) the traits that show significant geographic clustering are the traits that cluster in specific regions that are characterized by lower SES measures, and 2) we show that the processes that would result in these regional differences are measurable in the current dataset, namely migration out of the lower SES regions by individuals with a higher predisposition for SES-related traits such as higher educational attainment and lower body weight. Although these observations do not directly prove that ancestry differences cannot account for this geographic clustering, it does show that if subtle population stratification would be the cause of these regional differences (which is unlikely given our stringent control for ancestry differences), it would have to involve ancestry differences that are in line with genome-wide complex trait variation.
SNPs that are in LD with many SNPs are more likely to tag a causal SNP (i.e., be correlated with a causal SNP), and are thus more likely to have a higher test-statistic in a GWAS. The amount of SNPs that is tagged by a SNP is quantified by its LD Score. LD Score regression is an approach that leverages the relationship between the LD score of a SNP and the GWAS test statistic to distinguish inflation of genome-wide test statistics due to variants that influence the complex trait under study from inflation due to confounding bias such as population stratification.⁵⁷ LD Score regression analyses show that the results from our regional GWASs all show an inflation of test statistics that is partly due to confounding (likely shared environmental influences) but also contains a considerable inflation due to variants that are associated with complex trait variation that is being captured with the regional measures. LD Score regression was then used to compare the parts of the genetic signals that were due to causal variants between our regional GWASs and GWASs from a wide range of other complex traits. Importantly, LD Score regression showed that the signals from our regional GWAS on EA contained almost the same signals as an individual level GWAS on EA that was conducted on non-UK Biobank datasets, which is in line with the geographic clustering of genome-wide alleles that have a causal influence on EA.

Ascertainment Bias

The UK Biobank ascertainment strategy was designed to capture sufficient variation in socioeconomic, urban–rural, and ethnic background.²³ The participation rate however was 5.45% and was biased towards older, more healthy, and female residents.⁶² The UK Biobank sample does reflect nationally representative data sources to a significant degree, making it likely that our observations would generalize to the population at large. We tested at the MSOA level how EA measurements in UK Biobank compare to nationally representative census data (the same EA census measurements that we used for the regional EA GWAS). The average EA per MSOA region as measured in UK Biobank is strongly predictive of MSOA-EA as measured from the nationally representative census data (p < 10⁻¹⁶, R² = 40%; Supplementary Figure 5). The average polygenic scores per MSOA region, with 100 PCs regressed out, are also highly predictive of MSOA-EA according to nationally representative census data (p < 10⁻¹⁶, R² = 19%; Supplementary Figure 5). Since UK Biobank has sampled healthier individuals as well as fewer individuals from more economically deprived areas as compared to the British population as a whole,⁶² the regional differences that we report may turn out to be stronger in the real population than in the UK Biobank sample.

Acknowledgements

This research was supported by the Australian National Health and Medical Research Council (1107258, 1078901, 1078037, 1056929, 1048853, and 1113400), and the Sylvia & Charles Viertel Charitable Foundation (Senior Medical Research Fellowship). B.P.Z. received funding from The Australian Research Council (FT160100298). M.G.N. is supported by ZonMw grants 849200011 and 531003014 from The Netherlands Organisation for Health Research and Development. K.J.H.V is supported by the Foundation Volksbond Rotterdam. This study makes use of data from the UK Biobank Resource (Application Number: 12514) and dbGaP (Accession Number: phs000674).

References

↵
Tobler, W. R. A computer movie simulating urban growth in the Detroit region. Economic geography 46, 234–240 (1970).
OpenUrl CrossRef Web of Science
↵
Novembre, J. et al. Genes mirror geography within Europe. Nature 456, 98 (2008).
OpenUrl CrossRef PubMed Web of Science
↵
Abdellaoui, A. et al. Population structure, migration, and diversifying selection in the Netherlands. European journal of human genetics 21, 1277 (2013).
OpenUrl CrossRef PubMed
Kerminen, S. et al. Fine-Scale Genetic Structure in Finland. G3: Genes, Genomes, Genetics 7, 3459–3468 (2017).
OpenUrl
↵
Leslie, S. et al. The fine-scale genetic structure of the British population. Nature 519, 309–314 (2015).
OpenUrl CrossRef PubMed
↵
Zhang, G., Muglia, L. J., Chakraborty, R., Akey, J. M. & Williams, S. M. Signatures of natural selection on genetic variants affecting complex human traits. Applied & translational genomics 2, 78–94 (2013).
OpenUrl
↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS genetics 10, e1004412 (2014).
↵
Turkheimer, E. Three laws of behavior genetics and what they mean. Current directions in psychological science 9, 160–164 (2000).
OpenUrl CrossRef Web of Science
↵
Coulter, R. & Scott, J. What motivates residential mobility? Re‐examining self‐reported reasons for desiring and making residential moves. Population, Space and Place 21, 354–371 (2015).
OpenUrl
↵
Long, J. Rural-urban migration and socioeconomic mobility in Victorian Britain. The Journal of Economic History 65, 1–35 (2005).
OpenUrl
↵
Park, C. Sacred worlds: An introduction to geography and religion. (Routledge, 2002).
↵
Rodden, J. The geographic distribution of political preferences. Annual Review of Political Science 13, 321–340 (2010).
OpenUrl CrossRef Web of Science
↵
Lewis, G. & Booth, M. Regional differences in mental health in Great Britain. Journal of Epidemiology & Community Health 46, 608–611 (1992).
OpenUrl Abstract/FREE Full Text
↵
Boyle, P. Population geography: migration and inequalities in mortality and morbidity. Progress in Human Geography 28, 767–776 (2004).
OpenUrl CrossRef Web of Science
↵
Tyrrell, J. et al. Height, body mass index, and socioeconomic status: mendelian randomisation study in UK Biobank. bmj 352, i582 (2016).
OpenUrl Abstract/FREE Full Text
Marmot, M. The health gap: the challenge of an unequal world. The Lancet 386, 2442–2444 (2015).
OpenUrl
↵
Beard, E. et al. Healthier central England or North–South divide? Analysis of national survey data on smoking and high-risk drinking. BMJ open 7, e014210 (2017).
↵
Brimblecombe, N., Dorling, D. & Shaw, M. Migration and geographical inequalities in health in Britain. Social Science & Medicine 50, 861–878 (2000).
OpenUrl CrossRef PubMed
↵
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International journal of epidemiology 32, 1–22 (2003).
OpenUrl CrossRef PubMed Web of Science
↵
Richards, J. B. & Evans, D. M. (British Medical Journal Publishing Group, 2017).
↵
Verweij, K. J., Mosing, M. A., Zietsch, B. P. & Medland, S. E. in Statistical Human Genetics 151–170 (Springer, 2012).
↵
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nature genetics 47, 1236–1241 (2015).
OpenUrl CrossRef PubMed
↵
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 12, e1001779 (2015).
↵
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics 38, 904 (2006).
OpenUrl CrossRef PubMed Web of Science
↵
Wray, N. R. et al. Pitfalls of predicting complex traits from SNPs. Nature Reviews Genetics 14, 507–515 (2013).
OpenUrl CrossRef PubMed
↵
Moran, P. A. Notes on continuous stochastic phenomena. Biometrika 37, 17–23 (1950).
OpenUrl CrossRef PubMed Web of Science
↵
Haworth, S. et al. Common genetic variants and health outcomes appear geographically structured in the UK Biobank sample: Old concerns returning and their implications. bioRxiv, 294876 (2018).
↵
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Ruderfer, D. M. et al. Polygenic dissection of diagnosis and clinical dimensions of bipolar disorder and schizophrenia. Molecular psychiatry 19, 1017 (2014).
OpenUrl CrossRef PubMed Web of Science
↵
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv, 167577 (2017).
↵
Demontis, D. et al. Discovery of the first genome-wide significant risk loci for ADHD. bioRxiv, 145581 (2017).
↵
Schumann, G. et al. KLB is associated with alcohol drinking, and its gene product β-Klotho is necessary for FGF21 regulation of alcohol preference. Proceedings of the National Academy of Sciences 113, 14372–14377 (2016).
OpenUrl Abstract/FREE Full Text
↵
Niedomysl, T. How migration motives change over migration distance: evidence on variation across socio-economic and demographic groups. Regional Studies 45, 843–855 (2011).
OpenUrl CrossRef
↵
Foden, M., Fothergill, S. & Gore, T. The state of the coalfields: Economic and social conditions in the former mining communities of England, Scotland and Wales. Centre for Regional Economic and Social Research, Sheffield Hallam University (2014).
↵
Beatty, C., Fothergill, S. & Powell, R. Twenty years on: has the economy of the UK coalfields recovered? Environment and Planning A 39, 1654–1675 (2007).
OpenUrl CrossRef
↵
Townsend, P., Phillimore, P. & Beattie, A. Health and deprivation: inequality and the North. (Routledge, 1988).
↵
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539 (2016).
OpenUrl CrossRef PubMed
↵
Menozzi, P., Piazza, A. & Cavalli-Sforza, L. Synthetic maps of human gene frequencies in Europeans. Science 201, 786–792 (1978).
OpenUrl Abstract/FREE Full Text
↵
Cummins, S. C., McKay, L. & MacIntyre, S. McDonald’s restaurants and neighborhood deprivation in Scotland and England. American journal of preventive medicine 29, 308–310 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Alford, J. R., Funk, C. L. & Hibbing, J. R. Are political orientations genetically transmitted? American political science review 99, 153–167 (2005).
OpenUrl CrossRef Web of Science
Benjamin, D. J. et al. The genetic architecture of economic and political preferences. Proceedings of the National Academy of Sciences 109, 8026–8031 (2012).
OpenUrl Abstract/FREE Full Text
Hatemi, P. K. & McDermott, R. The genetics of politics: discovery, challenges, and progress. Trends in Genetics 28, 525–533 (2012).
OpenUrl CrossRef PubMed Web of Science
Hatemi, P. K., Medland, S. E., Morley, K. I., Heath, A. C. & Martin, N. G. The genetics of voting: An Australian twin study. Behavior genetics 37, 435 (2007).
OpenUrl CrossRef PubMed Web of Science
Smith, K. et al. Biology, ideology, and epistemology: how do we know political attitudes are inherited and why should we care? American journal of political science 56, 17–33 (2012).
OpenUrl CrossRef PubMed
↵
Koenig, L. B., McGue, M., Krueger, R. F. & Bouchard, T. J. Genetic and environmental influences on religiousness: Findings for retrospective and current religiousness ratings. Journal of personality 73, 471–488 (2005).
OpenUrl CrossRef PubMed Web of Science
↵
Robinson, M. R. et al. Genetic evidence of assortative mating in humans. Nature Human Behaviour 1, 0016 (2017).
OpenUrl
↵
Hugh-Jones, D., Verweij, K. J., Pourcain, B. S. & Abdellaoui, A. Assortative mating on educational attainment leads to genetic spousal resemblance for polygenic scores. Intelligence 59, 103–108 (2016).
OpenUrl
↵
Bycroft, C. et al. Genome-wide genetic data on~ 500,000 UK Biobank participants. Nature 562, 203–209 (2018).
OpenUrl CrossRef
↵
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88, 76–82 (2011).
OpenUrl CrossRef PubMed
↵
Consortium, G. P. A global reference for human genetic variation. Nature 526, 68–74 (2015).
OpenUrl CrossRef PubMed
↵
Abraham, G., Qiu, Y. & Inouye, M. FlashPCA2: principal component analysis of Biobank-scale genotype datasets. Bioinformatics 33, 2776–2778 (2017).
OpenUrl CrossRef PubMed
↵
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics 48, 1279 (2016).
OpenUrl CrossRef PubMed
↵
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics 81, 559–575 (2007).
OpenUrl CrossRef PubMed
↵
Sweet, D. (2011).
↵
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nature genetics 47, 284–290 (2015).
OpenUrl CrossRef PubMed
↵
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nature genetics 46, 100–106 (2014).
OpenUrl CrossRef PubMed
↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291–295 (2015).
OpenUrl CrossRef PubMed
↵
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature genetics 47, 1228–1235 (2015).
OpenUrl CrossRef PubMed
↵
Price, A. L., Zaitlen, N. A., Reich, D. & Patterson, N. New approaches to population stratification in genome-wide association studies. Nature Reviews Genetics 11, 459 (2010).
OpenUrl CrossRef PubMed Web of Science
↵
Kvale, M. N. et al. Genotyping informatics and quality control for 100,000 subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. Genetics 200, 1051–1060 (2015).
OpenUrl Abstract/FREE Full Text
↵
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS genetics 9, e1003348 (2013).
↵
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. American journal of epidemiology 186, 1026–1034 (2017).
OpenUrl CrossRef PubMed
De Moor, M. H. et al. Meta-analysis of genome-wide association studies for personality. Molecular psychiatry 17, 337 (2012).
OpenUrl CrossRef PubMed Web of Science
Lo, M.-T. et al. Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders. Nature genetics 49, 152 (2017).
OpenUrl
Duncan, L. et al. Significant locus and metabolic genetic correlations revealed in genome-wide association study of anorexia nervosa. American Journal of Psychiatry 174, 850–858 (2017).
OpenUrl
Consortium, A. S. D. W. G. o. T. P. G. et al. Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24. 32 and a significant overlap with schizophrenia. Molecular autism 8, 1–17 (2017).
OpenUrl
Lambert, J.-C. et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature genetics 45, 1452 (2013).
OpenUrl CrossRef PubMed
Furberg, H. et al. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nature genetics 42, 441 (2010).
OpenUrl CrossRef PubMed Web of Science
Stringer, S. et al. Genome-wide association study of lifetime cannabis use based on a large meta-analytic sample of 32 330 subjects from the International Cannabis Consortium. Translational psychiatry 6, e769 (2017).
Cornelis, M. C. et al. Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption. Molecular psychiatry 20, 647 (2015).
OpenUrl CrossRef PubMed
Nikpay, M. et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nature genetics 47, 1121 (2015).
OpenUrl CrossRef PubMed
Morris, A. P. et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature genetics 44, 981 (2012).
OpenUrl CrossRef PubMed
Elks, C. E. et al. Thirty new loci for age at menarche identified by a meta-analysis of genome-wide association studies. Nature genetics 42, 1077 (2010).
OpenUrl CrossRef PubMed
Day, F. R. et al. Large-scale genomic analyses link reproductive aging to hypothalamic signaling, breast cancer susceptibility and BRCA1-mediated DNA repair. Nature genetics 47, 1294 (2015).
OpenUrl CrossRef PubMed
Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature genetics 46, 1173 (2014).
OpenUrl CrossRef PubMed
Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197 (2015).
OpenUrl CrossRef PubMed
Lu, Y. et al. New loci for body fat percentage reveal link between adiposity and cardiometabolic disease risk. Nature communications 7, 10495 (2016).
OpenUrl
Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187 (2015).
OpenUrl CrossRef PubMed Web of Science
Benyamin, B. et al. Childhood intelligence is heritable, highly polygenic and associated with FNBP1L. Molecular psychiatry 19, 253 (2014).
OpenUrl CrossRef PubMed Web of Science
Sniekers, S. et al. Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence. Nature Genetics (2017).
Hill, W. D. et al. Molecular genetic contributions to social deprivation and household income in UK Biobank. Current Biology 26, 3083–3089 (2016).
OpenUrl CrossRef PubMed
Okbay, A. et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nature genetics 48, 624 (2016).
OpenUrl CrossRef PubMed
Deary, V. et al. Genetic contributions to self-reported tiredness. Molecular psychiatry 23, 609 (2018).
OpenUrl
Churchhouse, C. & Neale, B. Rapid GWAS of Thousands of Phenotypes for 337,000 Samples in the UK Biobank. Neale Lab (2017).
Clarke, T.-K. et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N= 112 117). Molecular psychiatry 22, 1376 (2017).
OpenUrl CrossRef PubMed
Walters, R. K. et al. Trans-ancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. bioRxiv, 257311 (2018).
Harris, S. E. et al. Molecular genetic contributions to self-rated health. International journal of epidemiology 46, 994–1009 (2016).
OpenUrl
Pilling, L. C. et al. Human longevity is influenced by many genetic variants: evidence from 75,000 UK Biobank participants. Aging (Albany NY) 8, 547 (2016).
OpenUrl
Barban, N. et al. Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nature genetics 48, 1462 (2016).
OpenUrl CrossRef PubMed

View the discussion thread.

Posted October 30, 2018.

Download PDF

Supplementary Material

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5204)
Biochemistry (11725)
Bioengineering (8728)
Bioinformatics (29135)
Biophysics (14940)
Cancer Biology (12052)
Cell Biology (17363)
Clinical Trials (138)
Developmental Biology (9408)
Ecology (14147)
Epidemiology (2067)
Evolutionary Biology (18272)
Genetics (12223)
Genomics (16773)
Immunology (11844)
Microbiology (28027)
Molecular Biology (11564)
Neuroscience (60841)
Paleontology (451)
Pathology (1864)
Pharmacology and Toxicology (3232)
Physiology (4940)
Plant Biology (10405)
Scientific Communication and Education (1681)
Synthetic Biology (2878)
Systems Biology (7335)
Zoology (1642)

[1] ↵
Tobler, W. R. A computer movie simulating urban growth in the Detroit region. Economic geography 46, 234–240 (1970).
OpenUrl CrossRef Web of Science

[2] ↵
Novembre, J. et al. Genes mirror geography within Europe. Nature 456, 98 (2008).
OpenUrl CrossRef PubMed Web of Science

[3] ↵
Abdellaoui, A. et al. Population structure, migration, and diversifying selection in the Netherlands. European journal of human genetics 21, 1277 (2013).
OpenUrl CrossRef PubMed

[4] Kerminen, S. et al. Fine-Scale Genetic Structure in Finland. G3: Genes, Genomes, Genetics 7, 3459–3468 (2017).
OpenUrl

[5] ↵
Leslie, S. et al. The fine-scale genetic structure of the British population. Nature 519, 309–314 (2015).
OpenUrl CrossRef PubMed

[6] ↵
Zhang, G., Muglia, L. J., Chakraborty, R., Akey, J. M. & Williams, S. M. Signatures of natural selection on genetic variants affecting complex human traits. Applied & translational genomics 2, 78–94 (2013).
OpenUrl

[7] ↵
Berg, J. J. & Coop, G. A population genetic signal of polygenic adaptation. PLoS genetics 10, e1004412 (2014).

[8] ↵
Turkheimer, E. Three laws of behavior genetics and what they mean. Current directions in psychological science 9, 160–164 (2000).
OpenUrl CrossRef Web of Science

[9] ↵
Coulter, R. & Scott, J. What motivates residential mobility? Re‐examining self‐reported reasons for desiring and making residential moves. Population, Space and Place 21, 354–371 (2015).
OpenUrl

[10] ↵
Long, J. Rural-urban migration and socioeconomic mobility in Victorian Britain. The Journal of Economic History 65, 1–35 (2005).
OpenUrl

[11] ↵
Park, C. Sacred worlds: An introduction to geography and religion. (Routledge, 2002).

[12] ↵
Rodden, J. The geographic distribution of political preferences. Annual Review of Political Science 13, 321–340 (2010).
OpenUrl CrossRef Web of Science

[13] ↵
Lewis, G. & Booth, M. Regional differences in mental health in Great Britain. Journal of Epidemiology & Community Health 46, 608–611 (1992).
OpenUrl Abstract/FREE Full Text

[14] ↵
Boyle, P. Population geography: migration and inequalities in mortality and morbidity. Progress in Human Geography 28, 767–776 (2004).
OpenUrl CrossRef Web of Science

[15] ↵
Tyrrell, J. et al. Height, body mass index, and socioeconomic status: mendelian randomisation study in UK Biobank. bmj 352, i582 (2016).
OpenUrl Abstract/FREE Full Text

[16] Marmot, M. The health gap: the challenge of an unequal world. The Lancet 386, 2442–2444 (2015).
OpenUrl

[17] ↵
Beard, E. et al. Healthier central England or North–South divide? Analysis of national survey data on smoking and high-risk drinking. BMJ open 7, e014210 (2017).

[18] ↵
Brimblecombe, N., Dorling, D. & Shaw, M. Migration and geographical inequalities in health in Britain. Social Science & Medicine 50, 861–878 (2000).
OpenUrl CrossRef PubMed

[19] ↵
Davey Smith, G. & Ebrahim, S. ‘Mendelian randomization’: can genetic epidemiology contribute to understanding environmental determinants of disease? International journal of epidemiology 32, 1–22 (2003).
OpenUrl CrossRef PubMed Web of Science

[20] ↵
Richards, J. B. & Evans, D. M. (British Medical Journal Publishing Group, 2017).

[21] ↵
Verweij, K. J., Mosing, M. A., Zietsch, B. P. & Medland, S. E. in Statistical Human Genetics 151–170 (Springer, 2012).

[22] ↵
Bulik-Sullivan, B. et al. An atlas of genetic correlations across human diseases and traits. Nature genetics 47, 1236–1241 (2015).
OpenUrl CrossRef PubMed

[23] ↵
Sudlow, C. et al. UK biobank: an open access resource for identifying the causes of a wide range of complex diseases of middle and old age. PLoS medicine 12, e1001779 (2015).

[24] ↵
Price, A. L. et al. Principal components analysis corrects for stratification in genome-wide association studies. Nature genetics 38, 904 (2006).
OpenUrl CrossRef PubMed Web of Science

[25] ↵
Wray, N. R. et al. Pitfalls of predicting complex traits from SNPs. Nature Reviews Genetics 14, 507–515 (2013).
OpenUrl CrossRef PubMed

[26] ↵
Moran, P. A. Notes on continuous stochastic phenomena. Biometrika 37, 17–23 (1950).
OpenUrl CrossRef PubMed Web of Science

[27] ↵
Haworth, S. et al. Common genetic variants and health outcomes appear geographically structured in the UK Biobank sample: Old concerns returning and their implications. bioRxiv, 294876 (2018).

[28] ↵
Ripke, S. et al. Biological insights from 108 schizophrenia-associated genetic loci. Nature 511, 421 (2014).
OpenUrl CrossRef PubMed Web of Science

[29] ↵
Ruderfer, D. M. et al. Polygenic dissection of diagnosis and clinical dimensions of bipolar disorder and schizophrenia. Molecular psychiatry 19, 1017 (2014).
OpenUrl CrossRef PubMed Web of Science

[30] ↵
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. bioRxiv, 167577 (2017).

[31] ↵
Demontis, D. et al. Discovery of the first genome-wide significant risk loci for ADHD. bioRxiv, 145581 (2017).

[32] ↵
Schumann, G. et al. KLB is associated with alcohol drinking, and its gene product β-Klotho is necessary for FGF21 regulation of alcohol preference. Proceedings of the National Academy of Sciences 113, 14372–14377 (2016).
OpenUrl Abstract/FREE Full Text

[33] ↵
Niedomysl, T. How migration motives change over migration distance: evidence on variation across socio-economic and demographic groups. Regional Studies 45, 843–855 (2011).
OpenUrl CrossRef

[34] ↵
Foden, M., Fothergill, S. & Gore, T. The state of the coalfields: Economic and social conditions in the former mining communities of England, Scotland and Wales. Centre for Regional Economic and Social Research, Sheffield Hallam University (2014).

[35] ↵
Beatty, C., Fothergill, S. & Powell, R. Twenty years on: has the economy of the UK coalfields recovered? Environment and Planning A 39, 1654–1675 (2007).
OpenUrl CrossRef

[36] ↵
Townsend, P., Phillimore, P. & Beattie, A. Health and deprivation: inequality and the North. (Routledge, 1988).

[37] ↵
Okbay, A. et al. Genome-wide association study identifies 74 loci associated with educational attainment. Nature 533, 539 (2016).
OpenUrl CrossRef PubMed

[38] ↵
Menozzi, P., Piazza, A. & Cavalli-Sforza, L. Synthetic maps of human gene frequencies in Europeans. Science 201, 786–792 (1978).
OpenUrl Abstract/FREE Full Text

[39] ↵
Cummins, S. C., McKay, L. & MacIntyre, S. McDonald’s restaurants and neighborhood deprivation in Scotland and England. American journal of preventive medicine 29, 308–310 (2005).
OpenUrl CrossRef PubMed Web of Science

[40] ↵
Alford, J. R., Funk, C. L. & Hibbing, J. R. Are political orientations genetically transmitted? American political science review 99, 153–167 (2005).
OpenUrl CrossRef Web of Science

[41] Benjamin, D. J. et al. The genetic architecture of economic and political preferences. Proceedings of the National Academy of Sciences 109, 8026–8031 (2012).
OpenUrl Abstract/FREE Full Text

[42] Hatemi, P. K. & McDermott, R. The genetics of politics: discovery, challenges, and progress. Trends in Genetics 28, 525–533 (2012).
OpenUrl CrossRef PubMed Web of Science

[43] Hatemi, P. K., Medland, S. E., Morley, K. I., Heath, A. C. & Martin, N. G. The genetics of voting: An Australian twin study. Behavior genetics 37, 435 (2007).
OpenUrl CrossRef PubMed Web of Science

[44] Smith, K. et al. Biology, ideology, and epistemology: how do we know political attitudes are inherited and why should we care? American journal of political science 56, 17–33 (2012).
OpenUrl CrossRef PubMed

[45] ↵
Koenig, L. B., McGue, M., Krueger, R. F. & Bouchard, T. J. Genetic and environmental influences on religiousness: Findings for retrospective and current religiousness ratings. Journal of personality 73, 471–488 (2005).
OpenUrl CrossRef PubMed Web of Science

[46] ↵
Robinson, M. R. et al. Genetic evidence of assortative mating in humans. Nature Human Behaviour 1, 0016 (2017).
OpenUrl

[47] ↵
Hugh-Jones, D., Verweij, K. J., Pourcain, B. S. & Abdellaoui, A. Assortative mating on educational attainment leads to genetic spousal resemblance for polygenic scores. Intelligence 59, 103–108 (2016).
OpenUrl

[48] ↵
Bycroft, C. et al. Genome-wide genetic data on~ 500,000 UK Biobank participants. Nature 562, 203–209 (2018).
OpenUrl CrossRef

[49] ↵
Yang, J., Lee, S. H., Goddard, M. E. & Visscher, P. M. GCTA: a tool for genome-wide complex trait analysis. The American Journal of Human Genetics 88, 76–82 (2011).
OpenUrl CrossRef PubMed

[50] ↵
Consortium, G. P. A global reference for human genetic variation. Nature 526, 68–74 (2015).
OpenUrl CrossRef PubMed

[51] ↵
Abraham, G., Qiu, Y. & Inouye, M. FlashPCA2: principal component analysis of Biobank-scale genotype datasets. Bioinformatics 33, 2776–2778 (2017).
OpenUrl CrossRef PubMed

[52] ↵
McCarthy, S. et al. A reference panel of 64,976 haplotypes for genotype imputation. Nature genetics 48, 1279 (2016).
OpenUrl CrossRef PubMed

[53] ↵
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. The American Journal of Human Genetics 81, 559–575 (2007).
OpenUrl CrossRef PubMed

[54] ↵
Sweet, D. (2011).

[55] ↵
Loh, P.-R. et al. Efficient Bayesian mixed-model analysis increases association power in large cohorts. Nature genetics 47, 284–290 (2015).
OpenUrl CrossRef PubMed

[56] ↵
Yang, J., Zaitlen, N. A., Goddard, M. E., Visscher, P. M. & Price, A. L. Advantages and pitfalls in the application of mixed-model association methods. Nature genetics 46, 100–106 (2014).
OpenUrl CrossRef PubMed

[57] ↵
Bulik-Sullivan, B. K. et al. LD Score regression distinguishes confounding from polygenicity in genome-wide association studies. Nature genetics 47, 291–295 (2015).
OpenUrl CrossRef PubMed

[58] ↵
Finucane, H. K. et al. Partitioning heritability by functional annotation using genome-wide association summary statistics. Nature genetics 47, 1228–1235 (2015).
OpenUrl CrossRef PubMed

[59] ↵
Price, A. L., Zaitlen, N. A., Reich, D. & Patterson, N. New approaches to population stratification in genome-wide association studies. Nature Reviews Genetics 11, 459 (2010).
OpenUrl CrossRef PubMed Web of Science

[60] ↵
Kvale, M. N. et al. Genotyping informatics and quality control for 100,000 subjects in the Genetic Epidemiology Research on Adult Health and Aging (GERA) cohort. Genetics 200, 1051–1060 (2015).
OpenUrl Abstract/FREE Full Text

[61] ↵
Dudbridge, F. Power and predictive accuracy of polygenic risk scores. PLoS genetics 9, e1003348 (2013).

[62] ↵
Fry, A. et al. Comparison of sociodemographic and health-related characteristics of UK Biobank participants with those of the general population. American journal of epidemiology 186, 1026–1034 (2017).
OpenUrl CrossRef PubMed

[63] De Moor, M. H. et al. Meta-analysis of genome-wide association studies for personality. Molecular psychiatry 17, 337 (2012).
OpenUrl CrossRef PubMed Web of Science

[64] Lo, M.-T. et al. Genome-wide analyses for personality traits identify six genomic loci and show correlations with psychiatric disorders. Nature genetics 49, 152 (2017).
OpenUrl

[65] Duncan, L. et al. Significant locus and metabolic genetic correlations revealed in genome-wide association study of anorexia nervosa. American Journal of Psychiatry 174, 850–858 (2017).
OpenUrl

[66] Consortium, A. S. D. W. G. o. T. P. G. et al. Meta-analysis of GWAS of over 16,000 individuals with autism spectrum disorder highlights a novel locus at 10q24. 32 and a significant overlap with schizophrenia. Molecular autism 8, 1–17 (2017).
OpenUrl

[67] Lambert, J.-C. et al. Meta-analysis of 74,046 individuals identifies 11 new susceptibility loci for Alzheimer’s disease. Nature genetics 45, 1452 (2013).
OpenUrl CrossRef PubMed

[68] Furberg, H. et al. Genome-wide meta-analyses identify multiple loci associated with smoking behavior. Nature genetics 42, 441 (2010).
OpenUrl CrossRef PubMed Web of Science

[69] Stringer, S. et al. Genome-wide association study of lifetime cannabis use based on a large meta-analytic sample of 32 330 subjects from the International Cannabis Consortium. Translational psychiatry 6, e769 (2017).

[70] Cornelis, M. C. et al. Genome-wide meta-analysis identifies six novel loci associated with habitual coffee consumption. Molecular psychiatry 20, 647 (2015).
OpenUrl CrossRef PubMed

[71] Nikpay, M. et al. A comprehensive 1000 Genomes–based genome-wide association meta-analysis of coronary artery disease. Nature genetics 47, 1121 (2015).
OpenUrl CrossRef PubMed

[72] Morris, A. P. et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nature genetics 44, 981 (2012).
OpenUrl CrossRef PubMed

[73] Elks, C. E. et al. Thirty new loci for age at menarche identified by a meta-analysis of genome-wide association studies. Nature genetics 42, 1077 (2010).
OpenUrl CrossRef PubMed

[74] Day, F. R. et al. Large-scale genomic analyses link reproductive aging to hypothalamic signaling, breast cancer susceptibility and BRCA1-mediated DNA repair. Nature genetics 47, 1294 (2015).
OpenUrl CrossRef PubMed

[75] Wood, A. R. et al. Defining the role of common variation in the genomic and biological architecture of adult human height. Nature genetics 46, 1173 (2014).
OpenUrl CrossRef PubMed

[76] Locke, A. E. et al. Genetic studies of body mass index yield new insights for obesity biology. Nature 518, 197 (2015).
OpenUrl CrossRef PubMed

[77] Lu, Y. et al. New loci for body fat percentage reveal link between adiposity and cardiometabolic disease risk. Nature communications 7, 10495 (2016).
OpenUrl

[78] Shungin, D. et al. New genetic loci link adipose and insulin biology to body fat distribution. Nature 518, 187 (2015).
OpenUrl CrossRef PubMed Web of Science

[79] Benyamin, B. et al. Childhood intelligence is heritable, highly polygenic and associated with FNBP1L. Molecular psychiatry 19, 253 (2014).
OpenUrl CrossRef PubMed Web of Science

[80] Sniekers, S. et al. Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence. Nature Genetics (2017).

[81] Hill, W. D. et al. Molecular genetic contributions to social deprivation and household income in UK Biobank. Current Biology 26, 3083–3089 (2016).
OpenUrl CrossRef PubMed

[82] Okbay, A. et al. Genetic variants associated with subjective well-being, depressive symptoms, and neuroticism identified through genome-wide analyses. Nature genetics 48, 624 (2016).
OpenUrl CrossRef PubMed

[83] Deary, V. et al. Genetic contributions to self-reported tiredness. Molecular psychiatry 23, 609 (2018).
OpenUrl

[84] Churchhouse, C. & Neale, B. Rapid GWAS of Thousands of Phenotypes for 337,000 Samples in the UK Biobank. Neale Lab (2017).

[85] Clarke, T.-K. et al. Genome-wide association study of alcohol consumption and genetic overlap with other health-related traits in UK Biobank (N= 112 117). Molecular psychiatry 22, 1376 (2017).
OpenUrl CrossRef PubMed

[86] Walters, R. K. et al. Trans-ancestral GWAS of alcohol dependence reveals common genetic underpinnings with psychiatric disorders. bioRxiv, 257311 (2018).

[87] Harris, S. E. et al. Molecular genetic contributions to self-rated health. International journal of epidemiology 46, 994–1009 (2016).
OpenUrl

[88] Pilling, L. C. et al. Human longevity is influenced by many genetic variants: evidence from 75,000 UK Biobank participants. Aging (Albany NY) 8, 547 (2016).
OpenUrl

[89] Barban, N. et al. Genome-wide analysis identifies 12 loci influencing human reproductive behavior. Nature genetics 48, 1462 (2016).
OpenUrl CrossRef PubMed