Measuring Intolerance to Mutation in Human Genetics

Zachary L. Fuller; Jeremy J. Berg; Hakhamanesh Mostafavi; Guy Sella; Molly Przeworski

doi:10.1101/382481

Abstract

In numerous applications, from working with animal models to mapping the genetic basis of human disease susceptibility, it is useful to know whether a single disrupting mutation in a gene is likely to be deleterious^1–4. With this goal in mind, a number of measures have been developed to identify genes in which protein-truncating variants (PTVs), or other types of mutations, are absent or kept at very low frequency in large numbers of healthy individuals—genes that appear intolerant to mutation^3,5–9. One measure in particular, pLI, has been widely adopted⁷. By contrasting the observed versus expected numbers of PTVs, it aims to classify genes into three categories, labelled null, recessive and haploinsufficient⁷. Here we discuss how pLI and similar measures relate to population genetic parameters and why they reflect the strength of selection acting on heterozygotes, rather than dominance or haploinsufficiency.

Experimental biologists and human geneticists are often interested in whether a single disrupting mutation, be it a protein-truncating variant (PTV) or a missense mutation, is likely to have a phenotypic effect. A related question is whether a single disrupting mutation is likely to have a deleterious effect, that is whether it will lead to a reduction in fitness of its carrier. While the terms haploinsufficient and dominant are often used interchangeably, the relationship between effects on phenotypes and on fitness is not straight-forward. For instance, a single mutation could lead to a clinically important phenotype, indicating that the gene is haploinsufficient or that there is a gain of function, yet have small or negligible effects on fitness unless homozygous. Examples include ELN and BRCA2, genes in which a single PTV leads to a severe disease, but where the fitness effect on heterozygotes is likely quite small because the disease is late onset (while homozygote PTVs are lethal)^10–13. Conversely, a mutation in a highly pleiotropic gene can have very weak phenotypic effects, yet inflict a severe cost on fitness.

Following common practice in human genetics (e.g.,⁴), we refer to genes in which a single disrupting mutation has a discernable phenotypic effect in heterozygotes as haploinsufficient (at least with regard to that phenotype); we note, however, that a phenotypic effect of a single mutation could also be due to a gain of function. In turn, we describe genes in which a single disrupting mutation has a fitness effect in heterozygotes as at least partially dominant. More precisely, following the convention in population genetics, we denote the fitnesses 1, 1 – hs, and 1 – s as corresponding, respectively, to genotypes AA, AD, and DD, where D is the deleterious allele, h is the dominance coefficient, and s is the selection coefficient. A mutation is completely recessive if h is equal to 0 and at least partially dominant if h is not near 0. This definition of dominance differs from one often used in population genetics (where dominance is defined as h > 0.5), but has more direct relevance for the expected frequency of deleterious mutations¹⁴ (Box 1).

Estimating the strength of selection acting on a gene, as summarized by the selection coefficient (s) and dominance effects (h) of mutations, has a long tradition in population genetics^15–18. In model organisms, these efforts have taken the form of mutation accumulation experiments and assays of gene deletion libraries^15,19–21; in humans and other species, these parameters have been inferred from polymorphism data^22–26. The statistical inferences are based on the notion of a mutation-selection balance, namely that the frequencies of deleterious alleles reflect a balance between the rate at which they are purged from the population and the rate at which they are replenished by mutation. Mutations with larger hs are purged more effectively and hence are expected at lower frequencies in the population—or, equivalently, are more likely to be absent from large samples (Box 1). Therefore, one way to identify genes whose loss is likely to reduce fitness is to assess whether disrupting mutations are found at lower frequencies than expected under some sensible null model.

Deleterious alleles are introduced into the population by mutation, then change in frequency due to the combined effects of genetic drift, demography and natural selection. Unless a disease mutation confers an advantage in some environments (e.g., the sickle cell allele in populations inflicted by malaria²⁷), the frequency at which it will be found it in a population reflects a balance between the rate at which it is introduced by mutation and removed by drift and purifying selection^28–30.

This phenomenon is referred to as mutation-selection-drift balance and modeled as follows (e.g., see³¹). Let u be the mutation rate from the wild type allele A to deleterious allele D. This mutation rate can be defined per site or per gene, by summing the mutation rate to deleterious alleles across sites (this simple summing implicitly assumes that there is no complementation and compound heterozygotes for deleterious alleles have the same fitness effects as homozygotes³²). The fitnesses for diploid individuals carrying genes with wild-type (A) or deleterious (D) alleles are given by where s is the selection coefficient measuring the fitness of DD relative to AA and h is the dominance coefficient, such that hs is the reduction in fitness of AD relative to AA.

In the limit of an infinite, panmictic population (i.e., ignoring genetic drift and inbreeding), when h>0 (and hs >> u), the equilibrium frequency of the deleterious D allele, q, is approximately²⁹:

Notably, when h>0, the equilibrium frequency q is determined by the strength of selection in heterozygotes, i.e., the joint effect of hs, because the frequency of deleterious homozygotes is too low for selection on them to have an appreciable effect. Hence, in this approximation, for a given hs, different combinations of h and s will yield the same value of q.

For a completely recessive allele (h=0), in turn, q is well approximated by²⁹:

Here, the equilibrium frequency is necessarily determined by selection in homozygotes. In this limit of an infinite population size, the same frequency q of a recessive allele with s >0 can also arise from a dominant allele for some value of hs >0.

In a finite population, there is a distribution of deleterious allele frequencies rather than a single (deterministic) value for given values of h and s. This distribution was derived for a constant population size by Wright³⁰ and is again a function of hs jointly (assuming that 2N_ehs >> 1 and setting aside the case of sustained, high levels of inbreeding³³). The resulting distribution can be highly variable, reflecting both stochasticity in the mutation process and the variance due to the evolutionary process (i.e., due to genetic drift). Dramatic changes in population size, as experienced by human populations, can also have a marked effect on the distribution of deleterious alleles. Regardless of these complications, it remains the case that distinguishing complete recessivity (h = 0) from small hs may not be feasible and that, other than for complete recessivity, the expected allele frequency is a function of hs, not h and s separately¹⁴.

To our knowledge, this population genetic approach was introduced as a tool for prioritizing human disease genes by Petrovski et al.⁵, who ranked genes by comparing the observed number of common PTVs and missense mutations to the total number of observed variants. This statistic was then supplemented by a number of others^6,34–36, notably pLI, which is defined as the probability of being loss of function intolerant⁷. pLI is derived from a comparison of the observed number of PTVs in a sample to the number expected in the absence of fitness effects (i.e., under neutrality), given an estimated mutation rate for the gene. To build this score, Lek et al.⁷ assumed that the number of PTVs in a gene is Poisson distributed with mean λM, where M is the expected number of PTVs under neutrality (estimated for each gene based on a mutation model ⁶ and the observed synonymous polymorphism counts). The authors considered that a gene can be neutral with respect to fitness (with λ_Null = 1), reces-sive (λ_Rec = 0.463) or haploinsufficient (λ_H1 = 0.089). The fixed values of λ_Rec and λ_H1 were obtained from the average reduction in the number of observed PTVs relative to a neutral expectation in genes classified as recessive and haploinsufficient, respectively; the classification was based on the phenotypic effects of mutations in the ClinGen dosage sensitivity gene list and a hand curated gene set of Mendelian disorders³⁷. Using this model, Lek et al.⁷ first estimated the proportion of human genes in each of their three categories and then, for any given gene, obtained the maximum a posteriori probability that it belongs to each of the categories. Genes with high probability (set at > 0.9) of belonging to the set parameterized by λ_H1 were classified as extremely loss of function intolerant⁷.

The pLI measure has been broadly applied in human genetics to help identify genes in which a single disrupting mutation is likely of clinical significance^4,38–45. pLI is also increasingly used in clinical annotation and in databases of mouse models as indicative of haploinsufficiency and dosage sensitivity^46–50. In fact, however, pLI and related measures are not directly informative about dominance effects on fitness, let alone about the degree of haploinsufficiency with respect to a phenotype, and instead reflect only the strength of selection acting in heterozygotes.

The reason is that unless h is vanishingly small (or long-term inbreeding levels are very high), a reduction in the frequency of PTVs—and hence of PTV counts—is indicative of the strength of selection acting on heterozygotes, hs, and not of the two parameters h and s separately. This result derives from mutation-selection-drift balance theory developed by Haldane^28,29, Wright³⁰, and others⁵¹ (see Box 1). Intuitively, it reflects the fact that when there are fitness effects in heterozygotes, even subtle, deleterious alleles are kept at low frequency in the population, such that homozygotes for the deleterious allele are extremely rare; the efficiency with which the allele is purged then depends almost entirely on its effects in heterozygotes. Thus, the frequencies of PTVs—and therefore pLI and related measures—depend on the strength of selection acting on heterozygotes.

To illustrate this point, we modelled how the observed count of PTVs in a gene of typical length (and hence pLI) depends on h and s, under a constant size population (Fig 1A) as well as under a more realistic model for human demographic history⁵² (Fig 1B). As can be seen, markedly different combinations of h and s lead to indistinguishable distributions of PTV counts (and hence of pLI values), so long as hs is the same (Fig 1A, B). More generally, the probability of observing a specific PTV count is maximized along a ridge corresponding to combinations of h and s that result in a given hs value (Fig 1C). One implication is that pLI can be near 1 when the dominance coefficient h is small, provided s is sufficiently large—and more generally that pLI is not indicative of dominance or haploinsufficiency per se.

Although these considerations make clear that pLI should be thought of as reflecting hs, it was not designed to be an estimator of this parameter, and has several problematic features as such. First, for a given value of hs, the expected value of pLI depends on gene length (Fig 1D). Second, for a typical gene length and a wide range of realistic values of hs, the distribution of pLI is highly variable and bimodal, covering most of the range from 0 to 1 (Fig 1E). Consequently, two genes with the same hs can be assigned radically different pLI values and conversely, the same pLI value can reflect markedly different hs values (Fig 1E). Outside this range of hs values, pLI is almost uninformative about the underlying parameter: below, pLI is ~ 0 for any value of hs and above it, when hs is large (approximately > 10%), it is always ~ 1. This property of pLI taking values of either 0 or 1 is only worsened with increasing gene length (Fig 1D). Thus, if the goal is to learn about selective effects in heterozygotes, a direct estimate of hs under a plausible demographic model is preferable (e.g., ⁹), together with a measure of uncertainty.

Recasting pLI in a population genetic framework further helps to understand why the recessive assignments are less reliable⁷. Lek et al.⁷ aim to divide genes into three categories, two of which correspond to hs > 0 (pLI) and hs = 0, s = 0 (pNULL). Logically, the remaining category pREC should include the cases where hs = 0 but s > 0, i.e., complete reces-sivity, in which selection acts exclusively against homozygotes (Box 1). Regardless of the method used, however, it can be infeasible to distinguish this category from the hs > 0 case, because the same expected allele frequency (and hence PTV count) can arise in cases when h = 0 and when hs > 0 but small (see Box 1 and Fig 1F). As one example, ignoring genetic drift, for a typical mutation rate to disease alleles per gene of u = 10⁻⁶, the frequency of disease alleles would be 1% whether h = 0 and s – 10⁻² or h =1 and s = 10⁻⁴ (Box 1). In other words, strongly deleterious, completely recessive PTVs are hard to distinguish from those that are weakly selected and at least partially dominant.

Why then, in practice, do pLI and related measures appear to successfully distinguish genes classified by clinicians as recessive vs dominant based on Mendelian disease phenotypes^4,7,40? Mendelian disease genes consist mostly of cases in which mutations are known to cause a highly deleterious outcome, i.e., for which there is prior knowledge that s is likely to be large (even close to 1). When s is that large, a gene will be classified by pLI as haploinsufficient so long as h is not tiny, i.e., so long as fitness effects in heterozygotes are not small. For most genes, however, there is no prior knowledge about s, and in that case, pLI—or any measure based on the frequency of PTVs—cannot reliably distinguish recessivity from dominance, let alone identify haploin-sufficiency.

Figure 1: Properties of pLI.

(A & B) Different combinations of h and s with the same hs value yield highly similar distributions of pLI. We considered PTVs arising in a human gene of typical length, i.e., with 225 PTV mutational opportunities, for (A) a population of constant size and (B) a plausible model of historical changes in the effective population size of Europeans⁵². We assumed that mutations arise at rate u = 1.5 × 10⁻⁸ per mutational opportunity⁵³; while this value of u is only approximate, the qualitative conclusions do not depend on the precise value of u. Subsequent generations are formed by Wright-Fisher sampling with selection modeled by choosing parents for each generation according to their fitnesses. We assumed no intragenic recombination and that a PTV mutation can only occur on a background free of other PTV mutations (as is highly likely). For the demographic model⁵², each simulation begins with a constant population size N of 14,448 (the ancestral size inferred by⁵²) and a burn-in of > 10N generations, before the first population size change 55,940 generations ago (following⁵⁴). For the constant population model, we set N = 100, 000 (reflective of the more recent time period relevant to the dynamics of deleterious mutations⁵⁵) and ran each simulation for a period of 10N generations. PTV frequencies are estimated using a sample size of 33,370 individuals drawn at present, to match the number of non-Finnish Europeans in ExAC⁷. From these simulations, we obtained the mean number of PTVs under neutrality, i.e., for s = 0, by averaging over 10⁶ simulations. We then ran 10⁶ replicates for each combination of s and h, recording the distinct number of PTVs that are segregating at present. For each replicate, we calculated the ratio λ of observed counts of distinct segregating PTV variants to the expected number. Following the procedure detailed in⁷, we then calculated pLI using the observed λ and the estimates of the mixing weights for each set obtained from ExAC⁷ (π_Null = 0.208, ¶r_ćc = 0.489, and π_H1 = 0.304). The insets in each figure show the density of the distribution of pLI scores. We note that since we used the true expected number of PTVs under neutrality, rather than an estimate (as is the case in practice⁷), we are somewhat underestimating the variability in pLI scores.

(C) The probability of observing a specific PTV count is maximized for a given value of hs. The figure depicts the probability of observing the PTV count for a gene of typical length generated from a single simulation of 33,370 individuals, with parameters s = 0.10, h = 0.90, u =1.5 × 10⁻⁸ per mutational opportunity, and assuming a plausible model of population size changes ⁵² (see Fig 1A)—in this case, 3 PTVs. We estimated the likelihood of h and s, i.e., the probability of this “observed” value, for a grid of h and s values, using 10⁶ replicates for each parameter combination.

(D) Behavior of pLI as a function of hs. We simulated the counts of PTVs under a plausible model of population size changes in Europeans⁵² (explained in Fig 1A), for a range of hs values. For each run, we calculated pLI using the observed number of PTVs from each simulation and the expected number obtained from averaging over 10⁶ simulations with hs = 0. The gray circles depict the average pLI over 10⁶ simulations for each value of hs, shown on the x-axis (on a log₁₀ scale), in a human gene of typical length; the dark purple line is the loess smoothed curve over all simulations. The shaded area represents the 2.5^th and 97.5^th percentiles of pLI scores for each value of hs. The cyan and yellow lines are the loess smoothed curves for simulations in a human gene with half and twice the number of PTV mutational opportunities, respectively.

(E) For a given hs, pLI scores are highly variable. Considering s = 0.1, h = 0.5, and u = 1.5 × 10⁻⁸ per mutational opportunity, we generated the distribution of pLI scores in a gene of typical length, with the expected number of PTVs obtained by averaging over 10⁶ simulations with s = 0. The red curve depicts the pLI score as a function of the number of observed PTVs (calculated as in⁷. The histogram represents the distribution of simulated PTV counts under a plausible model of historical population size changes in Europeans⁵² (details described in Fig 1A), with darker shaded bars indicating pLI values that would be classified as “extremely loss-of-function intolerant” ⁷. The inset shows the density of pLI scores.

(F) Complete recessivity (h = 0) can lead to similar PTV counts as weak selection on heterozygotes (hs > 0). As in Fig 1B, we simulated the counts of PTVs in a typical human gene under a plausible model of population size changes in Europeans⁵², for different combinations of h and s and u = 1.5 × 10⁻⁸ per mutational opportunity. The distribution labeled as neutral depicts the counts of PTVs in simulations with h and s both equal to 0. Each distribution shows the results from 10⁶ simulations. Dashed lines indicate the mean for each distribution.

In summary, measures such as pLI and approaches based on related data summaries^{3,4,6,9,34,56,57} hold great promise for prioritizing genes in which mutations are likely to be harmful⁵ and learning about the fitness effects of mutations in heterozygotes⁹. Recasting these measures in terms of underlying population genetic parameters provides a natural framework for their interpretation and for the development of more reliable inferences.

Acknowledgments

We thank G. Coop, M.B. Eisen, M. Hurles, J.K. Pritchard, and Y. Shen for helpful discussions. This work was supported by GM128318 to ZF, GM126787 to JB, GM121372 to MP and GM115889 to GS. We acknowledge computing resources from Columbia University’s Shared Research Computing Facility project, which is supported by NIH Research Facility Improvement Grant 1G20RR030893-01, and associated funds from the New York State Empire State Development, Division of Science Technology and Innovation (NYS-TAR) Contract C090171.

References

[1].↵
J. A. Blake, C. J. Bult, J. T. Eppig, J. A. Kadin, and J. E. Richardson, The Mouse Genome Database genotypes::phenotypes, Nucleic Acids Research 37, D712 (2009), ISSN 0305-1048.
OpenUrl CrossRef PubMed Web of Science
[2].
N. Huang, I. Lee, E. M. Marcotte, and M. E. Hurles, Characterising and Predicting Haploinsuf-ficiency in the Human Genome, PLOS Genetics 6, e1001154 (2010), ISSN 1553-7404.
OpenUrl
[3].↵
K. Eilbeck, A. Quinlan, and M. Yandell, Settling the score: variant prioritization and Mendelian disease, Nature Reviews. Genetics 18, 599 (2017), ISSN 1471-0064.
OpenUrl CrossRef
[4].↵
I. Bartha, J. d. Iulio, J. C. Venter, and A. Telenti, Human gene essentiality, Nature Reviews Genetics 19, 51 (2018), ISSN 1471-0064.
OpenUrl CrossRef
[5].↵
S. Petrovski, Q. Wang, E. L. Heinzen, A. S. Allen, and D. B. Goldstein, Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes, PLOS Genetics 9, e1003709 (2013), ISSN 1553-7404.
OpenUrl
[6].↵
K. E. Samocha, E. B. Robinson, S. J. Sanders, C. Stevens, A. Sabo, L. M. McGrath, J. A. Kos-micki, K. Rehnstrm, S. Mallick, A. Kirby, et al., A framework for the interpretation of de novo mutation in human disease, Nature Genetics 46, 944 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed
[7].↵
M. Lek, K. J. Karczewski, E. V. Minikel, K. E. Samocha, E. Banks, T. Fennell, A. H. ODonnell-Luria, J. S. Ware, A. J. Hill, B. B. Cummings, et al., Analysis of protein-coding genetic variation in 60,706 humans, Nature 536, 285 (2016), ISSN 0028-0836.
OpenUrl CrossRef PubMed
[8].
Y.-F. Huang, B. Gulko, and A. Siepel, Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data, Nature Genetics 49, 618 (2017), ISSN 1546-1718.
OpenUrl CrossRef PubMed
[9].↵
C. A. Cassa, D. Weghorn, D. J. Balick, D. M. Jordan, D. Nusinow, K. E. Samocha, A. O’Donnell-Luria, D. G. MacArthur, M. J. Daly, D. R. Beier, et al., Estimating the selective effects of heterozygous protein-truncating variants from human exome data, Nature Genetics 49, 806 (2017), ISSN 1061-4036.
OpenUrl
[10].↵
M. C. Raybould, A. J. Birley, and M. Hultn, Molecular variation of the human elastin (ELN) gene in a normal human population, Annals of Human Genetics 59, 149 (1995), ISSN 1469-1809.
OpenUrl PubMed
[11].
R. Wooster, G. Bignell, J. Lancaster, S. Swift, S. Seal, J. Mangion, N. Collins, S. Gregory, C. Gumbs, G. Micklem, et al., Identification of the breast cancer susceptibility gene BRCA2, Nature 378, 789 (1995), ISSN 1476-4687.
OpenUrl CrossRef PubMed Web of Science
[12].
J. E. Wagenseil, C. H. Ciliberto, R. H. Knutsen, M. A. Levy, A. Kovacs, and R. P. Mecham, The importance of elastin to aortic development in mice, American Journal of Physiology - Heart and Circulatory Physiology 299, H257 (2010), ISSN 0363-6135.
OpenUrl
[13].↵
R. Roy, J. Chun, and S. N. Powell, BRCA1 and BRCA2: different roles in a common pathway of genome protection, Nature reviews. Cancer 12, 68 (2011), ISSN 1474-175X.
OpenUrl CrossRef PubMed
[14].↵
Y. B. Simons, M. C. Turchin, J. K. Pritchard, and G. Sella, The deleterious mutation load is insensitive to recent population history, Nature Genetics 46, 220 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed
[15].↵
M. J. Simmons and J. F. Crow, Mutations affecting fitness in Drosophila populations, Annual Review of Genetics 11, 49 (1977), ISSN 0066-4197.
OpenUrl CrossRef PubMed Web of Science
[16].
P. D. Keightley, The distribution of mutation effects on viability in Drosophila melanogaster, Genetics 138, 1315 (1994), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text
[17].
H. W. Deng and M. Lynch, Estimation of Deleterious-Mutation Parameters in Natural Populations, Genetics 144, 349 (1996), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text
[18].↵
H. A. Orr, Fitness and its role in evolutionary genetics, Nature reviews. Genetics 10, 531 (2009), ISSN 1471-0056.
OpenUrl CrossRef PubMed Web of Science
[19].↵
T. Mukai, S. I. Chigusa, L. E. Mettler, and J. F. Crow, Mutation Rate and Dominance of Genes Affecting Viability in DROSOPHILA MELANOGASTER, Genetics 72, 335 (1972), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text
[20].
N. Phadnis and J. D. Fry, Widespread Correlations Between Dominance and Homozygous Effects of Mutations: Implications for Theories of Dominance, Genetics 171, 385 (2005), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text
[21].↵
A. F. Agrawal and M. C. Whitlock, Inferences About the Distribution of Dominance Drawn From Yeast Gene Knockout Data, Genetics 187, 553 (2011), ISSN 0016-6731, 1943–2631.
OpenUrl
[22].↵
S. H. Williamson, R. Hernandez, A. Fledel-Alon, L. Zhu, R. Nielsen, and C. D. Bustamante, Simultaneous inference of selection and population growth from patterns of variation in the human genome, Proceedings of the National Academy of Sciences of the United States of America 102, 7882 (2005), ISSN 0027-8424.
OpenUrl Abstract/FREE Full Text
[23].
A. Eyre-Walker, M. Woolfit, and T. Phelps, The Distribution of Fitness Effects of New Deleterious Amino Acid Mutations in Humans, Genetics 173, 891 (2006), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text
[24].
A. R. Boyko, S. H. Williamson, A. R. Indap, J. D. Degenhardt, R. D. Hernandez, K. E. Lohmueller, M. D. Adams, S. Schmidt, J. J. Sninsky, S. R. Sunyaev, et al., Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome, PLOS Genetics 4, e1000083 (2008), ISSN 1553-7404.
OpenUrl
[25].
F. Racimo and J. G. Schraiber, Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms, PLOS Genetics 10, e1004697 (2014), ISSN 1553-7404.
OpenUrl
[26].↵
B. Y. Kim, C. D. Huber, and K. E. Lohmueller, Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples, Genetics 206, 345 (2017), ISSN 1943-2631.
OpenUrl Abstract/FREE Full Text
[27].↵
F. B. Piel, A. P. Patil, R. E. Howes, O. A. Nyangiri, P. W. Gething, T. N. Williams, D. J. Weatherall, and S. I. Hay, Global distribution of the sickle cell gene and geographical confirmation of the malaria hypothesis, Nature Communications 1, 104 (2010), ISSN 2041-1723.
OpenUrl
[28].↵
J. B. S. Haldane, A Mathematical Theory of Natural and Artificial Selection, Part V: Selection and Mutation, Mathematical Proceedings of the Cambridge Philosophical Society 23, 838 (1927), ISSN 1469-8064, 0305–0041.
OpenUrl
[29].↵
J. B. S. Haldane, The Effect of Variation of Fitness, The American Naturalist 71, 337 (1937), ISSN 0003-0147.
OpenUrl CrossRef Web of Science
[30].↵
S. Wright, The Distribution of Gene Frequencies in Populations, Proceedings of the National Academy of Sciences of the United States of America 23, 307 (1937), ISSN 0027-8424.
OpenUrl FREE Full Text
[31].↵
J. H. Gillespie, Population Genetics: A Concise Guide (JHU Press, 2004), ISBN 978-0-8018-80087.
[32].↵
A. G. Clark, Mutation-selection balance with multiple alleles, Genetica 102–103, 41 (1998), ISSN 0016-6707.
[33].↵
1. W. H. Freeman
B. Charlesworth and D. Charlesworth, Elements of Evolutionary Genetics ( W. H. Freeman, Greenwood Village, Colo, 2010), 1st ed., ISBN 978-09815194-2-5.
[34].↵
I. Bartha, A. Rausell, P. J. McLaren, P. Mohammadi, M. Tardaguila, N. Chaturvedi, J. Fellay, and A. Telenti, The Characteristics of Heterozygous Protein Truncating Variants in the Human Genome, PLOS Computational Biology 11, e1004647 (2015), ISSN 1553-7358.
OpenUrl
[35].
J. Steinberg, F. Honti, S. Meader, and C. Webber, Haploinsufficiency predictions without study bias, Nucleic Acids Research 43, e101 (2015), ISSN 0305-1048.
OpenUrl CrossRef PubMed
[36].↵
J. Fadista, N. Oskolkov, O. Hansson, L. Groop, and J. Hancock, LoFtool: a gene intolerance score based on loss-of-function variants in 60 706 individuals, Bioinformatics 33, 471 (2017), ISSN 1367-4803.
OpenUrl
[37].↵
R. Blekhman, O. Man, L. Herrmann, A. R. Boyko, A. Indap, C. Kosiol, C. D. Bustamante, K. M. Teshima, and M. Przeworski, Natural selection on genes that underlie human disease susceptibility, Current biology: CB 18, 883 (2008), ISSN 0960-9822.
OpenUrl
[38].↵
S. H. Lelieveld, M. R. F. Reijnders, R. Pfundt, H. G. Yntema, E.-J. Kamsteeg, P. d. Vries, B. B. A. d. Vries, M. H. Willemsen, T. Kleefstra, K. Lhner, et al., Meta-analysis of 2,104 trios provides support for 10 new genes for intellectual disability, Nature Neuroscience 19, 1194 (2016), ISSN 1546-1726.
OpenUrl CrossRef
[39].
D. M. Ruderfer, T. Hamamsy, M. Lek, K. J. Karczewski, D. Kavanagh, K. E. Samocha, E. A. Consortium, M. J. Daly, D. G. MacArthur, M. Fromer, et al., Patterns of genic intolerance of rare copy number variation in 59,898 human exomes, Nature Genetics 48, 1107 (2016), ISSN 1546-1718.
OpenUrl CrossRef PubMed
[40].↵
J. A. Kosmicki, K. E. Samocha, D. P. Howrigan, S. J. Sanders, K. Slowikowski, M. Lek, K. J. Karczewski, D. J. Cutler, B. Devlin, K. Roeder, et al., Refining the role of de novo protein-truncating variants in neurodevelopmental disorders by using population reference samples, Nature Genetics 49, 504 (2017), ISSN 1546-1718.
OpenUrl CrossRef
[41].
C. M. Skraban, C. F. Wells, P. Markose, M. T. Cho, A. I. Nesbitt, P. Y. B. Au, A. Begtrup, J. A. Bernat, L. M. Bird, K. Cao, et al., WDR26 Haploinsufficiency Causes a Recognizable Syndrome of Intellectual Disability, Seizures, Abnormal Gait, and Distinctive Facial Features, The American Journal of Human Genetics 101, 139 (2017), ISSN 0002-9297.
OpenUrl
[42].
P. Stankiewicz, T. N. Khan, P. Szafranski, L. Slattery, H. Streff, F. Vetrini, J. A. Bernstein, C. W. Brown, J. A. Rosenfeld, S. Rednam, et al., Haploinsufficiency of the Chromatin Remodeler BPTF Causes Syndromic Developmental and Speech Delay, Postnatal Microcephaly, and Dysmorphic Features, The American Journal of Human Genetics 101, 503 (2017), ISSN 0002-9297.
OpenUrl CrossRef
[43].
H. T. Nguyen, J. Bryois, A. Kim, A. Dobbyn, L. M. Huckins, A. B. Munoz-Manchado, D. M. Ruderfer, G. Genovese, M. Fromer, X. Xu, et al., Integrated Bayesian analysis of rare exonic variants to identify risk genes for schizophrenia and neurodevelopmental disorders, Genome Medicine 9, 114 (2017), ISSN 1756-994X.
OpenUrl
[44].
M. Zarrei, D. L. Fehlings, K. Mawjee, L. Switzer, B. Thiruvahindrapuram, S. Walker, D. Merico, G. Casallo, M. Uddin, J. R. MacDonald, et al., De novo and rare inherited copy-number variations in the hemiplegic form of cerebral palsy, Genetics in Medicine 20, 172 (2018), ISSN 1530-0366.
OpenUrl
[45].↵
H. O. Heyne, T. Singh, H. Stamberger, R. A. Jamra, H. Caglayan, D. Craiu, P. D. Jonghe, R. Guerrini, K. L. Helbig, B. P. C. Koeleman, et al., De novo variants in neurodevelopmental disorders with epilepsy, Nature Genetics 1 (2018), ISSN 1546-1718.
[46].↵
M. Zech, S. Boesch, E. M. Maier, I. Borggraefe, K. Vill, F. Laccone, V. Pilshofer, A. Ceballos-Baumann, B. Alhaddad, R. Berutti, et al., Haploinsufficiency of KMT2b, Encoding the Lysine-Specific Histone Methyltransferase 2b, Results in Early-Onset Generalized Dystonia, American Journal of Human Genetics 99, 1377 (2016), ISSN 1537-6605.
OpenUrl CrossRef
[47].
M. Haller, Q. Mo, A. Imamoto, and D. J. Lamb, Murine model indicates 22q11.2 signaling adaptor CRKL is a dosage-sensitive regulator of genitourinary development, Proceedings of the National Academy of Sciences 114, 4981 (2017), ISSN 0027-8424, 1091–6490.
OpenUrl
[48].
J. Wang, R. Al-Ouran, Y. Hu, S.-Y. Kim, Y.-W. Wan, M. F. Wangler, S. Yamamoto, H.-T. Chao, A. Comjean, S. E. Mohr, et al., MARRVEL: Integration of Human and Model Organism Genetic Resources to Facilitate Functional Annotation of the Human Genome, The American Journal of Human Genetics 100, 843 (2017), ISSN 0002-9297, 1537-X6605.
OpenUrl CrossRef
[49].
B. Afzali, J. Grnholm, J. Vandrovcova, C. OBrien, H.-W. Sun, I. Vanderleyden, F. P. Davis, A. Khoder, Y. Zhang, A. N. Hegazy, et al., BACH2 immunodeficiency illustrates an association between super-enhancers and haploinsufficiency, Nature immunology 18, 813 (2017), ISSN 1529-2908.
OpenUrl CrossRef
[50].↵
N. Gosalia, A. N. Economides, F. E. Dewey, and S. Balasubramanian, MAPPIN: a method for annotating, predicting pathogenicity and mode of inheritance for nonsynonymous variants, Nucleic Acids Research 45, 10393 (2017), ISSN 0305-1048.
OpenUrl CrossRef
[51].↵
J. F. Crow and M. Kimura, An introduction to population genetics theory (Harper & Row, 1970), google-Books-ID: ytMPAQAAMAAJ.
[52].↵
S. Schiffels and R. Durbin, Inferring human population size and separation history from multiple genome sequences, Nature genetics 46, 919 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed
[53].↵
B. M. Neale, Y. Kou, L. Liu, A. Ma’ayan, K. E. Samocha, A. Sabo, C.-F. Lin, C. Stevens, L.-S. Wang, V. Makarov, et al., Patterns and rates of exonic de novo mutations in autism spectrum disorders, Nature 485, 242 (2012), ISSN 0028-0836.
OpenUrl CrossRef PubMed Web of Science
[54].↵
Y. B. Simons, K. Bullaughey, R. R. Hudson, and G. Sella, A population genetic interpretation of GWAS findings for human quantitative traits, PLOS Biology 16, e2002985 (2018), ISSN 1545-7885.
OpenUrl CrossRef
[55].↵
C. E. G. Amorim, Z. Gao, Z. Baker, J. F. Diesel, Y. B. Simons, I. S. Haque, J. Pickrell, and M. Przeworski, The population genetics of human disease: The case of recessive, lethal mutations, PLOS Genetics 13, e1006915 (2017), ISSN 1553-7404.
OpenUrl
[56].↵
K. E. Samocha, J. A. Kosmicki, K. J. Karczewski, A. H. O’Donnell-Luria, E. Pierce-Hoffman, D. G. MacArthur, B. M. Neale, and M. J. Daly, Regional missense constraint improves variant deleteriousness prediction, bioRxiv 148353 (2017).
[57].↵
J. M. Havrilla, B. S. Pedersen, R. M. Layer, and A. R. Quinlan, A map of constrained coding regions in the human genome., bioRxiv 220814 (2017).

View the discussion thread.

Posted August 01, 2018.

Download PDF

Citation Tools

Subject Area

Genetics

Subject Areas

All Articles

Animal Behavior and Cognition (5215)
Biochemistry (11752)
Bioengineering (8752)
Bioinformatics (29200)
Biophysics (14974)
Cancer Biology (12096)
Cell Biology (17411)
Clinical Trials (138)
Developmental Biology (9421)
Ecology (14182)
Epidemiology (2067)
Evolutionary Biology (18308)
Genetics (12245)
Genomics (16803)
Immunology (11869)
Microbiology (28097)
Molecular Biology (11594)
Neuroscience (60969)
Paleontology (451)
Pathology (1871)
Pharmacology and Toxicology (3238)
Physiology (4959)
Plant Biology (10427)
Scientific Communication and Education (1683)
Synthetic Biology (2886)
Systems Biology (7340)
Zoology (1651)

[1] [1].↵
J. A. Blake, C. J. Bult, J. T. Eppig, J. A. Kadin, and J. E. Richardson, The Mouse Genome Database genotypes::phenotypes, Nucleic Acids Research 37, D712 (2009), ISSN 0305-1048.
OpenUrl CrossRef PubMed Web of Science

[2] [2].
N. Huang, I. Lee, E. M. Marcotte, and M. E. Hurles, Characterising and Predicting Haploinsuf-ficiency in the Human Genome, PLOS Genetics 6, e1001154 (2010), ISSN 1553-7404.
OpenUrl

[3] [3].↵
K. Eilbeck, A. Quinlan, and M. Yandell, Settling the score: variant prioritization and Mendelian disease, Nature Reviews. Genetics 18, 599 (2017), ISSN 1471-0064.
OpenUrl CrossRef

[4] [4].↵
I. Bartha, J. d. Iulio, J. C. Venter, and A. Telenti, Human gene essentiality, Nature Reviews Genetics 19, 51 (2018), ISSN 1471-0064.
OpenUrl CrossRef

[5] [5].↵
S. Petrovski, Q. Wang, E. L. Heinzen, A. S. Allen, and D. B. Goldstein, Genic Intolerance to Functional Variation and the Interpretation of Personal Genomes, PLOS Genetics 9, e1003709 (2013), ISSN 1553-7404.
OpenUrl

[6] [6].↵
K. E. Samocha, E. B. Robinson, S. J. Sanders, C. Stevens, A. Sabo, L. M. McGrath, J. A. Kos-micki, K. Rehnstrm, S. Mallick, A. Kirby, et al., A framework for the interpretation of de novo mutation in human disease, Nature Genetics 46, 944 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed

[7] [7].↵
M. Lek, K. J. Karczewski, E. V. Minikel, K. E. Samocha, E. Banks, T. Fennell, A. H. ODonnell-Luria, J. S. Ware, A. J. Hill, B. B. Cummings, et al., Analysis of protein-coding genetic variation in 60,706 humans, Nature 536, 285 (2016), ISSN 0028-0836.
OpenUrl CrossRef PubMed

[8] [8].
Y.-F. Huang, B. Gulko, and A. Siepel, Fast, scalable prediction of deleterious noncoding variants from functional and population genomic data, Nature Genetics 49, 618 (2017), ISSN 1546-1718.
OpenUrl CrossRef PubMed

[9] [9].↵
C. A. Cassa, D. Weghorn, D. J. Balick, D. M. Jordan, D. Nusinow, K. E. Samocha, A. O’Donnell-Luria, D. G. MacArthur, M. J. Daly, D. R. Beier, et al., Estimating the selective effects of heterozygous protein-truncating variants from human exome data, Nature Genetics 49, 806 (2017), ISSN 1061-4036.
OpenUrl

[10] [10].↵
M. C. Raybould, A. J. Birley, and M. Hultn, Molecular variation of the human elastin (ELN) gene in a normal human population, Annals of Human Genetics 59, 149 (1995), ISSN 1469-1809.
OpenUrl PubMed

[11] [11].
R. Wooster, G. Bignell, J. Lancaster, S. Swift, S. Seal, J. Mangion, N. Collins, S. Gregory, C. Gumbs, G. Micklem, et al., Identification of the breast cancer susceptibility gene BRCA2, Nature 378, 789 (1995), ISSN 1476-4687.
OpenUrl CrossRef PubMed Web of Science

[12] [12].
J. E. Wagenseil, C. H. Ciliberto, R. H. Knutsen, M. A. Levy, A. Kovacs, and R. P. Mecham, The importance of elastin to aortic development in mice, American Journal of Physiology - Heart and Circulatory Physiology 299, H257 (2010), ISSN 0363-6135.
OpenUrl

[13] [13].↵
R. Roy, J. Chun, and S. N. Powell, BRCA1 and BRCA2: different roles in a common pathway of genome protection, Nature reviews. Cancer 12, 68 (2011), ISSN 1474-175X.
OpenUrl CrossRef PubMed

[14] [14].↵
Y. B. Simons, M. C. Turchin, J. K. Pritchard, and G. Sella, The deleterious mutation load is insensitive to recent population history, Nature Genetics 46, 220 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed

[15] [15].↵
M. J. Simmons and J. F. Crow, Mutations affecting fitness in Drosophila populations, Annual Review of Genetics 11, 49 (1977), ISSN 0066-4197.
OpenUrl CrossRef PubMed Web of Science

[16] [16].
P. D. Keightley, The distribution of mutation effects on viability in Drosophila melanogaster, Genetics 138, 1315 (1994), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text

[17] [17].
H. W. Deng and M. Lynch, Estimation of Deleterious-Mutation Parameters in Natural Populations, Genetics 144, 349 (1996), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text

[18] [18].↵
H. A. Orr, Fitness and its role in evolutionary genetics, Nature reviews. Genetics 10, 531 (2009), ISSN 1471-0056.
OpenUrl CrossRef PubMed Web of Science

[19] [19].↵
T. Mukai, S. I. Chigusa, L. E. Mettler, and J. F. Crow, Mutation Rate and Dominance of Genes Affecting Viability in DROSOPHILA MELANOGASTER, Genetics 72, 335 (1972), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text

[20] [20].
N. Phadnis and J. D. Fry, Widespread Correlations Between Dominance and Homozygous Effects of Mutations: Implications for Theories of Dominance, Genetics 171, 385 (2005), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text

[21] [21].↵
A. F. Agrawal and M. C. Whitlock, Inferences About the Distribution of Dominance Drawn From Yeast Gene Knockout Data, Genetics 187, 553 (2011), ISSN 0016-6731, 1943–2631.
OpenUrl

[22] [22].↵
S. H. Williamson, R. Hernandez, A. Fledel-Alon, L. Zhu, R. Nielsen, and C. D. Bustamante, Simultaneous inference of selection and population growth from patterns of variation in the human genome, Proceedings of the National Academy of Sciences of the United States of America 102, 7882 (2005), ISSN 0027-8424.
OpenUrl Abstract/FREE Full Text

[23] [23].
A. Eyre-Walker, M. Woolfit, and T. Phelps, The Distribution of Fitness Effects of New Deleterious Amino Acid Mutations in Humans, Genetics 173, 891 (2006), ISSN 0016-6731.
OpenUrl Abstract/FREE Full Text

[24] [24].
A. R. Boyko, S. H. Williamson, A. R. Indap, J. D. Degenhardt, R. D. Hernandez, K. E. Lohmueller, M. D. Adams, S. Schmidt, J. J. Sninsky, S. R. Sunyaev, et al., Assessing the Evolutionary Impact of Amino Acid Mutations in the Human Genome, PLOS Genetics 4, e1000083 (2008), ISSN 1553-7404.
OpenUrl

[25] [25].
F. Racimo and J. G. Schraiber, Approximation to the Distribution of Fitness Effects across Functional Categories in Human Segregating Polymorphisms, PLOS Genetics 10, e1004697 (2014), ISSN 1553-7404.
OpenUrl

[26] [26].↵
B. Y. Kim, C. D. Huber, and K. E. Lohmueller, Inference of the Distribution of Selection Coefficients for New Nonsynonymous Mutations Using Large Samples, Genetics 206, 345 (2017), ISSN 1943-2631.
OpenUrl Abstract/FREE Full Text

[27] [27].↵
F. B. Piel, A. P. Patil, R. E. Howes, O. A. Nyangiri, P. W. Gething, T. N. Williams, D. J. Weatherall, and S. I. Hay, Global distribution of the sickle cell gene and geographical confirmation of the malaria hypothesis, Nature Communications 1, 104 (2010), ISSN 2041-1723.
OpenUrl

[28] [28].↵
J. B. S. Haldane, A Mathematical Theory of Natural and Artificial Selection, Part V: Selection and Mutation, Mathematical Proceedings of the Cambridge Philosophical Society 23, 838 (1927), ISSN 1469-8064, 0305–0041.
OpenUrl

[29] [29].↵
J. B. S. Haldane, The Effect of Variation of Fitness, The American Naturalist 71, 337 (1937), ISSN 0003-0147.
OpenUrl CrossRef Web of Science

[30] [30].↵
S. Wright, The Distribution of Gene Frequencies in Populations, Proceedings of the National Academy of Sciences of the United States of America 23, 307 (1937), ISSN 0027-8424.
OpenUrl FREE Full Text

[31] [31].↵
J. H. Gillespie, Population Genetics: A Concise Guide (JHU Press, 2004), ISBN 978-0-8018-80087.

[32] [32].↵
A. G. Clark, Mutation-selection balance with multiple alleles, Genetica 102–103, 41 (1998), ISSN 0016-6707.

[33] [33].↵
W. H. Freeman
B. Charlesworth and D. Charlesworth, Elements of Evolutionary Genetics ( W. H. Freeman, Greenwood Village, Colo, 2010), 1st ed., ISBN 978-09815194-2-5.

[34] W. H. Freeman

[35] [34].↵
I. Bartha, A. Rausell, P. J. McLaren, P. Mohammadi, M. Tardaguila, N. Chaturvedi, J. Fellay, and A. Telenti, The Characteristics of Heterozygous Protein Truncating Variants in the Human Genome, PLOS Computational Biology 11, e1004647 (2015), ISSN 1553-7358.
OpenUrl

[36] [35].
J. Steinberg, F. Honti, S. Meader, and C. Webber, Haploinsufficiency predictions without study bias, Nucleic Acids Research 43, e101 (2015), ISSN 0305-1048.
OpenUrl CrossRef PubMed

[37] [36].↵
J. Fadista, N. Oskolkov, O. Hansson, L. Groop, and J. Hancock, LoFtool: a gene intolerance score based on loss-of-function variants in 60 706 individuals, Bioinformatics 33, 471 (2017), ISSN 1367-4803.
OpenUrl

[38] [37].↵
R. Blekhman, O. Man, L. Herrmann, A. R. Boyko, A. Indap, C. Kosiol, C. D. Bustamante, K. M. Teshima, and M. Przeworski, Natural selection on genes that underlie human disease susceptibility, Current biology: CB 18, 883 (2008), ISSN 0960-9822.
OpenUrl

[39] [38].↵
S. H. Lelieveld, M. R. F. Reijnders, R. Pfundt, H. G. Yntema, E.-J. Kamsteeg, P. d. Vries, B. B. A. d. Vries, M. H. Willemsen, T. Kleefstra, K. Lhner, et al., Meta-analysis of 2,104 trios provides support for 10 new genes for intellectual disability, Nature Neuroscience 19, 1194 (2016), ISSN 1546-1726.
OpenUrl CrossRef

[40] [39].
D. M. Ruderfer, T. Hamamsy, M. Lek, K. J. Karczewski, D. Kavanagh, K. E. Samocha, E. A. Consortium, M. J. Daly, D. G. MacArthur, M. Fromer, et al., Patterns of genic intolerance of rare copy number variation in 59,898 human exomes, Nature Genetics 48, 1107 (2016), ISSN 1546-1718.
OpenUrl CrossRef PubMed

[41] [40].↵
J. A. Kosmicki, K. E. Samocha, D. P. Howrigan, S. J. Sanders, K. Slowikowski, M. Lek, K. J. Karczewski, D. J. Cutler, B. Devlin, K. Roeder, et al., Refining the role of de novo protein-truncating variants in neurodevelopmental disorders by using population reference samples, Nature Genetics 49, 504 (2017), ISSN 1546-1718.
OpenUrl CrossRef

[42] [41].
C. M. Skraban, C. F. Wells, P. Markose, M. T. Cho, A. I. Nesbitt, P. Y. B. Au, A. Begtrup, J. A. Bernat, L. M. Bird, K. Cao, et al., WDR26 Haploinsufficiency Causes a Recognizable Syndrome of Intellectual Disability, Seizures, Abnormal Gait, and Distinctive Facial Features, The American Journal of Human Genetics 101, 139 (2017), ISSN 0002-9297.
OpenUrl

[43] [42].
P. Stankiewicz, T. N. Khan, P. Szafranski, L. Slattery, H. Streff, F. Vetrini, J. A. Bernstein, C. W. Brown, J. A. Rosenfeld, S. Rednam, et al., Haploinsufficiency of the Chromatin Remodeler BPTF Causes Syndromic Developmental and Speech Delay, Postnatal Microcephaly, and Dysmorphic Features, The American Journal of Human Genetics 101, 503 (2017), ISSN 0002-9297.
OpenUrl CrossRef

[44] [43].
H. T. Nguyen, J. Bryois, A. Kim, A. Dobbyn, L. M. Huckins, A. B. Munoz-Manchado, D. M. Ruderfer, G. Genovese, M. Fromer, X. Xu, et al., Integrated Bayesian analysis of rare exonic variants to identify risk genes for schizophrenia and neurodevelopmental disorders, Genome Medicine 9, 114 (2017), ISSN 1756-994X.
OpenUrl

[45] [44].
M. Zarrei, D. L. Fehlings, K. Mawjee, L. Switzer, B. Thiruvahindrapuram, S. Walker, D. Merico, G. Casallo, M. Uddin, J. R. MacDonald, et al., De novo and rare inherited copy-number variations in the hemiplegic form of cerebral palsy, Genetics in Medicine 20, 172 (2018), ISSN 1530-0366.
OpenUrl

[46] [45].↵
H. O. Heyne, T. Singh, H. Stamberger, R. A. Jamra, H. Caglayan, D. Craiu, P. D. Jonghe, R. Guerrini, K. L. Helbig, B. P. C. Koeleman, et al., De novo variants in neurodevelopmental disorders with epilepsy, Nature Genetics 1 (2018), ISSN 1546-1718.

[47] [46].↵
M. Zech, S. Boesch, E. M. Maier, I. Borggraefe, K. Vill, F. Laccone, V. Pilshofer, A. Ceballos-Baumann, B. Alhaddad, R. Berutti, et al., Haploinsufficiency of KMT2b, Encoding the Lysine-Specific Histone Methyltransferase 2b, Results in Early-Onset Generalized Dystonia, American Journal of Human Genetics 99, 1377 (2016), ISSN 1537-6605.
OpenUrl CrossRef

[48] [47].
M. Haller, Q. Mo, A. Imamoto, and D. J. Lamb, Murine model indicates 22q11.2 signaling adaptor CRKL is a dosage-sensitive regulator of genitourinary development, Proceedings of the National Academy of Sciences 114, 4981 (2017), ISSN 0027-8424, 1091–6490.
OpenUrl

[49] [48].
J. Wang, R. Al-Ouran, Y. Hu, S.-Y. Kim, Y.-W. Wan, M. F. Wangler, S. Yamamoto, H.-T. Chao, A. Comjean, S. E. Mohr, et al., MARRVEL: Integration of Human and Model Organism Genetic Resources to Facilitate Functional Annotation of the Human Genome, The American Journal of Human Genetics 100, 843 (2017), ISSN 0002-9297, 1537-X6605.
OpenUrl CrossRef

[50] [49].
B. Afzali, J. Grnholm, J. Vandrovcova, C. OBrien, H.-W. Sun, I. Vanderleyden, F. P. Davis, A. Khoder, Y. Zhang, A. N. Hegazy, et al., BACH2 immunodeficiency illustrates an association between super-enhancers and haploinsufficiency, Nature immunology 18, 813 (2017), ISSN 1529-2908.
OpenUrl CrossRef

[51] [50].↵
N. Gosalia, A. N. Economides, F. E. Dewey, and S. Balasubramanian, MAPPIN: a method for annotating, predicting pathogenicity and mode of inheritance for nonsynonymous variants, Nucleic Acids Research 45, 10393 (2017), ISSN 0305-1048.
OpenUrl CrossRef

[52] [51].↵
J. F. Crow and M. Kimura, An introduction to population genetics theory (Harper & Row, 1970), google-Books-ID: ytMPAQAAMAAJ.

[53] [52].↵
S. Schiffels and R. Durbin, Inferring human population size and separation history from multiple genome sequences, Nature genetics 46, 919 (2014), ISSN 1061-4036.
OpenUrl CrossRef PubMed

[54] [53].↵
B. M. Neale, Y. Kou, L. Liu, A. Ma’ayan, K. E. Samocha, A. Sabo, C.-F. Lin, C. Stevens, L.-S. Wang, V. Makarov, et al., Patterns and rates of exonic de novo mutations in autism spectrum disorders, Nature 485, 242 (2012), ISSN 0028-0836.
OpenUrl CrossRef PubMed Web of Science

[55] [54].↵
Y. B. Simons, K. Bullaughey, R. R. Hudson, and G. Sella, A population genetic interpretation of GWAS findings for human quantitative traits, PLOS Biology 16, e2002985 (2018), ISSN 1545-7885.
OpenUrl CrossRef

[56] [55].↵
C. E. G. Amorim, Z. Gao, Z. Baker, J. F. Diesel, Y. B. Simons, I. S. Haque, J. Pickrell, and M. Przeworski, The population genetics of human disease: The case of recessive, lethal mutations, PLOS Genetics 13, e1006915 (2017), ISSN 1553-7404.
OpenUrl

[57] [56].↵
K. E. Samocha, J. A. Kosmicki, K. J. Karczewski, A. H. O’Donnell-Luria, E. Pierce-Hoffman, D. G. MacArthur, B. M. Neale, and M. J. Daly, Regional missense constraint improves variant deleteriousness prediction, bioRxiv 148353 (2017).

[58] [57].↵
J. M. Havrilla, B. S. Pedersen, R. M. Layer, and A. R. Quinlan, A map of constrained coding regions in the human genome., bioRxiv 220814 (2017).

Measuring Intolerance to Mutation in Human Genetics

Abstract

Acknowledgments

References

Citation Manager Formats

Subject Area