Abstract
Associating genetic variants with phenotypes is not only important to understand the underlying biology but also to identify potential drug targets for treating diseases. It is widely accepted that for most complex traits many associations remain to be discovered, the so-called “missing heritability.” Yet missing heritability can be estimated, it is a known unknown, and we argue is only a fraction of the unknowns in genetics. The majority of possible genetic variants in the genome space are either too rare to be detected or even entirely absent from populations, and therefore do not contribute to estimates of phenotypic or genetic variability. We call these unknown unknowns in genetics the “fog of genetics.” Using data from the 1000 Genomes Project we then show that larger genes with greater genetic diversity are more likely to be associated with human traits, demonstrating that genetic associations are biased towards particular types of genes and that the genetic information we are lacking about traits and diseases is potentially immense. Our results and model have multiple implications for how genetic variability is perceived to influence complex traits, provide insights on molecular mechanisms of disease and for drug discovery efforts based on genetic information.