TY - JOUR T1 - Sporadic, global linkage disequilibrium between unlinked segregating sites JF - bioRxiv DO - 10.1101/030247 SP - 030247 AU - Daniel A. Skelly AU - Paul M. Magwene AU - Eric A. Stone Y1 - 2015/01/01 UR - http://biorxiv.org/content/early/2015/10/30/030247.abstract N2 - Demographic, genetic, or stochastic factors can lead to perfect linkage disequilibrium (LD) between alleles at two loci without respect to the extent of their physical distance, a phenomenon that Lawrence et al. (2005a) refer to as “genetic indistinguishability”. This phenomenon can complicate genotype-phenotype association testing by hindering the ability to localize causal alleles, but has not been thoroughly explored from a theoretical perspective or using large, dense whole-genome polymorphism datasets. We derive a simple theoretical model of the prevalence of genetic indistinguishability between unlinked loci, and verify its accuracy via simulation. We show that sample size and minor allele frequency are the major determinants of the prevalence of perfect LD between unlinked loci but that demographic factors, such as deviations from random mating, can produce significant effects as well. Finally, we quantify this phenomenon in three model organisms and find thousands of pairs of moderate-frequency (> 5%) genetically indistinguishable variants in relatively large datasets. These results clarify a previously underexplored population genetic phenomenon with important implications for association studies, and define conditions under which it is likely to manifest. ER -