PT - JOURNAL ARTICLE AU - Daniel A. Skelly AU - Paul M. Magwene AU - Eric A. Stone TI - Sporadic, global linkage disequilibrium between unlinked segregating sites AID - 10.1101/030247 DP - 2015 Jan 01 TA - bioRxiv PG - 030247 4099 - http://biorxiv.org/content/early/2015/10/30/030247.short 4100 - http://biorxiv.org/content/early/2015/10/30/030247.full AB - Demographic, genetic, or stochastic factors can lead to perfect linkage disequilibrium (LD) between alleles at two loci without respect to the extent of their physical distance, a phenomenon that Lawrence et al. (2005a) refer to as “genetic indistinguishability”. This phenomenon can complicate genotype-phenotype association testing by hindering the ability to localize causal alleles, but has not been thoroughly explored from a theoretical perspective or using large, dense whole-genome polymorphism datasets. We derive a simple theoretical model of the prevalence of genetic indistinguishability between unlinked loci, and verify its accuracy via simulation. We show that sample size and minor allele frequency are the major determinants of the prevalence of perfect LD between unlinked loci but that demographic factors, such as deviations from random mating, can produce significant effects as well. Finally, we quantify this phenomenon in three model organisms and find thousands of pairs of moderate-frequency (> 5%) genetically indistinguishable variants in relatively large datasets. These results clarify a previously underexplored population genetic phenomenon with important implications for association studies, and define conditions under which it is likely to manifest.