PT - JOURNAL ARTICLE AU - Gregg W.C. Thomas AU - Matthew W. Hahn AU - Yoonsoo Hahn TI - The effects of increasing the number of taxa on inferences of molecular convergence AID - 10.1101/081612 DP - 2016 Jan 01 TA - bioRxiv PG - 081612 4099 - http://biorxiv.org/content/early/2016/10/17/081612.short 4100 - http://biorxiv.org/content/early/2016/10/17/081612.full AB - Convergent evolution provides insight into the link between phenotype and genotype. Recently, large-scale comparative studies of convergent evolution have become possible, but researchers are still trying to determine the best way to design these types of analyses. One aspect of molecular convergence studies that has not yet been investigated is how taxonomic sample size affects inferences of molecular convergence. Here we show that increased sample size decreases the amount of inferred molecular convergence associated with the three convergent transitions to a marine environment in mammals. The sampling of more taxa—both with and without the convergent phenotype—reveals that alleles associated only with marine mammals in small datasets are actually more widespread, or are not shared by all marine species. The sampling of more taxa also allows finer resolution of ancestral substitutions, revealing that they are not in fact on lineages leading to solely marine species. We revisit a previous study on marine mammals and find that only 7 of the reported 43 genes with convergent substitutions still show signs of convergence with a larger number of background species. However, 4 of those 7 genes also showed signs of positive selection in the original analysis and may still be good candidates for adaptive convergence. Though our study is framed around the convergence of marine mammals, we expect our conclusions on taxonomic sampling are generalizable to any study of molecular convergence.