RT Journal Article SR Electronic T1 Quantitative analysis of population-scale family trees using millions of relatives JF bioRxiv FD Cold Spring Harbor Laboratory SP 106427 DO 10.1101/106427 A1 Joanna Kaplanis A1 Assaf Gordon A1 Mary Wahl A1 Michael Gershovits A1 Barak Markus A1 Mona Sheikh A1 Melissa Gymrek A1 Gaurav Bhatia A1 Daniel G. MacArthur A1 Alkes L. Price A1 Yaniv Erlich YR 2017 UL http://biorxiv.org/content/early/2017/02/07/106427.abstract AB Family trees have vast applications in multiple fields from genetics to anthropology and economics. However, the collection of extended family trees is tedious and usually relies on resources with limited geographical scope and complex data usage restrictions. Here, we collected 86 million profiles from publicly-available online data from genealogy enthusiasts. After extensive cleaning and validation, we obtained population-scale family trees, including a single pedigree of 13 million individuals. We leveraged the data to partition the genetic architecture of longevity by inspecting millions of relative pairs and to provide insights to population genetics theories on the dispersion of families. We also report a simple digital procedure to overlay other datasets with our resource in order to empower studies with population-scale genealogical data.One Sentence Summary Using massive crowd-sourced genealogy data, we created a population-scale family tree resource for scientific studies.