TY - JOUR T1 - A performance assessment of relatedness inference methods using genome-wide data from thousands of relatives JF - bioRxiv DO - 10.1101/106013 SP - 106013 AU - Monica D. Ramstetter AU - Thomas D. Dyer AU - Donna M. Lehman AU - Joanne E. Curran AU - Ravindranath Duggirala AU - John Blangero AU - Jason G. Mezey AU - Amy L. Williams Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/02/04/106013.abstract N2 - Inferring relatedness from genomic data is an essential component of genetic association studies, population genetics, forensics, and genealogy. While numerous methods exist for inferring relatedness, thorough evaluation of these methods in real data has been lacking. Here, we report an assessment of 11 state-of-the-art relatedness inference methods using a dataset with 2,485 individuals contained in several large pedigrees that span up to six generations. We nd that all methods have high accuracy (~93% – 99%) when reporting first and second degree relationships, but their accuracy dwindles to less than 60% for fifth degree relationships. However, the inferred relationships were correct to within one relatedness degree at a rate of 83% – 99% across all methods and considered relationship degrees. Furthermore, most methods infer unrelated individuals correctly at a rate of ~99%, suggesting a low rate of false positives. Overall, the most accurate methods were ERSA 2.0 and approaches that classify relationships using the IBD segments inferred by Refined IBD and IBDseq. Combining results from the most accurate methods provides little accuracy improvement, indicating that novel approaches for relatedness inference may be needed to achieve a sizeable jump in performance. ER -