TY - JOUR T1 - Multi-ethnic polygenic risk scores improve risk prediction in diverse populations JF - bioRxiv DO - 10.1101/051458 SP - 051458 AU - Carla Márquez-Luna AU - The SIGMA Type 2 Diabetes Consortium AU - Alkes L. Price Y1 - 2016/01/01 UR - http://biorxiv.org/content/early/2016/05/02/051458.abstract N2 - Methods for genetic risk prediction have been widely investigated in recent years. However, most available training data involves European samples, and it is currently unclear how to accurately predict disease risk in other populations. Previous studies have used either training data from European samples in large sample size or training data from the target population in small sample size, but not both. Here, we introduce a multiethnic polygenic risk score approach, MultiPRS, that combines training data from European samples and training data from the target population. We applied MultiPRS to predict type 2 diabetes in a Latino cohort using both publicly available European summary statistics in large sample size and Latino training data in small sample size, and observed a >70% relative improvement in prediction accuracy compared to methods that use only one source of training data, consistent with large relative improvements observed in simulations. Notably, this improvement is contingent on the use of ancestry-adjusted coefficients in MultiPRS. Our work reduces the gap in risk prediction accuracy between European and non-European target populations. ER -