TY - JOUR T1 - A workflow for UPLC-MS non-targeted metabolomic profiling in large human population-based studies JF - bioRxiv DO - 10.1101/002782 SP - 002782 AU - Andrea Ganna AU - Tove Fall AU - Woojoo Lee AU - Corey D. Broeckling AU - Jitender Kumar AU - Sara Hägg AU - Patrik K. E. Magnusson AU - Jessica Prenni AU - Lars Lind AU - Yudi Pawitan AU - Erik Ingelsson Y1 - 2014/01/01 UR - http://biorxiv.org/content/early/2014/02/17/002782.abstract N2 - Metabolomic profiling is an emerging technique in life sciences. Human studies using these techniques have been performed in a small number of individuals or have been targeted at a restricted number of metabolites. In this article, we propose a data analysis workflow to perform non-targeted metabolomic profiling in large human population-based studies using ultra performance liquid chromatography-mass spectrometry (UPLC-MS). We describe challenges and propose solutions for quality control, statistical analysis and annotation of metabolic features. Using the data analysis workflow, we detected more than 8,000 metabolic features in serum samples from 2,489 fasting individuals. As an illustrative example, we performed a non-targeted metabolome-wide association analysis of high-sensitive C-reactive protein (hsCRP) and detected 407 metabolic features corresponding to 90 unique metabolites that could be replicated in an external population. Our results reveal unexpected biological associations, such as metabolites identified as monoacylphosphorylcholines (LysoPC) being negatively associated with hsCRP. R code and fragmentation spectra for all metabolites are made publically available. In conclusion, the results presented here illustrate the viability and potential of non-targeted metabolomic profiling in large population-based studies. ER -