RT Journal Article SR Electronic T1 AC-PCA adjusts for confounding variation in transcriptome data and recovers the anatomical structure of neocortex JF bioRxiv FD Cold Spring Harbor Laboratory SP 040485 DO 10.1101/040485 A1 Zhixiang Lin A1 Can Yang A1 Ying Zhu A1 John C. Duchi A1 Yao Fu A1 Yong Wang A1 Bai Jiang A1 Mahdi Zamanighomi A1 Xuming Xu A1 Mingfeng Li A1 Nenad Sestan A1 Hongyu Zhao A1 Wing Hung Wong YR 2016 UL http://biorxiv.org/content/early/2016/02/22/040485.abstract AB Microarray and RNA-sequencing technologies have enabled rapid quantification of the transcriptomes in a large number of samples. Although dimension reduction methods are commonly applied to transcriptome datasets for visualization and interpretation of the sample variations, the results can be hindered by confounding factors, either biological or technical. In this study, we propose a Principal Component Analysis-based approach to Adjust for Confounding variation (AC-PCA). We show that AC-PCA can adjust for variations across individual donors present in a human brain exon array dataset. Our approach is able to recover the anatomical structure of neocortex regions, including the frontal-temporal and dorsal-ventral axes, and reveal temporal dynamics of the interregional variation, mimicking the “hourglass” pattern of spatiotempo-ral dynamics. For gene selection purposes, we extend AC-PCA with sparsity constraints, and propose and implement an efficient algorithm. The top selected genes from this algorithm demonstrate frontal/temporal and dorsal/ventral expression gradients and strong functional conservation.