RT Journal Article SR Electronic T1 MetaGxData: Breast and Ovarian Clinically Annotated Transcriptomics Datasets JF bioRxiv FD Cold Spring Harbor Laboratory SP 052910 DO 10.1101/052910 A1 Deena Mohamad Ameen Gendoo A1 Natchar Ratanasirigulchai A1 Gregory M Chen A1 Levi Waldron A1 Benjamin Haibe-Kains YR 2016 UL http://biorxiv.org/content/early/2016/05/12/052910.abstract AB A wealth of transcriptomic and clinical data on breast and ovarian cancers are underutilized due to unharmonized data storage and format. We have developed the MetaGxData package compendium, which includes manually-curated and standardized clinical, pathological, survival, and treatment metadata across both breast and ovarian cancer microarray data. MetaGxData is the largest compendium of breast and ovarian microarray data to date, spanning 65 datasets and encompassing 13,756 samples. Standardization of metadata across the two cancer types promotes the use of their expression datasets in a variety of cross-tumour analyses, including identification of common biomarkers, establishing common patterns of co-expression networks, assessing the validity of prognostic signatures, and the identification of new consensus signatures that reflects upon common biological mechanisms. Here, we present our flexible framework, unified nomenclature, as well as applications that demonstrate the analytical power that is harnessed by combining breast and ovarian cancer datasets.