RT Journal Article SR Electronic T1 Systematic identification of significantly mutated regions reveals a rich landscape of functional molecular alterations across cancer genomes JF bioRxiv FD Cold Spring Harbor Laboratory SP 020875 DO 10.1101/020875 A1 Carlos L. Araya A1 Can Cenik A1 Jason A. Reuter A1 Gert Kiss A1 Vijay S. Pande A1 Michael P. Snyder A1 William J. Greenleaf YR 2015 UL http://biorxiv.org/content/early/2015/06/15/020875.abstract AB One Sentence Summary Identification of multi-scale mutational hotspots in cancer exomes facilitates understanding of mutations both within coding and non-coding elements.Cancer genome sequencing studies have identified cancer-driver genes from the increased accumulation of protein-altering mutations. However, the positional distributions of coding mutations, and the 79% of somatic variants in exome data that do not alter protein sequence or RNA splicing, remain largely unstudied. We employed density-based clustering methods on ∼4,700 exomes from 21 tumor types to detect variably-sized significantly mutated regions (SMRs). SMRs reveal recurrent alterations across a diverse spectrum of coding and non-coding elements, including microRNAs, transcription factor binding sites, and untranslated regions that are individually mutated in up to ∼15% of samples in specific cancer types. SMRs often associated with changes in gene expression and signalling. Mapping SMRs to protein structures revealed spatial clustering of somatic mutations at known and novel cancer-driver domains and molecular interfaces. Mutation frequencies in SMRs demonstrate that distinct protein regions are differentially mutated among tumor types. The functional diversity of SMRs underscores both the varied mechanisms of oncogenic misregulation and the advantage of unbiased driver identification.