TY - JOUR T1 - Systematic identification of significantly mutated regions reveals a rich landscape of functional molecular alterations across cancer genomes JF - bioRxiv DO - 10.1101/020875 SP - 020875 AU - Carlos L. Araya AU - Can Cenik AU - Jason A. Reuter AU - Gert Kiss AU - Vijay S. Pande AU - Michael P. Snyder AU - William J. Greenleaf Y1 - 2015/01/01 UR - http://biorxiv.org/content/early/2015/06/15/020875.abstract N2 - One Sentence Summary Identification of multi-scale mutational hotspots in cancer exomes facilitates understanding of mutations both within coding and non-coding elements.Cancer genome sequencing studies have identified cancer-driver genes from the increased accumulation of protein-altering mutations. However, the positional distributions of coding mutations, and the 79% of somatic variants in exome data that do not alter protein sequence or RNA splicing, remain largely unstudied. We employed density-based clustering methods on ∼4,700 exomes from 21 tumor types to detect variably-sized significantly mutated regions (SMRs). SMRs reveal recurrent alterations across a diverse spectrum of coding and non-coding elements, including microRNAs, transcription factor binding sites, and untranslated regions that are individually mutated in up to ∼15% of samples in specific cancer types. SMRs often associated with changes in gene expression and signalling. Mapping SMRs to protein structures revealed spatial clustering of somatic mutations at known and novel cancer-driver domains and molecular interfaces. Mutation frequencies in SMRs demonstrate that distinct protein regions are differentially mutated among tumor types. The functional diversity of SMRs underscores both the varied mechanisms of oncogenic misregulation and the advantage of unbiased driver identification. ER -