PT - JOURNAL ARTICLE AU - Mahfuza Sharmin AU - Héctor Corrada Bravo AU - Sridhar Hannenhalli TI - Distinct genomic and epigenomic features demarcate hypomethylated blocks in colon cancer AID - 10.1101/028803 DP - 2015 Jan 01 TA - bioRxiv PG - 028803 4099 - http://biorxiv.org/content/early/2015/10/09/028803.short 4100 - http://biorxiv.org/content/early/2015/10/09/028803.full AB - Background. Large mega base-pair genomic regions show robust alterations in DNA methylation levels in multiple cancers, a vast majority of which are hypo-methylated in cancers. These regions are generally bounded by CpG islands, overlap with Lamin Associated Domains and Large organized chromatin lysine modifications, and are associated with stochastic variability in gene expression. Given the size and consistency of hypo-methylated blocks (HMB) across cancer types, their immediate causes are likely to be encoded in the genomic region near HMB boundaries, in terms of specific genomic or epigenomic signatures. However, a detailed characterization of the HMB boundaries has not been reported.Method. Here, we focused on ~13k HMBs, encompassing approximately half the genome, identified in colon cancer. We analyzed a number of distinguishing features at the HMB boundaries including transcription factor (TF) binding motifs, various epigenomic marks, and chromatin structural features.Result. We found that the classical promoter epigenomic mark – H3K4me3, is highly enriched at HMB boundaries, as are CTCF bound sites. HMB boundaries harbor distinct combinations of TF motifs. Our Random Forest model based on TF motifs can accurately distinguish boundaries not only from regions inside and outside HMBs, but surprisingly, from active promoters as well. Interestingly, the distinguishing TFs and their interacting proteins are involved in chromatin modification. Finally, HMB boundaries significantly coincide with the boundaries of Topologically Associating Domains of the chromatin.Conclusion. Our analyses suggest that the overall architecture of HMBs is guided by pre-existing chromatin architecture, and are associated with aberrant activity of promoter-like sequences at the boundary.