Abstract
DNA methylation plays a crucial role in establishing tissue-specific gene expression. However, our incomplete understanding of the cis elements that regulate DNA methylation prevents us from interpreting the functional effects of non-coding variants. We present CpGenie (http://cpgenie.csail.mit.edu), a deep convolutional neural network that learns a regulatory sequence code of DNA methylation and enables allele-specific DNA methylation prediction with single-nucleotide sensitivity. Variant annotations from CpGenie accurately identify methylation quantitative trait loci (meQTL) and contribute to the prioritization of functional non-coding variants including expression quantitative trait loci (eQTL) and disease-associated mutations.
Copyright
The copyright holder for this preprint is the author/funder, who has granted bioRxiv a license to display the preprint in perpetuity. It is made available under a CC-BY-NC-ND 4.0 International license.