RT Journal Article SR Electronic T1 Predicting Transcriptional Regulatory Activities with Deep Convolutional Networks JF bioRxiv FD Cold Spring Harbor Laboratory SP 099879 DO 10.1101/099879 A1 Joe Paggi A1 Andrew Lamb A1 Kevin Tian A1 Irving Hsu A1 Pierre-Louis Cedoz A1 Prasad Kawthekar YR 2017 UL http://biorxiv.org/content/early/2017/01/12/099879.abstract AB Massively parallel reporter assays (MPRAs) are a method to probe the effects of short sequences on transcriptional regulation activity. In a MPRA, short sequences are extracted from suspected regulatory regions, inserted into reporter plasmids, transfected into cell-types of interest, and the transcriptional activity of each reporter is assayed. Recently, Ernst et al. presented MPRA data covering 15750 putative regulatory regions. We trained a multitask convolutional neural network architecture using these sequence expression readouts which predicts as output the expression level outputs across four combinations of cell types and promoters. The model allows for the assigning of importance scores to each base through in silico mutagenesis, and the resulting importance scores correlated well with regions enriched for conservation and transcription factor binding.