RT Journal Article SR Electronic T1 Accurate Identification of Active Transcriptional Regulatory Elements from Global Run-On and Sequencing Data JF bioRxiv FD Cold Spring Harbor Laboratory SP 011353 DO 10.1101/011353 A1 Charles G. Danko A1 Stephanie L. Hyland A1 Leighton J. Core A1 Andre L. Martins A1 Colin T. Waters A1 Hyung Won Lee A1 Vivian G. Cheung A1 W. Lee Kraus A1 John T. Lis A1 Adam Siepel YR 2014 UL http://biorxiv.org/content/early/2014/11/12/011353.abstract AB Identification of the genomic regions that regulate transcription remains an important open problem. We have recently shown that global run-on and sequencing (GRO-seq) with enrichment for 5'-capped RNAs reveals patterns of divergent transcription that accurately mark active transcriptional regulatory elements (TREs), including enhancers and promoters. Here, we demonstrate that active TREs can be identified with comparable accuracy by applying sensitive machine-learning methods to standard GRO-seq and PRO-seq data, allowing TREs to be assayed together with transcription levels, elongation rates, and other transcriptional features, in a single experiment. Our method, called discriminative Regulatory Element detection from GRO-seq (dREG), summarizes GRO-seq read counts at multiple scales and uses support vector regression to predict active TREs. The predicted TREs are strongly enriched for marks associated with functional elements, including H3K27ac, transcription factor binding sites, eQTLs, and GWAS-associated SNPs. Using dREG, we survey TREs in eight cell types and provide new insights into global patterns of TRE assembly and function.