PT - JOURNAL ARTICLE AU - Leonardo Collado-Torres AU - Abhinav Nellore AU - Kai Kammers AU - Shannon E. Ellis AU - Margaret A. Taub AU - Kasper D. Hansen AU - Andrew E. Jaffe AU - Ben Langmead AU - Jeffrey T. Leek TI - <kbd>recount</kbd>: A large-scale resource of analysis-ready RNA-seq expression data AID - 10.1101/068478 DP - 2016 Jan 01 TA - bioRxiv PG - 068478 4099 - http://biorxiv.org/content/early/2016/08/08/068478.short 4100 - http://biorxiv.org/content/early/2016/08/08/068478.full AB - recount is a resource of processed and summarized expression data spanning nearly 60,000 human RNA-seq samples from the Sequence Read Archive (SRA). The associated recount Bio-conductor package provides a convenient API for querying, downloading, and analyzing the data. Each processed study consists of meta/phenotype data, the expression levels of genes and their underlying exons and splice junctions, and corresponding genomic annotation. We also provide data summarization types for quantifying novel transcribed sequence including base-resolution coverage and potentially unannotated splice junctions. We present workflows illustrating how to use recount to perform differential expression analysis including meta-analysis, annotation-free base-level analysis, and replication of smaller studies using data from larger studies. recount provides a valuable and user-friendly resource of processed RNA-seq datasets to draw additional biological insights from existing public data. The resource is available at https://jhubiostatistics.shinyapps.io/recount/.