PT  - JOURNAL ARTICLE
AU  - Yuguang Xiong
AU  - Magali Soumillon
AU  - Jie Wu
AU  - Jens Hansen
AU  - Bin Hu
AU  - Johan G.C. van Hasselt
AU  - Gomathi Jayaraman
AU  - Ryan Lim
AU  - Mehdi Bouhaddou
AU  - Loren Ornelas
AU  - Jim Bochicchio
AU  - Lindsay Lenaeus
AU  - Jennifer Stocksdale
AU  - Jaehee Shim
AU  - Emilda Gomez
AU  - Dhruv Sareen
AU  - Clive Svendsen
AU  - Leslie M. Thompson
AU  - Milind Mahajan
AU  - Ravi Iyengar
AU  - Eric A. Sobie
AU  - Evren U. Azeloglu
AU  - Marc R. Birtwistle
TI  - A Comparison of mRNA Sequencing with Random Primed and 3’-Directed Libraries
AID  - 10.1101/098905
DP  - 2017 Jan 01
TA  - bioRxiv
PG  - 098905
4099  - http://biorxiv.org/content/early/2017/01/06/098905.short
4100  - http://biorxiv.org/content/early/2017/01/06/098905.full
AB  - Deep mRNA sequencing (mRNAseq) is the state-of-the-art for whole transcriptome measurements. A key step is creating a library of cDNA sequencing fragments from RNA. This is generally done by random priming, creating multiple sequencing fragments along the length of each transcript. A 3’ end-focused library approach cannot detect differential splicing, but has potentially higher throughput at lower cost (~10-fold lower), along with the ability to improve quantification by using transcript molecule counting with unique molecular identifiers (UMI) to correct for PCR bias. Here, we compare implementation of such a 3’-digital gene expression (3’-DGE) approach with “conventional” random primed mRNAseq, which has not yet been done. We find that while conventional mRNAseq detects ~15% more genes, the resulting lists of differentially expressed genes and therefore biological conclusions and gene signatures are highly concordant between the two techniques. We also find good quantitative agreement on the level of individual genes between the two techniques in terms of both read counts and fold change between two conditions. We conclude that for high-throughput applications, the potential cost savings associated with the 3’-DGE approach are a very reasonable tradeoff for modest reduction in sensitivity and inability to observe alternative splicing, and should enable much larger scale studies focused on not only differential expression analysis, but also quantitative transcriptome profiling. The computational scripts and programs, along with experimental standard operating procedures used in our pipeline presented here, are freely available on our website (www.dtoxs.org).