PT - JOURNAL ARTICLE AU - Bruce Moran AU - Stephen T. Butler AU - Christopher J. Creevey TI - Comparison and Characterisation of Mutation Calling from Whole Exome and RNA Sequencing Data for Liver and Muscle Tissue in Lactating Holstein Cows Divergent for Fertility AID - 10.1101/101733 DP - 2017 Jan 01 TA - bioRxiv PG - 101733 4099 - http://biorxiv.org/content/early/2017/01/20/101733.short 4100 - http://biorxiv.org/content/early/2017/01/20/101733.full AB - Whole exome sequencing has had low uptake in livestock species, despite allowing accurate analysis of single nucleotide variant (SNV) mutations. Transcriptomic data in the form of RNA sequencing has been generated for many livestock species and also represents a source of mutational information. However, there is little information on the accuracy of using this data for the identification of SNVs. We generated a bovine exome capture design and used it to sequence and call mutations from a lactating dairy cow model genetically divergent for fertility (Fert+, n=8; Fert-, n=8). We compared mutations called from liver and muscle transcriptomes from the same animals. Our exome capture demonstrated 99.1% coverage of the exome design of 56.7MB, whereas transcriptomes covered 55 and 46.5% of the exome, or 24.4 and 20.7MB, in liver and muscle respectively after filtering. We found that specificity of SNVs in the transcriptome data is approximately 75% following basic hard-filtering, and could be increased to above 80% by increasing the minimum threshold of reads covering SNVs, but this effect was negated in more highly covered SNVs. RNA-DNA differences, SNVs found in transcriptome but not exome, were discovered and shown to have significantly increased levels of transition mutations in both tissues. Functional annotation of non-synonymous SNVs specific to the high and low fertility phenotypes identified immune response-related genes, supporting previous work that has identified differential expression in the same genes. Publically available RNAseq data may be analysed in a similar way to further increase the utility of this resource.Summary The exome and transcriptome both relate to the same protein-coding regions of the genome. There has been sparse research on characterising mutations in RNA and DNA within the same individuals. Here we characterise the similarities in our Holstein dairy cow animal model. We offer practical and biological results indicating that RNA sequencing is a useful proxy of exome sequencing, itself shown to be applicable to this livestock species using a previously untested commercial application. This potentially unlocks public RNA sequencing data for further analysis, also indicating that RNA-DNA differences may associate with transcriptomic divergence.