TY - JOUR T1 - Non Hybrid Long Read Consensus Using Local De Bruijn Graph Assembly JF - bioRxiv DO - 10.1101/106252 SP - 106252 AU - German Tischler AU - Eugene W. Myers Y1 - 2017/01/01 UR - http://biorxiv.org/content/early/2017/02/06/106252.abstract N2 - While second generation sequencing led to a vast increase in sequenced data, the shorter reads which came with it made assembly a much harder task and for some regions impossible with only short read data. This changed again with the advent of third generation long read sequencers. The length of the long reads allows a much better resolution of repetitive regions, their high error rate however is a major challenge. Using the data successfully requires to remove most of the sequencing errors. The first hybrid correction methods used low noise second generation data to correct third generation data, but this approach has issues when it is unclear where to place the short reads due to repeats and also because second generation sequencers fail to sequence some regions which third generation sequencers work on. Later non hybrid methods appeared. We present a new method for non hybrid long read error correction based on De Bruijn graph assembly of short windows of long reads with subsequent combination of these correct windows to corrected long reads. Our experiments show that this method yields a better correction than other state of the art non hybrid correction approaches. ER -