Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 57,294 bioRxiv papers from 263,837 authors.

Non Hybrid Long Read Consensus Using Local De Bruijn Graph Assembly

By German Tischler, Eugene W. Myers

Posted 06 Feb 2017
bioRxiv DOI: 10.1101/106252

While second generation sequencing led to a vast increase in sequenced data, the shorter reads which came with it made assembly a much harder task and for some regions impossible with only short read data. This changed again with the advent of third generation long read sequencers. The length of the long reads allows a much better resolution of repetitive regions, their high error rate however is a major challenge. Using the data successfully requires to remove most of the sequencing errors. The first hybrid correction methods used low noise second generation data to correct third generation data, but this approach has issues when it is unclear where to place the short reads due to repeats and also because second generation sequencers fail to sequence some regions which third generation sequencers work on. Later non hybrid methods appeared. We present a new method for non hybrid long read error correction based on De Bruijn graph assembly of short windows of long reads with subsequent combination of these correct windows to corrected long reads. Our experiments show that this method yields a better correction than other state of the art non hybrid correction approaches.

Download data

  • Downloaded 1,439 times
  • Download rankings, all-time:
    • Site-wide: 3,550 out of 57,294
    • In bioinformatics: 710 out of 5,853
  • Year to date:
    • Site-wide: 19,177 out of 57,294
  • Since beginning of last month:
    • Site-wide: 15,234 out of 57,294

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News