Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 67,258 bioRxiv papers from 296,039 authors.

Non Hybrid Long Read Consensus Using Local De Bruijn Graph Assembly

By German Tischler, Eugene W. Myers

Posted 06 Feb 2017
bioRxiv DOI: 10.1101/106252

While second generation sequencing led to a vast increase in sequenced data, the shorter reads which came with it made assembly a much harder task and for some regions impossible with only short read data. This changed again with the advent of third generation long read sequencers. The length of the long reads allows a much better resolution of repetitive regions, their high error rate however is a major challenge. Using the data successfully requires to remove most of the sequencing errors. The first hybrid correction methods used low noise second generation data to correct third generation data, but this approach has issues when it is unclear where to place the short reads due to repeats and also because second generation sequencers fail to sequence some regions which third generation sequencers work on. Later non hybrid methods appeared. We present a new method for non hybrid long read error correction based on De Bruijn graph assembly of short windows of long reads with subsequent combination of these correct windows to corrected long reads. Our experiments show that this method yields a better correction than other state of the art non hybrid correction approaches.

Download data

  • Downloaded 1,468 times
  • Download rankings, all-time:
    • Site-wide: 4,120 out of 67,258
    • In bioinformatics: 814 out of 6,627
  • Year to date:
    • Site-wide: 26,873 out of 67,258
  • Since beginning of last month:
    • Site-wide: 39,533 out of 67,258

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)