Rxivist logo

Modular and efficient pre-processing of single-cell RNA-seq

By Pall Melsted, A. Sina Booeshaghi, Fan Gao, Eduardo da Veiga Beltrame, Lambda Moses, Kristjan Eldjarn Hjorleifsson, Jase Gehring, Lior Pachter

Posted 17 Jun 2019
bioRxiv DOI: 10.1101/673285

Analysis of single-cell RNA-seq data begins with pre-processing of sequencing reads to generate count matrices. We investigate algorithm choices for the challenges of pre-processing, and describe a workflow that balances efficiency and accuracy. Our workflow is based on the kallisto (<https://pachterlab.github.io/kallisto/>) and bustools (<https://bustools.github.io/>) programs, and is near-optimal in speed and memory. The workflow is modular, and we demonstrate its flexibility by showing how it can be used for RNA velocity analyses. Documentation and tutorials for using the kallisto | bus workflow are available at <https://www.kallistobus.tools/>.

Download data

  • Downloaded 8,522 times
  • Download rankings, all-time:
    • Site-wide: 1,183
    • In bioinformatics: 65
  • Year to date:
    • Site-wide: 2,847
  • Since beginning of last month:
    • Site-wide: 1,757

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide