Rxivist logo

Modular and efficient pre-processing of single-cell RNA-seq

By Páll Melsted, A. Sina Booeshaghi, Fan Gao, Eduardo da Veiga Beltrame, Lambda Lu, Kristján Eldjárn Hjorleifsson, Jase Gehring, Lior Pachter

Posted 17 Jun 2019
bioRxiv DOI: 10.1101/673285

Analysis of single-cell RNA-seq data begins with pre-processing of sequencing reads to generate count matrices. We investigate algorithm choices for the challenges of pre-processing, and describe a workflow that balances efficiency and accuracy. Our workflow is based on the kallisto (<https://pachterlab.github.io/kallisto/>) and bustools (<https://bustools.github.io/>) programs, and is near-optimal in speed and memory. The workflow is modular, and we demonstrate its flexibility by showing how it can be used for RNA velocity analyses. Documentation and tutorials for using the kallisto | bus workflow are available at <https://www.kallistobus.tools/>.

Download data

  • Downloaded 6,266 times
  • Download rankings, all-time:
    • Site-wide: 477 out of 85,151
    • In bioinformatics: 74 out of 8,142
  • Year to date:
    • Site-wide: 746 out of 85,151
  • Since beginning of last month:
    • Site-wide: 1,072 out of 85,151

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)