Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 62,502 bioRxiv papers from 277,505 authors.

Modular and efficient pre-processing of single-cell RNA-seq

By Páll Melsted, A. Sina Booeshaghi, Fan Gao, Eduardo da Veiga Beltrame, Lambda Lu, Kristján Eldjárn Hjorleifsson, Jase Gehring, Lior Pachter

Posted 17 Jun 2019
bioRxiv DOI: 10.1101/673285

Analysis of single-cell RNA-seq data begins with pre-processing of sequencing reads to generate count matrices. We investigate algorithm choices for the challenges of pre-processing, and describe a workflow that balances efficiency and accuracy. Our workflow is based on the kallisto (<https://pachterlab.github.io/kallisto/>) and bustools (<https://bustools.github.io/>) programs, and is near-optimal in speed and memory. The workflow is modular, and we demonstrate its flexibility by showing how it can be used for RNA velocity analyses. Documentation and tutorials for using the kallisto | bus workflow are available at <https://www.kallistobus.tools/>.

Download data

  • Downloaded 4,121 times
  • Download rankings, all-time:
    • Site-wide: 594 out of 62,502
    • In bioinformatics: 118 out of 6,231
  • Year to date:
    • Site-wide: 99 out of 62,502
  • Since beginning of last month:
    • Site-wide: 54 out of 62,502

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News