Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 54,968 bioRxiv papers from 253,677 authors.

Scalable preprocessing for sparse scRNA-seq data exploiting prior knowledge

By Sumit Mukherjee, Yue Zhang, Joshua Fan, Sreeram Kannan, Georg Seelig

Posted 25 May 2017
bioRxiv DOI: 10.1101/142398 (published DOI: 10.1093/bioinformatics/bty293)

Motivation: Single cell RNA-seq (scRNA-seq) data contains a wealth of information which has to be inferred computationally from the observed sequencing reads. As the ability to sequence more cells improves rapidly, existing computational tools suffer from three problems. (1) The decreased reads per-cell implies a highly sparse sample of the true cellular transcriptome. (2) Many tools simply cannot handle the size of the resulting datasets. (3) Prior biological knowledge such as bulk RNA-seq information of certain cell types or qualitative marker information is not taken into account. Here we present UNCURL, a preprocessing framework based on non-negative matrix factorization for scRNA-seq data, that is able to handle varying sampling distributions, scales to very large cell numbers and can incorporate prior knowledge. Results: We find that preprocessing using UNCURL consistently improves performance of commonly used scRNA-seq tools for clustering, visualization, and lineage estimation, both in the absence and presence of prior knowledge. Finally we demonstrate that UNCURL is extremely scalable and parallelizable, and runs faster than other methods on a scRNA-seq dataset containing 1.3 million cells.

Download data

  • Downloaded 1,577 times
  • Download rankings, all-time:
    • Site-wide: 2,805 out of 54,968
    • In bioinformatics: 567 out of 5,689
  • Year to date:
    • Site-wide: 18,243 out of 54,968
  • Since beginning of last month:
    • Site-wide: 21,965 out of 54,968

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)