Rxivist logo

Non-linear Archetypal Analysis of Single-cell RNA-seq Data by Deep Autoencoders

By Yuge Wang, Hongyu Zhao

Posted 20 Sep 2021
bioRxiv DOI: 10.1101/2021.09.17.460824

Advances in single-cell RNA sequencing (scRNA-seq) have led to successes in discovering novel cell types and understanding cellular heterogeneity among complex cell populations through cluster analysis. However, cluster analysis is not able to reveal continuous spectrum of states and underlying gene expression programs (GEPs) shared across cell types. We introduce scAAnet, an autoencoder for single-cell non-linear archetypal analysis, to identify GEPs and infer the relative activity of each GEP across cells. We use a count distribution-based loss term to account for the sparsity and overdispersion of the raw count data and add an archetypal constraint to the loss function of scAAnet. We first show that scAAnet outperforms existing methods for archetypal analysis across different metrics through simulations. We then demonstrate the ability of scAAnet to extract biologically meaningful GEPs using publicly available scRNA-seq datasets including a pancreatic islet dataset, a lung idiopathic pulmonary fibrosis dataset and a prefrontal cortex dataset.

Download data

  • Downloaded 407 times
  • Download rankings, all-time:
    • Site-wide: 94,573
    • In bioinformatics: 8,239
  • Year to date:
    • Site-wide: 52,110
  • Since beginning of last month:
    • Site-wide: 56,145

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide