Rxivist logo

SuperCT: A supervised-learning-framework to enhance the characterization of single-cell transcriptomic profiles

By Peng Xie, Mingxuan Gao, Chunming Wang, Pawan Noel, Chaoyong Yang, Daniel Von Hoff, Haiyong Han, Michael Q. Zhang, Wei Lin

Posted 16 Sep 2018
bioRxiv DOI: 10.1101/416719 (published DOI: 10.1093/nar/gkz116)

Characterization of individual cell types is fundamental to the study of multicellular samples such as tumor tissues. Single-cell RNAseq techniques, which allow high-throughput expression profiling of individual cells, have significantly advanced our ability of this task. Currently, most of the scRNA-seq data analyses are commenced with unsupervised clustering of cells followed by visualization of clusters in a low-dimensional space. Clusters are often assigned to different cell types based on canonical markers. However, the efficiency of characterizing the known cell types in this way is low and limited by the investigator[s] knowledge. In this study, we present a technical framework of training the expandable supervised-classifier in order to reveal the single-cell identities based on their RNA expression profiles. Using multiple scRNA-seq datasets we demonstrate the superior accuracy, robustness, compatibility and expandability of this new solution compared to the traditional methods. We use two examples of model upgrade to demonstrate how the projected evolution of the cell-type classifier is realized.

Download data

  • Downloaded 606 times
  • Download rankings, all-time:
    • Site-wide: 24,188 out of 88,415
    • In bioinformatics: 3,381 out of 8,368
  • Year to date:
    • Site-wide: 55,335 out of 88,415
  • Since beginning of last month:
    • Site-wide: 65,917 out of 88,415

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)