Rxivist logo

Modeling transcriptional profiles of gene perturbation with deep neural network

By Wenke Liu, Xuya Wang, D R Mani, David Fenyƶ

Posted 16 Jul 2021
bioRxiv DOI: 10.1101/2021.07.15.452534

Cell line perturbation data could be utilized as a reference for inferring underlying molecular processes in new gene expression profiles. It is important to develop accurate and computationally efficient algorithms to exploit biological knowledge in the growing compendium of existing perturbation data and harness these for new predictions. We reframed the problem of inferring possible gene perturbation based on a reference perturbation database into a classification task and evaluated the application of deep neural network models to address this problem. Our results showed that a fully-connected multi-layer neural network was able to achieve up to 74.9% accuracy in a holdout test set, but the model generalizability was limited by consistency between training and testing data. Capacity and flexibility enables neural network models to efficiently represent transcriptomic features associated with single gene knockdown perturbations. With consistent signals between training and testing sets, neural networks may be trained to classify new samples to experimentally confirmed molecular phenotypes.

Download data

  • Downloaded 199 times
  • Download rankings, all-time:
    • Site-wide: 147,902
    • In bioinformatics: 11,404
  • Year to date:
    • Site-wide: None
  • Since beginning of last month:
    • Site-wide: 87,408

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide