Rxivist logo

Classification and Mutation Prediction from Non-Small Cell Lung Cancer Histopathology Images using Deep Learning

By Nicolas Coudray, Andre L Moreira, Theodore Sakellaropoulos, David Fenyƶ, Narges Razavian, Aristotelis Tsirigos

Posted 03 Oct 2017
bioRxiv DOI: 10.1101/197574 (published DOI: 10.1038/s41591-018-0177-5)

Visual analysis of histopathology slides of lung cell tissues is one of the main methods used by pathologists to assess the stage, types and sub-types of lung cancers. Adenocarcinoma and squamous cell carcinoma are two most prevalent sub-types of lung cancer, but their distinction can be challenging and time-consuming even for the expert eye. In this study, we trained a deep learning convolutional neural network (CNN) model (inception v3) on histopathology images obtained from The Cancer Genome Atlas (TCGA) to accurately classify whole-slide pathology images into adenocarcinoma, squamous cell carcinoma or normal lung tissue. Our method slightly outperforms a human pathologist, achieving better sensitivity and specificity, with ~0.97 average Area Under the Curve (AUC) on a held-out population of whole-slide scans. Furthermore, we trained the neural network to predict the ten most commonly mutated genes in lung adenocarcinoma. We found that six of these genes - STK11, EGFR, FAT1, SETBP1, KRAS and TP53 - can be predicted from pathology images with an accuracy ranging from 0.733 to 0.856, as measured by the AUC on the held-out population. These findings suggest that deep learning models can offer both specialists and patients a fast, accurate and inexpensive detection

Download data

  • Downloaded 9,336 times
  • Download rankings, all-time:
    • Site-wide: 284 out of 93,433
    • In cancer biology: 6 out of 3,302
  • Year to date:
    • Site-wide: 750 out of 93,433
  • Since beginning of last month:
    • Site-wide: 1,588 out of 93,433

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)