Sparse Deep Neural Networks on Imaging Genetics for Schizophrenia Case-Control Classification
Jessica A. Turner,
Theo G. M. van Erp,
Lars T. Westlye,
Daniel H. Mathalon,
Daniel S. O'Leary,
Posted 12 Jun 2020
medRxiv DOI: 10.1101/2020.06.11.20128975
Posted 12 Jun 2020
Machine learning approaches hold potential for deconstructing complex psychiatric traits and yielding biomarkers which have a large potential for clinical application. Particularly, the advancement in deep learning methods has promoted them as highly promising tools for this purpose due to their capability to handle high-dimensional data and automatically extract high-level latent features. However, current proposed approaches for psychiatric classification or prediction using biological data do not allow direct interpretation of original features, which hinders insights into the biological underpinnings and development of biomarkers. In the present study, we introduce a sparse deep neural network (DNN) approach to identify sparse and interpretable features for schizophrenia (SZ) case-control classification. An L0-norm regularization is implemented on the input layer of the network for sparse feature selection, which can later be interpreted based on importance weights. We applied the proposed approach on a large multi-study cohort (N = 1,684) with brain structural MRI (gray matter volume (GMV)) and genetic (single nucleotide polymorphism (SNP)) data for discrimination of patients with SZ vs. controls. A total of 634 individuals served as training samples, and the resulting classification model was evaluated for generalizability on three independent data sets collected at different sites with different scanning protocols (n = 635, 255 and 160, respectively). We examined the classification power of pure GMV features, as well as combined GMV and SNP features. The performance of the proposed approach was compared with that yielded by an independent component analysis + support vector machine (ICA+SVM) framework. Empirical experiments demonstrated that sparse DNN slightly outperformed ICA+SVM and more effectively fused GMV and SNP features for SZ discrimination. With combined GMV and SNP features, sparse DNN yielded an average classification error rate of 28.98% on external data. The importance weights suggested that the DNN model prioritized to select frontal and superior temporal gyrus for SZ classification when a high sparsity was enforced, and parietal regions were further included with a lower sparsity setting, which strongly echoed previous literature. This is the first attempt to apply an interpretable sparse DNN model to imaging and genetic features for SZ classification with generalizability assessed in a large and multi-study cohort. The results validate the application of the proposed approach to SZ classification, and promise extended utility on other data modalities (e.g. functional and diffusion images) and traits (e.g. continuous scores) which ultimately may result in clinically useful tools.
- Downloaded 264 times
- Download rankings, all-time:
- Site-wide: 103,452
- In psychiatry and clinical psychology: 430
- Year to date:
- Site-wide: 79,517
- Since beginning of last month:
- Site-wide: 99,046
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!