Rxivist logo

Predicting Alzheimer's disease progression using deep recurrent neural networks

By Minh Nguyen, Tong He, Lijun An, Daniel C Alexander, Jiashi Feng, B.T. Thomas Yeo, for the Alzheimer’s Disease Neuroimaging Initiative

Posted 04 Sep 2019
bioRxiv DOI: 10.1101/755058 (published DOI: 10.1016/j.neuroimage.2020.117203)

Early identification of individuals at risk of developing Alzheimer's disease (AD) dementia is important for developing disease-modifying therapies. In this study, given multimodal AD markers and clinical diagnosis of an individual from one or more timepoints, we seek to predict the clinical diagnosis, cognition and ventricular volume of the individual for every month (indefinitely) into the future. We proposed and applied a minimal recurrent neural network (minimalRNN) model to data from The Alzheimer's Disease Prediction Of Longitudinal Evolution (TADPOLE) challenge, comprising longitudinal data of 1677 participants (Marinescu et al. 2018) from the Alzheimer's Disease Neuroimaging Initiative (ADNI). We compared the performance of the minimalRNN model and four baseline algorithms up to 6 years into the future. Most previous work on predicting AD progression ignore the issue of missing data, which is a prevalent issue in longitudinal data. Here, we explored three different strategies to handle missing data. Two of the strategies treated the missing data as a "preprocessing" issue, by imputing the missing data using the previous timepoint ("forward filling") or linear interpolation ("linear filling). The third strategy utilized the minimalRNN model itself to fill in the missing data both during training and testing ("model filling"). Our analyses suggest that the minimalRNN with "model filling" compared favorably with baseline algorithms, including support vector machine/regression, linear state space (LSS) model, and long short-term memory (LSTM) model. Importantly, although the training procedure utilized longitudinal data, we found that the trained minimalRNN model exhibited similar performance, when using only 1 input timepoint or 4 input timepoints, suggesting that our approach might work well with just cross-sectional data. An earlier version of our approach was ranked 5th (out of 53 entries) in the TADPOLE challenge in 2019. The current approach is ranked 2nd out of 63 entries as of June 3rd, 2020. ### Competing Interest Statement The authors have declared no competing interest.

Download data

  • Downloaded 1,822 times
  • Download rankings, all-time:
    • Site-wide: 13,578
    • In neuroscience: 1,406
  • Year to date:
    • Site-wide: 35,774
  • Since beginning of last month:
    • Site-wide: 24,412

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide