Rxivist logo

A fast and robust Bayesian nonparametric method for prediction of complex traits using summary statistics

By Geyu Zhou, Hongyu Zhao

Posted 02 Dec 2020
bioRxiv DOI: 10.1101/2020.11.30.405241

Genetic prediction of complex traits has great promise for disease prevention, monitoring, and treatment. The development of accurate risk prediction models is hindered by the wide diversity of genetic architecture across different traits, limited access to individual level data for training and parameter tuning, and the demand for computational resources. To overcome the limitations of the most existing methods that make explicit assumptions on the underlying genetic architecture and need a separate validation data set for parameter tuning, we develop a summary statistics-based nonparametric method that does not rely on validation datasets to tune parameters. In our implementation, we refine the commonly used likelihood assumption to deal with the discrepancy between summary statistics and external reference panel. We also leverage the block structure of the reference linkage disequilibrium matrix for implementation of a parallel algorithm. Through simulations and applications to twelve traits, we show that our method is adaptive to different genetic architectures, statistically robust, and computationally efficient. Our method is available at https://github.com/eldronzhou/SDPR.

Download data

  • Downloaded 748 times
  • Download rankings, all-time:
    • Site-wide: 45,474
    • In bioinformatics: 4,689
  • Year to date:
    • Site-wide: 45,491
  • Since beginning of last month:
    • Site-wide: 155,098

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide