Rxivist logo

Assessing computational variant effect predictors with a large prospective cohort

By Da Kuang, Roujia Li, Yingzhou Wu, Jochen Weile, Robert A. Hegele, Frederick P. Roth

Posted 20 Sep 2021
bioRxiv DOI: 10.1101/2021.09.20.459182

Computational predictors can help interpret pathogenicity of human genetic variants, especially for the majority of variants where no experimental data are available. However, because we lack a high-quality unbiased test set, identifying the best-performing predictors remains a challenge. To address this issue, we evaluated missense variant effect predictors using genotypes and traits from a prospective cohort. We considered 139 gene-trait combinations with rare-variant burden association based on at least one of four systematic studies using phenotypes and whole-exome sequences from ~200K UK Biobank participants. Using an evaluation set of 35,525 rare missense variants and the relevant associated traits, we assessed the correlation of participants' traits with scores derived from 20 computational variant effect predictors. We found that two predictors--VARITY and REVEL--outperformed all others according to multiple performance measures. We expect that this study will help in selecting variant effect predictors, for both research and clinical purposes, while providing an unbiased benchmarking strategy that can be applied to additional cohorts and predictors.

Download data

  • Downloaded 286 times
  • Download rankings, all-time:
    • Site-wide: 124,751
    • In systems biology: 2,669
  • Year to date:
    • Site-wide: 13,338
  • Since beginning of last month:
    • Site-wide: 42,118

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide