Rxivist logo

Improving on a modal-based estimation method: model averaging for consistent and efficient estimation in Mendelian randomization when a plurality of candidate instruments are valid

By Stephen Burgess, Verena Zuber, Apostolos Gkatzionis, Jessica MB Rees, Christopher N Foley

Posted 11 Aug 2017
bioRxiv DOI: 10.1101/175372

Background: A robust method for Mendelian randomization does not require all genetic variants to be valid instruments to give consistent estimates of a causal parameter. Several such methods have been developed, including a mode-based estimation method giving consistent estimates if a plurality of genetic variants are valid instruments; that is, there is no larger subset of invalid instruments estimating the same causal parameter than the subset of valid instruments. Methods: We here develop a model averaging method that gives consistent estimates under the same 'plurality of valid instruments' assumption. The method considers a mixture distribution of estimates derived from each subset of genetic variants. The estimates are weighted such that subsets with more genetic variants receive more weight, unless variants in the subset have heterogeneous causal estimates, in which case that subset is severely downweighted. The mode of this mixture distribution is the causal estimate. This heterogeneity-penalized model averaging method has several technical advantages over the previously proposed mode-based estimation method. Results: The heterogeneity-penalized model averaging method outperformed the mode-based estimation in terms of efficiency and outperformed other robust methods in terms of Type 1 error rate in an extensive simulation analysis. The proposed method suggests two distinct mechanisms by which inflammation affects coronary heart disease risk, with subsets of variants suggesting both positive and negative causal effects. Conclusions: The heterogeneity-penalized model averaging method is an additional robust method for Mendelian randomization with excellent theoretical and practical properties, and can reveal features in the data such as the presence of multiple causal mechanisms.

Download data

  • Downloaded 515 times
  • Download rankings, all-time:
    • Site-wide: 57,637
    • In genetics: 2,572
  • Year to date:
    • Site-wide: 109,073
  • Since beginning of last month:
    • Site-wide: 111,615

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide