Rxivist logo

Reverse GWAS: Using Genetics to Identify and Model Phenotypic Subtypes

By Andrew Dahl, Na Cai, Arthur Ko, Markku Laakso, Päivi Pajukanta, Jonathan Flint, Noah Zaitlen

Posted 17 Oct 2018
bioRxiv DOI: 10.1101/446492 (published DOI: 10.1371/journal.pgen.1008009)

Recent and classical work has revealed biologically and medically significant subtypes in complex diseases and traits. However, relevant subtypes are often unknown, unmeasured, or actively debated, making automatic statistical approaches to subtype definition particularly valuable. We propose reverse GWAS (RGWAS) to identify and validate subtypes using genetics and multiple traits: while GWAS seeks the genetic basis of a given trait, RGWAS seeks to define trait subtypes with distinct genetic bases. Unlike existing approaches relying on off-the-shelf clustering methods, RGWAS uses a bespoke decomposition, MFMR, to model covariates, binary traits, and population structure. We use extensive simulations to show these features can be crucial for power and calibration. We validate RGWAS in practice by recovering known stress subtypes in major depressive disorder. We then show the utility of RGWAS by identifying three novel subtypes of metabolic traits. We biologically validate these metabolic subtypes with SNP-level tests and a novel polygenic test: the former recover known metabolic GxE SNPs; the latter suggests genetic heterogeneity may explain substantial missing heritability. Crucially, statins, which are widely prescribed and theorized to increase diabetes risk, have opposing effects on blood glucose across metabolic subtypes, suggesting potential have potential translational value.

Download data

  • Downloaded 1,751 times
  • Download rankings, all-time:
    • Site-wide: 16,494
    • In genetics: 661
  • Year to date:
    • Site-wide: 136,319
  • Since beginning of last month:
    • Site-wide: 140,983

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide