Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 64,753 bioRxiv papers from 287,057 authors.

A powerful subset-based gene-set analysis method identifies novel associations and improves interpretation in UK Biobank

By Diptavo Dutta, Peter VandeHaar, Laura Scott, Michael Boehnke, Seunggeun Lee

Posted 10 Oct 2019
bioRxiv DOI: 10.1101/799791

A test of association between the phenotype and a set of genes within a biological pathway can be complementary to single variant or single gene association analysis and provide further insights into the genetic architecture of complex phenotypes. Although multiple methods exist to perform such a gene-set analysis, most have low statistical power when only a small fraction of the genes are associated with the phenotype. Further, since existing methods cannot identify possible genes driving association signals, interpreting results of such association in terms of the underlying genetic mechanism is challenging. Here, we introduce Gene-set analysis Association Using Sparse Signals (GAUSS), a method for gene-set association analysis with GWAS summary statistics. In addition to providing a p-value for association, GAUSS identifies the subset of genes that have the maximal evidence of association and appears to drive the association. Using pre-computed correlation structure among test statistics from a reference panel, the p-value calculation is substantially faster compared to other permutation or simulation-based approaches. Our numerical experiments show that GAUSS can increase power over several existing methods while controlling type-I error under a variety of association models. Through the analysis of summary statistics from the UK Biobank data for 1,403 phenotypes, we show that GAUSS is scalable and can identify associations across many phenotypes and gene-sets.

Download data

  • Downloaded 165 times
  • Download rankings, all-time:
    • Site-wide: 50,650 out of 64,753
    • In genetics: 3,000 out of 3,675
  • Year to date:
    • Site-wide: 28,005 out of 64,753
  • Since beginning of last month:
    • Site-wide: 1,348 out of 64,753

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)