An expanded analysis framework for multivariate GWAS connects inflammatory biomarkers to functional variants and disease
Sanni E Ruotsalainen,
Mary Pat Reeve,
Jukka T. Koskela
Posted 06 Dec 2019
bioRxiv DOI: 10.1101/867267 (published DOI: 10.1038/s41431-020-00730-8)
Posted 06 Dec 2019
Multivariate methods are known to increase the statistical power of association detection, but they have lacked essential follow-up analysis tools necessary for understanding the biology underlying these associations. We developed a novel computational workflow for multivariate GWAS follow-up analyses, including fine-mapping and identification of the subset of traits driving associations (driver traits). Many follow-up tools require univariate regression coefficients which are lacking from multivariate results. Our method overcomes this problem by using Canonical Correlation Analysis to turn each multivariate association into its optimal univariate Linear Combination Phenotype (LCP). This enables an LCP-GWAS, which in turn generates the statistics required for follow-up analyses. We implemented our method on 12 highly correlated inflammatory biomarkers in a Finnish population-based study. Altogether, we identified 11 associations, four of which (F5, ABO, C1orf140 and PDGFRB) were not detected by biomarker-specific analyses. Fine-mapping identified 19 signals within the 11 loci and driver trait analysis determined the traits contributing to the associations. A phenome-wide association study on the 19 putative causal variants from the signals in 176,899 individuals from the FinnGen study revealed 53 disease associations (p < 1x10-4). Several reported pQTLs in the 11 loci provided orthogonal evidence for the biologically relevant functions of the putative causal variants. Our novel multivariate analysis workflow provides a powerful addition to standard univariate GWAS analyses by enabling multivariate GWAS follow-up and thus promoting the advancement of powerful multivariate methods in genomics.
- Downloaded 695 times
- Download rankings, all-time:
- Site-wide: 36,504
- In genetics: 1,723
- Year to date:
- Site-wide: 21,746
- Since beginning of last month:
- Site-wide: 30,131
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!