Quantifying the extent to which index event biases influence large genetic association studies
Michael P. Bancks,
Sam E. Jones,
Andrew R. Wood,
Lynne J. Hocking,
Marcus A. Tuke,
Katherine S. Ruth,
Ewan R Pearson,
Rachel M Freathy,
Patricia B Munroe,
Michael N Weedon,
James S Pankow,
Timothy M. Frayling,
Posted 12 Sep 2016
bioRxiv DOI: 10.1101/074781 (published DOI: 10.1093/hmg/ddw433)
Posted 12 Sep 2016
As genetic association studies increase in size to 100,000s of individuals, subtle biases may influence conclusions. One possible bias is ″index event bias″ (IEB), also called ″collider bias″, caused by the stratification by, or enrichment for, disease status when testing associations between gene variants and a disease-associated trait. We first provided a statistical framework for quantifying IEB then identified real examples of IEB in a range of study and analytical designs. We observed evidence of biased associations for some disease alleles and genetic risk scores, even in population-based studies. For example, a genetic risk score consisting of type 2 diabetes variants was associated with lower BMI in 113,203 type 2 diabetes controls from the population based UK Biobank study (-0.010 SDs BMI per allele, P=5E-4), entirely driven by IEB. Three of 11 individual type 2 diabetes risk alleles, and 10 of 25 hypertension alleles were associated with lower BMI at p<0.05 in UK Biobank when analyzing disease free individuals only, of which six hypertension alleles remained associated at p<0.05 after correction for IEB. Our analysis suggested that the associations between CCND2 and TCF7L2 diabetes risk alleles and BMI could (at least partially) be explained by IEB. Variants remaining associated after correction may be pleiotropic and include those in CYP17A1 (allele associated with hypertension risk and lower BMI). In conclusion, IEB may result in false positive or negative associations in very large studies stratified or strongly enriched for/against disease cases.
- Downloaded 688 times
- Download rankings, all-time:
- Site-wide: 35,335
- In genetics: 1,695
- Year to date:
- Site-wide: 122,462
- Since beginning of last month:
- Site-wide: 131,712
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!