Rxivist logo

FIQT: a simple, powerful method to accurately estimate effect sizes in genome scans

By Tim Bigdeli, Donghyung Lee, Brien Riley, Vladimir Vladimirov, Ayman H Fanous, Kenneth Kendler, Silviu-Alin Bacanu

Posted 13 May 2015
bioRxiv DOI: 10.1101/019299

Genome scans, including both genome-wide association studies and deep sequencing, continue to discover a growing number of significant association signals for various traits. However, often variants meeting genome-wide significance criteria explain far less of the overall trait variance than “sub-threshold” association signals. To extract these sub-threshold signals, there is a need for methods which accurately estimate the mean of all (normally-distributed) test-statistics from a genome scan (i.e., Z-scores). This is currently achieved by the difficult procedures of adjusting all Z-score (χ_1^2) statistics for “winner’s curse” (multiple testing). Given that multiple testing adjustments are much simpler for p-values, we propose a method for estimating Z-scores means by i) first adjusting their p-values for multiple testing and then ii) transforming the adjusted p-values to upper tail Z-scores with the sign of the original statistics. Because a False Discovery Rate (FDR) procedure is used for multiple testing adjustment, we denote this method FDR Inverse Quantile Transformation (FIQT). When compared to competitors, e.g. Empirical Bayes (including proposed improvements), FIQT is more i) accurate and ii) computationally efficient by orders of magnitude. Its accuracy advantage is substantial at larger sample sizes and/or moderate numbers of association signals. Practical application of FIQT to Z-scores from the first Psychiatric Genetic Consortium (PGC) schizophrenia predicts a non-trivial fraction of the significant signal regions from the subsequent published PGC schizophrenia studies. Finally, we suggest that FIQT might be i) used to improve subject level risk prediction and ii) further improved by modelling the noncentrality of χ_1^2 statistics.

Download data

  • Downloaded 519 times
  • Download rankings, all-time:
    • Site-wide: 72,801
    • In genetics: 3,047
  • Year to date:
    • Site-wide: 100,538
  • Since beginning of last month:
    • Site-wide: 70,026

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide