Rxivist logo

Expectation of the intercept from bivariate LD score regression in the presence of population stratification

By Loic Yengo, Jian Yang, Peter M. Visscher

Posted 28 Apr 2018
bioRxiv DOI: 10.1101/310565

Linkage disequilibrium (LD) score regression is an increasingly popular method used to quantify the level of confounding in genome-wide association studies (GWAS) or to estimate heritability and genetic correlation between traits. When applied to a pair of GWAS, the LD score regression (LDSC) methodology produces a statistic, referred to as the bivariate LDSC intercept, which deviation from 0 is classically interpreted as an indication of sample overlap between the two GWAS. Here we propose an extension of the theory underlying the bivariate LDSC methodology, which accounts for population stratification within and between GWAS. Our extended theory predicts an inflation of the bivariate LDSC intercept when sample sizes and heritability are large, even in the absence of sample overlap. We illustrate our theoretical results with simulations based on actual SNP genotypes and we propose a re-interpretation of previously published results in the light of our extended theory.

Download data

  • Downloaded 1,972 times
  • Download rankings, all-time:
    • Site-wide: 13,745
    • In genetics: 532
  • Year to date:
    • Site-wide: 63,167
  • Since beginning of last month:
    • Site-wide: 38,013

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide