Rxivist logo

Quantification of frequency-dependent genetic architectures and action of negative selection in 25 UK Biobank traits

By Armin P Schoech, Daniel Jordan, Po-Ru Loh, Steven Gazal, Luke O’Connor, Daniel J. Balick, Pier F. Palamara, Hilary K Finucane, Shamil R. Sunyaev, Alkes L. Price

Posted 13 Sep 2017
bioRxiv DOI: 10.1101/188086 (published DOI: 10.1038/s41467-019-08424-6)

Understanding the role of rare variants is important in elucidating the genetic basis of human diseases and complex traits. It is widely believed that negative selection can cause rare variants to have larger per-allele effect sizes than common variants. Here, we develop a method to estimate the minor allele frequency (MAF) dependence of SNP effect sizes. We use a model in which per-allele effect sizes have variance proportional to [p(1-p)]α, where p is the MAF and negative values of α imply larger effect sizes for rare variants. We estimate α by maximizing its profile likelihood in a linear mixed model framework using imputed genotypes, including rare variants (MAF >0.07%). We applied this method to 25 UK Biobank diseases and complex traits (N=113,851). All traits produced negative α estimates with 20 significantly negative, implying larger rare variant effect sizes. The inferred best-fit distribution of true α values across traits had mean -0.38 (s.e. 0.02) and standard deviation 0.08 (s.e. 0.03), with statistically significant heterogeneity across traits (P=0.0014). Despite larger rare variant effect sizes, we show that for most traits analyzed, rare variants (MAF <1%) explain less than 10% of total SNP-heritability. Using evolutionary modeling and forward simulations, we validated the α model of MAF-dependent trait effects and estimated the level of coupling between fitness effects and trait effects. Based on this analysis an average genome-wide negative selection coefficient on the order of 10-4 or stronger is necessary to explain the α values that we inferred.

Download data

  • Downloaded 859 times
  • Download rankings, all-time:
    • Site-wide: 29,539
    • In genetics: 1,376
  • Year to date:
    • Site-wide: 103,705
  • Since beginning of last month:
    • Site-wide: 102,402

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide