Rxivist logo

Bayesian Estimation of Population Size Changes by Sampling Tajima’s Trees

By Julia A Palacios, Amandine Véber, Lorenzo Cappello, Zhangyuan Wang, John Wakeley, Sohini Ramachandran

Posted 11 Apr 2019
bioRxiv DOI: 10.1101/605352 (published DOI: 10.1534/genetics.119.302373)

The large state space of gene genealogies is a major hurdle for inference methods based on Kingman’s coalescent. Here, we present a new Bayesian approach for inferring past population sizes which relies on a lower resolution coalescent process we refer to as “Tajima’s coalescent”. Tajima’s coalescent has a drastically smaller state space, and hence it is a computationally more efficient model, than the standard Kingman coalescent. We provide a new algorithm for efficient and exact likelihood calculations for data without recombination, which exploits a directed acyclic graph and a correspondingly tailored Markov Chain Monte Carlo method. We compare the performance of our Bayesian Estimation of population size changes by Sampling Tajima’s Trees (BESTT) with a popular implementation of coalescent-based inference in BEAST using simulated data and human data. We empirically demonstrate that BESTT can accurately infer effective population sizes, and it further provides an efficient alternative to the Kingman’s coalescent. The algorithms described here are implemented in the R package phylodyn, which is available for download at <https://github.com/JuliaPalacios/phylodyn>.

Download data

  • Downloaded 696 times
  • Download rankings, all-time:
    • Site-wide: 44,216
    • In genetics: 1,948
  • Year to date:
    • Site-wide: 134,622
  • Since beginning of last month:
    • Site-wide: 147,452

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide