Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 55,165 bioRxiv papers from 254,492 authors.

Dating genomic variants and shared ancestry in population-scale sequencing data

By Patrick K. Albers, Gil McVean

Posted 13 Sep 2018
bioRxiv DOI: 10.1101/416610

The origin and fate of new mutations within species is the fundamental process underlying evolution. However, while previous efforts have been focused on characterizing the presence, frequency, and phenotypic impact of genetic variation, the evolutionary histories of most variants are largely unexplored. We have developed a non-parametric approach for estimating the date of origin of genetic variants that can be applied to large-scale genomic variation data sets. We demonstrate the accuracy and robustness of the approach through simulation and apply it to over 16 million single nucleotide polymorphisms (SNPs) from two publicly available human genomic diversity resources. We characterize the differential relationship between variant frequency and age in different geographical regions and demonstrate the value of allele age in interpreting variants of known functional and selective importance. Finally, we use allele age estimates to power a rapid approach for inferring the genealogical history of a single genome or a group of individuals.

Download data

  • Downloaded 1,864 times
  • Download rankings, all-time:
    • Site-wide: 2,126 out of 55,165
    • In genomics: 460 out of 3,907
  • Year to date:
    • Site-wide: 2,753 out of 55,165
  • Since beginning of last month:
    • Site-wide: 32,155 out of 55,165

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News