Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 62,102 bioRxiv papers from 275,834 authors.

Fine-scale Inference of Ancestry Segments without Prior Knowledge of Admixing Groups

By Michael Salter-Townshend, Simon Myers

Posted 25 Jul 2018
bioRxiv DOI: 10.1101/376137 (published DOI: 10.1534/genetics.119.302139)

We present an algorithm for inferring ancestry segments and characterizing admixture events, which involve an arbitrary number of genetically differentiated groups coming together. This allows inference of the demographic history of the species, properties of admixing groups, identification of signatures of natural selection, and may aid disease gene mapping. The algorithm employs nested hidden Markov models to obtain local ancestry estimation along the genome for each admixed individual. In a range of simulations, the accuracy of these estimates equals or exceeds leading existing methods that return local ancestry. Moreover, and unlike these approaches, we do not require any prior knowledge of the relationship between sub-groups of donor reference haplotypes and the unseen mixing ancestral populations. Instead, our approach infers these in terms of conditional "copying probabilities". In application to the Human Genome Diversity Panel we corroborate many previously inferred admixture events (e.g. an ancient admixture event in the Kalash). We further identify novel events such as complex 4-way admixture in San-Khomani individuals, and show that Eastern European populations possess 1-5% ancestry from a group resembling modern-day central Asians. We also identify evidence of recent natural selection favouring sub-Saharan ancestry at the HLA region, across North African individuals. We make available an R and C ++ software library, which we term MOSAIC (which stands for MOSAIC Organises Segments of Ancestry In Chromosomes).

Download data

  • Downloaded 1,440 times
  • Download rankings, all-time:
    • Site-wide: 3,820 out of 62,102
    • In genomics: 711 out of 4,270
  • Year to date:
    • Site-wide: 7,803 out of 62,102
  • Since beginning of last month:
    • Site-wide: 11,576 out of 62,102

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide

Sign up for the Rxivist weekly newsletter! (Click here for more details.)