Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 65,227 bioRxiv papers from 288,972 authors.

Population size history from short genomic scaffolds: how short is too short?

By Graham Gower, Jono Tuke, AB Rohrlach, Julien Soubrier, Bastien Llamas, Nigel Bean, Alan Cooper

Posted 03 Aug 2018
bioRxiv DOI: 10.1101/382036

The Pairwise Sequentially Markov Coalescent (PSMC), and its extension PSMC', model past population sizes from a single diploid genome. Both models have been widely applied, even to organisms with scaffold-level genome reference assemblies of limited contiguity. However it is unclear how PSMC and PSMC' perform on short scaffolds. We evaluated psmc and msmc, implementations of the PSMC and PSMC' models respectively, on simulated genomes with low contiguity, and compared results to those from fully contiguous data. Simulations with scaffolds from 100 Mb to 10 kb revealed that psmc maintains high accuracy down to lengths of 100 kb, while msmc is accurate down to 1 Mb. The discrepancy is not due to differing models, but stems from an implementation detail of msmc---homozygous tracts at the ends of scaffolds are discarded, making msmc unreliable for low contiguity genomes. We recommend excluding data that are aligned to shorter scaffolds when undertaking demographic inference.

Download data

  • Downloaded 377 times
  • Download rankings, all-time:
    • Site-wide: 27,949 out of 65,227
    • In genetics: 1,800 out of 3,692
  • Year to date:
    • Site-wide: 22,473 out of 65,227
  • Since beginning of last month:
    • Site-wide: 20,358 out of 65,227

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News