Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 67,014 bioRxiv papers from 294,953 authors.

New synthetic-diploid benchmark for accurate variant calling evaluation

By Heng Li, Jonathan M Bloom, Yossi Farjoun, Mark Fleharty, Laura Gauthier, Benjamin Neale, Daniel MacArthur

Posted 22 Nov 2017
bioRxiv DOI: 10.1101/223297 (published DOI: 10.1038/s41592-018-0054-7)

Constructed from the consensus of multiple variant callers based on short-read data, existing benchmark datasets for evaluating variant calling accuracy are biased toward easy regions accessible by known algorithms. We derived a new benchmark dataset from the de novo PacBio assemblies of two human cell lines that are homozygous across the whole genome. This benchmark provides a more accurate and less biased estimate of the error rate of small variant calls in a realistic context.

Download data

  • Downloaded 2,870 times
  • Download rankings, all-time:
    • Site-wide: 1,276 out of 67,027
    • In bioinformatics: 256 out of 6,609
  • Year to date:
    • Site-wide: 7,976 out of 67,027
  • Since beginning of last month:
    • Site-wide: 27,909 out of 67,027

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News