Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 57,553 bioRxiv papers from 264,961 authors.

New synthetic-diploid benchmark for accurate variant calling evaluation

By Heng Li, Jonathan M Bloom, Yossi Farjoun, Mark Fleharty, Laura Gauthier, Benjamin Neale, Daniel MacArthur

Posted 22 Nov 2017
bioRxiv DOI: 10.1101/223297 (published DOI: 10.1038/s41592-018-0054-7)

Constructed from the consensus of multiple variant callers based on short-read data, existing benchmark datasets for evaluating variant calling accuracy are biased toward easy regions accessible by known algorithms. We derived a new benchmark dataset from the de novo PacBio assemblies of two human cell lines that are homozygous across the whole genome. This benchmark provides a more accurate and less biased estimate of the error rate of small variant calls in a realistic context.

Download data

  • Downloaded 2,773 times
  • Download rankings, all-time:
    • Site-wide: 1,155 out of 57,611
    • In bioinformatics: 243 out of 5,869
  • Year to date:
    • Site-wide: 6,863 out of 57,611
  • Since beginning of last month:
    • Site-wide: 11,932 out of 57,611

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News