Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 54,615 bioRxiv papers from 252,242 authors.

Integrative haplotype estimation with sub-linear complexity

By Olivier Delaneau, Jean-Francois Zagury, Matthew R Robinson, Jonathan Marchini, Emmanouil Dermitzakis

Posted 13 Dec 2018
bioRxiv DOI: 10.1101/493403

The number of human genomes being genotyped or sequenced increases exponentially and efficient haplotype estimation methods able to handle this amount of data are now required. Here, we present a new method, SHAPEIT4, which substantially improves upon other methods to process large genotype and high coverage sequencing datasets. It notably exhibits sub-linear scaling with sample size, provides highly accurate haplotypes and allows integrating external phasing information such as large reference panels of haplotypes, collections of pre-phased variants and long sequencing reads. We provide SHAPET4 in an open source format on https://odelaneau.github.io/shapeit4/ and demonstrate its performance in terms of accuracy and running times on two gold standard datasets: the UK Biobank data and the Genome In A Bottle.

Download data

  • Downloaded 447 times
  • Download rankings, all-time:
    • Site-wide: 18,136 out of 54,615
    • In bioinformatics: 2,701 out of 5,669
  • Year to date:
    • Site-wide: 7,059 out of 54,615
  • Since beginning of last month:
    • Site-wide: 8,825 out of 54,615

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News