Rxivist logo

Population Genotype Calling from Low-coverage Sequencing Data

By Lin Huang, Petr Danecek, Sivan Bercovici, Serafim Batzoglou

Posted 05 Nov 2016
bioRxiv DOI: 10.1101/085936

In recent years, several large-scale whole-genome projects sequencing tens of thousands of individuals were completed, with larger studies are underway. These projects aim to provide high-quality genotypes for a large number of whole genomes in a cost-efficient manner, by sequencing each genome at low coverage and subsequently identifying alleles jointly in the entire cohort. Here we present Ref-Reveel, a novel method for large-scale population genotyping. We show that Ref-Reveel provides genotyping at a higher accuracy and higher efficiency in comparison to existing methods by applying our method to one of the largest whole-genome sequencing datasets presently available to the public. We further show that utilizing the resulting genotype panel as references, through the Ref-Reveel framework, greatly improves the ability to call genotypes accurately on newly sequenced genomes. In addition, we present a Ref-Reveel pipeline that is applicable for genotyping of very small datasets. In summary, Ref-Reveel is an accurate, scalable and applicable method for a wide range of genotyping scenarios, and will greatly improves the quality of calling genomic alterations in current and future large-scale sequencing projects.

Download data

  • Downloaded 1,198 times
  • Download rankings, all-time:
    • Site-wide: 15,038
    • In bioinformatics: 1,843
  • Year to date:
    • Site-wide: 71,639
  • Since beginning of last month:
    • Site-wide: 62,089

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)