Advanced Whole Genome Sequencing Using a Complete PCR-free Massively Parallel Sequencing (MPS) Workflow
Posted 23 Dec 2019
bioRxiv DOI: 10.1101/2019.12.20.885517
Posted 23 Dec 2019
Systematic errors could be introduced by amplification during MPS library preparation and cluster/array formation. Polymerase Chain Reaction (PCR)-free library preparation methods have previously demonstrated improved sequencing quality with PCR-amplified read-clusters, however we hypothesized that some some InDel errors are still introduced by the remaining PCR step. Here we sequenced PCR-free libraries on MGI's PCR-free DNBSEQTM arrays to obtain for the first time a true PCR-free WGS (Whole Genome Sequencing). We used MGI's PCR-free WGS library preparation kits as recommended or with some modifications to make several NA12878 libraries. Reproducibly high quality libraries where obtained with low bias and less than 1% read duplication for both ultrasonic and enzymatic DNA fragmenting. In a triplicate analysis, over 96% SNPs and about 89% InDels in each library were found in at least one of the other two libraries. Using machine learning (ML) methods (DeepVariant or DNAscope), variant calling performance (SNPs F-measure>99.94%, InDels F-measure>99.6%) exceeded the widely accepted standards. The F-measure of 15X PCR-free ML-WGS was comparable to or even better than 30X PCR WGS analyzed with GATK. Furthermore, PCR-free WGS libraries sequenced on PCR-free DNBSEQTM platform have up to 50% less InDel errors compared to NovaSeq platform confirming that DNA clusters have PCR-generated errors.Enabled by the new PCR-free library kits, super high-thoughput sequencer and ML-based variant calling, DNBSEQ TM true PCR-free WGS provides a powerful solution to improve accuracy while reducing cost and analysis time to facilitate future precision medicine,cohort studies and large population genome project.
- Downloaded 1,539 times
- Download rankings, all-time:
- Site-wide: 6,571 out of 101,478
- In genomics: 1,016 out of 6,285
- Year to date:
- Site-wide: 1,625 out of 101,478
- Since beginning of last month:
- Site-wide: 5,179 out of 101,478
Downloads over time
Distribution of downloads per paper, site-wide
- 20 Oct 2020: Support for sorting preprints using Twitter activity has been removed, at least temporarily, until a new source of social media activity data becomes available.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!