A high-throughput multiplexing and selection strategy to complete bacterial genomes
Anna K. Pöntinen,
Rebecca A. Gladstone,
Anita C. Schürch,
Pål J. Johnsen,
Posted 14 Jun 2021
bioRxiv DOI: 10.1101/2021.06.14.448320
Posted 14 Jun 2021
Background: Bacterial whole-genome sequencing based on short-read sequencing data often results in a draft assembly formed by contiguous sequences. The introduction of long-read sequencing technologies permits to unambiguously bridge those contiguous sequences into complete genomes. However, the elevated costs associated with long-read sequencing frequently limit the number of bacterial isolates that can be long-read sequenced. Here we evaluated the recently released 96 barcoding kit from Oxford Nanopore Technologies (ONT) to generate complete genomes on a high-throughput basis. In addition, we propose a long-read isolate selection strategy that optimizes a representative selection of isolates from large-scale bacterial collections. Results: Despite an uneven distribution of long-reads per barcode, near-complete chromosomal sequences (assembly contiguity = 0.89) were generated for 96 Escherichia coli isolates with associated short-read sequencing data. The assembly contiguity of the plasmid replicons was even higher (0.98) which indicated the suitability of the multiplexing strategy for studies focused on resolving plasmid sequences. We benchmarked hybrid and ONT-only assemblies and showed that the combination of ONT sequencing data with short-read sequencing data is still highly desirable: (i) to perform an unbiased selection of isolates for long-read sequencing, (ii) to achieve an optimal genome accuracy and completeness, and (iii) to include small plasmids underrepresented in the ONT library. Conclusions: The proposed long-read isolate selection ensures completing bacterial genomes of isolates that span the genome diversity inherent in large collections of bacterial isolates. We show the potential of using this multiplexing approach to close bacterial genomes on a high-throughput basis.
- Downloaded 673 times
- Download rankings, all-time:
- Site-wide: 52,525
- In microbiology: 3,191
- Year to date:
- Site-wide: None
- Since beginning of last month:
- Site-wide: 23,441
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!