Critical Assessment of Metagenome Interpretation − a benchmark of computational metagenomics software
Tue Sparholt Jørgensen,
Philip D Blood,
Matthew Z. DeMaere,
Lars Hestbjerg Hansen,
Søren J. Sørensen,
Burton K H Chia,
Jeff L Froula,
Dongwan Don Kang,
Jeffrey J Cook,
Steven W. Singer,
Genivaldo Gueiros Z. Silva,
Daniel A. Cuevas,
Robert A. Edwards,
Vitor C. Piro,
Bernhard Y. Renard,
Nikos C. Kyrpides,
Julia A. Vorholt,
Edward M Rubin,
Aaron E. Darling,
Alice C McHardy
Posted 09 Jan 2017
bioRxiv DOI: 10.1101/099127 (published DOI: 10.1038/nmeth.4458)
Posted 09 Jan 2017
In metagenome analysis, computational methods for assembly, taxonomic profiling and binning are key components facilitating downstream biological data interpretation. However, a lack of consensus about benchmarking datasets and evaluation metrics complicates proper performance assessment. The Critical Assessment of Metagenome Interpretation (CAMI) challenge has engaged the global developer community to benchmark their programs on datasets of unprecedented complexity and realism. Benchmark metagenomes were generated from ~700 newly sequenced microorganisms and ~600 novel viruses and plasmids, including genomes with varying degrees of relatedness to each other and to publicly available ones and representing common experimental setups. Across all datasets, assembly and genome binning programs performed well for species represented by individual genomes, while performance was substantially affected by the presence of related strains. Taxonomic profiling and binning programs were proficient at high taxonomic ranks, with a notable performance decrease below the family level. Parameter settings substantially impacted performances, underscoring the importance of program reproducibility. While highlighting current challenges in computational metagenomics, the CAMI results provide a roadmap for software selection to answer specific research questions.
- Downloaded 6,783 times
- Download rankings, all-time:
- Site-wide: 466 out of 93,204
- In bioinformatics: 66 out of 8,734
- Year to date:
- Site-wide: 16,528 out of 93,204
- Since beginning of last month:
- Site-wide: 20,318 out of 93,204
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!