Rxivist logo

Accessible, curated metagenomic data through ExperimentHub

By Edoardo Pasolli, Lucas Schiffer, Paolo Manghi, Audrey Renson, Valerie Obenchain, Duy Tin Truong, Francesco Beghini, Faizan Malik, Marcel Ramos, Jennifer B. Dowd, Curtis Huttenhower, Martin Morgan, Nicola Segata, Levi Waldron

Posted 27 Jan 2017
bioRxiv DOI: 10.1101/103085 (published DOI: 10.1038/nmeth.4468)

We present curatedMetagenomicData, a Bioconductor and command-line interface to thousands of metagenomic profiles from the Human Microbiome Project and other publicly available datasets, and ExperimentHub, a platform for convenient cloud-based distribution of data to the R desktop. The resource provides standardized per-participant metadata linked to bacterial, fungal, archaeal, and viral taxonomic abundances, as well as quantitative metabolic functional profiles. The datasets can be immediately analyzed in R or other software with a minimum of bioinformatic expertise and no preprocessing of data. We demonstrate identification of taxonomic/functional correlations, an investigation of gut "enterotypes", and a comparison of the accuracy of disease classification from different data types. These documented analyses can be reproduced efficiently on a laptop, without the barriers of working with large-scale, raw sequencing data. The building and expansion of curatedMetagenomicData is based entirely on open source software and pipelines, to facilitate the addition of new microbiome datasets and methods.

Download data

  • Downloaded 2,757 times
  • Download rankings, all-time:
    • Site-wide: 8,024
    • In bioinformatics: 797
  • Year to date:
    • Site-wide: 50,888
  • Since beginning of last month:
    • Site-wide: 16,276

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide