Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 66,879 bioRxiv papers from 294,441 authors.
A unified sequence catalogue of over 280,000 genomes obtained from the human gut microbiome
By
Alexandre Almeida,
Stephen Nayfach,
Miguel Boland,
Francesco Strozzi,
Martin Beracochea,
Zhou Jason Shi,
Katherine S Pollard,
Donovan H Parks,
Philip Hugenholtz,
Nicola Segata,
Nikos Kyrpides,
Robert D. Finn
Posted 19 Sep 2019
bioRxiv DOI: 10.1101/762682
Comprehensive reference data is essential for accurate taxonomic and functional characterization of the human gut microbiome. Here we present the Unified Human Gastrointestinal Genome (UHGG) collection, a resource combining 286,997 genomes representing 4,644 prokaryotic species from the human gut. These genomes contain over 625 million protein sequences used to generate the Unified Human Gastrointestinal Protein (UHGP) catalogue, a collection that more than doubles the number of gut protein clusters over the Integrated Gene Catalogue. We find that a large portion of the human gut microbiome remains to be fully explored, with over 70% of the UHGG species lacking cultured representatives, and 40% of the UHGP missing meaningful functional annotations. Intra-species genomic variation analyses revealed a large reservoir of accessory genes and single-nucleotide variants, many of which were specific to individual human populations. These freely available genomic resources should greatly facilitate investigations into the human gut microbiome.
Download data
- Downloaded 1,681 times
- Download rankings, all-time:
- Site-wide: 3,241 out of 66,879
- In microbiology: 93 out of 5,381
- Year to date:
- Site-wide: 742 out of 66,879
- Since beginning of last month:
- Site-wide: 418 out of 66,879
Altmetric data
Downloads over time
Distribution of downloads per paper, site-wide
- Home
- Top preprints of 2018
- Paper search
- Author leaderboards
- Overall metrics
- The API
- Email newsletter
- About
News
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!