Rxivist logo

Ultra-rapid metagenotyping of the human gut microbiome

By Zhou Jason Shi, Boris Dimitrov, Chunyu Zhao, Stephen Nayfach, Katherine Pollard

Posted 13 Jun 2020
bioRxiv DOI: 10.1101/2020.06.12.149336

Sequence variation is used to quantify population structure and identify genetic determinants of phenotypes that vary within species. In the human microbiome and other environments, single nucleotide polymorphisms (SNPs) are frequently detected by aligning metagenomic sequencing reads to catalogs of genes or genomes. But this requires high-performance computing and enough read coverage to distinguish SNPs from sequencing errors. We solved these problems by developing the GenoTyper for Prokaytotes (GT-Pro), a suite of novel methods to catalog SNPs from genomes and use exact k-mer matches to perform ultra-fast reference-based SNP calling from metagenomes. Compared to read alignment, GT-Pro is more accurate and two orders of magnitude faster. We discovered 104 million SNPs in 909 human gut species, characterized their global population structure, and tracked pathogenic strains. GT-Pro democratizes strain-level microbiome analysis by making it possible to genotype hundreds of metagenomes on a personal computer. Software availability: GT-Pro is available at https://github.com/zjshi/gt-pro. ### Competing Interest Statement The authors have declared no competing interest.

Download data

  • Downloaded 1,079 times
  • Download rankings, all-time:
    • Site-wide: 30,313
    • In bioinformatics: 3,205
  • Year to date:
    • Site-wide: 56,836
  • Since beginning of last month:
    • Site-wide: 101,747

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide