Rxivist logo

Identifying Streptococcus pneumoniae genes associated with invasive disease using pangenome-based whole genome sequence typing

By Uri Obolski, Andrea Gori, Jose Lourenco, Craig Thompson, Robin Thompson, Neil French, Robert Heyderman, Sunetra Gupta

Posted 04 May 2018
bioRxiv DOI: 10.1101/314666

Streptococcus pneumoniae is a normal commensal of the upper respiratory tract but can also invade the bloodstream or CSF (cerebrospinal fluid), causing invasive pneumococcal disease (IPD). In this study, we attempt to identify genes associated with IPD by applying a random forest machine-learning algorithm to whole genome sequence (WGS) data. We find 43 genes consistently associated with IPD across three geographically distinct WGS data sets of pneumococcal carriage isolates. Of these genes, 23 genes have previously shown to be directly relevant to IPD, while the other 18 are uncharacterized.

Download data

  • Downloaded 1,159 times
  • Download rankings, all-time:
    • Site-wide: 19,653
    • In bioinformatics: 2,287
  • Year to date:
    • Site-wide: 110,940
  • Since beginning of last month:
    • Site-wide: 86,277

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide