Phylogenetic factorization of mammalian viruses complements trait-based analyses and guides surveillance efforts
Predicting which novel microorganisms may spill over from animals to humans has become a major priority in infectious disease biology. However, there are few tools to help assess the zoonotic potential of the enormous number of potential pathogens, the majority of which are undiscovered or unclassified and may be unlikely to infect or cause disease in humans. We adapt a new biological machine learning technique - phylofactorization - to partition viruses into clades based on their non-human host range and whether or not there exist evidence they have infected humans. Our cladistic analyses identify clades of viruses with common within-clade patterns - unusually high or low propensity for spillover. Phylofactorization by spillover yields many clades of viruses containing few to no representatives that have spilled over to humans, including the families Papillomaviridae and Herpesviridae, and the genus Parvovirus. Removal of these non-zoonotic clades from previous trait-based analyses changed the relative significance of traits determining spillover due to strong associations of traits with non-zoonotic clades. Phylofactorization by host breadth yielded clades with unusually high host breadth, including the family Togaviridae. We identify putative life-history traits differentiating clades' host breadth and propensities for zoonosis, and discuss how these results can prioritize sequencing-based surveillance of emerging infectious diseases.
- Downloaded 337 times
- Download rankings, all-time:
- Site-wide: 51,052 out of 94,912
- In epidemiology: 784 out of 1,556
- Year to date:
- Site-wide: 57,625 out of 94,912
- Since beginning of last month:
- Site-wide: 72,070 out of 94,912
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!