Rxivist logo

Database-integrated genome screening (DIGS): exploring genomes heuristically using sequence similarity search tools and a relational database.

By Henan Zhu, Tristan Dennis, Joseph Hughes, Robert J. Gifford

Posted 25 Apr 2018
bioRxiv DOI: 10.1101/246835

A significant fraction of most genomes is comprised of DNA sequences that have been incompletely investigated. This genomic 'dark matter' contains a wealth of useful biological information that can be recovered by systematically screening genomes in silico using sequence similarity search tools. Specialized computational tools are required to implement these screens efficiently. Here, we describe the database-integrated genome-screening (DIGS) tool: a computational framework for performing these investigations. To demonstrate, we screen mammalian genomes for endogenous viral elements (EVEs) derived from the Filoviridae, Parvoviridae, Circoviridae and Bornaviridae families, identifying numerous novel elements in addition to those that have been described previously. The DIGS tool provides a simple, robust framework for implementing a broad range of heuristic, sequence analysis-based explorations of genomic diversity.

Download data

  • Downloaded 375 times
  • Download rankings, all-time:
    • Site-wide: 100,947
    • In bioinformatics: 8,644
  • Year to date:
    • Site-wide: None
  • Since beginning of last month:
    • Site-wide: 131,536

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide