Rxivist logo

PathRacer: racing profile HMM paths on assembly graph

By Alexander Shlemov, Anton Korobeynikov

Posted 27 Feb 2019
bioRxiv DOI: 10.1101/562579

Recently large databases containing profile Hidden Markov Models (pHMMs) emerged. These pHMMs may represent the sequences of antibiotic resistance genes, or allelic variations amongst highly conserved housekeeping genes used for strain typing, etc. The typical application of such a database includes the alignment of contigs to pHMM hoping that the sequence of gene of interest is located within the single contig. Such a condition is often violated for metagenomes preventing the effective use of such databases. We present PathRacer - a novel standalone tool that aligns profile HMM directly to the assembly graph (performing the codon translation on fly for amino acid pHMMs). The tool provides the set of most probable paths traversed by a HMM through the whole assembly graph, regardless whether the sequence of interested is encoded on the single contig or scattered across the set of edges, therefore significantly improving the recovery of sequences of interest even from fragmented metagenome assemblies.

Download data

  • Downloaded 783 times
  • Download rankings, all-time:
    • Site-wide: 35,068
    • In bioinformatics: 3,833
  • Year to date:
    • Site-wide: 59,713
  • Since beginning of last month:
    • Site-wide: 108,291

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide