Rxivist logo

ExpansionHunter Denovo: A computational method for locating known and novel repeat expansions in short-read sequencing data

By Egor Dolzhenko, Mark F Bennett, Phillip Andrew Richmond, Brett Trost, Sai Chen, Joke J.F.A. van Vugt, Charlotte Nguyen, Giuseppe Narzisi, Vladimir G Gainullin, Andrew Gross, Bryan Lajoie, Ryan J Taft, Wyeth W. Wasserman, Stephen W. Scherer, Jan H. Veldink, David R Bentley, R K.C. Yuen, Melanie Bahlo, Michael A. Eberle

Posted 03 Dec 2019
bioRxiv DOI: 10.1101/863035 (published DOI: 10.1186/s13059-020-02017-z)

Expansions of short tandem repeats are responsible for over 40 monogenic disorders, and undoubtedly many more pathogenic repeat expansions (REs) remain to be discovered. Existing methods for detecting REs in short-read sequencing data require predefined repeat catalogs. However recent discoveries have emphasized the need for detection methods that do not require candidate repeats to be specified in advance. To address this need, we introduce ExpansionHunter Denovo, an efficient catalog-free method for genome-wide detection of REs. Analysis of real and simulated data shows that our method can identify large expansions of 41 out of 44 pathogenic repeats, including nine recently reported non-reference REs not discoverable via existing methods. ExpansionHunter Denovo is freely available at <https://github.com/Illumina/ExpansionHunterDenovo>

Download data

  • Downloaded 799 times
  • Download rankings, all-time:
    • Site-wide: 32,681
    • In bioinformatics: 3,628
  • Year to date:
    • Site-wide: 74,944
  • Since beginning of last month:
    • Site-wide: 82,910

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide