Identification of Pathogenic Structural Variants in Rare Disease Patients through Genome Sequencing
James M. Holt,
Camille L Birch,
Donna M Brown,
Melissa A Wilk,
Rebecca C Spillmann,
Alden Y Huang,
Jennefer N. Kohler,
Ellen F. Macnamara,
Undiagnosed Diseases Network,
Stanley F Nelson,
Elizabeth A Worthey
Posted 15 May 2019
bioRxiv DOI: 10.1101/627661
Posted 15 May 2019
Purpose: Clinical whole genome sequencing is becoming more common for determining the molecular diagnosis of rare disease. However, standard clinical practice often focuses on small variants such as single nucleotide variants and small insertions/deletions. This leaves a wide range of larger "structural variants" that are not commonly analyzed in patients. Methods: We developed a pipeline for processing structural variants for patients who received whole genome sequencing through the Undiagnosed Diseases Network (UDN). This pipeline called structural variants, stored them in an internal database, and filtered the variants based on internal frequencies and external annotations. The remaining variants were manually inspected and then interesting findings were reported as research variants to clinical sites in the UDN. Results: Of 477 analyzed UDN cases, 286 cases (≈ 60%) received at least one structural variant as a research finding. The variants in 16 cases (≈ 4%) are considered "Certain" or "Highly likely" molecularly diagnosed and another 4 cases are currently in review. Of those 20 cases, at least 13 were identified originally through our pipeline with one finding leading to identification of a new disease. As part of this paper, we have also released the collection of variant calls identified in our cohort along with heterozygous and homozygous call counts. This data is available at https://github.com/HudsonAlpha/UDN\_SV\_export. Conclusion: Structural variants are key genetic features that should be analyzed during routine clinical genomic analysis. For our UDN patients, structural variants helped solve ≈ 4% of the total number of cases (≈ 13% of all genome sequencing solves), a success rate we expect to improve with better tools and greater understanding of the human genome.
- Downloaded 1,195 times
- Download rankings, all-time:
- Site-wide: 22,651
- In genomics: 2,069
- Year to date:
- Site-wide: 24,805
- Since beginning of last month:
- Site-wide: 51,476
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!