Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 65,624 bioRxiv papers from 290,715 authors.
VEP-G2P: A Tool for Efficient, Flexible and Scalable Diagnostic Filtering of Genomic Variants
David J Moore,
Shona M. Kerr,
Malcolm G Dunlop,
Matthew E Hurles,
Caroline F Wright,
Helen V Firth,
David R FitzPatrick
Posted 13 Sep 2018
bioRxiv DOI: 10.1101/416552
Posted 13 Sep 2018
Purpose: We aimed to develop an efficient, flexible, scalable and evidence-based approach to sequence-based diagnostic analysis/re-analysis of conditions with very large numbers of different causative genes. We then wished to define the expected rate of plausibly causative variants coming through strict filtering in control in comparison to disease populations to quantify background diagnostic ″noise″. Methods: We developed G2P (www.ebi.ac.uk/gene2phenotype) as an online system to facilitate the development, validation, curation and distribution of large-scale, evidence-based datasets for use in diagnostic variant filtering. Each locus-genotype-mechanism-disease-evidence thread (LGMDET) associates an allelic requirement and a mutational consequence at a defined locus with a disease entity and a confidence level and evidence links. We then developed an extension to Ensembl Variant Effect Predictor (VEP), VEP-G2P, which can filter based on G2P other widely used gene panel curation systems. We compared the output of disease-associated and control whole exome sequence (WES) using Developmental Disorders G2P (G2PDD; 2044 LGMDETs) and constitutional cancer predisposition G2P (G2PCancer; 128 LGMDETs). Results: We have shown a sensitivity/precision of 97.3%/33% and 81.6%/22.7% for causative de novo and inherited variants respectively using VEP-G2PDD in DDD study probands WES. Many of the apparently diagnostic genotypes ″missed″ are likely false-positive reports with lower minor allele frequencies and more severe predicted consequences being diagnostically-discriminative features. Conclusion: Case:control comparisons using VEP-G2PDD established an observed:expected ratio of 1:30,000 plausibly causative variants in proband WES to ~1:40,000 reportable but presumed-benign variants in controls. At least half the filtered variants in probands represent background ″noise″. Supporting phenotypic evidence is, therefore, necessary in genetically-heterogeneous disorders. G2P and VEP-G2P provides a practical approach to optimize disease-specific filtering parameters in diagnostic genetic research.
- Downloaded 360 times
- Download rankings, all-time:
- Site-wide: 29,520 out of 65,624
- In genetics: 1,888 out of 3,709
- Year to date:
- Site-wide: 30,400 out of 65,624
- Since beginning of last month:
- Site-wide: 36,102 out of 65,624
Downloads over time
Distribution of downloads per paper, site-wide
- Top preprints of 2018
- Paper search
- Author leaderboards
- Overall metrics
- The API
- Email newsletter
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!