Selective constraints and pathogenicity of mitochondrial DNA variants inferred from a novel database of 196,554 unrelated individuals
Elizabeth T. Cirulli,
William J Metcalf,
Joseph J Grzymski,
James T. Lu,
Nicole L Washington
Posted 08 Oct 2019
bioRxiv DOI: 10.1101/798264
Posted 08 Oct 2019
Robust characterization of mitochondrial variation provides an opportunity to map regions under high constraint, and identify essential functional domains. We sequenced the mitochondrial genomes of 196,554 unrelated individuals, and identified 15,035 unique variants. We found that 47% of the mitochondrial genome was invariant across the population, and generated a map of constrained mitochondrial regions. We find that the longest intervals in the mitochondrial genome without any variant were in the two rRNA genes (26 of 40 intervals >10bp long). We also showed that the 13 protein-coding genes in the mitochondrial genome did not tolerate loss-of-function variants. The only frameshift or nonsense variant observed at homoplasmic levels was a nonsense at the start codon of MT-ND1, which may be rescued by the methionine at amino acid position 3. Lastly, we applied these data to variants reported to be pathogenic for Leber Hereditary Optic Neuropathy (LHON). We found that 42% of variants (19 of the 45) reported to be pathogenic have a frequency above the maximum credible population allele frequency for an LHON-causing variant, including the primary LHON mutation m.14484T>C, which suggests that m.14484T>C cannot be causing LHON by itself. This result showed that allele frequency information across a large unselected population is important to assess the pathogenicity of variants in the context of rare mitochondrial disorders. We made HelixMTdb -- the list of variants and their allele frequency in 196,554 unrelated individuals -- publicly available.
- Downloaded 674 times
- Download rankings, all-time:
- Site-wide: 17,382 out of 78,244
- In genetics: 1,154 out of 4,208
- Year to date:
- Site-wide: 7,288 out of 78,244
- Since beginning of last month:
- Site-wide: 9,017 out of 78,244
Downloads over time
Distribution of downloads per paper, site-wide
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!