Population sequencing data reveal a compendium of mutational processes in human germline
By
Vladimir B. Seplyarskiy,
Ruslan A. Soldatov,
Ryan J. McGinty,
Jakob M. Goldmann,
Ryan Hernandez,
Kathleen Barnes,
Adolfo Correa,
Esteban G. Burchard,
Patrick T. Ellinor,
Stephen T McGarvey,
Braxton D Mitchell,
Vasan S. Ramachandran,
Susan Redline,
Edwin Silverman,
Scott T Weiss,
Donna K. Arnett,
John Blangero,
Eric Boerwinkle,
Jiang He,
Courtney Montgomery,
DC Rao,
Jerome I. Rotter,
Jennifer A. Brody,
Yii-Der Ida Chen,
Lisa de las Fuentes,
Chii-Min Hwu,
Stephen S Rich,
Ani W. Manichaikul,
Josyf C Mychaleckyj,
Nicholette D Palmer,
Jennifer A. Smith,
Sharon LR Kardia,
Patricia A. Peyser,
Lawrence F Bielak,
Timothy D. O’Connor,
Leslie S Emery,
NHLBI Trans-Omics for Precision Medicine (TOPMed) Consortium,
TOPMed Population Genetics Working Group,
Christian Gilissen,
Wendy S.W. Wong,
Peter V. Kharchenko,
Shamil R. Sunyaev
Posted 11 Jan 2020
bioRxiv DOI: 10.1101/2020.01.10.893024
Mechanistic processes underlying human germline mutations remain largely unknown. Variation in mutation rate and spectra along the genome is informative about the biological mechanisms. We statistically decompose this variation into separate processes using a blind source separation technique. The analysis of a large-scale whole genome sequencing dataset (TOPMed) reveals nine processes that explain the variation in mutation properties between loci. Seven of these processes lend themselves to a biological interpretation. One process is driven by bulky DNA lesions that resolve asymmetrically with respect to transcription and replication. Two processes independently track direction of replication fork and replication timing. We identify a mutagenic effect of active demethylation primarily acting in regulatory regions. We also demonstrate that a recently discovered mutagenic process specific to oocytes can be localized solely from population sequencing data. This process is spread across all chromosomes and is highly asymmetric with respect to the direction of transcription, suggesting a major role of DNA damage.
Download data
- Downloaded 1,219 times
- Download rankings, all-time:
- Site-wide: 14,488
- In genetics: 744
- Year to date:
- Site-wide: 23,082
- Since beginning of last month:
- Site-wide: 23,082
Altmetric data
Downloads over time
Distribution of downloads per paper, site-wide
PanLingua
News
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!