Genome-wide survey of parent-of-origin specific associations across clinical traits derived from electronic health records
Hye In Kim,
Regeneron Genetics Center,
Geisinger Regeneron DiscovEHR Collaboration,
Alan R Shuldiner,
Cristopher Van Hout
Posted 11 Dec 2020
medRxiv DOI: 10.1101/2020.12.08.20246199
Posted 11 Dec 2020
Parent-of-origin (PoO) effects refer to the differential phenotypic impact of genetic variants dependent on their parental inheritance. Genetic variants in imprinted genes can have PoO specific effects on complex traits, but these effects may be poorly captured by models that do not differentiate the parental origin of the variant. The aim of this study was to screen genome-wide imputed sequence for PoO effects on electronic health records (EHR) derived clinical traits in 134,049 individuals of European ancestry from the DiscovEHR study. Using pairwise kinship estimates from genetic data and demographic data, we identified 22,051 offspring with at least one parent present in the DiscovEHR study. We then assigned the PoO of ~9 million variants in the heterozygous offspring using two methods. First, when one of the parental genotypes was homozygous, we determined PoO based on apparent Mendelian segregation. Second, we estimated PoO by comparing parental and offspring haplotypes around the variant allele. Using these PoO assignments, we performed genome-wide PoO association analyses across 154 quantitative traits including lab test results and biometric measures and 612 binary traits of ICD10 3-digit codes extracted from EHR in the DiscovEHR study. Out of 732 PoO associations meeting a significance threshold of P <5x10-8, we attempted to replicate 274 PoO associations in the UK Biobank study, consisting of 462,453 individuals and including 5,015 offspring with at least one parent, and replicated 9 PoO associations with nominal significance threshold P <0.05. In summary, the current study characterizes PoO effects of genetic variants genome-wide on a broad range of clinical traits derived from EHR in a large population study enriched for familial relationships. Our results suggest that 1) PoO specific effects are frequently captured by a standard additive model and that 2) statistical power to detect PoO specific effects remains modest even in large studies. Nonetheless, accurately modeling PoO effects of genetic variants has the potential to improve our understanding of the mechanism of the association and finding new associations that are not captured by the additive model.
- Downloaded 327 times
- Download rankings, all-time:
- Site-wide: 113,580
- In genetic and genomic medicine: 657
- Year to date:
- Site-wide: 141,753
- Since beginning of last month:
- Site-wide: 66,093
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!