Rxivist logo

The human functional genome defined by genetic diversity

By Julia di Iulio, Istvan Bartha, Emily H.M. Wong, Hung-Chun Yu, Michael Hicks, Naisha Shah, Victor Lavrenko, Ewen F. Kirkness, Martin M Fabani, Dongchan Yang, Inkyung Jung, William H. Biggs, Bing Ren, J. Craig Venter, Amalio Telenti

Posted 21 Oct 2016
bioRxiv DOI: 10.1101/082362 (published DOI: 10.1038/s41588-018-0062-7)

Large scale efforts to sequence whole human genomes provide extensive data on the non-coding portion of the genome. We used variation information from 11,257 human genomes to describe the spectrum of sequence conservation in the population. We established the genome-wide variability for each nucleotide in the context of the surrounding sequence in order to identify departure from expectation at the population level (context-dependent conservation). We characterized the population diversity for functional elements in the genome and identified the coordination of conserved sequences of distal and cis enhancers, chromatin marks, promoters, coding and intronic regions. The most context-dependent conserved regions of the genome are associated with unique functional annotations and a genomic organization that spreads up to one megabase. Importantly, these regions are enriched by over 100-fold of non-coding pathogenic variants. This analysis of human genetic diversity thus provides a detailed view of sequence conservation, functional constraint and genomic organization of the human genome. Specifically, it identifies highly conserved non-coding sequences that are not captured by analysis of interspecies conservation and are greatly enriched in disease variants.

Download data

  • Downloaded 2,258 times
  • Download rankings, all-time:
    • Site-wide: 8,771
    • In genomics: 859
  • Year to date:
    • Site-wide: 118,779
  • Since beginning of last month:
    • Site-wide: 82,279

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide