Rxivist logo

Functional characterization of 3D-protein structures informed by human genetic diversity

By Michael Hicks, Istvan Bartha, Julia di Iulio, Ruben Abagyan, J. Craig Venter, Amalio Telenti

Posted 29 Aug 2017
bioRxiv DOI: 10.1101/182287 (published DOI: 10.1073/pnas.1820813116)

Sequence variation data of the human proteome can be used to analyze 3-dimensional (3D) protein structures to derive functional insights. We used genetic variant data from nearly 150,000 individuals to analyze 3D positional conservation in 4,390 protein structures using 481,708 missense and 264,257 synonymous variants. Sixty percent of protein structures harbor at least one intolerant 3D site as defined by significant depletion of observed over expected missense variation. We established an Angstrom-scale distribution of annotated pathogenic missense variants and showed that they accumulate in proximity to the most intolerant 3D sites. Structural intolerance data correlated with experimental functional read-outs in vitro. The 3D structural intolerance analysis revealed characteristic features of ligand binding pockets, orthosteric and allosteric sites. The identification of novel functional 3D sites based on human genetic data helps to validate, rank or predict drug target binding sites in vivo.

Download data

  • Downloaded 1,001 times
  • Download rankings, all-time:
    • Site-wide: 29,720
    • In genomics: 2,555
  • Year to date:
    • Site-wide: 32,118
  • Since beginning of last month:
    • Site-wide: 149,570

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide