Rxivist logo

A method for independent estimation of false localisation rate for phosphoproteomics

By Kerry A Ramsbottom, Ananth A Prakash, Yasset Perez-Riverol, Oscar Martin Camacho, Maria Martin, Juan Antonio Vizcaino, Eric W. Deutsch, Andrew R Jones

Posted 19 Oct 2021
bioRxiv DOI: 10.1101/2021.10.18.464791

Phosphoproteomics methods are commonly employed in labs to identify and quantify the sites of phosphorylation on proteins. In recent years, various software tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified, or to estimate the global false localisation rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic data sets, and their statistical reliability on real datasets is largely unknown. As a result, there is considerable problem in the field of reporting incorrectly localised phosphosites, due to inadequate statistical control. In this work, we develop the concept of using scoring and ranking modifications on a decoy amino acid, i.e. one that cannot be modified, to allow for independent estimation of global FLR. We test a variety of different amino acids to act as the decoy, on both synthetic and real data sets, demonstrating that the amino acid selection can make a substantial difference to the estimated global FLR. We conclude that while several different amino acids might be appropriate, the most reliable FLR results were achieved using alanine and leucine as decoys, although we have a preference for alanine due to the risk of potential confusion between leucine and isoleucine amino acids. We propose that the phosphoproteomics field should adopt the use of a decoy amino acid, so that there is better control of false reporting in the literature, and in public databases that re-distribute the data.

Download data

  • Downloaded 175 times
  • Download rankings, all-time:
    • Site-wide: 152,414
    • In bioinformatics: 11,632
  • Year to date:
    • Site-wide: None
  • Since beginning of last month:
    • Site-wide: 59,572

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide