Rxivist logo

The proBAM and proBed standard formats: enabling a seamless integration of genomics and proteomics data.

By Gerben Menschaert, Xiaojing Wang, Andrew R Jones, Fawaz Ghali, David Fenyƶ, Volodimir Olexiouk, Bing Zhang, Eric W Deutsch, Tobias Ternent, Juan Antonio Vizcaƭno

Posted 20 Jun 2017
bioRxiv DOI: 10.1101/152579 (published DOI: 10.1186/s13059-017-1377-x)

On behalf of The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI), we are here introducing two novel standard data formats, proBAM and proBed, that have been developed to address the current challenges of integrating mass spectrometry based proteomics data with genomics and transcriptomics information in proteogenomics studies. proBAM and proBed are adaptations from the well-defined, widely used file formats SAM/BAM and BED respectively, and both have been extended to meet specific requirements entailed by proteomics data. Therefore, existing popular genomics tools such as SAMtools and Bedtools, and several very popular genome browsers, can be used to manipulate and visualize these formats already out-of-the-box. We also highlight that a number of specific additional software tools, properly supporting the proteomics information available in these formats, are now available providing functionalities such as file generation, file conversion, and data analysis. All the related documentation to the formats, including the detailed file format specifications, and example files are accessible at http://www.psidev.info/probam and http://www.psidev.info/probed.

Download data

  • Downloaded 491 times
  • Download rankings, all-time:
    • Site-wide: 77,951
    • In bioinformatics: 7,128
  • Year to date:
    • Site-wide: 161,159
  • Since beginning of last month:
    • Site-wide: 124,188

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide