Rxivist logo

opentsv prevents the corruption of scientific data by Excel

By Peter De Rijk, Svenn D’Hert, Mojca Strazisar

Posted 16 Dec 2018
bioRxiv DOI: 10.1101/497370

Microsoft Excel is widely used by researchers to edit tab- or comma-separated data files. However, Excel often corrupts the data when opening these files, most notably by changing some gene names to a date. Although this problem was cautioned against earlier, we show that every year hundreds of published papers still come with supplementary data files containing these errors. Opentsv was developed to effectively circumvent this problem at the root by providing an easy and transparent way to open delimited data files in Excel without these conversions. Opentsv is freely available at <https://github.com/derijkp/opentsv>.

Download data

  • Downloaded 638 times
  • Download rankings, all-time:
    • Site-wide: 22,544 out of 88,687
    • In bioinformatics: 3,211 out of 8,385
  • Year to date:
    • Site-wide: 29,346 out of 88,687
  • Since beginning of last month:
    • Site-wide: 17,424 out of 88,687

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)