Rxivist logo

GFF3sort: a novel tool to sort GFF3 files for tabix indexing

By Tao Zhu, Chengzhen Liang, Zhigang Meng, Sandui Guo, Rui Zhang

Posted 04 Jun 2017
bioRxiv DOI: 10.1101/145938 (published DOI: 10.1186/s12859-017-1930-3)

Background: The traditional method of visualizing gene annotation data in JBrowse is converting GFF3 files to JSON format, which is time-consuming. The latest version of JBrowse supports rendering sorted GFF3 files indexed by tabix, a novel strategy that is more convenient than the original conversion process. However, current tools available for GFF3 file sorting have some limitations and their sorting results would lead to erroneous rendering in JBrowse. Results: We developed GFF3sort, a script to sort GFF3 files for tabix indexing. Specifically designed for JBrowse rendering, GFF3sort can properly deal with the order of features that have the same chromosome and start position, either by remembering their original orders or by conducting parent-child topology sorting. Based on our test datasets from seven species, GFF3sort produced accurate sorting results with acceptable efficiency compared with currently available tools Conclusions: GFF3sort is a novel tool to sort GFF3 files for tabix indexing. We anticipate that GFF3sort will be useful to help with genome annotation data processing and visualization.

Download data

  • Downloaded 571 times
  • Download rankings, all-time:
    • Site-wide: 51,642
    • In bioinformatics: 5,257
  • Year to date:
    • Site-wide: 134,209
  • Since beginning of last month:
    • Site-wide: 139,846

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide