Rxivist logo

BaRTv1.0: an improved barley reference transcript dataset to determine accurate changes in the barley transcriptome using RNA-seq

By Paulo Rapazote-Flores, Micha Bayer, Linda Milne, Claus-Dieter Mayer, John Fuller, Wenbin Guo, Peter E Hedley, Jenny Morris, Claire Halpin, Jason Kam, Sarah M. McKim, Monika Zwirek, M. Cristina Casao, Abdellah Barakate, Miriam Schreiber, Gordon Stephen, Runxuan Zhang, John W.S. Brown, Robbie Waugh, Craig Simpson

Posted 16 May 2019
bioRxiv DOI: 10.1101/638106 (published DOI: 10.1186/s12864-019-6243-7)

Background Time consuming computational assembly and quantification of gene expression and splicing analysis from RNA-seq data vary considerably. Recent fast non-alignment tools such as Kallisto and Salmon overcome these problems, but these tools require a high quality, comprehensive reference transcripts dataset (RTD), which are rarely available in plants. Results A high-quality, non-redundant barley gene RTD and database (Barley Reference Transcripts – BaRTv1.0) has been generated. BaRTv1.0, was constructed from a range of tissues, cultivars and abiotic treatments and transcripts assembled and aligned to the barley cv. Morex reference genome ([Mascher et al., 2017][1]). Full-length cDNAs from the barley variety Haruna nijo ([Matsumoto et al., 2011][2]) determined transcript coverage, and high-resolution RT-PCR validated alternatively spliced (AS) transcripts of 86 genes in five different organs and tissue. These methods were used as benchmarks to select an optimal barley RTD. BaRTv1.0-Quantification of Alternatively Spliced Isoforms (QUASI) was also made to overcome inaccurate quantification due to variation in 5’ and 3’ UTR ends of transcripts. BaRTv1.0-QUASI was used for accurate transcript quantification of RNA-seq data of five barley organs/tissues. This analysis identified 20,972 significant differentially expressed genes, 2,791 differentially alternatively spliced genes and 2,768 transcripts with differential transcript usage. Conclusion A high confidence barley reference transcript dataset consisting of 60,444 genes with 177,240 transcripts has been generated. Compared to current barley transcripts, BaRTv1.0 transcripts are generally longer, have less fragmentation and improved gene models that are well supported by splice junction reads. Precise transcript quantification using BaRTv1.0 allows routine analysis of gene expression and AS. [1]: #ref-34 [2]: #ref-36

Download data

  • Downloaded 707 times
  • Download rankings, all-time:
    • Site-wide: 43,253
    • In genomics: 3,441
  • Year to date:
    • Site-wide: 107,933
  • Since beginning of last month:
    • Site-wide: 134,086

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide