Rxivist logo

SMRT-Cappable-seq reveals complex operon variants in bacteria

By Bo Yan, Matthew Boitano, Tyson Clark, Laurence Ettwiller

Posted 09 Feb 2018
bioRxiv DOI: 10.1101/262964 (published DOI: 10.1038/s41467-018-05997-6)

Current methods for genome-wide analysis of gene expression requires shredding original transcripts into small fragments for short-read sequencing. In bacteria, the resulting fragmented information hides operon complexity. Additionally, in-vivo processing of transcripts confounds the accurate identification of the 5 and 3 prime ends of operons. Here we developed a novel methodology called SMRT-Cappable-seq that combines the isolation of unfragmented primary transcripts with single-molecule long read sequencing. Applied to E. coli, this technology results in an unprecedented definition of the transcriptome with 34 % of the known operons being extended by at least one gene. Furthermore, 40% of transcription termination sites have read-through that alters the gene content of the operons. As a result, most of the bacterial genes are present in multiple operon variants reminiscent of eukaryotic splicing. By providing an unprecedented granularity in the operon structure, this study represents an important resource for the study of prokaryotic gene network and regulation.

Download data

  • Downloaded 2,290 times
  • Download rankings, all-time:
    • Site-wide: 7,206
    • In genomics: 757
  • Year to date:
    • Site-wide: 105,662
  • Since beginning of last month:
    • Site-wide: 42,092

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide