Rxivist logo

Universal correction of enzymatic sequence bias reveals molecular signatures of protein/DNA interactions

By André L. Martins, Ninad M. Walavalkar, Warren D. Anderson, Chongzhi Zang, Michael J Guertin

Posted 30 Jan 2017
bioRxiv DOI: 10.1101/104364 (published DOI: 10.1093/nar/gkx1053)

Coupling molecular biology to high throughput sequencing has revolutionized the study of biology. Molecular genomics techniques are continually refined to provide higher resolution mapping of nucleic acid interactions and structure. Sequence preferences of enzymes can interfere with the accurate interpretation of these data. We developed seqOutBias to characterize enzymatic sequence bias from experimental data and scale individual sequence reads to correct intrinsic enzymatic sequence biases. SeqOutBias efficiently corrects DNase-seq, TACh-seq, ATAC-seq, MNase-seq, and PRO-seq data. We show that seqOutBias correction facilitates identification of true molecular signatures resulting from transcription factors and RNA polymerase interacting with DNA.

Download data

  • Downloaded 1,253 times
  • Download rankings, all-time:
    • Site-wide: 18,948
    • In genomics: 1,802
  • Year to date:
    • Site-wide: 89,189
  • Since beginning of last month:
    • Site-wide: 152,009

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide