Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 71,071 bioRxiv papers from 310,049 authors.

A map of direct TF-DNA interactions in the human genome

By Marius Gheorghe, Geir K. Sandve, Aziz Khan, Jeanne Chèneby, Benoit Ballester, Anthony Mathelier

Posted 17 Aug 2018
bioRxiv DOI: 10.1101/394205 (published DOI: 10.1093/nar/gky1210)

Chromatin immunoprecipitation followed by sequencing (ChIP-seq) is the most popular assay to identify genomic regions, called ChIP-seq peaks, that are bound in vivo by transcription factors (TFs). These regions are derived from direct TF-DNA interactions, indirect binding of the TF to the DNA (through a co-binding partner), nonspecific binding to the DNA, and noise/bias/artifacts. Delineating the bona fide direct TF-DNA interactions within the ChIP-seq peaks remains challenging. We developed a dedicated software, ChIP-eat, that combines computational TF binding models and ChIP-seq peaks to automatically predict direct TF-DNA interactions. Our work culminated with predicted interactions covering >4% of the human genome, obtained by uniformly processing 1,983 ChIP-seq peak data sets from the ReMap database for 232 unique TFs. The predictions were a posteriori assessed using protein binding microarray and ChIP-exo data, and were predominantly found in high quality ChIP-seq peaks. The set of predicted direct TF-DNA interactions suggested that high-occupancy target regions are likely not derived from direct binding of the TFs to the DNA. Our predictions derived co-binding TFs supported by protein-protein interaction data and defined cis-regulatory modules enriched for disease- and trait-associated SNPs. Finally, we provide this collection of direct TF-DNA interactions and cis-regulatory modules in the human genome through the UniBind web-interface (http://unibind.uio.no).

Download data

  • Downloaded 954 times
  • Download rankings, all-time:
    • Site-wide: 9,015 out of 71,071
    • In bioinformatics: 1,556 out of 6,949
  • Year to date:
    • Site-wide: 55,023 out of 71,071
  • Since beginning of last month:
    • Site-wide: 55,328 out of 71,071

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


PanLingua

Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News