Rxivist logo

HPRep: Quantifying reproducibility in HiChIP and PLAC-seq datasets

By Jonathan D Rosen, Yuchen Yang, Armen Abnousi, Jiawen Chen, Michael Song, Ian R Jones, Yin Shen, Ming Hu, Yun Li

Posted 23 Nov 2020
bioRxiv DOI: 10.1101/2020.11.23.394239

HiChIP and PLAC-seq are emerging technologies for studying genome-wide long-range chromatin interactions mediated by protein of interest, enabling more sensitive and cost-efficient interrogation of protein-centric chromatin conformation. However, due to the unbalanced read distribution introduced by protein immunoprecipitation, existing reproducibility measures developed for Hi-C data are not appropriate for the analysis of HiChIP and PLAC-seq data. Here, we present HPRep, a stratified and weighted correlation metric derived from normalized contact counts, to quantify reproducibility in HiChIP and PLAC-seq data. We applied HPRep to multiple real datasets and demonstrate that HPRep outperforms existing reproducibility measures developed for Hi-C data. Specifically, we applied HPRep to H3K4me3 PLAC-seq data from mouse embryonic stem cells and mouse brain tissues, as well as H3K27ac HiChIP data from human lymphoblastoid cell line GM12878 and leukemia cell line K562, showing that HPRep can more clearly separate among pseudo-replicates, real replicates, and non-replicates. Furthermore, in an H3K4me3 PLAC-seq dataset consisting of 11 samples from four human brain cell types, HPRep demonstrates expected clustering of data which could not be achieved by existing methods developed for Hi-C data, highlighting the need of a reproducibility metric tailored to HiChIP and PLAC-seq data.

Download data

  • Downloaded 235 times
  • Download rankings, all-time:
    • Site-wide: 133,542
    • In genomics: 7,159
  • Year to date:
    • Site-wide: 70,169
  • Since beginning of last month:
    • Site-wide: 76,051

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide