Rxivist logo

Distinct genomic and epigenomic features demarcate hypomethylated blocks in colon cancer

By Mahfuza Sharmin, H├ęctor Corrada Bravo, Sridhar Hannenhalli

Posted 09 Oct 2015
bioRxiv DOI: 10.1101/028803 (published DOI: 10.1186/s12885-016-2128-1)

Background. Large mega base-pair genomic regions show robust alterations in DNA methylation levels in multiple cancers, a vast majority of which are hypo-methylated in cancers. These regions are generally bounded by CpG islands, overlap with Lamin Associated Domains and Large organized chromatin lysine modifications, and are associated with stochastic variability in gene expression. Given the size and consistency of hypo-methylated blocks (HMB) across cancer types, their immediate causes are likely to be encoded in the genomic region near HMB boundaries, in terms of specific genomic or epigenomic signatures. However, a detailed characterization of the HMB boundaries has not been reported. Method. Here, we focused on ~13k HMBs, encompassing approximately half the genome, identified in colon cancer. We analyzed a number of distinguishing features at the HMB boundaries including transcription factor (TF) binding motifs, various epigenomic marks, and chromatin structural features. Result. We found that the classical promoter epigenomic mark, H3K4me3, is highly enriched at HMB boundaries, as are CTCF bound sites. HMB boundaries harbor distinct combinations of TF motifs. Our Random Forest model based on TF motifs can accurately distinguish boundaries not only from regions inside and outside HMBs, but surprisingly, from active promoters as well. Interestingly, the distinguishing TFs and their interacting proteins are involved in chromatin modification. Finally, HMB boundaries significantly coincide with the boundaries of Topologically Associating Domains of the chromatin. Conclusion. Our analyses suggest that the overall architecture of HMBs is guided by pre-existing chromatin architecture, and are associated with aberrant activity of promoter-like sequences at the boundary.

Download data

  • Downloaded 411 times
  • Download rankings, all-time:
    • Site-wide: 102,413
    • In bioinformatics: 8,476
  • Year to date:
    • Site-wide: 138,921
  • Since beginning of last month:
    • Site-wide: 173,160

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide