An integrative ENCODE resource for cancer genomics
By
Jing Zhang,
Donghoon Lee,
Vineet Dhiman,
Peng Jiang,
Jie Xu,
Patrick McGillivray,
Hongbo Yang,
Jason Liu,
William Meyerson,
Declan Clarke,
Mengting Gu,
Shantao Li,
Shaoke Lou,
Jinrui Xu,
Lucas Lochovsky,
Matthew Ung,
Lijia Ma,
Shan Yu,
Qin Cao,
Arif Harmanci,
Koon-Kiu Yan,
Anurag Sethi,
Gamze Gursoy,
Michael Rutenberg Schoenberg,
Joel Rozowsky,
Jonathan Warrell,
Prashant Emani,
Yucheng T Yang,
Timur Galeev,
Xiangmeng Kong,
Shuang Liu,
Xiaotong Li,
Jayanth Krishnan,
Yanlin Feng,
Juan Carlos Rivera-Mulia,
Jessica Adrian,
James R. Broach,
Michael Bolt,
Jennifer Moran,
Dominic Fitzgerald,
Vishnu Dileep,
Tingting Liu,
Shenglin Mei,
Takayo Sasaki,
Claudia Trevilla-Garcia,
Su Wang,
Yanli Wang,
Chongzhi Zang,
Daifeng Wang,
Robert Klein,
Michael Snyder,
David M. Gilbert,
Kevin Yip,
Chao Cheng,
Feng Yue,
X Shirley Liu,
Kevin White,
Mark Gerstein
Posted 18 Jul 2019
bioRxiv DOI: 10.1101/706424
ENCODE comprises thousands of functional genomics datasets, and the encyclopedia covers hundreds of cell types, providing a universal annotation for genome interpretation. However, for particular applications, it may be advantageous to use a customized annotation. Here, we develop such a custom annotation by leveraging advanced assays, such as eCLIP, Hi-C, and whole-genome STARR-seq on a number of data-rich ENCODE cell types. A key aspect of this annotation is comprehensive and experimentally derived networks of both transcription factors and RNA-binding proteins (TFs and RBPs). Cancer, a disease of system-wide dysregulation, is an ideal application for such a network-based annotation. Specifically, for cancer-associated cell types, we put regulators into hierarchies and measure their network change (rewiring) during oncogenesis. We also extensively survey TF-RBP crosstalk, highlighting how SUB1, a previously uncharacterized RBP, drives aberrant tumor expression and amplifies the effect of MYC, a well-known oncogenic TF. Furthermore, we show how our annotation allows us to place oncogenic transformations in the context of a broad cell space; here, many normal-to-tumor transitions move towards a stem-like state, while oncogene knockdowns show an opposing trend. Finally, we organize the resource into a coherent workflow to prioritize key elements and variants, in addition to regulators. We showcase the application of this prioritization to somatic burdening, cancer differential expression and GWAS. Targeted validations of the prioritized regulators, elements and variants using siRNA knockdowns, CRISPR-based editing, and luciferase assays demonstrate the value of the ENCODE resource.
Download data
- Downloaded 1,470 times
- Download rankings, all-time:
- Site-wide: 21,791
- In genomics: 1,883
- Year to date:
- Site-wide: 170,247
- Since beginning of last month:
- Site-wide: 148,584
Altmetric data
Downloads over time
Distribution of downloads per paper, site-wide
PanLingua
News
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!