Rxivist logo

Whole genome base-wise aggregation and functional prediction for human non-coding regulatory variants

By Shijie Zhang, Yukun He, Huanhuan Liu, Haoyu Zhai, Dandan Huang, Xianfu Yi, Xiaobao Dong, Zhao Wang, Ke Zhao, Yao Zhou, Jianhua Wang, Hongcheng Yao, Hang Xu, Zhenglu Yang, Pak Chung Sham, Kexin Chen, Mulin Jun Li

Posted 20 Mar 2019
bioRxiv DOI: 10.1101/583237

Predicting the functional or pathogenic regulatory variants in the human non-coding genome facilitates the interpretation of disease causation. While numerous prediction methods are available, their performance is inconsistent or restricted to specific tasks, which raises the demand of developing comprehensive integration for those methods. Here, we compile whole genome base-wise aggregations, regBase, that incorporate largest prediction scores. Building on different assumptions of causality, we train three composite models to score functional, pathogenic and cancer driver non-coding regulatory variants respectively. We demonstrate the superior and stable performance of our models using independent benchmarks and show great success to fine-map causal regulatory variants. We believe that regBase database together with three composite models will be useful in different areas of human genetic studies, such as annotation-based casual variant fine-mapping, pathogenic variant discovery as well as cancer driver mutation identification. regBase is freely available at https://github.com/mulinlab/regBase.

Download data

  • Downloaded 551 times
  • Download rankings, all-time:
    • Site-wide: 28,793 out of 92,738
    • In genomics: 2,976 out of 5,845
  • Year to date:
    • Site-wide: 37,978 out of 92,738
  • Since beginning of last month:
    • Site-wide: 63,334 out of 92,738

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)