Rxivist logo

Genomic analyses of 10,376 individuals provides comprehensive map of genetic variations, structure and reference haplotypes for Chinese population

By Peikuan Cong, Weiyang Bai, Jinchen Li, Nan Li, Sirui Gai, Saber Khederzadeh, Yuheng Liu, Mochang Qiu, Xiaowei Zhu, Pianpian Zhao, Jiangwei Xia, Shihui Yu, Weiwei Zhao, Junquan Liu, Penglin Guan, Yu Qian, Jianguo Tao, Mengyuan Yang, Geng Tian, Shuyang Xie, Keqi Liu, Beisha Tang, Houfeng Zheng

Posted 08 Feb 2021
bioRxiv DOI: 10.1101/2021.02.06.430086

Here, we initiated the Westlake BioBank for Chinese (WBBC) pilot project with 4,535 whole-genome sequencing individuals and 5,481 high-density genotyping individuals. We identified 80.99 million SNPs and INDELs, of which 38.6% are novel. The genetic evidence of Chinese population structure supported the corresponding geographical boundaries of the Qinling-Huaihe Line and Nanling Mountains. The genetic architecture within North Han was more homogeneous than South Han, and the history of effective population size of Lingnan began to deviate from the other three regions from 6 thousand years ago. In addition, we identified a novel locus (SNX29) under selection pressure and confirmed several loci associated with alcohol metabolism and histocompatibility systems. We observed significant selection of genes on epidermal cell differentiation and skin development only in southern Chinese. Finally, we provided an online imputation server (https://wbbc.westlake.edu.cn/) which could result in higher imputation accuracy compared to the existing panels, especially for lower frequency variants.

Download data

  • Downloaded 392 times
  • Download rankings, all-time:
    • Site-wide: 77,062
    • In genetics: 3,381
  • Year to date:
    • Site-wide: 11,143
  • Since beginning of last month:
    • Site-wide: 23,577

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide