Rxivist logo

High-coverage, long-read sequencing of Han Chinese trio reference samples.

By Ying-Chih Wang, Nathan D. Olson, Gintaras Deikus, Hardik Shah, Aaron M Wenger, Jonathan Trow, Chunlin Xiao, Stephen Sherry, Marc L. Salit, Justin M. Zook, Melissa Smith, Robert Sebra

Posted 28 Feb 2019
bioRxiv DOI: 10.1101/562611 (published DOI: 10.1038/s41597-019-0098-2)

Single-molecule long-read sequencing datasets were generated for a son-father-mother trio of Han Chinese descent that is part of the Genome In a Bottle (GIAB) consortium portfolio. The dataset was generated using the Pacific Biosciences Sequel System. The son and each parent were sequenced to an average coverage of 60 and 30, respectively, with N50 subread lengths between 16 and 18 kb. Raw reads and reads aligned to both the GRCh37 and GRCh38 are available at the NCBI GIAB ftp site (ftp://ftp-trace.ncbi.nlm.nih.gov/giab/ftp/data/ChineseTrio/) and the raw read data is archived in NCBI SRA (SRX4739017, SRX4739121, and SRX4739122). This dataset is available for anyone to develop and evaluate long-read bioinformatics methods.

Download data

  • Downloaded 438 times
  • Download rankings, all-time:
    • Site-wide: 71,618
    • In genomics: 4,923
  • Year to date:
    • Site-wide: 144,933
  • Since beginning of last month:
    • Site-wide: None

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide