Rxivist logo

The Genome Sequence Archive Family: Towards Explosive Data Growth and Diverse Data Types

By Tingting Chen, Xu Chen, Sisi Zhang, Junwei Zhu, Bixia Tang, Anke Wang, Lili Dong, Zhewen Zhang, Caixia Yu, Yanling Sun, Lianjiang Chi, Huanxin Chen, Shuang Zhai, Yubin Sun, Li Lan, Xin Zhang, Jingfa Xiao, Yiming Bao, Yanqing Wang, Zhang Zhang, Wenming Zhao

Posted 01 Jul 2021
bioRxiv DOI: 10.1101/2021.06.29.449849

The Genome Sequence Archive (GSA) is a data repository for archiving raw sequence data, which provides data storing and sharing services for worldwide scientific communities. Considering explosive data growth with diverse data types, here we present the GSA family by expanding into a set of resources for raw data archive with different purposes, namely, GSA (https://ngdc.cncb.ac.cn/gsa/), GSA for Human (GSA-Human, https://ngdc.cncb.ac.cn/gsa-human/), and Open Archive for Miscellaneous Data (OMIX, https://ngdc.cncb.ac.cn/omix/). Compared with the 2017 version, GSA has been significantly updated in data model, online functionalities, and web interfaces. GSA-Human, as a new partner of GSA, is a data repository specialized in human genetics-related data with controlled access and security. OMIX, as a critical complement to the two resources mentioned above, is an open archive for miscellaneous data. Together, all these resources form a family of resources dedicated to archiving explosive data with diverse types, accept data submissions from all over the world and provide free open access to all publicly available data in support of worldwide research activities.

Download data

  • Downloaded 91 times
  • Download rankings, all-time:
    • Site-wide: 168,291
    • In bioinformatics: 12,518
  • Year to date:
    • Site-wide: 144,269
  • Since beginning of last month:
    • Site-wide: 92,568

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide