Rxivist logo

DRAMS: A Tool to Detect and Re-Align Mixed-up Samples for Integrative Studies of Multi-omics Data

By Yi Jiang, Gina Giase, Kay Grennan, Annie W. Shieh, Yan Xia, Lide Han, Quan Wang, Qiang Wei, Rui Chen, Sihan Liu, Kevin P White, Chao Chen, Bingshan Li, Chunyu Liu

Posted 06 Nov 2019
bioRxiv DOI: 10.1101/831537 (published DOI: 10.1371/journal.pcbi.1007522)

Studies of complex disorders benefit from integrative analyses of multiple omics data. Yet, sample mix-ups frequently occur in multi-omics studies, weakening statistical power and risking false findings. Accurately aligning sample information, genotype, and corresponding omics data is critical for integrative analyses. We developed DRAMS ( https://github.com/Yi-Jiang/DRAMS ) to Detect and Re-Align Mixed-up Samples to address the sample mix-up problem. It uses a logistic regression model followed by a modified topological sorting algorithm to identify the potential true IDs based on data relationships of multi-omics. According to tests using simulated data, the more types of omics data used or the smaller the proportion of mix-ups, the better that DRAMS performs. Applying DRAMS to real data from the PsychENCODE BrainGVEX project, we detected and corrected 201 (12.5% of total data generated) mix-ups. Of the 21 mix-ups involving errors of racial identity, DRAMS re-assigned all samples to the correct racial cluster in the 1000 Genomes project. In doing so, quantitative trait loci (QTL) (FDR<0.01) increased by an average of 1.62-fold. The use of DRAMS in multi-omics studies will strengthen statistical power of the study and improve quality of the results. Even though very limited studies have multi-omics data in place, we expect such data will increase quickly with the needs of DRAMS.

Download data

  • Downloaded 225 times
  • Download rankings, all-time:
    • Site-wide: 108,161
    • In bioinformatics: 9,096
  • Year to date:
    • Site-wide: 132,332
  • Since beginning of last month:
    • Site-wide: 104,143

Altmetric data

Downloads over time

Distribution of downloads per paper, site-wide