Rxivist logo

Rxivist combines preprints from bioRxiv with data from Twitter to help you find the papers being discussed in your field. Currently indexing 67,545 bioRxiv papers from 297,698 authors.

projectR: An R/Bioconductor package for transfer learning via PCA, NMF, correlation, and clustering

By Gaurav Sharma, Carlo Colantuoni, Loyal A. Goff, Elana J. Fertig, Genevieve Stein-O’Brien

Posted 06 Aug 2019
bioRxiv DOI: 10.1101/726547

Motivation: Dimension reduction techniques are widely used to interpret high-dimensional biological data. Features learned from these methods are used to discover both technical artifacts and novel biological phenomena. Such feature discovery is critically import to large single-cell datasets, where lack of a ground truth limits validation and interpretation. Transfer learning (TL) can be used to relate the features learned from one source dataset to a new target dataset to perform biologically-driven validation by evaluating their use in or association with additional sample annotations in that independent target dataset. Results: We developed an R/Bioconductor package, projectR, to perform TL for analyses of genomics data via TL of clustering, correlation, and factorization methods. We then demonstrate the utility TL for integrated data analysis with an example for spatial single-cell analysis. Availability: projectR is available on Bioconductor and at https://github.com/genesofeve/projectR.

Download data

  • Downloaded 359 times
  • Download rankings, all-time:
    • Site-wide: 30,902 out of 67,516
    • In bioinformatics: 3,978 out of 6,648
  • Year to date:
    • Site-wide: 11,385 out of 67,516
  • Since beginning of last month:
    • Site-wide: 24,018 out of 67,516

Altmetric data


Downloads over time

Distribution of downloads per paper, site-wide


Sign up for the Rxivist weekly newsletter! (Click here for more details.)


News