stLearn: integrating spatial location, tissue morphology and gene expression to find cell types, cell-cell interactions and spatial trajectories within undissociated tissues
Spatial Transcriptomics is an emerging technology that adds spatial dimensionality and tissue morphology to the genome-wide transcriptional profile of cells in an undissociated tissue. Integrating these three types of data creates a vast potential for deciphering novel biology of cell types in their native morphological context. Here we developed innovative integrative analysis approaches to utilise all three data types to first find cell types, then reconstruct cell type evolution within a tissue, and search for tissue regions with high cell-to-cell interactions. First, for normalisation of gene expression, we compute a distance measure using morphological similarity and neighbourhood smoothing. The normalised data is then used to find clusters that represent transcriptional profiles of specific cell types and cellular phenotypes. Clusters are further sub-clustered if cells are spatially separated. Analysing anatomical regions in three mouse brain sections and 12 human brain datasets, we found the spatial clustering method more accurate and sensitive than other methods. Second, we introduce a method to calculate transcriptional states by pseudo-space-time (PST) distance. PST distance is a function of physical distance (spatial distance) and gene expression distance (pseudotime distance) to estimate the pairwise similarity between transcriptional profiles among cells within a tissue. We reconstruct spatial transition gradients within and between cell types that are connected locally within a cluster, or globally between clusters, by a directed minimum spanning tree optimisation approach for PST distance. The PST algorithm could model spatial transition from non-invasive to invasive cells within a breast cancer dataset. Third, we utilise spatial information and gene expression profiles to identify locations in the tissue where there is both high ligand-receptor interaction activity and diverse cell type co-localisation. These tissue locations are predicted to be hotspots where cell-cell interactions are more likely to occur. We detected tissue regions and ligand-receptor pairs significantly enriched compared to background distribution across breast cancer tissue. Together, these three algorithms, implemented in a comprehensive Python software stLearn, allow for the elucidation of biological processes within healthy and diseased tissues. ### Competing Interest Statement The authors have declared no competing interest.
- Downloaded 3,086 times
- Download rankings, all-time:
- Site-wide: 4,993
- In bioinformatics: 477
- Year to date:
- Site-wide: 1,766
- Since beginning of last month:
- Site-wide: 2,093
Downloads over time
Distribution of downloads per paper, site-wide
- 27 Nov 2020: The website and API now include results pulled from medRxiv as well as bioRxiv.
- 18 Dec 2019: We're pleased to announce PanLingua, a new tool that enables you to search for machine-translated bioRxiv preprints using more than 100 different languages.
- 21 May 2019: PLOS Biology has published a community page about Rxivist.org and its design.
- 10 May 2019: The paper analyzing the Rxivist dataset has been published at eLife.
- 1 Mar 2019: We now have summary statistics about bioRxiv downloads and submissions.
- 8 Feb 2019: Data from Altmetric is now available on the Rxivist details page for every preprint. Look for the "donut" under the download metrics.
- 30 Jan 2019: preLights has featured the Rxivist preprint and written about our findings.
- 22 Jan 2019: Nature just published an article about Rxivist and our data.
- 13 Jan 2019: The Rxivist preprint is live!