12 Introduction to spatial transcriptomics

Instructor: Daianna Gonzalez-Padilla

You might also be interested in this recent blog post by Leo https://lcolladotor.github.io/2024/05/23/humanpilot-first-spatially-resolved-transcriptomics-study-using-visium/ and the companion walk through video

For a journal club presentation on the HumanPilot paper, check this video:

In recent years, with constant improvements in the current sequencing technologies and the generation of more sophisticated omics methodologies and bioinformatic pipelines, we have been constantly demonstrating that specific cell types and cell-to-cell interactions play critical roles in the definition of numerous diseases and development-related processes. In fact, cell type-specific associations have been established for a number of diseases and disorders. Thus, understanding the cellular context and the spatial location in which normal and deregulated cellular events occur is necessary to unveil the molecular underpinnings of disease pathologies and malfunctions of the organisms.

Spatial transcriptomics technologies are molecular profiling methods developed to measure gene expression levels in a tissue sample at the spatial resolution. These methods have been improved and expanded over time and are widely applied to study a wide range of biological processes and have provided numerous insights into disease and development mechanisms.

In particular, the 10x Genomics Visium platform is a technology that spatially profiles the transcriptome of frozen and fixed tissue sections in combination with histology.

12.1 3’ Visium spatial technology

This is the Visium technology more frequently used and it captures polyadenilated transcripts within individual spatially barcoded spots.

In the Visium expression slide there are 4 capture areas, each of 6.5 (+1.5) mm\(^2\) with ~5k barcoded spots (55 µm in diameter each), within which mRNAs are captured by polyT primers that contain a read for sequencing (see below), a UMI (unique molecular identifier), and a spatial barcode. In this way, all RNAs trapped in the same spot are tagged with the same spot-specific barcode and we can computationally trace the original location of the transcripts.

Figure 1: Schematic representation of the Visium capture areas and spots. Source: SciLifeLab (2023).

12.2 Spatial data visualization

In order to interactively visualize example spatial data we’ll use the shiny web application of spatialLIBD: http://spatial.libd.org/spatialLIBD/. This web application allows to browse the human dorsolateral pre-frontal cortex (DLPFC) spatial transcriptomics data generated at the LIBD using the 10x Genomics Visium platform. In total there are 12 DLPFC tissue sections from 3 donors, each spanning six classical histological layers plus the white matter (WM).

Figure 2: Human DLPFC tissue section. Spot plot depicting the 6 classical histological layers (L1-L6) and the white matter (WM) in a human DLPFC sample.

12.2.1 Spot-level data exploration

With this tool you can:

  • Observe per-spot QC metrics and gene expression levels
  • Explore spot clusters in the tissue sections
  • Visualize the spot data on reduced dimensions
  • Manually annotate spots to layers and export your manual annotations
  • Customize the spatial images

📝 Exercise 1: visualize the clustering of spots in all tissue sections using the different discrete variables to plot. Which one recapitulates better the six histological layers (plus the white matter) of the human DLPFC?

📝 Exercise 2: explore the expression of SNAP25 (neuronal marker gene), MOBP (oligodendrocyte/WM marker gene), and PCP4 (layer 5 marker gene) in each DLPFC tissue section. What do you observe? Are there any spatial patterns in the expression of these genes?

12.2.2 Layer-level data exploration

Layer-level data result from pseudo-bulking the spot-level data, i.e. from aggregating spot data from all spots assigned to a given layer.

At this level the tool allows to:

  • Visualize the gene expression data at the layer level in reduced dimensions
  • Plot the layer-level lognorm or raw expression of a gene across all tissue sections and extract DEGs among layers (ANOVA model), in a specific layer compared to the rest (enrichment model) or compared to another layer (pairwise model)
  • Assess the enrichment of your own sets of genes of interest among the DEGs from these spatial DLPFC data
  • Correlate gene-wise statistics for DE between sn/scRNA-seq data clusters/cell populations with the DE statistics in the human DLPFC layers provided in this study. This can be used to label your sn/scRNA-seq groups or clusters with the more molecularly-defined histological layers

📝 Exercise 3: plot the expression of SNAP25, MOBP, and PCP4 in the different layers of each DLPFC tissue section. Are there any significant differences in the expression of these genes between layers under any of the statistical models for DGE?

Bibliography

  1. SciLifeLab (2023). 10X Genomics Visium for Fresh Frozen samples. Web site: https://ngisweden.scilifelab.se/methods/10x-visium/

  2. 10x Genomics (n.d.). Whole transcriptome discovery in the tissue context. Web site: https://www.10xgenomics.com/platforms/visium

© 2011-2023. All thoughts and opinions here are my own. The icon was designed by Mauricio Guzmán and is inspired by Huichol culture; it represents my community building interests.

Published with Bookdown