Title
Functional analysis of single-cell transcriptomics
Tutorial details
- Date: Sunday, September 18th
- Time: 09:00 to 13:00 CEST (Slot 22)
- Format: Face-to-face
- Room: TBD
Instructors
- Pau Badia i Mompel, PhD candidate, Saez-Rodrigez Group; Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg (Germany)
- Robin Browaeys, Data Mining and Modeling for Biomedicine, VIB Center for Inflammation Research, Ghent, (Belgium)
- Daniel Dimitrov, PhD Candidate, Saez-Rodriguez Group; Heidelberg University Hospital, Institute for Computational Biomedicine, Bioquant, Heidelberg (Germany)
Summary
Recent advances in omics technologies have led to a rapid increase in the popularity and applications of single-cell data-sets. Standard analyses and workflows solely focus on basic preprocessing steps followed by the identification of differentially expressed genes, and their subsequent use in cell-type annotation and characterization of biological processes. In this tutorial, we show how prior knowledge can be used to extend each of the aforementioned steps, as well as to extract clear biological insights. Furthermore, we provide an introduction to the state-of-the-art intercellular communication methods, as tools for systems-level hypothesis generation tools in single-cell data. We thus cover a diverse set of prior knowledge resources and show how these can be used to support and extend analysis steps ranging from quality control, cell-type annotation and transcription factor and cytokine activity inference. Finally, we show how advanced functional omics analyses can be used to refine cell-cell communication predictions.
Intended audience
Broad audience with some experience in omics data-analysis and interest in analyzing single-cell transcriptomics.
Prerequisites
Maximum number of attendees
30
Material required (for participants)
Laptop with at least 4GB of RAM memory and 10 GB of free disk space, with the package management system Conda installed.
Programme
- Brief summary to single-cell transcriptomics pre-processing (25 min)
To start, we will briefly go over the basics of single-cell transcriptomics: reading raw data, QC filtering, normalization, feature selection, dimensionality reduction and visualization using the scanpy framework
- Prior-knowledge-driven cell-type annotation (20 min)
We will introduce the concept of gene sets, and how to use them to aid in cell identity annotation using over-representation analysis and PanglaoDB, a database of cell type-specific marker genes.
- Gene set enrichment of biological terms (20 min)
Then we will show different gene set data-base resources like MSigDB, and how they can be coupled to a statistical method like GSEA or over-representation analysis to infer enrichment of biological terms in scRNA-seq.
- Transcription factor and cytokine activity estimation from prior-knowledge footprints (30 min)
Afterwards, we will introduce the concept of footprints and compare them to gene sets. As examples, we will show how to infer transcription factors and cytokine activities with the decoupleR framework, making use of the gene regulatory network DoRothEA and CytoSig’s cytokine signatures, respectively.
- Break (15 min)
- Cell-cell communication inference (30 min)
We will introduce the audience to the relevance, concepts, and assumptions behind intercellular communication inference. We will showcase these with a hands-on tutorial on ligand-receptor interaction prediction methods, including CellPhoneDB, NATMI, and SingleCellSignalR, via LIANA.
- Modeling intercellular communication by linking ligands to target genes (25 minutes)
We will then show how one could go beyond ligand-receptor interactions by incorporating prior-knowledge of intracellular signalling. This part will largely focus on introducing the audience to the NicheNet method.
- Connecting biological activities to intercellular communication (15 min)
Finally, we will summarize the lessons from the tutorial by connecting the results from different analyses, and thus highlighting their potential as complementary hypothesis-generation tools.
TIME | CONTENT |
---|---|
9:00 - 9:25 | Brief summary to single-cell transcriptomics pre-processing To start, we will briefly go over the basics of single-cell transcriptomics: reading raw data, QC filtering, normalization, feature selection, dimensionality reduction and visualization using the scanpy framework |
9:25 - 9:45 | Prior-knowledge-driven cell-type annotation We will introduce the concept of gene sets, and how to use them to aid in cell identity annotation using over-representation analysis and PanglaoDB, a database of cell type-specific marker genes. |
9:45 - 10:05 | Gene set enrichment of biological terms Then we will show different gene set data-base resources like MSigDB, and how they can be coupled to a statistical method like GSEA or over-representation analysis to infer enrichment of biological terms in scRNA-seq. |
10:05 - 10:35 | Transcription factor and cytokine activity estimation from prior-knowledge footprints Afterwards, we will introduce the concept of footprints and compare them to gene sets. As examples, we will show how to infer transcription factors and cytokine activities with the decoupleR framework, making use of the gene regulatory network DoRothEA and CytoSig’s cytokine signatures, respectively. |
Break | |
10:50 - 11:20 | Cell-cell communication inference We will introduce the audience to the relevance, concepts, and assumptions behind intercellular communication inference. We will showcase these with a hands-on tutorial on ligand-receptor interaction prediction methods, including CellPhoneDB, NATMI, and SingleCellSignalR, via LIANA. |
11:20 - 11:45 | Modeling intercellular communication by linking ligands to target genes We will then show how one could go beyond ligand-receptor interactions by incorporating prior-knowledge of intracellular signalling. This part will largely focus on introducing the audience to the NicheNet method. |
11:45 - 12:00 | Connecting biological activities to intercellular communication Finally, we will summarize the lessons from the tutorial by connecting the results from different analyses, and thus highlighting their potential as complementary hypothesis-generation tools. |