Course Title: Single-Cell Genomics Data Processing Training Course
Executive Summary
This intensive two-week course equips participants with the knowledge and skills necessary to effectively process and analyze single-cell genomics data. Covering a range of topics from experimental design to advanced computational techniques, the course provides hands-on experience in quality control, data normalization, dimensionality reduction, cell clustering, differential gene expression analysis, and trajectory inference. Participants will gain proficiency in using industry-standard software and tools, as well as learn best practices for data interpretation and visualization. The curriculum emphasizes reproducibility and rigor in single-cell data analysis, enabling researchers to extract meaningful biological insights from complex datasets. This course is designed for researchers with a basic understanding of genomics and programming who wish to specialize in single-cell data analysis.
Introduction
Single-cell genomics has revolutionized our understanding of cellular heterogeneity and function, offering unprecedented insights into development, disease, and evolution. The ability to profile gene expression, chromatin accessibility, and other molecular features at single-cell resolution has led to breakthroughs in various fields, including immunology, cancer biology, and neuroscience. However, the analysis of single-cell data presents unique computational challenges due to its high dimensionality, noise, and complexity. This course provides a comprehensive introduction to the principles and practices of single-cell genomics data processing, covering the essential steps from raw data to biological interpretation. Through a combination of lectures, hands-on exercises, and case studies, participants will learn how to effectively analyze single-cell datasets and extract meaningful biological insights. The course aims to bridge the gap between experimental design and data analysis, empowering researchers to leverage the full potential of single-cell genomics.
Course Outcomes
- Understand the principles of single-cell genomics and its applications.
- Perform quality control and pre-processing of single-cell RNA-seq data.
- Apply normalization and batch correction methods to remove technical artifacts.
- Perform dimensionality reduction and cell clustering to identify cell types and states.
- Conduct differential gene expression analysis to identify marker genes and pathways.
- Infer cell trajectories and pseudotime to study developmental processes.
- Visualize and interpret single-cell data using various bioinformatics tools.
Training Methodologies
- Interactive lectures and discussions.
- Hands-on workshops with real-world datasets.
- Case studies of published single-cell genomics research.
- Individual and group programming exercises.
- Guest lectures from leading experts in the field.
- Online resources and tutorials.
- Project-based learning and peer review.
Benefits to Participants
- Gain expertise in processing and analyzing single-cell genomics data.
- Develop proficiency in using industry-standard bioinformatics tools.
- Improve skills in data visualization and interpretation.
- Enhance understanding of single-cell genomics experimental design.
- Expand professional network through interaction with experts and peers.
- Increase competitiveness in the job market.
- Contribute to cutting-edge research in single-cell biology.
Benefits to Sending Organization
- Enhance research capabilities in single-cell genomics.
- Improve data quality and reproducibility.
- Increase efficiency in data analysis and interpretation.
- Foster collaboration and knowledge sharing among researchers.
- Attract and retain top talent in the field.
- Gain a competitive advantage in research funding and publications.
- Contribute to advancements in biomedical research and healthcare.
Target Participants
- Graduate students in biology, bioinformatics, or related fields.
- Postdoctoral researchers in genomics, immunology, or cancer biology.
- Research scientists in pharmaceutical and biotechnology companies.
- Bioinformaticians and data scientists working with single-cell data.
- Principal investigators interested in incorporating single-cell genomics into their research.
- Clinical researchers seeking to understand disease mechanisms at single-cell resolution.
- Core facility staff providing single-cell genomics services.
Week 1: Foundational Concepts and Data Preprocessing
Module 1: Introduction to Single-Cell Genomics
- Overview of single-cell technologies (RNA-seq, ATAC-seq, etc.).
- Applications of single-cell genomics in various fields.
- Experimental design considerations for single-cell experiments.
- Data formats and file structures for single-cell data.
- Overview of common single-cell analysis workflows.
- Introduction to command-line interface (CLI) and scripting.
- Setting up the analysis environment (e.g., conda).
Module 2: Quality Control and Preprocessing
- Quality assessment of raw sequencing data (FastQC).
- Read alignment to the reference genome (STAR, Cell Ranger).
- Cell barcode and unique molecular identifier (UMI) processing.
- Filtering of low-quality cells and genes.
- Removal of doublet cells (DoubletFinder, Scrublet).
- Exploration of quality metrics using visualization tools.
- Hands-on exercise: Quality control and filtering of a single-cell RNA-seq dataset.
Module 3: Normalization and Batch Correction
- Normalization methods for single-cell data (CPM, TPM, Seurat’s NormalizeData).
- Log transformation and scaling of gene expression values.
- Batch effect correction methods (Combat, Harmony).
- Integration of multiple single-cell datasets.
- Evaluation of batch correction performance.
- Hands-on exercise: Normalization and batch correction of a single-cell dataset.
- Discussion of the assumptions and limitations of different methods.
Module 4: Dimensionality Reduction
- Introduction to dimensionality reduction techniques (PCA, t-SNE, UMAP).
- Selection of highly variable genes (HVGs).
- Performing PCA and visualizing principal components.
- Implementing t-SNE and UMAP for non-linear dimensionality reduction.
- Parameter tuning and optimization for t-SNE and UMAP.
- Hands-on exercise: Dimensionality reduction of a single-cell dataset.
- Interpretation of dimensionality reduction plots.
Module 5: Cell Clustering
- Introduction to cell clustering algorithms (k-means, Louvain, Leiden).
- Graph-based clustering methods.
- Determining the optimal number of clusters.
- Visualization of cell clusters on dimensionality reduction plots.
- Cluster stability analysis.
- Hands-on exercise: Cell clustering of a single-cell dataset.
- Comparison of different clustering algorithms.
Week 2: Advanced Analysis and Biological Interpretation
Module 6: Differential Gene Expression Analysis
- Identifying differentially expressed genes between cell clusters.
- Statistical tests for differential expression (Wilcoxon rank-sum test, t-test).
- Multiple hypothesis correction (Benjamini-Hochberg).
- Volcano plots and heatmap visualization of differential expression results.
- Pathway enrichment analysis (GO, KEGG).
- Hands-on exercise: Differential gene expression analysis of a single-cell dataset.
- Functional interpretation of differentially expressed genes.
Module 7: Cell Type Annotation
- Using known marker genes to annotate cell types.
- Automated cell type annotation tools (CellAssign, SingleR).
- Integration with public reference datasets.
- Manual curation and refinement of cell type annotations.
- Validation of cell type annotations using orthogonal methods.
- Hands-on exercise: Cell type annotation of a single-cell dataset.
- Discussion of cell type identity and heterogeneity.
Module 8: Trajectory Inference and Pseudotime Analysis
- Introduction to trajectory inference methods (Monocle, Slingshot).
- Inferring cell trajectories and developmental lineages.
- Pseudotime ordering of cells along trajectories.
- Identifying genes that change along pseudotime.
- Visualization of cell trajectories.
- Hands-on exercise: Trajectory inference and pseudotime analysis of a single-cell dataset.
- Interpretation of trajectory inference results.
Module 9: Advanced Single-Cell Analysis Techniques
- Single-cell ATAC-seq data analysis.
- Multi-omics integration (RNA-seq + ATAC-seq).
- Spatial transcriptomics data analysis.
- Cell-cell interaction analysis.
- Copy number variation analysis in single cells.
- Hands-on exercise: Application of advanced single-cell analysis techniques.
- Discussion of emerging trends in single-cell genomics.
Module 10: Data Visualization and Interpretation
- Best practices for data visualization in single-cell genomics.
- Creating publication-quality figures.
- Effective communication of single-cell genomics results.
- Tools for interactive data exploration (Shiny, R Markdown).
- Reproducible research practices.
- Capstone project presentation: Analysis of a single-cell dataset and presentation of results.
- Course wrap-up and future directions.
Action Plan for Implementation
- Apply the learned skills to ongoing or planned research projects.
- Share knowledge and best practices with colleagues.
- Develop standard operating procedures (SOPs) for single-cell data analysis.
- Explore new single-cell analysis tools and techniques.
- Participate in single-cell genomics conferences and workshops.
- Contribute to open-source bioinformatics projects.
- Seek opportunities to collaborate with experts in the field.
Course Features
- Lecture 0
- Quiz 0
- Skill level All levels
- Students 0
- Certificate No
- Assessments Self





