Cargando…
scGEM: Unveiling the Nested Tree-Structured Gene Co-Expressing Modules in Single Cell Transcriptome Data
SIMPLE SUMMARY: Single-cell RNA sequencing has significantly contributed to the discovery of heterogenous cellular programs that are highly expressed by distinct cell subtypes. Yet, the quantification of cell subtype-specific and shared gene co-expressing modules (GEMs) associated with cell differen...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10486867/ https://www.ncbi.nlm.nih.gov/pubmed/37686554 http://dx.doi.org/10.3390/cancers15174277 |
Sumario: | SIMPLE SUMMARY: Single-cell RNA sequencing has significantly contributed to the discovery of heterogenous cellular programs that are highly expressed by distinct cell subtypes. Yet, the quantification of cell subtype-specific and shared gene co-expressing modules (GEMs) associated with cell differentiation is a challenge. Herein, we have developed scGEM to uncover such hidden GEMs and conducted a systematic evaluation of model performance as well as a comparison with existing methods. We demonstrate that scGEM has the potential to generate a better biological explanation of GEMs using simulated and real-world datasets. The positive impacts of this study can illuminate the interpretation of gene modules in single-cell transcriptome analysis and shed light on the cell–cell communication and regulatory network. ABSTRACT: Background: Single-cell transcriptome analysis has fundamentally changed biological research by allowing higher-resolution computational analysis of individual cells and subsets of cell types. However, few methods have met the need to recognize and quantify the underlying cellular programs that determine the specialization and differentiation of the cell types. Methods: In this study, we present scGEM, a nested tree-structured nonparametric Bayesian model, to reveal the gene co-expression modules (GEMs) reflecting transcriptome processes in single cells. Results: We show that scGEM can discover shared and specialized transcriptome signals across different cell types using peripheral blood mononuclear single cells and early brain development single cells. scGEM outperformed other methods in perplexity and topic coherence (p < 0.001) on our simulation data. Larger datasets, deeper trees and pre-trained models are shown to be positively associated with better scGEM performance. The GEMs obtained from triple-negative breast cancer single cells exhibited better correlations with lymphocyte infiltration (p = 0.009) and the cell cycle (p < 0.001) than other methods in additional validation on the bulk RNAseq dataset. Conclusions: Altogether, we demonstrate that scGEM can be used to model the hidden cellular functions of single cells, thereby unveiling the specialization and generalization of transcriptomic programs across different types of cells. |
---|