Cargando…
Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9469931/ https://www.ncbi.nlm.nih.gov/pubmed/36110899 http://dx.doi.org/10.1093/nargab/lqac066 |
_version_ | 1784788740716101632 |
---|---|
author | Wang, Zhe Yang, Shiyi Koga, Yusuke Corbett, Sean E Shea, Conor V Johnson, W Evan Yajima, Masanao Campbell, Joshua D |
author_facet | Wang, Zhe Yang, Shiyi Koga, Yusuke Corbett, Sean E Shea, Conor V Johnson, W Evan Yajima, Masanao Campbell, Joshua D |
author_sort | Wang, Zhe |
collection | PubMed |
description | Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to perform co-clustering of genes into transcriptional modules and cells into subpopulations. Celda can quantify the probabilistic contribution of each gene to each module, each module to each cell population and each cell population to each sample. In a peripheral blood mononuclear cell dataset, Celda identified a subpopulation of proliferating T cells and a plasma cell which were missed by two other common single-cell workflows. Celda also identified transcriptional modules that could be used to characterize unique and shared biological programs across cell types. Finally, Celda outperformed other approaches for clustering genes into modules on simulated data. Celda presents a novel method for characterizing transcriptional programs and cellular heterogeneity in scRNA-seq data. |
format | Online Article Text |
id | pubmed-9469931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-94699312022-09-14 Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data Wang, Zhe Yang, Shiyi Koga, Yusuke Corbett, Sean E Shea, Conor V Johnson, W Evan Yajima, Masanao Campbell, Joshua D NAR Genom Bioinform Standard Article Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to perform co-clustering of genes into transcriptional modules and cells into subpopulations. Celda can quantify the probabilistic contribution of each gene to each module, each module to each cell population and each cell population to each sample. In a peripheral blood mononuclear cell dataset, Celda identified a subpopulation of proliferating T cells and a plasma cell which were missed by two other common single-cell workflows. Celda also identified transcriptional modules that could be used to characterize unique and shared biological programs across cell types. Finally, Celda outperformed other approaches for clustering genes into modules on simulated data. Celda presents a novel method for characterizing transcriptional programs and cellular heterogeneity in scRNA-seq data. Oxford University Press 2022-09-13 /pmc/articles/PMC9469931/ /pubmed/36110899 http://dx.doi.org/10.1093/nargab/lqac066 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Standard Article Wang, Zhe Yang, Shiyi Koga, Yusuke Corbett, Sean E Shea, Conor V Johnson, W Evan Yajima, Masanao Campbell, Joshua D Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title | Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title_full | Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title_fullStr | Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title_full_unstemmed | Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title_short | Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data |
title_sort | celda: a bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell rna-seq data |
topic | Standard Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9469931/ https://www.ncbi.nlm.nih.gov/pubmed/36110899 http://dx.doi.org/10.1093/nargab/lqac066 |
work_keys_str_mv | AT wangzhe celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT yangshiyi celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT kogayusuke celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT corbettseane celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT sheaconorv celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT johnsonwevan celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT yajimamasanao celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata AT campbelljoshuad celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata |