Cargando…

Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data

Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zhe, Yang, Shiyi, Koga, Yusuke, Corbett, Sean E, Shea, Conor V, Johnson, W Evan, Yajima, Masanao, Campbell, Joshua D
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9469931/
https://www.ncbi.nlm.nih.gov/pubmed/36110899
http://dx.doi.org/10.1093/nargab/lqac066
_version_ 1784788740716101632
author Wang, Zhe
Yang, Shiyi
Koga, Yusuke
Corbett, Sean E
Shea, Conor V
Johnson, W Evan
Yajima, Masanao
Campbell, Joshua D
author_facet Wang, Zhe
Yang, Shiyi
Koga, Yusuke
Corbett, Sean E
Shea, Conor V
Johnson, W Evan
Yajima, Masanao
Campbell, Joshua D
author_sort Wang, Zhe
collection PubMed
description Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to perform co-clustering of genes into transcriptional modules and cells into subpopulations. Celda can quantify the probabilistic contribution of each gene to each module, each module to each cell population and each cell population to each sample. In a peripheral blood mononuclear cell dataset, Celda identified a subpopulation of proliferating T cells and a plasma cell which were missed by two other common single-cell workflows. Celda also identified transcriptional modules that could be used to characterize unique and shared biological programs across cell types. Finally, Celda outperformed other approaches for clustering genes into modules on simulated data. Celda presents a novel method for characterizing transcriptional programs and cellular heterogeneity in scRNA-seq data.
format Online
Article
Text
id pubmed-9469931
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-94699312022-09-14 Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data Wang, Zhe Yang, Shiyi Koga, Yusuke Corbett, Sean E Shea, Conor V Johnson, W Evan Yajima, Masanao Campbell, Joshua D NAR Genom Bioinform Standard Article Single-cell RNA-seq (scRNA-seq) has emerged as a powerful technique to quantify gene expression in individual cells and to elucidate the molecular and cellular building blocks of complex tissues. We developed a novel Bayesian hierarchical model called Cellular Latent Dirichlet Allocation (Celda) to perform co-clustering of genes into transcriptional modules and cells into subpopulations. Celda can quantify the probabilistic contribution of each gene to each module, each module to each cell population and each cell population to each sample. In a peripheral blood mononuclear cell dataset, Celda identified a subpopulation of proliferating T cells and a plasma cell which were missed by two other common single-cell workflows. Celda also identified transcriptional modules that could be used to characterize unique and shared biological programs across cell types. Finally, Celda outperformed other approaches for clustering genes into modules on simulated data. Celda presents a novel method for characterizing transcriptional programs and cellular heterogeneity in scRNA-seq data. Oxford University Press 2022-09-13 /pmc/articles/PMC9469931/ /pubmed/36110899 http://dx.doi.org/10.1093/nargab/lqac066 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Standard Article
Wang, Zhe
Yang, Shiyi
Koga, Yusuke
Corbett, Sean E
Shea, Conor V
Johnson, W Evan
Yajima, Masanao
Campbell, Joshua D
Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title_full Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title_fullStr Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title_full_unstemmed Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title_short Celda: a Bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell RNA-seq data
title_sort celda: a bayesian model to perform co-clustering of genes into modules and cells into subpopulations using single-cell rna-seq data
topic Standard Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9469931/
https://www.ncbi.nlm.nih.gov/pubmed/36110899
http://dx.doi.org/10.1093/nargab/lqac066
work_keys_str_mv AT wangzhe celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT yangshiyi celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT kogayusuke celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT corbettseane celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT sheaconorv celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT johnsonwevan celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT yajimamasanao celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata
AT campbelljoshuad celdaabayesianmodeltoperformcoclusteringofgenesintomodulesandcellsintosubpopulationsusingsinglecellrnaseqdata