Cargando…

ZINBMM: a general mixture model for simultaneous clustering and gene selection using single-cell transcriptomic data

Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can prom...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yang, Wu, Mingcong, Ma, Shuangge, Wu, Mengyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10496184/
https://www.ncbi.nlm.nih.gov/pubmed/37697330
http://dx.doi.org/10.1186/s13059-023-03046-0
Descripción
Sumario:Clustering is a critical component of single-cell RNA sequencing (scRNA-seq) data analysis and can help reveal cell types and infer cell lineages. Despite considerable successes, there are few methods tailored to investigating cluster-specific genes contributing to cell heterogeneity, which can promote biological understanding of cell heterogeneity. In this study, we propose a zero-inflated negative binomial mixture model (ZINBMM) that simultaneously achieves effective scRNA-seq data clustering and gene selection. ZINBMM conducts a systemic analysis on raw counts, accommodating both batch effects and dropout events. Simulations and the analysis of five scRNA-seq datasets demonstrate the practical applicability of ZINBMM. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s13059-023-03046-0.