Cargando…

scConsensus: combining supervised and unsupervised clustering for cell type identification in single-cell RNA sequencing data

BACKGROUND: Clustering is a crucial step in the analysis of single-cell data. Clusters identified in an unsupervised manner are typically annotated to cell types based on differentially expressed genes. In contrast, supervised methods use a reference panel of labelled transcriptomes to guide both cl...

Descripción completa

Detalles Bibliográficos
Autores principales: Ranjan, Bobby, Schmidt, Florian, Sun, Wenjie, Park, Jinyu, Honardoost, Mohammad Amin, Tan, Joanna, Arul Rayan, Nirmala, Prabhakar, Shyam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8042883/
https://www.ncbi.nlm.nih.gov/pubmed/33845760
http://dx.doi.org/10.1186/s12859-021-04028-4
Descripción
Sumario:BACKGROUND: Clustering is a crucial step in the analysis of single-cell data. Clusters identified in an unsupervised manner are typically annotated to cell types based on differentially expressed genes. In contrast, supervised methods use a reference panel of labelled transcriptomes to guide both clustering and cell type identification. Supervised and unsupervised clustering approaches have their distinct advantages and limitations. Therefore, they can lead to different but often complementary clustering results. Hence, a consensus approach leveraging the merits of both clustering paradigms could result in a more accurate clustering and a more precise cell type annotation. RESULTS: We present scConsensus, an [Formula: see text] framework for generating a consensus clustering by (1) integrating results from both unsupervised and supervised approaches and (2) refining the consensus clusters using differentially expressed genes. The value of our approach is demonstrated on several existing single-cell RNA sequencing datasets, including data from sorted PBMC sub-populations. CONCLUSIONS: scConsensus combines the merits of unsupervised and supervised approaches to partition cells with better cluster separation and homogeneity, thereby increasing our confidence in detecting distinct cell types. scConsensus is implemented in [Formula: see text] and is freely available on GitHub at https://github.com/prabhakarlab/scConsensus. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12859-021-04028-4.