Cargando…

Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets

Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell...

Descripción completa

Detalles Bibliográficos
Autores principales: Mahalanabis, Alaina, Turinsky, Andrei L., Husić, Mia, Christensen, Erik, Luo, Ping, Naidas, Alaine, Brudno, Michael, Pugh, Trevor, Ramani, Arun K., Shooshtari, Parisa
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677128/
https://www.ncbi.nlm.nih.gov/pubmed/36420149
http://dx.doi.org/10.1016/j.csbj.2022.10.029
_version_ 1784833744588242944
author Mahalanabis, Alaina
Turinsky, Andrei L.
Husić, Mia
Christensen, Erik
Luo, Ping
Naidas, Alaine
Brudno, Michael
Pugh, Trevor
Ramani, Arun K.
Shooshtari, Parisa
author_facet Mahalanabis, Alaina
Turinsky, Andrei L.
Husić, Mia
Christensen, Erik
Luo, Ping
Naidas, Alaine
Brudno, Michael
Pugh, Trevor
Ramani, Arun K.
Shooshtari, Parisa
author_sort Mahalanabis, Alaina
collection PubMed
description Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell populations in cancer samples. Single-cell techniques are now able to capture the transcriptional profiles of individual cells. However, the complexity of RNA-seq data, especially in cancer samples, makes it challenging to cluster single-cell profiles into groups that reflect the underlying cell types. We have developed a framework for a systematic examination of single-cell RNA-seq clustering algorithms for cancer data, which uses a range of well-established metrics to generate a unified quality score and algorithm ranking. To demonstrate this framework, we examined clustering performance of 15 different single-cell RNA-seq clustering algorithms on eight different cancer datasets. Our results suggest that the single-cell RNA-seq clustering algorithms fall into distinct groups by performance, with the highest clustering quality on non-malignant cells achieved by three algorithms: Seurat, bigSCale and Cell Ranger. However, for malignant cells, two additional algorithms often reach a better performance, namely Monocle and SC3. Their ability to detect known rare cell types was also among the best, along with Seurat. Our approach and results can be used by a broad audience of practitioners who analyze single-cell transcriptomic data in cancer research.
format Online
Article
Text
id pubmed-9677128
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-96771282022-11-22 Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets Mahalanabis, Alaina Turinsky, Andrei L. Husić, Mia Christensen, Erik Luo, Ping Naidas, Alaine Brudno, Michael Pugh, Trevor Ramani, Arun K. Shooshtari, Parisa Comput Struct Biotechnol J Research Article Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell populations in cancer samples. Single-cell techniques are now able to capture the transcriptional profiles of individual cells. However, the complexity of RNA-seq data, especially in cancer samples, makes it challenging to cluster single-cell profiles into groups that reflect the underlying cell types. We have developed a framework for a systematic examination of single-cell RNA-seq clustering algorithms for cancer data, which uses a range of well-established metrics to generate a unified quality score and algorithm ranking. To demonstrate this framework, we examined clustering performance of 15 different single-cell RNA-seq clustering algorithms on eight different cancer datasets. Our results suggest that the single-cell RNA-seq clustering algorithms fall into distinct groups by performance, with the highest clustering quality on non-malignant cells achieved by three algorithms: Seurat, bigSCale and Cell Ranger. However, for malignant cells, two additional algorithms often reach a better performance, namely Monocle and SC3. Their ability to detect known rare cell types was also among the best, along with Seurat. Our approach and results can be used by a broad audience of practitioners who analyze single-cell transcriptomic data in cancer research. Research Network of Computational and Structural Biotechnology 2022-10-26 /pmc/articles/PMC9677128/ /pubmed/36420149 http://dx.doi.org/10.1016/j.csbj.2022.10.029 Text en © 2022 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Mahalanabis, Alaina
Turinsky, Andrei L.
Husić, Mia
Christensen, Erik
Luo, Ping
Naidas, Alaine
Brudno, Michael
Pugh, Trevor
Ramani, Arun K.
Shooshtari, Parisa
Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title_full Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title_fullStr Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title_full_unstemmed Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title_short Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
title_sort evaluation of single-cell rna-seq clustering algorithms on cancer tumor datasets
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677128/
https://www.ncbi.nlm.nih.gov/pubmed/36420149
http://dx.doi.org/10.1016/j.csbj.2022.10.029
work_keys_str_mv AT mahalanabisalaina evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT turinskyandreil evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT husicmia evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT christensenerik evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT luoping evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT naidasalaine evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT brudnomichael evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT pughtrevor evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT ramaniarunk evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets
AT shooshtariparisa evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets