Cargando…
Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets
Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677128/ https://www.ncbi.nlm.nih.gov/pubmed/36420149 http://dx.doi.org/10.1016/j.csbj.2022.10.029 |
_version_ | 1784833744588242944 |
---|---|
author | Mahalanabis, Alaina Turinsky, Andrei L. Husić, Mia Christensen, Erik Luo, Ping Naidas, Alaine Brudno, Michael Pugh, Trevor Ramani, Arun K. Shooshtari, Parisa |
author_facet | Mahalanabis, Alaina Turinsky, Andrei L. Husić, Mia Christensen, Erik Luo, Ping Naidas, Alaine Brudno, Michael Pugh, Trevor Ramani, Arun K. Shooshtari, Parisa |
author_sort | Mahalanabis, Alaina |
collection | PubMed |
description | Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell populations in cancer samples. Single-cell techniques are now able to capture the transcriptional profiles of individual cells. However, the complexity of RNA-seq data, especially in cancer samples, makes it challenging to cluster single-cell profiles into groups that reflect the underlying cell types. We have developed a framework for a systematic examination of single-cell RNA-seq clustering algorithms for cancer data, which uses a range of well-established metrics to generate a unified quality score and algorithm ranking. To demonstrate this framework, we examined clustering performance of 15 different single-cell RNA-seq clustering algorithms on eight different cancer datasets. Our results suggest that the single-cell RNA-seq clustering algorithms fall into distinct groups by performance, with the highest clustering quality on non-malignant cells achieved by three algorithms: Seurat, bigSCale and Cell Ranger. However, for malignant cells, two additional algorithms often reach a better performance, namely Monocle and SC3. Their ability to detect known rare cell types was also among the best, along with Seurat. Our approach and results can be used by a broad audience of practitioners who analyze single-cell transcriptomic data in cancer research. |
format | Online Article Text |
id | pubmed-9677128 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-96771282022-11-22 Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets Mahalanabis, Alaina Turinsky, Andrei L. Husić, Mia Christensen, Erik Luo, Ping Naidas, Alaine Brudno, Michael Pugh, Trevor Ramani, Arun K. Shooshtari, Parisa Comput Struct Biotechnol J Research Article Tumors are complex biological entities that comprise cell types of different origins, with different mutational profiles and different patterns of transcriptional dysregulation. The exploration of data related to cancer biology requires careful analytical methods to reflect the heterogeneity of cell populations in cancer samples. Single-cell techniques are now able to capture the transcriptional profiles of individual cells. However, the complexity of RNA-seq data, especially in cancer samples, makes it challenging to cluster single-cell profiles into groups that reflect the underlying cell types. We have developed a framework for a systematic examination of single-cell RNA-seq clustering algorithms for cancer data, which uses a range of well-established metrics to generate a unified quality score and algorithm ranking. To demonstrate this framework, we examined clustering performance of 15 different single-cell RNA-seq clustering algorithms on eight different cancer datasets. Our results suggest that the single-cell RNA-seq clustering algorithms fall into distinct groups by performance, with the highest clustering quality on non-malignant cells achieved by three algorithms: Seurat, bigSCale and Cell Ranger. However, for malignant cells, two additional algorithms often reach a better performance, namely Monocle and SC3. Their ability to detect known rare cell types was also among the best, along with Seurat. Our approach and results can be used by a broad audience of practitioners who analyze single-cell transcriptomic data in cancer research. Research Network of Computational and Structural Biotechnology 2022-10-26 /pmc/articles/PMC9677128/ /pubmed/36420149 http://dx.doi.org/10.1016/j.csbj.2022.10.029 Text en © 2022 Published by Elsevier B.V. on behalf of Research Network of Computational and Structural Biotechnology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Mahalanabis, Alaina Turinsky, Andrei L. Husić, Mia Christensen, Erik Luo, Ping Naidas, Alaine Brudno, Michael Pugh, Trevor Ramani, Arun K. Shooshtari, Parisa Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title | Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title_full | Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title_fullStr | Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title_full_unstemmed | Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title_short | Evaluation of single-cell RNA-seq clustering algorithms on cancer tumor datasets |
title_sort | evaluation of single-cell rna-seq clustering algorithms on cancer tumor datasets |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9677128/ https://www.ncbi.nlm.nih.gov/pubmed/36420149 http://dx.doi.org/10.1016/j.csbj.2022.10.029 |
work_keys_str_mv | AT mahalanabisalaina evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT turinskyandreil evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT husicmia evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT christensenerik evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT luoping evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT naidasalaine evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT brudnomichael evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT pughtrevor evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT ramaniarunk evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets AT shooshtariparisa evaluationofsinglecellrnaseqclusteringalgorithmsoncancertumordatasets |