Cargando…
Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8602772/ https://www.ncbi.nlm.nih.gov/pubmed/33359678 http://dx.doi.org/10.1016/j.gpb.2020.07.004 |
_version_ | 1784601631650742272 |
---|---|
author | Huang, Qianhui Liu, Yu Du, Yuheng Garmire, Lana X. |
author_facet | Huang, Qianhui Liu, Yu Du, Yuheng Garmire, Lana X. |
author_sort | Huang, Qianhui |
collection | PubMed |
description | Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, it is not clear whether some classification methods originally designed for analyzing other bulk omics data are adaptable to scRNA-seq analysis. In this study, we evaluated ten cell type annotation methods publicly available as R packages. Eight of them are popular methods developed specifically for single-cell research, including Seurat, scmap, SingleR, CHETAH, SingleCellNet, scID, Garnett, and SCINA. The other two methods were repurposed from deconvoluting DNA methylation data, i.e., linear constrained projection (CP) and robust partial correlations (RPC). We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions; the robustness over practical challenges such as gene filtering, high similarity among cell types, and increased cell type classes; as well as the detection of rare and unknown cell types. Overall, methods such as Seurat, SingleR, CP, RPC, and SingleCellNet performed well, with Seurat being the best at annotating major cell types. Additionally, Seurat, SingleR, CP, and RPC were more robust against downsampling. However, Seurat did have a major drawback at predicting rare cell populations, and it was suboptimal at differentiating cell types highly similar to each other, compared to SingleR and RPC. All the code and data are available from https://github.com/qianhuiSenn/scRNA_cell_deconv_benchmark. |
format | Online Article Text |
id | pubmed-8602772 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-86027722021-11-24 Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data Huang, Qianhui Liu, Yu Du, Yuheng Garmire, Lana X. Genomics Proteomics Bioinformatics Original Research Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, it is not clear whether some classification methods originally designed for analyzing other bulk omics data are adaptable to scRNA-seq analysis. In this study, we evaluated ten cell type annotation methods publicly available as R packages. Eight of them are popular methods developed specifically for single-cell research, including Seurat, scmap, SingleR, CHETAH, SingleCellNet, scID, Garnett, and SCINA. The other two methods were repurposed from deconvoluting DNA methylation data, i.e., linear constrained projection (CP) and robust partial correlations (RPC). We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions; the robustness over practical challenges such as gene filtering, high similarity among cell types, and increased cell type classes; as well as the detection of rare and unknown cell types. Overall, methods such as Seurat, SingleR, CP, RPC, and SingleCellNet performed well, with Seurat being the best at annotating major cell types. Additionally, Seurat, SingleR, CP, and RPC were more robust against downsampling. However, Seurat did have a major drawback at predicting rare cell populations, and it was suboptimal at differentiating cell types highly similar to each other, compared to SingleR and RPC. All the code and data are available from https://github.com/qianhuiSenn/scRNA_cell_deconv_benchmark. Elsevier 2021-04 2020-12-24 /pmc/articles/PMC8602772/ /pubmed/33359678 http://dx.doi.org/10.1016/j.gpb.2020.07.004 Text en © 2021 Beijing Institute of Genomics https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Original Research Huang, Qianhui Liu, Yu Du, Yuheng Garmire, Lana X. Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title | Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title_full | Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title_fullStr | Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title_full_unstemmed | Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title_short | Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data |
title_sort | evaluation of cell type annotation r packages on single-cell rna-seq data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8602772/ https://www.ncbi.nlm.nih.gov/pubmed/33359678 http://dx.doi.org/10.1016/j.gpb.2020.07.004 |
work_keys_str_mv | AT huangqianhui evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata AT liuyu evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata AT duyuheng evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata AT garmirelanax evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata |