Cargando…

Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data

Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, i...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Qianhui, Liu, Yu, Du, Yuheng, Garmire, Lana X.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8602772/
https://www.ncbi.nlm.nih.gov/pubmed/33359678
http://dx.doi.org/10.1016/j.gpb.2020.07.004
_version_ 1784601631650742272
author Huang, Qianhui
Liu, Yu
Du, Yuheng
Garmire, Lana X.
author_facet Huang, Qianhui
Liu, Yu
Du, Yuheng
Garmire, Lana X.
author_sort Huang, Qianhui
collection PubMed
description Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, it is not clear whether some classification methods originally designed for analyzing other bulk omics data are adaptable to scRNA-seq analysis. In this study, we evaluated ten cell type annotation methods publicly available as R packages. Eight of them are popular methods developed specifically for single-cell research, including Seurat, scmap, SingleR, CHETAH, SingleCellNet, scID, Garnett, and SCINA. The other two methods were repurposed from deconvoluting DNA methylation data, i.e., linear constrained projection (CP) and robust partial correlations (RPC). We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions; the robustness over practical challenges such as gene filtering, high similarity among cell types, and increased cell type classes; as well as the detection of rare and unknown cell types. Overall, methods such as Seurat, SingleR, CP, RPC, and SingleCellNet performed well, with Seurat being the best at annotating major cell types. Additionally, Seurat, SingleR, CP, and RPC were more robust against downsampling. However, Seurat did have a major drawback at predicting rare cell populations, and it was suboptimal at differentiating cell types highly similar to each other, compared to SingleR and RPC. All the code and data are available from https://github.com/qianhuiSenn/scRNA_cell_deconv_benchmark.
format Online
Article
Text
id pubmed-8602772
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-86027722021-11-24 Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data Huang, Qianhui Liu, Yu Du, Yuheng Garmire, Lana X. Genomics Proteomics Bioinformatics Original Research Annotating cell types is a critical step in single-cell RNA sequencing (scRNA-seq) data analysis. Some supervised or semi-supervised classification methods have recently emerged to enable automated cell type identification. However, comprehensive evaluations of these methods are lacking. Moreover, it is not clear whether some classification methods originally designed for analyzing other bulk omics data are adaptable to scRNA-seq analysis. In this study, we evaluated ten cell type annotation methods publicly available as R packages. Eight of them are popular methods developed specifically for single-cell research, including Seurat, scmap, SingleR, CHETAH, SingleCellNet, scID, Garnett, and SCINA. The other two methods were repurposed from deconvoluting DNA methylation data, i.e., linear constrained projection (CP) and robust partial correlations (RPC). We conducted systematic comparisons on a wide variety of public scRNA-seq datasets as well as simulation data. We assessed the accuracy through intra-dataset and inter-dataset predictions; the robustness over practical challenges such as gene filtering, high similarity among cell types, and increased cell type classes; as well as the detection of rare and unknown cell types. Overall, methods such as Seurat, SingleR, CP, RPC, and SingleCellNet performed well, with Seurat being the best at annotating major cell types. Additionally, Seurat, SingleR, CP, and RPC were more robust against downsampling. However, Seurat did have a major drawback at predicting rare cell populations, and it was suboptimal at differentiating cell types highly similar to each other, compared to SingleR and RPC. All the code and data are available from https://github.com/qianhuiSenn/scRNA_cell_deconv_benchmark. Elsevier 2021-04 2020-12-24 /pmc/articles/PMC8602772/ /pubmed/33359678 http://dx.doi.org/10.1016/j.gpb.2020.07.004 Text en © 2021 Beijing Institute of Genomics https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Original Research
Huang, Qianhui
Liu, Yu
Du, Yuheng
Garmire, Lana X.
Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title_full Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title_fullStr Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title_full_unstemmed Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title_short Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
title_sort evaluation of cell type annotation r packages on single-cell rna-seq data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8602772/
https://www.ncbi.nlm.nih.gov/pubmed/33359678
http://dx.doi.org/10.1016/j.gpb.2020.07.004
work_keys_str_mv AT huangqianhui evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata
AT liuyu evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata
AT duyuheng evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata
AT garmirelanax evaluationofcelltypeannotationrpackagesonsinglecellrnaseqdata