Cargando…

Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis

BACKGROUND: Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage r...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Shiquan, Zhu, Jiaqiang, Ma, Ying, Zhou, Xiang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902413/
https://www.ncbi.nlm.nih.gov/pubmed/31823809
http://dx.doi.org/10.1186/s13059-019-1898-6
_version_ 1783477663658147840
author Sun, Shiquan
Zhu, Jiaqiang
Ma, Ying
Zhou, Xiang
author_facet Sun, Shiquan
Zhu, Jiaqiang
Ma, Ying
Zhou, Xiang
author_sort Sun, Shiquan
collection PubMed
description BACKGROUND: Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. RESULTS: We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. CONCLUSIONS: Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html.
format Online
Article
Text
id pubmed-6902413
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69024132019-12-11 Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis Sun, Shiquan Zhu, Jiaqiang Ma, Ying Zhou, Xiang Genome Biol Research BACKGROUND: Dimensionality reduction is an indispensable analytic component for many areas of single-cell RNA sequencing (scRNA-seq) data analysis. Proper dimensionality reduction can allow for effective noise removal and facilitate many downstream analyses that include cell clustering and lineage reconstruction. Unfortunately, despite the critical importance of dimensionality reduction in scRNA-seq analysis and the vast number of dimensionality reduction methods developed for scRNA-seq studies, few comprehensive comparison studies have been performed to evaluate the effectiveness of different dimensionality reduction methods in scRNA-seq. RESULTS: We aim to fill this critical knowledge gap by providing a comparative evaluation of a variety of commonly used dimensionality reduction methods for scRNA-seq studies. Specifically, we compare 18 different dimensionality reduction methods on 30 publicly available scRNA-seq datasets that cover a range of sequencing techniques and sample sizes. We evaluate the performance of different dimensionality reduction methods for neighborhood preserving in terms of their ability to recover features of the original expression matrix, and for cell clustering and lineage reconstruction in terms of their accuracy and robustness. We also evaluate the computational scalability of different dimensionality reduction methods by recording their computational cost. CONCLUSIONS: Based on the comprehensive evaluation results, we provide important guidelines for choosing dimensionality reduction methods for scRNA-seq data analysis. We also provide all analysis scripts used in the present study at www.xzlab.org/reproduce.html. BioMed Central 2019-12-10 /pmc/articles/PMC6902413/ /pubmed/31823809 http://dx.doi.org/10.1186/s13059-019-1898-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Sun, Shiquan
Zhu, Jiaqiang
Ma, Ying
Zhou, Xiang
Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_full Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_fullStr Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_full_unstemmed Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_short Accuracy, robustness and scalability of dimensionality reduction methods for single-cell RNA-seq analysis
title_sort accuracy, robustness and scalability of dimensionality reduction methods for single-cell rna-seq analysis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6902413/
https://www.ncbi.nlm.nih.gov/pubmed/31823809
http://dx.doi.org/10.1186/s13059-019-1898-6
work_keys_str_mv AT sunshiquan accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT zhujiaqiang accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT maying accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis
AT zhouxiang accuracyrobustnessandscalabilityofdimensionalityreductionmethodsforsinglecellrnaseqanalysis