Cargando…

A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data

Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important s...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiang, Ruizhi, Wang, Wencan, Yang, Lei, Wang, Shiyuan, Xu, Chaohan, Chen, Xiaowen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8021860/
https://www.ncbi.nlm.nih.gov/pubmed/33833778
http://dx.doi.org/10.3389/fgene.2021.646936
_version_ 1783674822707904512
author Xiang, Ruizhi
Wang, Wencan
Yang, Lei
Wang, Shiyuan
Xu, Chaohan
Chen, Xiaowen
author_facet Xiang, Ruizhi
Wang, Wencan
Yang, Lei
Wang, Shiyuan
Xu, Chaohan
Chen, Xiaowen
author_sort Xiang, Ruizhi
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important step in downstream analysis of scRNA-seq. Therefore, several dimension reduction methods have been developed. We developed a strategy to evaluate the stability, accuracy, and computing cost of 10 dimensionality reduction methods using 30 simulation datasets and five real datasets. Additionally, we investigated the sensitivity of all the methods to hyperparameter tuning and gave users appropriate suggestions. We found that t-distributed stochastic neighbor embedding (t-SNE) yielded the best overall performance with the highest accuracy and computing cost. Meanwhile, uniform manifold approximation and projection (UMAP) exhibited the highest stability, as well as moderate accuracy and the second highest computing cost. UMAP well preserves the original cohesion and separation of cell populations. In addition, it is worth noting that users need to set the hyperparameters according to the specific situation before using the dimensionality reduction methods based on non-linear model and neural network.
format Online
Article
Text
id pubmed-8021860
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-80218602021-04-07 A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data Xiang, Ruizhi Wang, Wencan Yang, Lei Wang, Shiyuan Xu, Chaohan Chen, Xiaowen Front Genet Genetics Single-cell RNA sequencing (scRNA-seq) is a high-throughput sequencing technology performed at the level of an individual cell, which can have a potential to understand cellular heterogeneity. However, scRNA-seq data are high-dimensional, noisy, and sparse data. Dimension reduction is an important step in downstream analysis of scRNA-seq. Therefore, several dimension reduction methods have been developed. We developed a strategy to evaluate the stability, accuracy, and computing cost of 10 dimensionality reduction methods using 30 simulation datasets and five real datasets. Additionally, we investigated the sensitivity of all the methods to hyperparameter tuning and gave users appropriate suggestions. We found that t-distributed stochastic neighbor embedding (t-SNE) yielded the best overall performance with the highest accuracy and computing cost. Meanwhile, uniform manifold approximation and projection (UMAP) exhibited the highest stability, as well as moderate accuracy and the second highest computing cost. UMAP well preserves the original cohesion and separation of cell populations. In addition, it is worth noting that users need to set the hyperparameters according to the specific situation before using the dimensionality reduction methods based on non-linear model and neural network. Frontiers Media S.A. 2021-03-23 /pmc/articles/PMC8021860/ /pubmed/33833778 http://dx.doi.org/10.3389/fgene.2021.646936 Text en Copyright © 2021 Xiang, Wang, Yang, Wang, Xu and Chen. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xiang, Ruizhi
Wang, Wencan
Yang, Lei
Wang, Shiyuan
Xu, Chaohan
Chen, Xiaowen
A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title_full A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title_fullStr A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title_full_unstemmed A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title_short A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
title_sort comparison for dimensionality reduction methods of single-cell rna-seq data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8021860/
https://www.ncbi.nlm.nih.gov/pubmed/33833778
http://dx.doi.org/10.3389/fgene.2021.646936
work_keys_str_mv AT xiangruizhi acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT wangwencan acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT yanglei acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT wangshiyuan acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT xuchaohan acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT chenxiaowen acomparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT xiangruizhi comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT wangwencan comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT yanglei comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT wangshiyuan comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT xuchaohan comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata
AT chenxiaowen comparisonfordimensionalityreductionmethodsofsinglecellrnaseqdata