Cargando…

SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection

To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA...

Descripción completa

Detalles Bibliográficos
Autores principales: Wan, Shibiao, Kim, Junil, Won, Kyoung Jae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050522/
https://www.ncbi.nlm.nih.gov/pubmed/31992615
http://dx.doi.org/10.1101/gr.254557.119
_version_ 1783502625693499392
author Wan, Shibiao
Kim, Junil
Won, Kyoung Jae
author_facet Wan, Shibiao
Kim, Junil
Won, Kyoung Jae
author_sort Wan, Shibiao
collection PubMed
description To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA-seq data sets show that SHARP outperforms existing methods in terms of speed and accuracy. Particularly, for large-size data sets (more than 40,000 cells), SHARP runs faster than other competitors while maintaining high clustering accuracy and robustness. To the best of our knowledge, SHARP is the only R-based tool that is scalable to clustering scRNA-seq data with 10 million cells.
format Online
Article
Text
id pubmed-7050522
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-70505222020-08-01 SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection Wan, Shibiao Kim, Junil Won, Kyoung Jae Genome Res Method To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA-seq data sets show that SHARP outperforms existing methods in terms of speed and accuracy. Particularly, for large-size data sets (more than 40,000 cells), SHARP runs faster than other competitors while maintaining high clustering accuracy and robustness. To the best of our knowledge, SHARP is the only R-based tool that is scalable to clustering scRNA-seq data with 10 million cells. Cold Spring Harbor Laboratory Press 2020-02 /pmc/articles/PMC7050522/ /pubmed/31992615 http://dx.doi.org/10.1101/gr.254557.119 Text en © 2020 Wan et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Method
Wan, Shibiao
Kim, Junil
Won, Kyoung Jae
SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title_full SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title_fullStr SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title_full_unstemmed SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title_short SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
title_sort sharp: hyperfast and accurate processing of single-cell rna-seq data via ensemble random projection
topic Method
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050522/
https://www.ncbi.nlm.nih.gov/pubmed/31992615
http://dx.doi.org/10.1101/gr.254557.119
work_keys_str_mv AT wanshibiao sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection
AT kimjunil sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection
AT wonkyoungjae sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection