Cargando…
SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection
To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory Press
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050522/ https://www.ncbi.nlm.nih.gov/pubmed/31992615 http://dx.doi.org/10.1101/gr.254557.119 |
_version_ | 1783502625693499392 |
---|---|
author | Wan, Shibiao Kim, Junil Won, Kyoung Jae |
author_facet | Wan, Shibiao Kim, Junil Won, Kyoung Jae |
author_sort | Wan, Shibiao |
collection | PubMed |
description | To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA-seq data sets show that SHARP outperforms existing methods in terms of speed and accuracy. Particularly, for large-size data sets (more than 40,000 cells), SHARP runs faster than other competitors while maintaining high clustering accuracy and robustness. To the best of our knowledge, SHARP is the only R-based tool that is scalable to clustering scRNA-seq data with 10 million cells. |
format | Online Article Text |
id | pubmed-7050522 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Cold Spring Harbor Laboratory Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-70505222020-08-01 SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection Wan, Shibiao Kim, Junil Won, Kyoung Jae Genome Res Method To process large-scale single-cell RNA-sequencing (scRNA-seq) data effectively without excessive distortion during dimension reduction, we present SHARP, an ensemble random projection-based algorithm that is scalable to clustering 10 million cells. Comprehensive benchmarking tests on 17 public scRNA-seq data sets show that SHARP outperforms existing methods in terms of speed and accuracy. Particularly, for large-size data sets (more than 40,000 cells), SHARP runs faster than other competitors while maintaining high clustering accuracy and robustness. To the best of our knowledge, SHARP is the only R-based tool that is scalable to clustering scRNA-seq data with 10 million cells. Cold Spring Harbor Laboratory Press 2020-02 /pmc/articles/PMC7050522/ /pubmed/31992615 http://dx.doi.org/10.1101/gr.254557.119 Text en © 2020 Wan et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/. |
spellingShingle | Method Wan, Shibiao Kim, Junil Won, Kyoung Jae SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title | SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title_full | SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title_fullStr | SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title_full_unstemmed | SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title_short | SHARP: hyperfast and accurate processing of single-cell RNA-seq data via ensemble random projection |
title_sort | sharp: hyperfast and accurate processing of single-cell rna-seq data via ensemble random projection |
topic | Method |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7050522/ https://www.ncbi.nlm.nih.gov/pubmed/31992615 http://dx.doi.org/10.1101/gr.254557.119 |
work_keys_str_mv | AT wanshibiao sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection AT kimjunil sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection AT wonkyoungjae sharphyperfastandaccurateprocessingofsinglecellrnaseqdataviaensemblerandomprojection |