Cargando…

Single-cell data clustering based on sparse optimization and low-rank matrix factorization

Unsupervised clustering is a fundamental step of single-cell RNA-sequencing (scRNA-seq) data analysis. This issue has inspired several clustering methods to classify cells in scRNA-seq data. However, accurate prediction of the cell clusters remains a substantial challenge. In this study, we propose...

Descripción completa

Detalles Bibliográficos
Autores principales: Hu, Yinlei, Li, Bin, Chen, Falai, Qu, Kun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495739/
https://www.ncbi.nlm.nih.gov/pubmed/33787873
http://dx.doi.org/10.1093/g3journal/jkab098
_version_ 1784579610836467712
author Hu, Yinlei
Li, Bin
Chen, Falai
Qu, Kun
author_facet Hu, Yinlei
Li, Bin
Chen, Falai
Qu, Kun
author_sort Hu, Yinlei
collection PubMed
description Unsupervised clustering is a fundamental step of single-cell RNA-sequencing (scRNA-seq) data analysis. This issue has inspired several clustering methods to classify cells in scRNA-seq data. However, accurate prediction of the cell clusters remains a substantial challenge. In this study, we propose a new algorithm for scRNA-seq data clustering based on Sparse Optimization and low-rank matrix factorization (scSO). We applied our scSO algorithm to analyze multiple benchmark datasets and showed that the cluster number predicted by scSO was close to the number of reference cell types and that most cells were correctly classified. Our scSO algorithm is available at https://github.com/QuKunLab/scSO. Overall, this study demonstrates a potent cell clustering approach that can help researchers distinguish cell types in single- scRNA-seq data.
format Online
Article
Text
id pubmed-8495739
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-84957392021-10-07 Single-cell data clustering based on sparse optimization and low-rank matrix factorization Hu, Yinlei Li, Bin Chen, Falai Qu, Kun G3 (Bethesda) Software and Data Resources Unsupervised clustering is a fundamental step of single-cell RNA-sequencing (scRNA-seq) data analysis. This issue has inspired several clustering methods to classify cells in scRNA-seq data. However, accurate prediction of the cell clusters remains a substantial challenge. In this study, we propose a new algorithm for scRNA-seq data clustering based on Sparse Optimization and low-rank matrix factorization (scSO). We applied our scSO algorithm to analyze multiple benchmark datasets and showed that the cluster number predicted by scSO was close to the number of reference cell types and that most cells were correctly classified. Our scSO algorithm is available at https://github.com/QuKunLab/scSO. Overall, this study demonstrates a potent cell clustering approach that can help researchers distinguish cell types in single- scRNA-seq data. Oxford University Press 2021-03-31 /pmc/articles/PMC8495739/ /pubmed/33787873 http://dx.doi.org/10.1093/g3journal/jkab098 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) ), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Software and Data Resources
Hu, Yinlei
Li, Bin
Chen, Falai
Qu, Kun
Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title_full Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title_fullStr Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title_full_unstemmed Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title_short Single-cell data clustering based on sparse optimization and low-rank matrix factorization
title_sort single-cell data clustering based on sparse optimization and low-rank matrix factorization
topic Software and Data Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8495739/
https://www.ncbi.nlm.nih.gov/pubmed/33787873
http://dx.doi.org/10.1093/g3journal/jkab098
work_keys_str_mv AT huyinlei singlecelldataclusteringbasedonsparseoptimizationandlowrankmatrixfactorization
AT libin singlecelldataclusteringbasedonsparseoptimizationandlowrankmatrixfactorization
AT chenfalai singlecelldataclusteringbasedonsparseoptimizationandlowrankmatrixfactorization
AT qukun singlecelldataclusteringbasedonsparseoptimizationandlowrankmatrixfactorization