Cargando…

scAce: an adaptive embedding and clustering method for single-cell gene expression data

MOTIVATION: Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clus...

Descripción completa

Detalles Bibliográficos
Autores principales: He, Xinwei, Qian, Kun, Wang, Ziqian, Zeng, Shirou, Li, Hongwei, Li, Wei Vivian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500084/
https://www.ncbi.nlm.nih.gov/pubmed/37672035
http://dx.doi.org/10.1093/bioinformatics/btad546
_version_ 1785105849193070592
author He, Xinwei
Qian, Kun
Wang, Ziqian
Zeng, Shirou
Li, Hongwei
Li, Wei Vivian
author_facet He, Xinwei
Qian, Kun
Wang, Ziqian
Zeng, Shirou
Li, Hongwei
Li, Wei Vivian
author_sort He, Xinwei
collection PubMed
description MOTIVATION: Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment. RESULTS: In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments. In the scAce method, we develop an adaptive cluster merging approach which achieves improved clustering results without the need to estimate the number of clusters in advance. In addition, scAce provides an option to perform clustering enhancement, which can update and enhance cluster assignments based on previous clustering results from other methods. Based on computational analysis of both simulated and real datasets, we demonstrate that scAce outperforms state-of-the-art clustering methods for scRNA-seq data, and achieves better clustering accuracy and robustness. AVAILABILITY AND IMPLEMENTATION: The scAce package is implemented in python 3.8 and is freely available from https://github.com/sldyns/scAce.
format Online
Article
Text
id pubmed-10500084
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-105000842023-09-15 scAce: an adaptive embedding and clustering method for single-cell gene expression data He, Xinwei Qian, Kun Wang, Ziqian Zeng, Shirou Li, Hongwei Li, Wei Vivian Bioinformatics Original Paper MOTIVATION: Since the development of single-cell RNA sequencing (scRNA-seq) technologies, clustering analysis of single-cell gene expression data has been an essential tool for distinguishing cell types and identifying novel cell types. Even though many methods have been available for scRNA-seq clustering analysis, the majority of them are constrained by the requirement on predetermined cluster numbers or the dependence on selected initial cluster assignment. RESULTS: In this article, we propose an adaptive embedding and clustering method named scAce, which constructs a variational autoencoder to simultaneously learn cell embeddings and cluster assignments. In the scAce method, we develop an adaptive cluster merging approach which achieves improved clustering results without the need to estimate the number of clusters in advance. In addition, scAce provides an option to perform clustering enhancement, which can update and enhance cluster assignments based on previous clustering results from other methods. Based on computational analysis of both simulated and real datasets, we demonstrate that scAce outperforms state-of-the-art clustering methods for scRNA-seq data, and achieves better clustering accuracy and robustness. AVAILABILITY AND IMPLEMENTATION: The scAce package is implemented in python 3.8 and is freely available from https://github.com/sldyns/scAce. Oxford University Press 2023-09-06 /pmc/articles/PMC10500084/ /pubmed/37672035 http://dx.doi.org/10.1093/bioinformatics/btad546 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Paper
He, Xinwei
Qian, Kun
Wang, Ziqian
Zeng, Shirou
Li, Hongwei
Li, Wei Vivian
scAce: an adaptive embedding and clustering method for single-cell gene expression data
title scAce: an adaptive embedding and clustering method for single-cell gene expression data
title_full scAce: an adaptive embedding and clustering method for single-cell gene expression data
title_fullStr scAce: an adaptive embedding and clustering method for single-cell gene expression data
title_full_unstemmed scAce: an adaptive embedding and clustering method for single-cell gene expression data
title_short scAce: an adaptive embedding and clustering method for single-cell gene expression data
title_sort scace: an adaptive embedding and clustering method for single-cell gene expression data
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10500084/
https://www.ncbi.nlm.nih.gov/pubmed/37672035
http://dx.doi.org/10.1093/bioinformatics/btad546
work_keys_str_mv AT hexinwei scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata
AT qiankun scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata
AT wangziqian scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata
AT zengshirou scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata
AT lihongwei scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata
AT liweivivian scaceanadaptiveembeddingandclusteringmethodforsinglecellgeneexpressiondata