Cargando…

scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data

Single-cell RNA sequencing (scRNA-seq) enables researchers to reveal previously unknown cell heterogeneity and functional diversity, which is impossible with bulk RNA sequencing. Clustering approaches are widely used for analyzing scRNA-seq data and identifying cell types and states. In the past few...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Zongqin, Xie, Xiaojun, Liu, Shouyang, Ji, Zhiwei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Life Science Alliance LLC 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547911/
https://www.ncbi.nlm.nih.gov/pubmed/37788907
http://dx.doi.org/10.26508/lsa.202302103
_version_ 1785115160362352640
author Wang, Zongqin
Xie, Xiaojun
Liu, Shouyang
Ji, Zhiwei
author_facet Wang, Zongqin
Xie, Xiaojun
Liu, Shouyang
Ji, Zhiwei
author_sort Wang, Zongqin
collection PubMed
description Single-cell RNA sequencing (scRNA-seq) enables researchers to reveal previously unknown cell heterogeneity and functional diversity, which is impossible with bulk RNA sequencing. Clustering approaches are widely used for analyzing scRNA-seq data and identifying cell types and states. In the past few years, various advanced computational strategies emerged. However, the low generalization and high computational cost are the main bottlenecks of existing methods. In this study, we established a novel computational framework, scFseCluster, for scRNA-seq clustering analysis. scFseCluster incorporates a metaheuristic algorithm (Feature Selection based on Quantum Squirrel Search Algorithm) to extract the optimal gene set, which largely guarantees the performance of cell clustering. We conducted simulation experiments in several aspects to verify the performance of the proposed approach. scFseCluster performed very well on eight benchmark scRNA-seq datasets because of the optimal gene sets obtained using the Feature Selection based on Quantum Squirrel Search Algorithm. The comparative study demonstrated the significant advantages of scFseCluster over seven State-of-the-Art algorithms. In addition, our analysis shows that feature selection on high-variable genes can significantly improve clustering performance. In conclusion, our study demonstrates that scFseCluster is a highly versatile tool for enhancing scRNA-seq data clustering analysis.
format Online
Article
Text
id pubmed-10547911
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Life Science Alliance LLC
record_format MEDLINE/PubMed
spelling pubmed-105479112023-10-05 scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data Wang, Zongqin Xie, Xiaojun Liu, Shouyang Ji, Zhiwei Life Sci Alliance Methods Single-cell RNA sequencing (scRNA-seq) enables researchers to reveal previously unknown cell heterogeneity and functional diversity, which is impossible with bulk RNA sequencing. Clustering approaches are widely used for analyzing scRNA-seq data and identifying cell types and states. In the past few years, various advanced computational strategies emerged. However, the low generalization and high computational cost are the main bottlenecks of existing methods. In this study, we established a novel computational framework, scFseCluster, for scRNA-seq clustering analysis. scFseCluster incorporates a metaheuristic algorithm (Feature Selection based on Quantum Squirrel Search Algorithm) to extract the optimal gene set, which largely guarantees the performance of cell clustering. We conducted simulation experiments in several aspects to verify the performance of the proposed approach. scFseCluster performed very well on eight benchmark scRNA-seq datasets because of the optimal gene sets obtained using the Feature Selection based on Quantum Squirrel Search Algorithm. The comparative study demonstrated the significant advantages of scFseCluster over seven State-of-the-Art algorithms. In addition, our analysis shows that feature selection on high-variable genes can significantly improve clustering performance. In conclusion, our study demonstrates that scFseCluster is a highly versatile tool for enhancing scRNA-seq data clustering analysis. Life Science Alliance LLC 2023-10-03 /pmc/articles/PMC10547911/ /pubmed/37788907 http://dx.doi.org/10.26508/lsa.202302103 Text en © 2023 Wang et al. https://creativecommons.org/licenses/by/4.0/This article is available under a Creative Commons License (Attribution 4.0 International, as described at https://creativecommons.org/licenses/by/4.0/).
spellingShingle Methods
Wang, Zongqin
Xie, Xiaojun
Liu, Shouyang
Ji, Zhiwei
scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title_full scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title_fullStr scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title_full_unstemmed scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title_short scFseCluster: a feature selection-enhanced clustering for single-cell RNA-seq data
title_sort scfsecluster: a feature selection-enhanced clustering for single-cell rna-seq data
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10547911/
https://www.ncbi.nlm.nih.gov/pubmed/37788907
http://dx.doi.org/10.26508/lsa.202302103
work_keys_str_mv AT wangzongqin scfseclusterafeatureselectionenhancedclusteringforsinglecellrnaseqdata
AT xiexiaojun scfseclusterafeatureselectionenhancedclusteringforsinglecellrnaseqdata
AT liushouyang scfseclusterafeatureselectionenhancedclusteringforsinglecellrnaseqdata
AT jizhiwei scfseclusterafeatureselectionenhancedclusteringforsinglecellrnaseqdata