Cargando…
On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpr...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586655/ https://www.ncbi.nlm.nih.gov/pubmed/37856458 http://dx.doi.org/10.1371/journal.pone.0292961 |
_version_ | 1785123191746723840 |
---|---|
author | Ng, Grace Yee Lin Tan, Shing Chiang Ong, Chia Sui |
author_facet | Ng, Grace Yee Lin Tan, Shing Chiang Ong, Chia Sui |
author_sort | Ng, Grace Yee Lin |
collection | PubMed |
description | Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872. |
format | Online Article Text |
id | pubmed-10586655 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-105866552023-10-20 On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data Ng, Grace Yee Lin Tan, Shing Chiang Ong, Chia Sui PLoS One Research Article Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872. Public Library of Science 2023-10-19 /pmc/articles/PMC10586655/ /pubmed/37856458 http://dx.doi.org/10.1371/journal.pone.0292961 Text en © 2023 Ng et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Ng, Grace Yee Lin Tan, Shing Chiang Ong, Chia Sui On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title | On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title_full | On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title_fullStr | On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title_full_unstemmed | On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title_short | On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data |
title_sort | on the use of qde-svm for gene feature selection and cell type classification from scrna-seq data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586655/ https://www.ncbi.nlm.nih.gov/pubmed/37856458 http://dx.doi.org/10.1371/journal.pone.0292961 |
work_keys_str_mv | AT nggraceyeelin ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata AT tanshingchiang ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata AT ongchiasui ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata |