Cargando…

On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data

Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpr...

Descripción completa

Detalles Bibliográficos
Autores principales: Ng, Grace Yee Lin, Tan, Shing Chiang, Ong, Chia Sui
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586655/
https://www.ncbi.nlm.nih.gov/pubmed/37856458
http://dx.doi.org/10.1371/journal.pone.0292961
_version_ 1785123191746723840
author Ng, Grace Yee Lin
Tan, Shing Chiang
Ong, Chia Sui
author_facet Ng, Grace Yee Lin
Tan, Shing Chiang
Ong, Chia Sui
author_sort Ng, Grace Yee Lin
collection PubMed
description Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872.
format Online
Article
Text
id pubmed-10586655
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-105866552023-10-20 On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data Ng, Grace Yee Lin Tan, Shing Chiang Ong, Chia Sui PLoS One Research Article Cell type identification is one of the fundamental tasks in single-cell RNA sequencing (scRNA-seq) studies. It is a key step to facilitate downstream interpretations such as differential expression, trajectory inference, etc. scRNA-seq data contains technical variations that could affect the interpretation of the cell types. Therefore, gene selection, also known as feature selection in data science, plays an important role in selecting informative genes for scRNA-seq cell type identification. Generally speaking, feature selection methods are categorized into filter-, wrapper-, and embedded-based approaches. From the existing literature, methods from filter- and embedded-based approaches are widely applied in scRNA-seq gene selection tasks. The wrapper-based method that gives promising results in other fields has yet been extensively utilized for selecting gene features from scRNA-seq data; in addition, most of the existing wrapper methods used in this field are clustering instead of classification-based. With a large number of annotated data available today, this study applied a classification-based approach as an alternative to the clustering-based wrapper method. In our work, a quantum-inspired differential evolution (QDE) wrapped with a classification method was introduced to select a subset of genes from twelve well-known scRNA-seq transcriptomic datasets to identify cell types. In particular, the QDE was combined with different machine-learning (ML) classifiers namely logistic regression, decision tree, support vector machine (SVM) with linear and radial basis function kernels, as well as extreme learning machine. The linear SVM wrapped with QDE, namely QDE-SVM, was chosen by referring to the feature selection results from the experiment. QDE-SVM showed a superior cell type classification performance among QDE wrapping with other ML classifiers as well as the recent wrapper methods (i.e., FSCAM, SSD-LAHC, MA-HS, and BSF). QDE-SVM achieved an average accuracy of 0.9559, while the other wrapper methods achieved average accuracies in the range of 0.8292 to 0.8872. Public Library of Science 2023-10-19 /pmc/articles/PMC10586655/ /pubmed/37856458 http://dx.doi.org/10.1371/journal.pone.0292961 Text en © 2023 Ng et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ng, Grace Yee Lin
Tan, Shing Chiang
Ong, Chia Sui
On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title_full On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title_fullStr On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title_full_unstemmed On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title_short On the use of QDE-SVM for gene feature selection and cell type classification from scRNA-seq data
title_sort on the use of qde-svm for gene feature selection and cell type classification from scrna-seq data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10586655/
https://www.ncbi.nlm.nih.gov/pubmed/37856458
http://dx.doi.org/10.1371/journal.pone.0292961
work_keys_str_mv AT nggraceyeelin ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata
AT tanshingchiang ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata
AT ongchiasui ontheuseofqdesvmforgenefeatureselectionandcelltypeclassificationfromscrnaseqdata