Cargando…

A new hybrid algorithm for three-stage gene selection based on whale optimization

In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a ne...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Junjian, Qu, Chiwen, Zhang, Lupeng, Tang, Yifan, Li, Jinlong, Feng, Huicong, Zeng, Xiaomin, Peng, Xiaoning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992521/
https://www.ncbi.nlm.nih.gov/pubmed/36882446
http://dx.doi.org/10.1038/s41598-023-30862-y
_version_ 1784902327694524416
author Liu, Junjian
Qu, Chiwen
Zhang, Lupeng
Tang, Yifan
Li, Jinlong
Feng, Huicong
Zeng, Xiaomin
Peng, Xiaoning
author_facet Liu, Junjian
Qu, Chiwen
Zhang, Lupeng
Tang, Yifan
Li, Jinlong
Feng, Huicong
Zeng, Xiaomin
Peng, Xiaoning
author_sort Liu, Junjian
collection PubMed
description In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a new three-stage hybrid feature gene selection method, that combines a variance filter, extremely randomized tree, and whale optimization algorithm. First, a variance filter is used to reduce the dimension of the feature gene space, and an extremely randomized tree is used to further reduce the feature gene set. Finally, the whale optimization algorithm is used to select the optimal feature gene subset. We evaluate the proposed method with three different classifiers in seven published gene expression profile datasets and compare it with other advanced feature selection algorithms. The results show that the proposed method has significant advantages in a variety of evaluation indicators.
format Online
Article
Text
id pubmed-9992521
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-99925212023-03-09 A new hybrid algorithm for three-stage gene selection based on whale optimization Liu, Junjian Qu, Chiwen Zhang, Lupeng Tang, Yifan Li, Jinlong Feng, Huicong Zeng, Xiaomin Peng, Xiaoning Sci Rep Article In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a new three-stage hybrid feature gene selection method, that combines a variance filter, extremely randomized tree, and whale optimization algorithm. First, a variance filter is used to reduce the dimension of the feature gene space, and an extremely randomized tree is used to further reduce the feature gene set. Finally, the whale optimization algorithm is used to select the optimal feature gene subset. We evaluate the proposed method with three different classifiers in seven published gene expression profile datasets and compare it with other advanced feature selection algorithms. The results show that the proposed method has significant advantages in a variety of evaluation indicators. Nature Publishing Group UK 2023-03-07 /pmc/articles/PMC9992521/ /pubmed/36882446 http://dx.doi.org/10.1038/s41598-023-30862-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Liu, Junjian
Qu, Chiwen
Zhang, Lupeng
Tang, Yifan
Li, Jinlong
Feng, Huicong
Zeng, Xiaomin
Peng, Xiaoning
A new hybrid algorithm for three-stage gene selection based on whale optimization
title A new hybrid algorithm for three-stage gene selection based on whale optimization
title_full A new hybrid algorithm for three-stage gene selection based on whale optimization
title_fullStr A new hybrid algorithm for three-stage gene selection based on whale optimization
title_full_unstemmed A new hybrid algorithm for three-stage gene selection based on whale optimization
title_short A new hybrid algorithm for three-stage gene selection based on whale optimization
title_sort new hybrid algorithm for three-stage gene selection based on whale optimization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992521/
https://www.ncbi.nlm.nih.gov/pubmed/36882446
http://dx.doi.org/10.1038/s41598-023-30862-y
work_keys_str_mv AT liujunjian anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT quchiwen anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT zhanglupeng anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT tangyifan anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT lijinlong anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT fenghuicong anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT zengxiaomin anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT pengxiaoning anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT liujunjian newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT quchiwen newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT zhanglupeng newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT tangyifan newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT lijinlong newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT fenghuicong newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT zengxiaomin newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization
AT pengxiaoning newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization