Cargando…
A new hybrid algorithm for three-stage gene selection based on whale optimization
In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a ne...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992521/ https://www.ncbi.nlm.nih.gov/pubmed/36882446 http://dx.doi.org/10.1038/s41598-023-30862-y |
_version_ | 1784902327694524416 |
---|---|
author | Liu, Junjian Qu, Chiwen Zhang, Lupeng Tang, Yifan Li, Jinlong Feng, Huicong Zeng, Xiaomin Peng, Xiaoning |
author_facet | Liu, Junjian Qu, Chiwen Zhang, Lupeng Tang, Yifan Li, Jinlong Feng, Huicong Zeng, Xiaomin Peng, Xiaoning |
author_sort | Liu, Junjian |
collection | PubMed |
description | In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a new three-stage hybrid feature gene selection method, that combines a variance filter, extremely randomized tree, and whale optimization algorithm. First, a variance filter is used to reduce the dimension of the feature gene space, and an extremely randomized tree is used to further reduce the feature gene set. Finally, the whale optimization algorithm is used to select the optimal feature gene subset. We evaluate the proposed method with three different classifiers in seven published gene expression profile datasets and compare it with other advanced feature selection algorithms. The results show that the proposed method has significant advantages in a variety of evaluation indicators. |
format | Online Article Text |
id | pubmed-9992521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-99925212023-03-09 A new hybrid algorithm for three-stage gene selection based on whale optimization Liu, Junjian Qu, Chiwen Zhang, Lupeng Tang, Yifan Li, Jinlong Feng, Huicong Zeng, Xiaomin Peng, Xiaoning Sci Rep Article In biomedical data mining, the gene dimension is often much larger than the sample size. To solve this problem, we need to use a feature selection algorithm to select feature gene subsets with a strong correlation with phenotype to ensure the accuracy of subsequent analysis. This paper presents a new three-stage hybrid feature gene selection method, that combines a variance filter, extremely randomized tree, and whale optimization algorithm. First, a variance filter is used to reduce the dimension of the feature gene space, and an extremely randomized tree is used to further reduce the feature gene set. Finally, the whale optimization algorithm is used to select the optimal feature gene subset. We evaluate the proposed method with three different classifiers in seven published gene expression profile datasets and compare it with other advanced feature selection algorithms. The results show that the proposed method has significant advantages in a variety of evaluation indicators. Nature Publishing Group UK 2023-03-07 /pmc/articles/PMC9992521/ /pubmed/36882446 http://dx.doi.org/10.1038/s41598-023-30862-y Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Liu, Junjian Qu, Chiwen Zhang, Lupeng Tang, Yifan Li, Jinlong Feng, Huicong Zeng, Xiaomin Peng, Xiaoning A new hybrid algorithm for three-stage gene selection based on whale optimization |
title | A new hybrid algorithm for three-stage gene selection based on whale optimization |
title_full | A new hybrid algorithm for three-stage gene selection based on whale optimization |
title_fullStr | A new hybrid algorithm for three-stage gene selection based on whale optimization |
title_full_unstemmed | A new hybrid algorithm for three-stage gene selection based on whale optimization |
title_short | A new hybrid algorithm for three-stage gene selection based on whale optimization |
title_sort | new hybrid algorithm for three-stage gene selection based on whale optimization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9992521/ https://www.ncbi.nlm.nih.gov/pubmed/36882446 http://dx.doi.org/10.1038/s41598-023-30862-y |
work_keys_str_mv | AT liujunjian anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT quchiwen anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT zhanglupeng anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT tangyifan anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT lijinlong anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT fenghuicong anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT zengxiaomin anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT pengxiaoning anewhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT liujunjian newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT quchiwen newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT zhanglupeng newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT tangyifan newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT lijinlong newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT fenghuicong newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT zengxiaomin newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization AT pengxiaoning newhybridalgorithmforthreestagegeneselectionbasedonwhaleoptimization |