Cargando…

A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization

BACKGROUND: In the field of genomics and personalized medicine, it is a key issue to find biomarkers directly related to the diagnosis of specific diseases from high-throughput gene microarray data. Feature selection technology can discover biomarkers with disease classification information. RESULTS...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Wei, Chi, Yuhuan, Yu, Kun, Xie, Weidong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10072044/
https://www.ncbi.nlm.nih.gov/pubmed/37016297
http://dx.doi.org/10.1186/s12859-023-05247-7
_version_ 1785019307844960256
author Li, Wei
Chi, Yuhuan
Yu, Kun
Xie, Weidong
author_facet Li, Wei
Chi, Yuhuan
Yu, Kun
Xie, Weidong
author_sort Li, Wei
collection PubMed
description BACKGROUND: In the field of genomics and personalized medicine, it is a key issue to find biomarkers directly related to the diagnosis of specific diseases from high-throughput gene microarray data. Feature selection technology can discover biomarkers with disease classification information. RESULTS: We use support vector machines as classifiers and use the five-fold cross-validation average classification accuracy, recall, precision and F1 score as evaluation metrics to evaluate the identified biomarkers. Experimental results show classification accuracy above 0.93, recall above 0.92, precision above 0.91, and F1 score above 0.94 on eight microarray datasets. METHOD: This paper proposes a two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization (EF-BDBA), which can effectively reduce the dimension of microarray data and obtain optimal biomarkers. In the first stage, we propose an ensemble filter feature selection method. The method combines an improved fast correlation-based filter algorithm with Fisher score. obviously redundant and irrelevant features can be filtered out to initially reduce the dimensionality of the microarray data. In the second stage, the optimal feature subset is selected using an improved binary differential evolution incorporating an improved binary African vultures optimization algorithm. The African vultures optimization algorithm has excellent global optimization ability. It has not been systematically applied to feature selection problems, especially for gene microarray data. We combine it with a differential evolution algorithm to improve population diversity. CONCLUSION: Compared with traditional feature selection methods and advanced hybrid methods, the proposed method achieves higher classification accuracy and identifies excellent biomarkers while retaining fewer features. The experimental results demonstrate the effectiveness and advancement of our proposed algorithmic model.
format Online
Article
Text
id pubmed-10072044
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-100720442023-04-04 A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization Li, Wei Chi, Yuhuan Yu, Kun Xie, Weidong BMC Bioinformatics Research BACKGROUND: In the field of genomics and personalized medicine, it is a key issue to find biomarkers directly related to the diagnosis of specific diseases from high-throughput gene microarray data. Feature selection technology can discover biomarkers with disease classification information. RESULTS: We use support vector machines as classifiers and use the five-fold cross-validation average classification accuracy, recall, precision and F1 score as evaluation metrics to evaluate the identified biomarkers. Experimental results show classification accuracy above 0.93, recall above 0.92, precision above 0.91, and F1 score above 0.94 on eight microarray datasets. METHOD: This paper proposes a two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization (EF-BDBA), which can effectively reduce the dimension of microarray data and obtain optimal biomarkers. In the first stage, we propose an ensemble filter feature selection method. The method combines an improved fast correlation-based filter algorithm with Fisher score. obviously redundant and irrelevant features can be filtered out to initially reduce the dimensionality of the microarray data. In the second stage, the optimal feature subset is selected using an improved binary differential evolution incorporating an improved binary African vultures optimization algorithm. The African vultures optimization algorithm has excellent global optimization ability. It has not been systematically applied to feature selection problems, especially for gene microarray data. We combine it with a differential evolution algorithm to improve population diversity. CONCLUSION: Compared with traditional feature selection methods and advanced hybrid methods, the proposed method achieves higher classification accuracy and identifies excellent biomarkers while retaining fewer features. The experimental results demonstrate the effectiveness and advancement of our proposed algorithmic model. BioMed Central 2023-04-04 /pmc/articles/PMC10072044/ /pubmed/37016297 http://dx.doi.org/10.1186/s12859-023-05247-7 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Li, Wei
Chi, Yuhuan
Yu, Kun
Xie, Weidong
A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title_full A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title_fullStr A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title_full_unstemmed A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title_short A two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary African vultures optimization
title_sort two-stage hybrid biomarker selection method based on ensemble filter and binary differential evolution incorporating binary african vultures optimization
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10072044/
https://www.ncbi.nlm.nih.gov/pubmed/37016297
http://dx.doi.org/10.1186/s12859-023-05247-7
work_keys_str_mv AT liwei atwostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT chiyuhuan atwostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT yukun atwostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT xieweidong atwostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT liwei twostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT chiyuhuan twostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT yukun twostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization
AT xieweidong twostagehybridbiomarkerselectionmethodbasedonensemblefilterandbinarydifferentialevolutionincorporatingbinaryafricanvulturesoptimization