Cargando…
Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets
Due to the high-dimensional characteristics of dataset, we propose a new method based on the Wolf Search Algorithm (WSA) for optimising the feature selection problem. The proposed approach uses the natural strategy established by Charles Darwin; that is, ‘It is not the strongest of the species that...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5489518/ https://www.ncbi.nlm.nih.gov/pubmed/28659577 http://dx.doi.org/10.1038/s41598-017-04037-5 |
_version_ | 1783246803579174912 |
---|---|
author | Li, Jinyan Fong, Simon Wong, Raymond K. Millham, Richard Wong, Kelvin K. L. |
author_facet | Li, Jinyan Fong, Simon Wong, Raymond K. Millham, Richard Wong, Kelvin K. L. |
author_sort | Li, Jinyan |
collection | PubMed |
description | Due to the high-dimensional characteristics of dataset, we propose a new method based on the Wolf Search Algorithm (WSA) for optimising the feature selection problem. The proposed approach uses the natural strategy established by Charles Darwin; that is, ‘It is not the strongest of the species that survives, but the most adaptable’. This means that in the evolution of a swarm, the elitists are motivated to quickly obtain more and better resources. The memory function helps the proposed method to avoid repeat searches for the worst position in order to enhance the effectiveness of the search, while the binary strategy simplifies the feature selection problem into a similar problem of function optimisation. Furthermore, the wrapper strategy gathers these strengthened wolves with the classifier of extreme learning machine to find a sub-dataset with a reasonable number of features that offers the maximum correctness of global classification models. The experimental results from the six public high-dimensional bioinformatics datasets tested demonstrate that the proposed method can best some of the conventional feature selection methods up to 29% in classification accuracy, and outperform previous WSAs by up to 99.81% in computational time. |
format | Online Article Text |
id | pubmed-5489518 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-54895182017-07-05 Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets Li, Jinyan Fong, Simon Wong, Raymond K. Millham, Richard Wong, Kelvin K. L. Sci Rep Article Due to the high-dimensional characteristics of dataset, we propose a new method based on the Wolf Search Algorithm (WSA) for optimising the feature selection problem. The proposed approach uses the natural strategy established by Charles Darwin; that is, ‘It is not the strongest of the species that survives, but the most adaptable’. This means that in the evolution of a swarm, the elitists are motivated to quickly obtain more and better resources. The memory function helps the proposed method to avoid repeat searches for the worst position in order to enhance the effectiveness of the search, while the binary strategy simplifies the feature selection problem into a similar problem of function optimisation. Furthermore, the wrapper strategy gathers these strengthened wolves with the classifier of extreme learning machine to find a sub-dataset with a reasonable number of features that offers the maximum correctness of global classification models. The experimental results from the six public high-dimensional bioinformatics datasets tested demonstrate that the proposed method can best some of the conventional feature selection methods up to 29% in classification accuracy, and outperform previous WSAs by up to 99.81% in computational time. Nature Publishing Group UK 2017-06-28 /pmc/articles/PMC5489518/ /pubmed/28659577 http://dx.doi.org/10.1038/s41598-017-04037-5 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Li, Jinyan Fong, Simon Wong, Raymond K. Millham, Richard Wong, Kelvin K. L. Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title | Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title_full | Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title_fullStr | Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title_full_unstemmed | Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title_short | Elitist Binary Wolf Search Algorithm for Heuristic Feature Selection in High-Dimensional Bioinformatics Datasets |
title_sort | elitist binary wolf search algorithm for heuristic feature selection in high-dimensional bioinformatics datasets |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5489518/ https://www.ncbi.nlm.nih.gov/pubmed/28659577 http://dx.doi.org/10.1038/s41598-017-04037-5 |
work_keys_str_mv | AT lijinyan elitistbinarywolfsearchalgorithmforheuristicfeatureselectioninhighdimensionalbioinformaticsdatasets AT fongsimon elitistbinarywolfsearchalgorithmforheuristicfeatureselectioninhighdimensionalbioinformaticsdatasets AT wongraymondk elitistbinarywolfsearchalgorithmforheuristicfeatureselectioninhighdimensionalbioinformaticsdatasets AT millhamrichard elitistbinarywolfsearchalgorithmforheuristicfeatureselectioninhighdimensionalbioinformaticsdatasets AT wongkelvinkl elitistbinarywolfsearchalgorithmforheuristicfeatureselectioninhighdimensionalbioinformaticsdatasets |