Cargando…
A novel feature selection algorithm based on damping oscillation theory
Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is p...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345869/ https://www.ncbi.nlm.nih.gov/pubmed/34358234 http://dx.doi.org/10.1371/journal.pone.0255307 |
_version_ | 1783734732023922688 |
---|---|
author | Wang, Fujun Wang, Xing |
author_facet | Wang, Fujun Wang, Xing |
author_sort | Wang, Fujun |
collection | PubMed |
description | Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets. |
format | Online Article Text |
id | pubmed-8345869 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-83458692021-08-07 A novel feature selection algorithm based on damping oscillation theory Wang, Fujun Wang, Xing PLoS One Research Article Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets. Public Library of Science 2021-08-06 /pmc/articles/PMC8345869/ /pubmed/34358234 http://dx.doi.org/10.1371/journal.pone.0255307 Text en © 2021 Wang, Wang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Wang, Fujun Wang, Xing A novel feature selection algorithm based on damping oscillation theory |
title | A novel feature selection algorithm based on damping oscillation theory |
title_full | A novel feature selection algorithm based on damping oscillation theory |
title_fullStr | A novel feature selection algorithm based on damping oscillation theory |
title_full_unstemmed | A novel feature selection algorithm based on damping oscillation theory |
title_short | A novel feature selection algorithm based on damping oscillation theory |
title_sort | novel feature selection algorithm based on damping oscillation theory |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345869/ https://www.ncbi.nlm.nih.gov/pubmed/34358234 http://dx.doi.org/10.1371/journal.pone.0255307 |
work_keys_str_mv | AT wangfujun anovelfeatureselectionalgorithmbasedondampingoscillationtheory AT wangxing anovelfeatureselectionalgorithmbasedondampingoscillationtheory AT wangfujun novelfeatureselectionalgorithmbasedondampingoscillationtheory AT wangxing novelfeatureselectionalgorithmbasedondampingoscillationtheory |