Cargando…

A novel feature selection algorithm based on damping oscillation theory

Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is p...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Fujun, Wang, Xing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345869/
https://www.ncbi.nlm.nih.gov/pubmed/34358234
http://dx.doi.org/10.1371/journal.pone.0255307
_version_ 1783734732023922688
author Wang, Fujun
Wang, Xing
author_facet Wang, Fujun
Wang, Xing
author_sort Wang, Fujun
collection PubMed
description Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets.
format Online
Article
Text
id pubmed-8345869
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-83458692021-08-07 A novel feature selection algorithm based on damping oscillation theory Wang, Fujun Wang, Xing PLoS One Research Article Feature selection is an important task in big data analysis and information retrieval processing. It reduces the number of features by removing noise, extraneous data. In this paper, one feature subset selection algorithm based on damping oscillation theory and support vector machine classifier is proposed. This algorithm is called the Maximum Kendall coefficient Maximum Euclidean Distance Improved Gray Wolf Optimization algorithm (MKMDIGWO). In MKMDIGWO, first, a filter model based on Kendall coefficient and Euclidean distance is proposed, which is used to measure the correlation and redundancy of the candidate feature subset. Second, the wrapper model is an improved grey wolf optimization algorithm, in which its position update formula has been improved in order to achieve optimal results. Third, the filter model and the wrapper model are dynamically adjusted by the damping oscillation theory to achieve the effect of finding an optimal feature subset. Therefore, MKMDIGWO achieves both the efficiency of the filter model and the high precision of the wrapper model. Experimental results on five UCI public data sets and two microarray data sets have demonstrated the higher classification accuracy of the MKMDIGWO algorithm than that of other four state-of-the-art algorithms. The maximum ACC value of the MKMDIGWO algorithm is at least 0.5% higher than other algorithms on 10 data sets. Public Library of Science 2021-08-06 /pmc/articles/PMC8345869/ /pubmed/34358234 http://dx.doi.org/10.1371/journal.pone.0255307 Text en © 2021 Wang, Wang https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Wang, Fujun
Wang, Xing
A novel feature selection algorithm based on damping oscillation theory
title A novel feature selection algorithm based on damping oscillation theory
title_full A novel feature selection algorithm based on damping oscillation theory
title_fullStr A novel feature selection algorithm based on damping oscillation theory
title_full_unstemmed A novel feature selection algorithm based on damping oscillation theory
title_short A novel feature selection algorithm based on damping oscillation theory
title_sort novel feature selection algorithm based on damping oscillation theory
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8345869/
https://www.ncbi.nlm.nih.gov/pubmed/34358234
http://dx.doi.org/10.1371/journal.pone.0255307
work_keys_str_mv AT wangfujun anovelfeatureselectionalgorithmbasedondampingoscillationtheory
AT wangxing anovelfeatureselectionalgorithmbasedondampingoscillationtheory
AT wangfujun novelfeatureselectionalgorithmbasedondampingoscillationtheory
AT wangxing novelfeatureselectionalgorithmbasedondampingoscillationtheory