Cargando…

uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features

Feature selection is considered to be one of the most critical methods for choosing appropriate features from a larger set of items. This task requires two basic steps: ranking and filtering. Of these, the former necessitates the ranking of all features, while the latter involves filtering out all i...

Descripción completa

Detalles Bibliográficos
Autores principales: Ali, Maqbool, Ali, Syed Imran, Kim, Dohyeong, Hur, Taeho, Bang, Jaehun, Lee, Sungyoung, Kang, Byeong Ho, Hussain, Maqbool
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6112679/
https://www.ncbi.nlm.nih.gov/pubmed/30153294
http://dx.doi.org/10.1371/journal.pone.0202705
_version_ 1783350891445747712
author Ali, Maqbool
Ali, Syed Imran
Kim, Dohyeong
Hur, Taeho
Bang, Jaehun
Lee, Sungyoung
Kang, Byeong Ho
Hussain, Maqbool
author_facet Ali, Maqbool
Ali, Syed Imran
Kim, Dohyeong
Hur, Taeho
Bang, Jaehun
Lee, Sungyoung
Kang, Byeong Ho
Hussain, Maqbool
author_sort Ali, Maqbool
collection PubMed
description Feature selection is considered to be one of the most critical methods for choosing appropriate features from a larger set of items. This task requires two basic steps: ranking and filtering. Of these, the former necessitates the ranking of all features, while the latter involves filtering out all irrelevant features based on some threshold value. In this regard, several feature selection methods with well-documented capabilities and limitations have already been proposed. Similarly, feature ranking is also nontrivial, as it requires the designation of an optimal cutoff value so as to properly select important features from a list of candidate features. However, the availability of a comprehensive feature ranking and a filtering approach, which alleviates the existing limitations and provides an efficient mechanism for achieving optimal results, is a major problem. Keeping in view these facts, we present an efficient and comprehensive univariate ensemble-based feature selection (uEFS) methodology to select informative features from an input dataset. For the uEFS methodology, we first propose a unified features scoring (UFS) algorithm to generate a final ranked list of features following a comprehensive evaluation of a feature set. For defining cutoff points to remove irrelevant features, we subsequently present a threshold value selection (TVS) algorithm to select a subset of features that are deemed important for the classifier construction. The uEFS methodology is evaluated using standard benchmark datasets. The extensive experimental results show that our proposed uEFS methodology provides competitive accuracy and achieved (1) on average around a 7% increase in f-measure, and (2) on average around a 5% increase in predictive accuracy as compared with state-of-the-art methods.
format Online
Article
Text
id pubmed-6112679
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61126792018-09-17 uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features Ali, Maqbool Ali, Syed Imran Kim, Dohyeong Hur, Taeho Bang, Jaehun Lee, Sungyoung Kang, Byeong Ho Hussain, Maqbool PLoS One Research Article Feature selection is considered to be one of the most critical methods for choosing appropriate features from a larger set of items. This task requires two basic steps: ranking and filtering. Of these, the former necessitates the ranking of all features, while the latter involves filtering out all irrelevant features based on some threshold value. In this regard, several feature selection methods with well-documented capabilities and limitations have already been proposed. Similarly, feature ranking is also nontrivial, as it requires the designation of an optimal cutoff value so as to properly select important features from a list of candidate features. However, the availability of a comprehensive feature ranking and a filtering approach, which alleviates the existing limitations and provides an efficient mechanism for achieving optimal results, is a major problem. Keeping in view these facts, we present an efficient and comprehensive univariate ensemble-based feature selection (uEFS) methodology to select informative features from an input dataset. For the uEFS methodology, we first propose a unified features scoring (UFS) algorithm to generate a final ranked list of features following a comprehensive evaluation of a feature set. For defining cutoff points to remove irrelevant features, we subsequently present a threshold value selection (TVS) algorithm to select a subset of features that are deemed important for the classifier construction. The uEFS methodology is evaluated using standard benchmark datasets. The extensive experimental results show that our proposed uEFS methodology provides competitive accuracy and achieved (1) on average around a 7% increase in f-measure, and (2) on average around a 5% increase in predictive accuracy as compared with state-of-the-art methods. Public Library of Science 2018-08-28 /pmc/articles/PMC6112679/ /pubmed/30153294 http://dx.doi.org/10.1371/journal.pone.0202705 Text en © 2018 Ali et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Ali, Maqbool
Ali, Syed Imran
Kim, Dohyeong
Hur, Taeho
Bang, Jaehun
Lee, Sungyoung
Kang, Byeong Ho
Hussain, Maqbool
uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title_full uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title_fullStr uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title_full_unstemmed uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title_short uEFS: An efficient and comprehensive ensemble-based feature selection methodology to select informative features
title_sort uefs: an efficient and comprehensive ensemble-based feature selection methodology to select informative features
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6112679/
https://www.ncbi.nlm.nih.gov/pubmed/30153294
http://dx.doi.org/10.1371/journal.pone.0202705
work_keys_str_mv AT alimaqbool uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT alisyedimran uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT kimdohyeong uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT hurtaeho uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT bangjaehun uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT leesungyoung uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT kangbyeongho uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures
AT hussainmaqbool uefsanefficientandcomprehensiveensemblebasedfeatureselectionmethodologytoselectinformativefeatures