Cargando…

A Bootstrap Framework for Aggregating within and between Feature Selection Methods

In the past decade, big data has become increasingly prevalent in a large number of applications. As a result, datasets suffering from noise and redundancy issues have necessitated the use of feature selection across multiple domains. However, a common concern in feature selection is that different...

Descripción completa

Detalles Bibliográficos
Autores principales:	Salman, Reem, Alzaatreh, Ayman, Sulieman, Hana, Faisal, Shaimaa
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914949/ https://www.ncbi.nlm.nih.gov/pubmed/33561948 http://dx.doi.org/10.3390/e23020200

_version_	1783657123372072960
author	Salman, Reem Alzaatreh, Ayman Sulieman, Hana Faisal, Shaimaa
author_facet	Salman, Reem Alzaatreh, Ayman Sulieman, Hana Faisal, Shaimaa
author_sort	Salman, Reem
collection	PubMed
description	In the past decade, big data has become increasingly prevalent in a large number of applications. As a result, datasets suffering from noise and redundancy issues have necessitated the use of feature selection across multiple domains. However, a common concern in feature selection is that different approaches can give very different results when applied to similar datasets. Aggregating the results of different selection methods helps to resolve this concern and control the diversity of selected feature subsets. In this work, we implemented a general framework for the ensemble of multiple feature selection methods. Based on diversified datasets generated from the original set of observations, we aggregated the importance scores generated by multiple feature selection techniques using two methods: the Within Aggregation Method (WAM), which refers to aggregating importance scores within a single feature selection; and the Between Aggregation Method (BAM), which refers to aggregating importance scores between multiple feature selection methods. We applied the proposed framework on 13 real datasets with diverse performances and characteristics. The experimental evaluation showed that WAM provides an effective tool for determining the best feature selection method for a given dataset. WAM has also shown greater stability than BAM in terms of identifying important features. The computational demands of the two methods appeared to be comparable. The results of this work suggest that by applying both WAM and BAM, practitioners can gain a deeper understanding of the feature selection process.
format	Online Article Text
id	pubmed-7914949
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-79149492021-03-01 A Bootstrap Framework for Aggregating within and between Feature Selection Methods Salman, Reem Alzaatreh, Ayman Sulieman, Hana Faisal, Shaimaa Entropy (Basel) Article In the past decade, big data has become increasingly prevalent in a large number of applications. As a result, datasets suffering from noise and redundancy issues have necessitated the use of feature selection across multiple domains. However, a common concern in feature selection is that different approaches can give very different results when applied to similar datasets. Aggregating the results of different selection methods helps to resolve this concern and control the diversity of selected feature subsets. In this work, we implemented a general framework for the ensemble of multiple feature selection methods. Based on diversified datasets generated from the original set of observations, we aggregated the importance scores generated by multiple feature selection techniques using two methods: the Within Aggregation Method (WAM), which refers to aggregating importance scores within a single feature selection; and the Between Aggregation Method (BAM), which refers to aggregating importance scores between multiple feature selection methods. We applied the proposed framework on 13 real datasets with diverse performances and characteristics. The experimental evaluation showed that WAM provides an effective tool for determining the best feature selection method for a given dataset. WAM has also shown greater stability than BAM in terms of identifying important features. The computational demands of the two methods appeared to be comparable. The results of this work suggest that by applying both WAM and BAM, practitioners can gain a deeper understanding of the feature selection process. MDPI 2021-02-06 /pmc/articles/PMC7914949/ /pubmed/33561948 http://dx.doi.org/10.3390/e23020200 Text en © 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Salman, Reem Alzaatreh, Ayman Sulieman, Hana Faisal, Shaimaa A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title	A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title_full	A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title_fullStr	A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title_full_unstemmed	A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title_short	A Bootstrap Framework for Aggregating within and between Feature Selection Methods
title_sort	bootstrap framework for aggregating within and between feature selection methods
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914949/ https://www.ncbi.nlm.nih.gov/pubmed/33561948 http://dx.doi.org/10.3390/e23020200
work_keys_str_mv	AT salmanreem abootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT alzaatrehayman abootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT suliemanhana abootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT faisalshaimaa abootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT salmanreem bootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT alzaatrehayman bootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT suliemanhana bootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods AT faisalshaimaa bootstrapframeworkforaggregatingwithinandbetweenfeatureselectionmethods

A Bootstrap Framework for Aggregating within and between Feature Selection Methods

Ejemplares similares