Cargando…

Improving feature selection performance using pairwise pre-evaluation

BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Songlu, Oh, Sejong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992252/
https://www.ncbi.nlm.nih.gov/pubmed/27544506
http://dx.doi.org/10.1186/s12859-016-1178-3
_version_ 1782448984168595456
author Li, Songlu
Oh, Sejong
author_facet Li, Songlu
Oh, Sejong
author_sort Li, Songlu
collection PubMed
description BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible using currently available computing power. Feature selection or feature subset selection provides a sub-optimal solution within a reasonable amount of time. RESULTS: In this study, we propose an improved feature selection method that uses information based on all the pairwise evaluations for a given dataset. We modify the original feature selection algorithms to use pre-evaluation information. The pre-evaluation captures the quality and interactions between two features. The feature subset should be improved by using the top ranking pairs for two features in the selection process. CONCLUSIONS: Experimental results demonstrated that the proposed method improved the quality of the feature subset produced by modified feature selection algorithms. The proposed method can be applied to microarray and other high-dimensional data.
format Online
Article
Text
id pubmed-4992252
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49922522016-08-31 Improving feature selection performance using pairwise pre-evaluation Li, Songlu Oh, Sejong BMC Bioinformatics Methodology Article BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible using currently available computing power. Feature selection or feature subset selection provides a sub-optimal solution within a reasonable amount of time. RESULTS: In this study, we propose an improved feature selection method that uses information based on all the pairwise evaluations for a given dataset. We modify the original feature selection algorithms to use pre-evaluation information. The pre-evaluation captures the quality and interactions between two features. The feature subset should be improved by using the top ranking pairs for two features in the selection process. CONCLUSIONS: Experimental results demonstrated that the proposed method improved the quality of the feature subset produced by modified feature selection algorithms. The proposed method can be applied to microarray and other high-dimensional data. BioMed Central 2016-08-20 /pmc/articles/PMC4992252/ /pubmed/27544506 http://dx.doi.org/10.1186/s12859-016-1178-3 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Li, Songlu
Oh, Sejong
Improving feature selection performance using pairwise pre-evaluation
title Improving feature selection performance using pairwise pre-evaluation
title_full Improving feature selection performance using pairwise pre-evaluation
title_fullStr Improving feature selection performance using pairwise pre-evaluation
title_full_unstemmed Improving feature selection performance using pairwise pre-evaluation
title_short Improving feature selection performance using pairwise pre-evaluation
title_sort improving feature selection performance using pairwise pre-evaluation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992252/
https://www.ncbi.nlm.nih.gov/pubmed/27544506
http://dx.doi.org/10.1186/s12859-016-1178-3
work_keys_str_mv AT lisonglu improvingfeatureselectionperformanceusingpairwisepreevaluation
AT ohsejong improvingfeatureselectionperformanceusingpairwisepreevaluation