Cargando…
Improving feature selection performance using pairwise pre-evaluation
BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992252/ https://www.ncbi.nlm.nih.gov/pubmed/27544506 http://dx.doi.org/10.1186/s12859-016-1178-3 |
_version_ | 1782448984168595456 |
---|---|
author | Li, Songlu Oh, Sejong |
author_facet | Li, Songlu Oh, Sejong |
author_sort | Li, Songlu |
collection | PubMed |
description | BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible using currently available computing power. Feature selection or feature subset selection provides a sub-optimal solution within a reasonable amount of time. RESULTS: In this study, we propose an improved feature selection method that uses information based on all the pairwise evaluations for a given dataset. We modify the original feature selection algorithms to use pre-evaluation information. The pre-evaluation captures the quality and interactions between two features. The feature subset should be improved by using the top ranking pairs for two features in the selection process. CONCLUSIONS: Experimental results demonstrated that the proposed method improved the quality of the feature subset produced by modified feature selection algorithms. The proposed method can be applied to microarray and other high-dimensional data. |
format | Online Article Text |
id | pubmed-4992252 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49922522016-08-31 Improving feature selection performance using pairwise pre-evaluation Li, Songlu Oh, Sejong BMC Bioinformatics Methodology Article BACKGROUND: Biological data such as microarrays contain a huge number of features. Thus, it is necessary to select a small number of novel features to characterize the entire dataset. All combinations of the features subset must be evaluated to produce an ideal feature subset, but this is impossible using currently available computing power. Feature selection or feature subset selection provides a sub-optimal solution within a reasonable amount of time. RESULTS: In this study, we propose an improved feature selection method that uses information based on all the pairwise evaluations for a given dataset. We modify the original feature selection algorithms to use pre-evaluation information. The pre-evaluation captures the quality and interactions between two features. The feature subset should be improved by using the top ranking pairs for two features in the selection process. CONCLUSIONS: Experimental results demonstrated that the proposed method improved the quality of the feature subset produced by modified feature selection algorithms. The proposed method can be applied to microarray and other high-dimensional data. BioMed Central 2016-08-20 /pmc/articles/PMC4992252/ /pubmed/27544506 http://dx.doi.org/10.1186/s12859-016-1178-3 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Li, Songlu Oh, Sejong Improving feature selection performance using pairwise pre-evaluation |
title | Improving feature selection performance using pairwise pre-evaluation |
title_full | Improving feature selection performance using pairwise pre-evaluation |
title_fullStr | Improving feature selection performance using pairwise pre-evaluation |
title_full_unstemmed | Improving feature selection performance using pairwise pre-evaluation |
title_short | Improving feature selection performance using pairwise pre-evaluation |
title_sort | improving feature selection performance using pairwise pre-evaluation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4992252/ https://www.ncbi.nlm.nih.gov/pubmed/27544506 http://dx.doi.org/10.1186/s12859-016-1178-3 |
work_keys_str_mv | AT lisonglu improvingfeatureselectionperformanceusingpairwisepreevaluation AT ohsejong improvingfeatureselectionperformanceusingpairwisepreevaluation |