Cargando…
Multi-task feature selection in microarray data by binary integer programming
A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solvi...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4043987/ https://www.ncbi.nlm.nih.gov/pubmed/24564944 http://dx.doi.org/10.1186/1753-6561-7-S7-S5 |
_version_ | 1782319047336001536 |
---|---|
author | Lan, Liang Vucetic, Slobodan |
author_facet | Lan, Liang Vucetic, Slobodan |
author_sort | Lan, Liang |
collection | PubMed |
description | A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods. |
format | Online Article Text |
id | pubmed-4043987 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40439872014-06-17 Multi-task feature selection in microarray data by binary integer programming Lan, Liang Vucetic, Slobodan BMC Proc Proceedings A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods. BioMed Central 2013-12-20 /pmc/articles/PMC4043987/ /pubmed/24564944 http://dx.doi.org/10.1186/1753-6561-7-S7-S5 Text en Copyright © 2013 Lan and Vucetic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Lan, Liang Vucetic, Slobodan Multi-task feature selection in microarray data by binary integer programming |
title | Multi-task feature selection in microarray data by binary integer programming |
title_full | Multi-task feature selection in microarray data by binary integer programming |
title_fullStr | Multi-task feature selection in microarray data by binary integer programming |
title_full_unstemmed | Multi-task feature selection in microarray data by binary integer programming |
title_short | Multi-task feature selection in microarray data by binary integer programming |
title_sort | multi-task feature selection in microarray data by binary integer programming |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4043987/ https://www.ncbi.nlm.nih.gov/pubmed/24564944 http://dx.doi.org/10.1186/1753-6561-7-S7-S5 |
work_keys_str_mv | AT lanliang multitaskfeatureselectioninmicroarraydatabybinaryintegerprogramming AT vuceticslobodan multitaskfeatureselectioninmicroarraydatabybinaryintegerprogramming |