Cargando…

Multi-task feature selection in microarray data by binary integer programming

A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solvi...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Liang, Vucetic, Slobodan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4043987/
https://www.ncbi.nlm.nih.gov/pubmed/24564944
http://dx.doi.org/10.1186/1753-6561-7-S7-S5
_version_ 1782319047336001536
author Lan, Liang
Vucetic, Slobodan
author_facet Lan, Liang
Vucetic, Slobodan
author_sort Lan, Liang
collection PubMed
description A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods.
format Online
Article
Text
id pubmed-4043987
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40439872014-06-17 Multi-task feature selection in microarray data by binary integer programming Lan, Liang Vucetic, Slobodan BMC Proc Proceedings A major challenge in microarray classification is that the number of features is typically orders of magnitude larger than the number of examples. In this paper, we propose a novel feature filter algorithm to select the feature subset with maximal discriminative power and minimal redundancy by solving a quadratic objective function with binary integer constraints. To improve the computational efficiency, the binary integer constraints are relaxed and a low-rank approximation to the quadratic term is applied. The proposed feature selection algorithm was extended to solve multi-task microarray classification problems. We compared the single-task version of the proposed feature selection algorithm with 9 existing feature selection methods on 4 benchmark microarray data sets. The empirical results show that the proposed method achieved the most accurate predictions overall. We also evaluated the multi-task version of the proposed algorithm on 8 multi-task microarray datasets. The multi-task feature selection algorithm resulted in significantly higher accuracy than when using the single-task feature selection methods. BioMed Central 2013-12-20 /pmc/articles/PMC4043987/ /pubmed/24564944 http://dx.doi.org/10.1186/1753-6561-7-S7-S5 Text en Copyright © 2013 Lan and Vucetic; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Lan, Liang
Vucetic, Slobodan
Multi-task feature selection in microarray data by binary integer programming
title Multi-task feature selection in microarray data by binary integer programming
title_full Multi-task feature selection in microarray data by binary integer programming
title_fullStr Multi-task feature selection in microarray data by binary integer programming
title_full_unstemmed Multi-task feature selection in microarray data by binary integer programming
title_short Multi-task feature selection in microarray data by binary integer programming
title_sort multi-task feature selection in microarray data by binary integer programming
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4043987/
https://www.ncbi.nlm.nih.gov/pubmed/24564944
http://dx.doi.org/10.1186/1753-6561-7-S7-S5
work_keys_str_mv AT lanliang multitaskfeatureselectioninmicroarraydatabybinaryintegerprogramming
AT vuceticslobodan multitaskfeatureselectioninmicroarraydatabybinaryintegerprogramming