Cargando…

A Predictive Based Regression Algorithm for Gene Network Selection

Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dime...

Descripción completa

Detalles Bibliográficos
Autores principales: Guerrier, Stéphane, Mili, Nabil, Molinari, Roberto, Orso, Samuel, Avella-Medina, Marco, Ma, Yanyuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908120/
https://www.ncbi.nlm.nih.gov/pubmed/27379155
http://dx.doi.org/10.3389/fgene.2016.00097
_version_ 1782437624750800896
author Guerrier, Stéphane
Mili, Nabil
Molinari, Roberto
Orso, Samuel
Avella-Medina, Marco
Ma, Yanyuan
author_facet Guerrier, Stéphane
Mili, Nabil
Molinari, Roberto
Orso, Samuel
Avella-Medina, Marco
Ma, Yanyuan
author_sort Guerrier, Stéphane
collection PubMed
description Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dimension-reduction of the problem which finally provide a single model as an output and, in most cases, rely on the likelihood function in order to achieve variable selection. We propose a new prediction-based objective function that can be tailored to the requirements of practitioners and can be used to assess and interpret a given problem. Based on cross-validation techniques and the idea of importance sampling, our proposal scans low-dimensional models under the assumption of sparsity and, for each of them, estimates their objective function to assess their predictive power in order to select. Two applications on cancer data sets and a simulation study show that the proposal compares favorably with competing alternatives such as, for example, Elastic Net and Support Vector Machine. Indeed, the proposed method not only selects smaller models for better, or at least comparable, classification errors but also provides a set of selected models instead of a single one, allowing to construct a network of possible models for a target prediction accuracy level.
format Online
Article
Text
id pubmed-4908120
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-49081202016-07-04 A Predictive Based Regression Algorithm for Gene Network Selection Guerrier, Stéphane Mili, Nabil Molinari, Roberto Orso, Samuel Avella-Medina, Marco Ma, Yanyuan Front Genet Genetics Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dimension-reduction of the problem which finally provide a single model as an output and, in most cases, rely on the likelihood function in order to achieve variable selection. We propose a new prediction-based objective function that can be tailored to the requirements of practitioners and can be used to assess and interpret a given problem. Based on cross-validation techniques and the idea of importance sampling, our proposal scans low-dimensional models under the assumption of sparsity and, for each of them, estimates their objective function to assess their predictive power in order to select. Two applications on cancer data sets and a simulation study show that the proposal compares favorably with competing alternatives such as, for example, Elastic Net and Support Vector Machine. Indeed, the proposed method not only selects smaller models for better, or at least comparable, classification errors but also provides a set of selected models instead of a single one, allowing to construct a network of possible models for a target prediction accuracy level. Frontiers Media S.A. 2016-06-15 /pmc/articles/PMC4908120/ /pubmed/27379155 http://dx.doi.org/10.3389/fgene.2016.00097 Text en Copyright © 2016 Guerrier, Mili, Molinari, Orso, Avella-Medina and Ma. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Guerrier, Stéphane
Mili, Nabil
Molinari, Roberto
Orso, Samuel
Avella-Medina, Marco
Ma, Yanyuan
A Predictive Based Regression Algorithm for Gene Network Selection
title A Predictive Based Regression Algorithm for Gene Network Selection
title_full A Predictive Based Regression Algorithm for Gene Network Selection
title_fullStr A Predictive Based Regression Algorithm for Gene Network Selection
title_full_unstemmed A Predictive Based Regression Algorithm for Gene Network Selection
title_short A Predictive Based Regression Algorithm for Gene Network Selection
title_sort predictive based regression algorithm for gene network selection
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908120/
https://www.ncbi.nlm.nih.gov/pubmed/27379155
http://dx.doi.org/10.3389/fgene.2016.00097
work_keys_str_mv AT guerrierstephane apredictivebasedregressionalgorithmforgenenetworkselection
AT milinabil apredictivebasedregressionalgorithmforgenenetworkselection
AT molinariroberto apredictivebasedregressionalgorithmforgenenetworkselection
AT orsosamuel apredictivebasedregressionalgorithmforgenenetworkselection
AT avellamedinamarco apredictivebasedregressionalgorithmforgenenetworkselection
AT mayanyuan apredictivebasedregressionalgorithmforgenenetworkselection
AT guerrierstephane predictivebasedregressionalgorithmforgenenetworkselection
AT milinabil predictivebasedregressionalgorithmforgenenetworkselection
AT molinariroberto predictivebasedregressionalgorithmforgenenetworkselection
AT orsosamuel predictivebasedregressionalgorithmforgenenetworkselection
AT avellamedinamarco predictivebasedregressionalgorithmforgenenetworkselection
AT mayanyuan predictivebasedregressionalgorithmforgenenetworkselection