Cargando…
A Predictive Based Regression Algorithm for Gene Network Selection
Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dime...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908120/ https://www.ncbi.nlm.nih.gov/pubmed/27379155 http://dx.doi.org/10.3389/fgene.2016.00097 |
_version_ | 1782437624750800896 |
---|---|
author | Guerrier, Stéphane Mili, Nabil Molinari, Roberto Orso, Samuel Avella-Medina, Marco Ma, Yanyuan |
author_facet | Guerrier, Stéphane Mili, Nabil Molinari, Roberto Orso, Samuel Avella-Medina, Marco Ma, Yanyuan |
author_sort | Guerrier, Stéphane |
collection | PubMed |
description | Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dimension-reduction of the problem which finally provide a single model as an output and, in most cases, rely on the likelihood function in order to achieve variable selection. We propose a new prediction-based objective function that can be tailored to the requirements of practitioners and can be used to assess and interpret a given problem. Based on cross-validation techniques and the idea of importance sampling, our proposal scans low-dimensional models under the assumption of sparsity and, for each of them, estimates their objective function to assess their predictive power in order to select. Two applications on cancer data sets and a simulation study show that the proposal compares favorably with competing alternatives such as, for example, Elastic Net and Support Vector Machine. Indeed, the proposed method not only selects smaller models for better, or at least comparable, classification errors but also provides a set of selected models instead of a single one, allowing to construct a network of possible models for a target prediction accuracy level. |
format | Online Article Text |
id | pubmed-4908120 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-49081202016-07-04 A Predictive Based Regression Algorithm for Gene Network Selection Guerrier, Stéphane Mili, Nabil Molinari, Roberto Orso, Samuel Avella-Medina, Marco Ma, Yanyuan Front Genet Genetics Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. To do so, many of the recently proposed classification methods require some form of dimension-reduction of the problem which finally provide a single model as an output and, in most cases, rely on the likelihood function in order to achieve variable selection. We propose a new prediction-based objective function that can be tailored to the requirements of practitioners and can be used to assess and interpret a given problem. Based on cross-validation techniques and the idea of importance sampling, our proposal scans low-dimensional models under the assumption of sparsity and, for each of them, estimates their objective function to assess their predictive power in order to select. Two applications on cancer data sets and a simulation study show that the proposal compares favorably with competing alternatives such as, for example, Elastic Net and Support Vector Machine. Indeed, the proposed method not only selects smaller models for better, or at least comparable, classification errors but also provides a set of selected models instead of a single one, allowing to construct a network of possible models for a target prediction accuracy level. Frontiers Media S.A. 2016-06-15 /pmc/articles/PMC4908120/ /pubmed/27379155 http://dx.doi.org/10.3389/fgene.2016.00097 Text en Copyright © 2016 Guerrier, Mili, Molinari, Orso, Avella-Medina and Ma. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Guerrier, Stéphane Mili, Nabil Molinari, Roberto Orso, Samuel Avella-Medina, Marco Ma, Yanyuan A Predictive Based Regression Algorithm for Gene Network Selection |
title | A Predictive Based Regression Algorithm for Gene Network Selection |
title_full | A Predictive Based Regression Algorithm for Gene Network Selection |
title_fullStr | A Predictive Based Regression Algorithm for Gene Network Selection |
title_full_unstemmed | A Predictive Based Regression Algorithm for Gene Network Selection |
title_short | A Predictive Based Regression Algorithm for Gene Network Selection |
title_sort | predictive based regression algorithm for gene network selection |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4908120/ https://www.ncbi.nlm.nih.gov/pubmed/27379155 http://dx.doi.org/10.3389/fgene.2016.00097 |
work_keys_str_mv | AT guerrierstephane apredictivebasedregressionalgorithmforgenenetworkselection AT milinabil apredictivebasedregressionalgorithmforgenenetworkselection AT molinariroberto apredictivebasedregressionalgorithmforgenenetworkselection AT orsosamuel apredictivebasedregressionalgorithmforgenenetworkselection AT avellamedinamarco apredictivebasedregressionalgorithmforgenenetworkselection AT mayanyuan apredictivebasedregressionalgorithmforgenenetworkselection AT guerrierstephane predictivebasedregressionalgorithmforgenenetworkselection AT milinabil predictivebasedregressionalgorithmforgenenetworkselection AT molinariroberto predictivebasedregressionalgorithmforgenenetworkselection AT orsosamuel predictivebasedregressionalgorithmforgenenetworkselection AT avellamedinamarco predictivebasedregressionalgorithmforgenenetworkselection AT mayanyuan predictivebasedregressionalgorithmforgenenetworkselection |