Cargando…

Refining gene signatures: a Bayesian approach

BACKGROUND: In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be s...

Descripción completa

Detalles Bibliográficos
Autores principales: Djebbari, Amira, Labbe, Aurélie
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2804684/
https://www.ncbi.nlm.nih.gov/pubmed/20003289
http://dx.doi.org/10.1186/1471-2105-10-410
_version_ 1782176174061912064
author Djebbari, Amira
Labbe, Aurélie
author_facet Djebbari, Amira
Labbe, Aurélie
author_sort Djebbari, Amira
collection PubMed
description BACKGROUND: In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be selected for a disease class prediction. Our work consists of a Bayesian supervised statistical learning approach to refine gene signatures with a regularization which penalizes for the correlation between the variables selected. RESULTS: Our simulation results show that we can most often recover the correct subset of genes that predict the class as compared to other methods, even when accuracy and subset size remain the same. On real microarray datasets, we show that our approach can refine gene signatures to obtain either the same or better predictive performance than other existing methods with a smaller number of genes. CONCLUSIONS: Our novel Bayesian approach includes a prior which penalizes highly correlated features in model selection and is able to extract key genes in the highly correlated context of microarray data. The methodology in the paper is described in the context of microarray data, but can be applied to any array data (such as micro RNA, for example) as a first step towards predictive modeling of cancer pathways. A user-friendly software implementation of the method is available.
format Text
id pubmed-2804684
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28046842010-01-12 Refining gene signatures: a Bayesian approach Djebbari, Amira Labbe, Aurélie BMC Bioinformatics Research article BACKGROUND: In high density arrays, the identification of relevant genes for disease classification is complicated by not only the curse of dimensionality but also the highly correlated nature of the array data. In this paper, we are interested in the question of how many and which genes should be selected for a disease class prediction. Our work consists of a Bayesian supervised statistical learning approach to refine gene signatures with a regularization which penalizes for the correlation between the variables selected. RESULTS: Our simulation results show that we can most often recover the correct subset of genes that predict the class as compared to other methods, even when accuracy and subset size remain the same. On real microarray datasets, we show that our approach can refine gene signatures to obtain either the same or better predictive performance than other existing methods with a smaller number of genes. CONCLUSIONS: Our novel Bayesian approach includes a prior which penalizes highly correlated features in model selection and is able to extract key genes in the highly correlated context of microarray data. The methodology in the paper is described in the context of microarray data, but can be applied to any array data (such as micro RNA, for example) as a first step towards predictive modeling of cancer pathways. A user-friendly software implementation of the method is available. BioMed Central 2009-12-10 /pmc/articles/PMC2804684/ /pubmed/20003289 http://dx.doi.org/10.1186/1471-2105-10-410 Text en Copyright ©2009 Djebbari and Labbe; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Djebbari, Amira
Labbe, Aurélie
Refining gene signatures: a Bayesian approach
title Refining gene signatures: a Bayesian approach
title_full Refining gene signatures: a Bayesian approach
title_fullStr Refining gene signatures: a Bayesian approach
title_full_unstemmed Refining gene signatures: a Bayesian approach
title_short Refining gene signatures: a Bayesian approach
title_sort refining gene signatures: a bayesian approach
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2804684/
https://www.ncbi.nlm.nih.gov/pubmed/20003289
http://dx.doi.org/10.1186/1471-2105-10-410
work_keys_str_mv AT djebbariamira refininggenesignaturesabayesianapproach
AT labbeaurelie refininggenesignaturesabayesianapproach