Cargando…
An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of sub...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3700986/ https://www.ncbi.nlm.nih.gov/pubmed/23844055 http://dx.doi.org/10.1371/journal.pone.0067672 |
_version_ | 1782275575873798144 |
---|---|
author | Peng, Bin Zhu, Dianwen Ander, Bradley P. Zhang, Xiaoshuai Xue, Fuzhong Sharp, Frank R. Yang, Xiaowei |
author_facet | Peng, Bin Zhu, Dianwen Ander, Bradley P. Zhang, Xiaoshuai Xue, Fuzhong Sharp, Frank R. Yang, Xiaowei |
author_sort | Peng, Bin |
collection | PubMed |
description | The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. |
format | Online Article Text |
id | pubmed-3700986 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-37009862013-07-10 An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways Peng, Bin Zhu, Dianwen Ander, Bradley P. Zhang, Xiaoshuai Xue, Fuzhong Sharp, Frank R. Yang, Xiaowei PLoS One Research Article The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. Public Library of Science 2013-07-03 /pmc/articles/PMC3700986/ /pubmed/23844055 http://dx.doi.org/10.1371/journal.pone.0067672 Text en © 2013 Peng et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Peng, Bin Zhu, Dianwen Ander, Bradley P. Zhang, Xiaoshuai Xue, Fuzhong Sharp, Frank R. Yang, Xiaowei An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title | An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title_full | An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title_fullStr | An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title_full_unstemmed | An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title_short | An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways |
title_sort | integrative framework for bayesian variable selection with informative priors for identifying genes and pathways |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3700986/ https://www.ncbi.nlm.nih.gov/pubmed/23844055 http://dx.doi.org/10.1371/journal.pone.0067672 |
work_keys_str_mv | AT pengbin anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT zhudianwen anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT anderbradleyp anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT zhangxiaoshuai anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT xuefuzhong anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT sharpfrankr anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT yangxiaowei anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT pengbin integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT zhudianwen integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT anderbradleyp integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT zhangxiaoshuai integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT xuefuzhong integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT sharpfrankr integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways AT yangxiaowei integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways |