Cargando…

An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways

The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of sub...

Descripción completa

Detalles Bibliográficos
Autores principales: Peng, Bin, Zhu, Dianwen, Ander, Bradley P., Zhang, Xiaoshuai, Xue, Fuzhong, Sharp, Frank R., Yang, Xiaowei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3700986/
https://www.ncbi.nlm.nih.gov/pubmed/23844055
http://dx.doi.org/10.1371/journal.pone.0067672
_version_ 1782275575873798144
author Peng, Bin
Zhu, Dianwen
Ander, Bradley P.
Zhang, Xiaoshuai
Xue, Fuzhong
Sharp, Frank R.
Yang, Xiaowei
author_facet Peng, Bin
Zhu, Dianwen
Ander, Bradley P.
Zhang, Xiaoshuai
Xue, Fuzhong
Sharp, Frank R.
Yang, Xiaowei
author_sort Peng, Bin
collection PubMed
description The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed.
format Online
Article
Text
id pubmed-3700986
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-37009862013-07-10 An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways Peng, Bin Zhu, Dianwen Ander, Bradley P. Zhang, Xiaoshuai Xue, Fuzhong Sharp, Frank R. Yang, Xiaowei PLoS One Research Article The discovery of genetic or genomic markers plays a central role in the development of personalized medicine. A notable challenge exists when dealing with the high dimensionality of the data sets, as thousands of genes or millions of genetic variants are collected on a relatively small number of subjects. Traditional gene-wise selection methods using univariate analyses face difficulty to incorporate correlational, structural, or functional structures amongst the molecular measures. For microarray gene expression data, we first summarize solutions in dealing with ‘large p, small n’ problems, and then propose an integrative Bayesian variable selection (iBVS) framework for simultaneously identifying causal or marker genes and regulatory pathways. A novel partial least squares (PLS) g-prior for iBVS is developed to allow the incorporation of prior knowledge on gene-gene interactions or functional relationships. From the point view of systems biology, iBVS enables user to directly target the joint effects of multiple genes and pathways in a hierarchical modeling diagram to predict disease status or phenotype. The estimated posterior selection probabilities offer probabilitic and biological interpretations. Both simulated data and a set of microarray data in predicting stroke status are used in validating the performance of iBVS in a Probit model with binary outcomes. iBVS offers a general framework for effective discovery of various molecular biomarkers by combining data-based statistics and knowledge-based priors. Guidelines on making posterior inferences, determining Bayesian significance levels, and improving computational efficiencies are also discussed. Public Library of Science 2013-07-03 /pmc/articles/PMC3700986/ /pubmed/23844055 http://dx.doi.org/10.1371/journal.pone.0067672 Text en © 2013 Peng et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Peng, Bin
Zhu, Dianwen
Ander, Bradley P.
Zhang, Xiaoshuai
Xue, Fuzhong
Sharp, Frank R.
Yang, Xiaowei
An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title_full An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title_fullStr An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title_full_unstemmed An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title_short An Integrative Framework for Bayesian Variable Selection with Informative Priors for Identifying Genes and Pathways
title_sort integrative framework for bayesian variable selection with informative priors for identifying genes and pathways
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3700986/
https://www.ncbi.nlm.nih.gov/pubmed/23844055
http://dx.doi.org/10.1371/journal.pone.0067672
work_keys_str_mv AT pengbin anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT zhudianwen anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT anderbradleyp anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT zhangxiaoshuai anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT xuefuzhong anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT sharpfrankr anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT yangxiaowei anintegrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT pengbin integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT zhudianwen integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT anderbradleyp integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT zhangxiaoshuai integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT xuefuzhong integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT sharpfrankr integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways
AT yangxiaowei integrativeframeworkforbayesianvariableselectionwithinformativepriorsforidentifyinggenesandpathways