Cargando…

Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources

Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior k...

Descripción completa

Detalles Bibliográficos
Autores principales: Praveen, Paurush, Fröhlich, Holger
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3691143/
https://www.ncbi.nlm.nih.gov/pubmed/23826291
http://dx.doi.org/10.1371/journal.pone.0067410
_version_ 1782274422758965248
author Praveen, Paurush
Fröhlich, Holger
author_facet Praveen, Paurush
Fröhlich, Holger
author_sort Praveen, Paurush
collection PubMed
description Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior knowledge from existing sources of biological information can address this low signal to noise problem by biasing the network inference towards biologically plausible network structures. Although integrating various sources of information is desirable, their heterogeneous nature makes this task challenging. We propose two computational methods to incorporate various information sources into a probabilistic consensus structure prior to be used in graphical model inference. Our first model, called Latent Factor Model (LFM), assumes a high degree of correlation among external information sources and reconstructs a hidden variable as a common source in a Bayesian manner. The second model, a Noisy-OR, picks up the strongest support for an interaction among information sources in a probabilistic fashion. Our extensive computational studies on KEGG signaling pathways as well as on gene expression data from breast cancer and yeast heat shock response reveal that both approaches can significantly enhance the reconstruction accuracy of Bayesian Networks compared to other competing methods as well as to the situation without any prior. Our framework allows for using diverse information sources, like pathway databases, GO terms and protein domain data, etc. and is flexible enough to integrate new sources, if available.
format Online
Article
Text
id pubmed-3691143
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-36911432013-07-03 Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources Praveen, Paurush Fröhlich, Holger PLoS One Research Article Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior knowledge from existing sources of biological information can address this low signal to noise problem by biasing the network inference towards biologically plausible network structures. Although integrating various sources of information is desirable, their heterogeneous nature makes this task challenging. We propose two computational methods to incorporate various information sources into a probabilistic consensus structure prior to be used in graphical model inference. Our first model, called Latent Factor Model (LFM), assumes a high degree of correlation among external information sources and reconstructs a hidden variable as a common source in a Bayesian manner. The second model, a Noisy-OR, picks up the strongest support for an interaction among information sources in a probabilistic fashion. Our extensive computational studies on KEGG signaling pathways as well as on gene expression data from breast cancer and yeast heat shock response reveal that both approaches can significantly enhance the reconstruction accuracy of Bayesian Networks compared to other competing methods as well as to the situation without any prior. Our framework allows for using diverse information sources, like pathway databases, GO terms and protein domain data, etc. and is flexible enough to integrate new sources, if available. Public Library of Science 2013-06-24 /pmc/articles/PMC3691143/ /pubmed/23826291 http://dx.doi.org/10.1371/journal.pone.0067410 Text en © 2013 Praveen, Fröhlich http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Praveen, Paurush
Fröhlich, Holger
Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title_full Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title_fullStr Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title_full_unstemmed Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title_short Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
title_sort boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3691143/
https://www.ncbi.nlm.nih.gov/pubmed/23826291
http://dx.doi.org/10.1371/journal.pone.0067410
work_keys_str_mv AT praveenpaurush boostingprobabilisticgraphicalmodelinferencebyincorporatingpriorknowledgefrommultiplesources
AT frohlichholger boostingprobabilisticgraphicalmodelinferencebyincorporatingpriorknowledgefrommultiplesources