Cargando…
Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources
Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior k...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3691143/ https://www.ncbi.nlm.nih.gov/pubmed/23826291 http://dx.doi.org/10.1371/journal.pone.0067410 |
_version_ | 1782274422758965248 |
---|---|
author | Praveen, Paurush Fröhlich, Holger |
author_facet | Praveen, Paurush Fröhlich, Holger |
author_sort | Praveen, Paurush |
collection | PubMed |
description | Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior knowledge from existing sources of biological information can address this low signal to noise problem by biasing the network inference towards biologically plausible network structures. Although integrating various sources of information is desirable, their heterogeneous nature makes this task challenging. We propose two computational methods to incorporate various information sources into a probabilistic consensus structure prior to be used in graphical model inference. Our first model, called Latent Factor Model (LFM), assumes a high degree of correlation among external information sources and reconstructs a hidden variable as a common source in a Bayesian manner. The second model, a Noisy-OR, picks up the strongest support for an interaction among information sources in a probabilistic fashion. Our extensive computational studies on KEGG signaling pathways as well as on gene expression data from breast cancer and yeast heat shock response reveal that both approaches can significantly enhance the reconstruction accuracy of Bayesian Networks compared to other competing methods as well as to the situation without any prior. Our framework allows for using diverse information sources, like pathway databases, GO terms and protein domain data, etc. and is flexible enough to integrate new sources, if available. |
format | Online Article Text |
id | pubmed-3691143 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-36911432013-07-03 Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources Praveen, Paurush Fröhlich, Holger PLoS One Research Article Inferring regulatory networks from experimental data via probabilistic graphical models is a popular framework to gain insights into biological systems. However, the inherent noise in experimental data coupled with a limited sample size reduces the performance of network reverse engineering. Prior knowledge from existing sources of biological information can address this low signal to noise problem by biasing the network inference towards biologically plausible network structures. Although integrating various sources of information is desirable, their heterogeneous nature makes this task challenging. We propose two computational methods to incorporate various information sources into a probabilistic consensus structure prior to be used in graphical model inference. Our first model, called Latent Factor Model (LFM), assumes a high degree of correlation among external information sources and reconstructs a hidden variable as a common source in a Bayesian manner. The second model, a Noisy-OR, picks up the strongest support for an interaction among information sources in a probabilistic fashion. Our extensive computational studies on KEGG signaling pathways as well as on gene expression data from breast cancer and yeast heat shock response reveal that both approaches can significantly enhance the reconstruction accuracy of Bayesian Networks compared to other competing methods as well as to the situation without any prior. Our framework allows for using diverse information sources, like pathway databases, GO terms and protein domain data, etc. and is flexible enough to integrate new sources, if available. Public Library of Science 2013-06-24 /pmc/articles/PMC3691143/ /pubmed/23826291 http://dx.doi.org/10.1371/journal.pone.0067410 Text en © 2013 Praveen, Fröhlich http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Praveen, Paurush Fröhlich, Holger Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title | Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title_full | Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title_fullStr | Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title_full_unstemmed | Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title_short | Boosting Probabilistic Graphical Model Inference by Incorporating Prior Knowledge from Multiple Sources |
title_sort | boosting probabilistic graphical model inference by incorporating prior knowledge from multiple sources |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3691143/ https://www.ncbi.nlm.nih.gov/pubmed/23826291 http://dx.doi.org/10.1371/journal.pone.0067410 |
work_keys_str_mv | AT praveenpaurush boostingprobabilisticgraphicalmodelinferencebyincorporatingpriorknowledgefrommultiplesources AT frohlichholger boostingprobabilisticgraphicalmodelinferencebyincorporatingpriorknowledgefrommultiplesources |