Cargando…
Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induc...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231426/ https://www.ncbi.nlm.nih.gov/pubmed/23924435 http://dx.doi.org/10.1186/1752-0509-7-73 |
_version_ | 1782344442892517376 |
---|---|
author | Pinna, Andrea Heise, Sandra Flassig, Robert J Fuente, Alberto de la Klamt, Steffen |
author_facet | Pinna, Andrea Heise, Sandra Flassig, Robert J Fuente, Alberto de la Klamt, Steffen |
author_sort | Pinna, Andrea |
collection | PubMed |
description | BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induced by directed perturbations are collected in a signed and directed perturbation graph (PG). In a second step, Transitive Reduction (TR) is used to identify and eliminate those edges in the PG that can be explained by paths and are therefore likely to reflect indirect effects. RESULTS: In this work we introduce novel variants for PG generation and TR, leading to significantly improved performances. The key modifications concern: (i) use of novel statistical criteria for deriving a high-quality PG from experimental data; (ii) the application of local TR which allows only short paths to explain (and remove) a given edge; and (iii) a novel strategy to rank the edges with respect to their confidence. To compare the new methods with existing ones we not only apply them to a recent DREAM network inference challenge but also to a novel and unprecedented synthetic compendium consisting of 30 5000-gene networks simulated with varying biological and measurement error variances resulting in a total of 270 datasets. The benchmarks clearly demonstrate the superior reconstruction performance of the novel PG and TR variants compared to existing approaches. Moreover, the benchmark enabled us to draw some general conclusions. For example, it turns out that local TR restricted to paths with a length of only two is often sufficient or even favorable. We also demonstrate that considering edge weights is highly beneficial for TR whereas consideration of edge signs is of minor importance. We explain these observations from a graph-theoretical perspective and discuss the consequences with respect to a greatly reduced computational demand to conduct TR. Finally, as a realistic application scenario, we use our framework for inferring gene interactions in yeast based on a library of gene expression data measured in mutants with single knockouts of transcription factors. The reconstructed network shows a significant enrichment of known interactions, especially within the 100 most confident (and for experimental validation most relevant) edges. CONCLUSIONS: This paper presents several major achievements. The novel methods introduced herein can be seen as state of the art for inference techniques relying on perturbation graphs and transitive reduction. Another key result of the study is the generation of a new and unprecedented large-scale in silico benchmark dataset accounting for different noise levels and providing a solid basis for unbiased testing of network inference methodologies. Finally, applying our approach to Saccharomyces cerevisiae suggested several new gene interactions with high confidence awaiting experimental validation. |
format | Online Article Text |
id | pubmed-4231426 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-42314262014-11-18 Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation Pinna, Andrea Heise, Sandra Flassig, Robert J Fuente, Alberto de la Klamt, Steffen BMC Syst Biol Methodology Article BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induced by directed perturbations are collected in a signed and directed perturbation graph (PG). In a second step, Transitive Reduction (TR) is used to identify and eliminate those edges in the PG that can be explained by paths and are therefore likely to reflect indirect effects. RESULTS: In this work we introduce novel variants for PG generation and TR, leading to significantly improved performances. The key modifications concern: (i) use of novel statistical criteria for deriving a high-quality PG from experimental data; (ii) the application of local TR which allows only short paths to explain (and remove) a given edge; and (iii) a novel strategy to rank the edges with respect to their confidence. To compare the new methods with existing ones we not only apply them to a recent DREAM network inference challenge but also to a novel and unprecedented synthetic compendium consisting of 30 5000-gene networks simulated with varying biological and measurement error variances resulting in a total of 270 datasets. The benchmarks clearly demonstrate the superior reconstruction performance of the novel PG and TR variants compared to existing approaches. Moreover, the benchmark enabled us to draw some general conclusions. For example, it turns out that local TR restricted to paths with a length of only two is often sufficient or even favorable. We also demonstrate that considering edge weights is highly beneficial for TR whereas consideration of edge signs is of minor importance. We explain these observations from a graph-theoretical perspective and discuss the consequences with respect to a greatly reduced computational demand to conduct TR. Finally, as a realistic application scenario, we use our framework for inferring gene interactions in yeast based on a library of gene expression data measured in mutants with single knockouts of transcription factors. The reconstructed network shows a significant enrichment of known interactions, especially within the 100 most confident (and for experimental validation most relevant) edges. CONCLUSIONS: This paper presents several major achievements. The novel methods introduced herein can be seen as state of the art for inference techniques relying on perturbation graphs and transitive reduction. Another key result of the study is the generation of a new and unprecedented large-scale in silico benchmark dataset accounting for different noise levels and providing a solid basis for unbiased testing of network inference methodologies. Finally, applying our approach to Saccharomyces cerevisiae suggested several new gene interactions with high confidence awaiting experimental validation. BioMed Central 2013-08-08 /pmc/articles/PMC4231426/ /pubmed/23924435 http://dx.doi.org/10.1186/1752-0509-7-73 Text en Copyright © 2013 Pinna et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Pinna, Andrea Heise, Sandra Flassig, Robert J Fuente, Alberto de la Klamt, Steffen Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title | Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title_full | Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title_fullStr | Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title_full_unstemmed | Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title_short | Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
title_sort | reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231426/ https://www.ncbi.nlm.nih.gov/pubmed/23924435 http://dx.doi.org/10.1186/1752-0509-7-73 |
work_keys_str_mv | AT pinnaandrea reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation AT heisesandra reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation AT flassigrobertj reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation AT fuentealbertodela reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation AT klamtsteffen reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation |