Cargando…

Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation

BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induc...

Descripción completa

Detalles Bibliográficos
Autores principales: Pinna, Andrea, Heise, Sandra, Flassig, Robert J, Fuente, Alberto de la, Klamt, Steffen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231426/
https://www.ncbi.nlm.nih.gov/pubmed/23924435
http://dx.doi.org/10.1186/1752-0509-7-73
_version_ 1782344442892517376
author Pinna, Andrea
Heise, Sandra
Flassig, Robert J
Fuente, Alberto de la
Klamt, Steffen
author_facet Pinna, Andrea
Heise, Sandra
Flassig, Robert J
Fuente, Alberto de la
Klamt, Steffen
author_sort Pinna, Andrea
collection PubMed
description BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induced by directed perturbations are collected in a signed and directed perturbation graph (PG). In a second step, Transitive Reduction (TR) is used to identify and eliminate those edges in the PG that can be explained by paths and are therefore likely to reflect indirect effects. RESULTS: In this work we introduce novel variants for PG generation and TR, leading to significantly improved performances. The key modifications concern: (i) use of novel statistical criteria for deriving a high-quality PG from experimental data; (ii) the application of local TR which allows only short paths to explain (and remove) a given edge; and (iii) a novel strategy to rank the edges with respect to their confidence. To compare the new methods with existing ones we not only apply them to a recent DREAM network inference challenge but also to a novel and unprecedented synthetic compendium consisting of 30 5000-gene networks simulated with varying biological and measurement error variances resulting in a total of 270 datasets. The benchmarks clearly demonstrate the superior reconstruction performance of the novel PG and TR variants compared to existing approaches. Moreover, the benchmark enabled us to draw some general conclusions. For example, it turns out that local TR restricted to paths with a length of only two is often sufficient or even favorable. We also demonstrate that considering edge weights is highly beneficial for TR whereas consideration of edge signs is of minor importance. We explain these observations from a graph-theoretical perspective and discuss the consequences with respect to a greatly reduced computational demand to conduct TR. Finally, as a realistic application scenario, we use our framework for inferring gene interactions in yeast based on a library of gene expression data measured in mutants with single knockouts of transcription factors. The reconstructed network shows a significant enrichment of known interactions, especially within the 100 most confident (and for experimental validation most relevant) edges. CONCLUSIONS: This paper presents several major achievements. The novel methods introduced herein can be seen as state of the art for inference techniques relying on perturbation graphs and transitive reduction. Another key result of the study is the generation of a new and unprecedented large-scale in silico benchmark dataset accounting for different noise levels and providing a solid basis for unbiased testing of network inference methodologies. Finally, applying our approach to Saccharomyces cerevisiae suggested several new gene interactions with high confidence awaiting experimental validation.
format Online
Article
Text
id pubmed-4231426
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42314262014-11-18 Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation Pinna, Andrea Heise, Sandra Flassig, Robert J Fuente, Alberto de la Klamt, Steffen BMC Syst Biol Methodology Article BACKGROUND: The data-driven inference of intracellular networks is one of the key challenges of computational and systems biology. As suggested by recent works, a simple yet effective approach for reconstructing regulatory networks comprises the following two steps. First, the observed effects induced by directed perturbations are collected in a signed and directed perturbation graph (PG). In a second step, Transitive Reduction (TR) is used to identify and eliminate those edges in the PG that can be explained by paths and are therefore likely to reflect indirect effects. RESULTS: In this work we introduce novel variants for PG generation and TR, leading to significantly improved performances. The key modifications concern: (i) use of novel statistical criteria for deriving a high-quality PG from experimental data; (ii) the application of local TR which allows only short paths to explain (and remove) a given edge; and (iii) a novel strategy to rank the edges with respect to their confidence. To compare the new methods with existing ones we not only apply them to a recent DREAM network inference challenge but also to a novel and unprecedented synthetic compendium consisting of 30 5000-gene networks simulated with varying biological and measurement error variances resulting in a total of 270 datasets. The benchmarks clearly demonstrate the superior reconstruction performance of the novel PG and TR variants compared to existing approaches. Moreover, the benchmark enabled us to draw some general conclusions. For example, it turns out that local TR restricted to paths with a length of only two is often sufficient or even favorable. We also demonstrate that considering edge weights is highly beneficial for TR whereas consideration of edge signs is of minor importance. We explain these observations from a graph-theoretical perspective and discuss the consequences with respect to a greatly reduced computational demand to conduct TR. Finally, as a realistic application scenario, we use our framework for inferring gene interactions in yeast based on a library of gene expression data measured in mutants with single knockouts of transcription factors. The reconstructed network shows a significant enrichment of known interactions, especially within the 100 most confident (and for experimental validation most relevant) edges. CONCLUSIONS: This paper presents several major achievements. The novel methods introduced herein can be seen as state of the art for inference techniques relying on perturbation graphs and transitive reduction. Another key result of the study is the generation of a new and unprecedented large-scale in silico benchmark dataset accounting for different noise levels and providing a solid basis for unbiased testing of network inference methodologies. Finally, applying our approach to Saccharomyces cerevisiae suggested several new gene interactions with high confidence awaiting experimental validation. BioMed Central 2013-08-08 /pmc/articles/PMC4231426/ /pubmed/23924435 http://dx.doi.org/10.1186/1752-0509-7-73 Text en Copyright © 2013 Pinna et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Pinna, Andrea
Heise, Sandra
Flassig, Robert J
Fuente, Alberto de la
Klamt, Steffen
Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title_full Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title_fullStr Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title_full_unstemmed Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title_short Reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
title_sort reconstruction of large-scale regulatory networks based on perturbation graphs and transitive reduction: improved methods and their evaluation
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4231426/
https://www.ncbi.nlm.nih.gov/pubmed/23924435
http://dx.doi.org/10.1186/1752-0509-7-73
work_keys_str_mv AT pinnaandrea reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation
AT heisesandra reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation
AT flassigrobertj reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation
AT fuentealbertodela reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation
AT klamtsteffen reconstructionoflargescaleregulatorynetworksbasedonperturbationgraphsandtransitivereductionimprovedmethodsandtheirevaluation