Cargando…

Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge

BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying net...

Descripción completa

Detalles Bibliográficos
Autores principales: Geier, Florian, Timmer, Jens, Fleck, Christian
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1839889/
https://www.ncbi.nlm.nih.gov/pubmed/17408501
http://dx.doi.org/10.1186/1752-0509-1-11
_version_ 1782132869807734784
author Geier, Florian
Timmer, Jens
Fleck, Christian
author_facet Geier, Florian
Timmer, Jens
Fleck, Christian
author_sort Geier, Florian
collection PubMed
description BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying network structure. We compare the performance of different network reconstruction methods using synthetic data generated from an ensemble of reference networks. Data requirements as well as optimal experiments for the reconstruction of gene-regulatory networks are investigated. Additionally, the impact of prior knowledge on network reconstruction as well as the effect of unobserved cellular processes is studied. RESULTS: We identify linear Gaussian dynamic Bayesian networks and variable selection based on F-statistics as suitable methods for the reconstruction of gene-regulatory networks from time series data. Commonly used discrete dynamic Bayesian networks perform inferior and this result can be attributed to the inevitable information loss by discretization of expression data. It is shown that short time series generated under transcription factor knock-out are optimal experiments in order to reveal the structure of gene regulatory networks. Relative to the level of observational noise, we give estimates for the required amount of gene expression data in order to accurately reconstruct gene-regulatory networks. The benefit of using of prior knowledge within a Bayesian learning framework is found to be limited to conditions of small gene expression data size. Unobserved processes, like protein-protein interactions, induce dependencies between gene expression levels similar to direct transcriptional regulation. We show that these dependencies cannot be distinguished from transcription factor mediated gene regulation on the basis of gene expression data alone. CONCLUSION: Currently available data size and data quality make the reconstruction of gene networks from gene expression data a challenge. In this study, we identify an optimal type of experiment, requirements on the gene expression data quality and size as well as appropriate reconstruction methods in order to reverse engineer gene regulatory networks from time series data.
format Text
id pubmed-1839889
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-18398892007-04-02 Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge Geier, Florian Timmer, Jens Fleck, Christian BMC Syst Biol Research Article BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying network structure. We compare the performance of different network reconstruction methods using synthetic data generated from an ensemble of reference networks. Data requirements as well as optimal experiments for the reconstruction of gene-regulatory networks are investigated. Additionally, the impact of prior knowledge on network reconstruction as well as the effect of unobserved cellular processes is studied. RESULTS: We identify linear Gaussian dynamic Bayesian networks and variable selection based on F-statistics as suitable methods for the reconstruction of gene-regulatory networks from time series data. Commonly used discrete dynamic Bayesian networks perform inferior and this result can be attributed to the inevitable information loss by discretization of expression data. It is shown that short time series generated under transcription factor knock-out are optimal experiments in order to reveal the structure of gene regulatory networks. Relative to the level of observational noise, we give estimates for the required amount of gene expression data in order to accurately reconstruct gene-regulatory networks. The benefit of using of prior knowledge within a Bayesian learning framework is found to be limited to conditions of small gene expression data size. Unobserved processes, like protein-protein interactions, induce dependencies between gene expression levels similar to direct transcriptional regulation. We show that these dependencies cannot be distinguished from transcription factor mediated gene regulation on the basis of gene expression data alone. CONCLUSION: Currently available data size and data quality make the reconstruction of gene networks from gene expression data a challenge. In this study, we identify an optimal type of experiment, requirements on the gene expression data quality and size as well as appropriate reconstruction methods in order to reverse engineer gene regulatory networks from time series data. BioMed Central 2007-02-02 /pmc/articles/PMC1839889/ /pubmed/17408501 http://dx.doi.org/10.1186/1752-0509-1-11 Text en Copyright © 2007 Geier et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Geier, Florian
Timmer, Jens
Fleck, Christian
Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title_full Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title_fullStr Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title_full_unstemmed Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title_short Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
title_sort reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1839889/
https://www.ncbi.nlm.nih.gov/pubmed/17408501
http://dx.doi.org/10.1186/1752-0509-1-11
work_keys_str_mv AT geierflorian reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge
AT timmerjens reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge
AT fleckchristian reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge