Cargando…
Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge
BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying net...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1839889/ https://www.ncbi.nlm.nih.gov/pubmed/17408501 http://dx.doi.org/10.1186/1752-0509-1-11 |
_version_ | 1782132869807734784 |
---|---|
author | Geier, Florian Timmer, Jens Fleck, Christian |
author_facet | Geier, Florian Timmer, Jens Fleck, Christian |
author_sort | Geier, Florian |
collection | PubMed |
description | BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying network structure. We compare the performance of different network reconstruction methods using synthetic data generated from an ensemble of reference networks. Data requirements as well as optimal experiments for the reconstruction of gene-regulatory networks are investigated. Additionally, the impact of prior knowledge on network reconstruction as well as the effect of unobserved cellular processes is studied. RESULTS: We identify linear Gaussian dynamic Bayesian networks and variable selection based on F-statistics as suitable methods for the reconstruction of gene-regulatory networks from time series data. Commonly used discrete dynamic Bayesian networks perform inferior and this result can be attributed to the inevitable information loss by discretization of expression data. It is shown that short time series generated under transcription factor knock-out are optimal experiments in order to reveal the structure of gene regulatory networks. Relative to the level of observational noise, we give estimates for the required amount of gene expression data in order to accurately reconstruct gene-regulatory networks. The benefit of using of prior knowledge within a Bayesian learning framework is found to be limited to conditions of small gene expression data size. Unobserved processes, like protein-protein interactions, induce dependencies between gene expression levels similar to direct transcriptional regulation. We show that these dependencies cannot be distinguished from transcription factor mediated gene regulation on the basis of gene expression data alone. CONCLUSION: Currently available data size and data quality make the reconstruction of gene networks from gene expression data a challenge. In this study, we identify an optimal type of experiment, requirements on the gene expression data quality and size as well as appropriate reconstruction methods in order to reverse engineer gene regulatory networks from time series data. |
format | Text |
id | pubmed-1839889 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-18398892007-04-02 Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge Geier, Florian Timmer, Jens Fleck, Christian BMC Syst Biol Research Article BACKGROUND: Cellular processes are controlled by gene-regulatory networks. Several computational methods are currently used to learn the structure of gene-regulatory networks from data. This study focusses on time series gene expression and gene knock-out data in order to identify the underlying network structure. We compare the performance of different network reconstruction methods using synthetic data generated from an ensemble of reference networks. Data requirements as well as optimal experiments for the reconstruction of gene-regulatory networks are investigated. Additionally, the impact of prior knowledge on network reconstruction as well as the effect of unobserved cellular processes is studied. RESULTS: We identify linear Gaussian dynamic Bayesian networks and variable selection based on F-statistics as suitable methods for the reconstruction of gene-regulatory networks from time series data. Commonly used discrete dynamic Bayesian networks perform inferior and this result can be attributed to the inevitable information loss by discretization of expression data. It is shown that short time series generated under transcription factor knock-out are optimal experiments in order to reveal the structure of gene regulatory networks. Relative to the level of observational noise, we give estimates for the required amount of gene expression data in order to accurately reconstruct gene-regulatory networks. The benefit of using of prior knowledge within a Bayesian learning framework is found to be limited to conditions of small gene expression data size. Unobserved processes, like protein-protein interactions, induce dependencies between gene expression levels similar to direct transcriptional regulation. We show that these dependencies cannot be distinguished from transcription factor mediated gene regulation on the basis of gene expression data alone. CONCLUSION: Currently available data size and data quality make the reconstruction of gene networks from gene expression data a challenge. In this study, we identify an optimal type of experiment, requirements on the gene expression data quality and size as well as appropriate reconstruction methods in order to reverse engineer gene regulatory networks from time series data. BioMed Central 2007-02-02 /pmc/articles/PMC1839889/ /pubmed/17408501 http://dx.doi.org/10.1186/1752-0509-1-11 Text en Copyright © 2007 Geier et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Geier, Florian Timmer, Jens Fleck, Christian Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title | Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title_full | Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title_fullStr | Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title_full_unstemmed | Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title_short | Reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
title_sort | reconstructing gene-regulatory networks from time series, knock-out data, and prior knowledge |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1839889/ https://www.ncbi.nlm.nih.gov/pubmed/17408501 http://dx.doi.org/10.1186/1752-0509-1-11 |
work_keys_str_mv | AT geierflorian reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge AT timmerjens reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge AT fleckchristian reconstructinggeneregulatorynetworksfromtimeseriesknockoutdataandpriorknowledge |