Cargando…

High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network

BACKGROUND: Inferring gene regulatory network (GRN) has been an important topic in Bioinformatics. Many computational methods infer the GRN from high-throughput expression data. Due to the presence of time delays in the regulatory relationships, High-Order Dynamic Bayesian Network (HO-DBN) is a good...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Leung-Yau, Wong, Man-Leung, Lee, Kin-Hong, Leung, Kwong-Sak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659244/
https://www.ncbi.nlm.nih.gov/pubmed/26608050
http://dx.doi.org/10.1186/s12859-015-0823-6
_version_ 1782402600471101440
author Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
author_facet Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
author_sort Lo, Leung-Yau
collection PubMed
description BACKGROUND: Inferring gene regulatory network (GRN) has been an important topic in Bioinformatics. Many computational methods infer the GRN from high-throughput expression data. Due to the presence of time delays in the regulatory relationships, High-Order Dynamic Bayesian Network (HO-DBN) is a good model of GRN. However, previous GRN inference methods assume causal sufficiency, i.e. no unobserved common cause. This assumption is convenient but unrealistic, because it is possible that relevant factors have not even been conceived of and therefore un-measured. Therefore an inference method that also handles hidden common cause(s) is highly desirable. Also, previous methods for discovering hidden common causes either do not handle multi-step time delays or restrict that the parents of hidden common causes are not observed genes. RESULTS: We have developed a discrete HO-DBN learning algorithm that can infer also hidden common cause(s) from discrete time series expression data, with some assumptions on the conditional distribution, but is less restrictive than previous methods. We assume that each hidden variable has only observed variables as children and parents, with at least two children and possibly no parents. We also make the simplifying assumption that children of hidden variable(s) are not linked to each other. Moreover, our proposed algorithm can also utilize multiple short time series (not necessarily of the same length), as long time series are difficult to obtain. CONCLUSIONS: We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. Experiment results show that our proposed algorithm can recover the causal GRNs adequately given the incomplete data. Using the limited real expression data and small subnetworks of the YEASTRACT network, we have also demonstrated the potential of our algorithm on real data, though more time series expression data is needed.
format Online
Article
Text
id pubmed-4659244
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46592442015-11-26 High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network Lo, Leung-Yau Wong, Man-Leung Lee, Kin-Hong Leung, Kwong-Sak BMC Bioinformatics Research Article BACKGROUND: Inferring gene regulatory network (GRN) has been an important topic in Bioinformatics. Many computational methods infer the GRN from high-throughput expression data. Due to the presence of time delays in the regulatory relationships, High-Order Dynamic Bayesian Network (HO-DBN) is a good model of GRN. However, previous GRN inference methods assume causal sufficiency, i.e. no unobserved common cause. This assumption is convenient but unrealistic, because it is possible that relevant factors have not even been conceived of and therefore un-measured. Therefore an inference method that also handles hidden common cause(s) is highly desirable. Also, previous methods for discovering hidden common causes either do not handle multi-step time delays or restrict that the parents of hidden common causes are not observed genes. RESULTS: We have developed a discrete HO-DBN learning algorithm that can infer also hidden common cause(s) from discrete time series expression data, with some assumptions on the conditional distribution, but is less restrictive than previous methods. We assume that each hidden variable has only observed variables as children and parents, with at least two children and possibly no parents. We also make the simplifying assumption that children of hidden variable(s) are not linked to each other. Moreover, our proposed algorithm can also utilize multiple short time series (not necessarily of the same length), as long time series are difficult to obtain. CONCLUSIONS: We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. Experiment results show that our proposed algorithm can recover the causal GRNs adequately given the incomplete data. Using the limited real expression data and small subnetworks of the YEASTRACT network, we have also demonstrated the potential of our algorithm on real data, though more time series expression data is needed. BioMed Central 2015-11-25 /pmc/articles/PMC4659244/ /pubmed/26608050 http://dx.doi.org/10.1186/s12859-015-0823-6 Text en © Lo et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title_full High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title_fullStr High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title_full_unstemmed High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title_short High-order dynamic Bayesian Network learning with hidden common causes for causal gene regulatory network
title_sort high-order dynamic bayesian network learning with hidden common causes for causal gene regulatory network
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4659244/
https://www.ncbi.nlm.nih.gov/pubmed/26608050
http://dx.doi.org/10.1186/s12859-015-0823-6
work_keys_str_mv AT loleungyau highorderdynamicbayesiannetworklearningwithhiddencommoncausesforcausalgeneregulatorynetwork
AT wongmanleung highorderdynamicbayesiannetworklearningwithhiddencommoncausesforcausalgeneregulatorynetwork
AT leekinhong highorderdynamicbayesiannetworklearningwithhiddencommoncausesforcausalgeneregulatorynetwork
AT leungkwongsak highorderdynamicbayesiannetworklearningwithhiddencommoncausesforcausalgeneregulatorynetwork