Cargando…

Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes

Inferring the gene regulatory network (GRN) is crucial to understanding the working of the cell. Many computational methods attempt to infer the GRN from time series expression data, instead of through expensive and time-consuming experiments. However, existing methods make the convenient but unreal...

Descripción completa

Detalles Bibliográficos
Autores principales: Lo, Leung-Yau, Wong, Man-Leung, Lee, Kin-Hong, Leung, Kwong-Sak
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578777/
https://www.ncbi.nlm.nih.gov/pubmed/26394325
http://dx.doi.org/10.1371/journal.pone.0138596
_version_ 1782391165239164928
author Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
author_facet Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
author_sort Lo, Leung-Yau
collection PubMed
description Inferring the gene regulatory network (GRN) is crucial to understanding the working of the cell. Many computational methods attempt to infer the GRN from time series expression data, instead of through expensive and time-consuming experiments. However, existing methods make the convenient but unrealistic assumption of causal sufficiency, i.e. all the relevant factors in the causal network have been observed and there are no unobserved common cause. In principle, in the real world, it is impossible to be certain that all relevant factors or common causes have been observed, because some factors may not have been conceived of, and therefore are impossible to measure. In view of this, we have developed a novel algorithm named HCC-CLINDE to infer an GRN from time series data allowing the presence of hidden common cause(s). We assume there is a sparse causal graph (possibly with cycles) of interest, where the variables are continuous and each causal link has a delay (possibly more than one time step). A small but unknown number of variables are not observed. Each unobserved variable has only observed variables as children and parents, with at least two children, and the children are not linked to each other. Since it is difficult to obtain very long time series, our algorithm is also capable of utilizing multiple short time series, which is more realistic. To our knowledge, our algorithm is far less restrictive than previous works. We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. The results show that our algorithm can adequately recover the true causal GRN and is robust to slight deviation from Gaussian distribution in the error terms. We have also demonstrated the potential of our algorithm on small YEASTRACT subnetworks using limited real data.
format Online
Article
Text
id pubmed-4578777
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-45787772015-10-01 Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes Lo, Leung-Yau Wong, Man-Leung Lee, Kin-Hong Leung, Kwong-Sak PLoS One Research Article Inferring the gene regulatory network (GRN) is crucial to understanding the working of the cell. Many computational methods attempt to infer the GRN from time series expression data, instead of through expensive and time-consuming experiments. However, existing methods make the convenient but unrealistic assumption of causal sufficiency, i.e. all the relevant factors in the causal network have been observed and there are no unobserved common cause. In principle, in the real world, it is impossible to be certain that all relevant factors or common causes have been observed, because some factors may not have been conceived of, and therefore are impossible to measure. In view of this, we have developed a novel algorithm named HCC-CLINDE to infer an GRN from time series data allowing the presence of hidden common cause(s). We assume there is a sparse causal graph (possibly with cycles) of interest, where the variables are continuous and each causal link has a delay (possibly more than one time step). A small but unknown number of variables are not observed. Each unobserved variable has only observed variables as children and parents, with at least two children, and the children are not linked to each other. Since it is difficult to obtain very long time series, our algorithm is also capable of utilizing multiple short time series, which is more realistic. To our knowledge, our algorithm is far less restrictive than previous works. We have performed extensive experiments using synthetic data on GRNs of size up to 100, with up to 10 hidden nodes. The results show that our algorithm can adequately recover the true causal GRN and is robust to slight deviation from Gaussian distribution in the error terms. We have also demonstrated the potential of our algorithm on small YEASTRACT subnetworks using limited real data. Public Library of Science 2015-09-22 /pmc/articles/PMC4578777/ /pubmed/26394325 http://dx.doi.org/10.1371/journal.pone.0138596 Text en © 2015 Lo et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lo, Leung-Yau
Wong, Man-Leung
Lee, Kin-Hong
Leung, Kwong-Sak
Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title_full Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title_fullStr Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title_full_unstemmed Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title_short Time Delayed Causal Gene Regulatory Network Inference with Hidden Common Causes
title_sort time delayed causal gene regulatory network inference with hidden common causes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4578777/
https://www.ncbi.nlm.nih.gov/pubmed/26394325
http://dx.doi.org/10.1371/journal.pone.0138596
work_keys_str_mv AT loleungyau timedelayedcausalgeneregulatorynetworkinferencewithhiddencommoncauses
AT wongmanleung timedelayedcausalgeneregulatorynetworkinferencewithhiddencommoncauses
AT leekinhong timedelayedcausalgeneregulatorynetworkinferencewithhiddencommoncauses
AT leungkwongsak timedelayedcausalgeneregulatorynetworkinferencewithhiddencommoncauses