Cargando…

Joint estimation of causal effects from observational and intervention gene expression data

BACKGROUND: In recent years, there has been great interest in using transcriptomic data to infer gene regulatory networks. For the time being, methodological development in this area has primarily made use of graphical Gaussian models for observational wild-type data, resulting in undirected graphs...

Descripción completa

Detalles Bibliográficos
Autores principales: Rau, Andrea, Jaffrézic, Florence, Nuel, Grégory
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834107/
https://www.ncbi.nlm.nih.gov/pubmed/24172639
http://dx.doi.org/10.1186/1752-0509-7-111
_version_ 1782291935120064512
author Rau, Andrea
Jaffrézic, Florence
Nuel, Grégory
author_facet Rau, Andrea
Jaffrézic, Florence
Nuel, Grégory
author_sort Rau, Andrea
collection PubMed
description BACKGROUND: In recent years, there has been great interest in using transcriptomic data to infer gene regulatory networks. For the time being, methodological development in this area has primarily made use of graphical Gaussian models for observational wild-type data, resulting in undirected graphs that are not able to accurately highlight causal relationships among genes. In the present work, we seek to improve the estimation of causal effects among genes by jointly modeling observational transcriptomic data with arbitrarily complex intervention data obtained by performing partial, single, or multiple gene knock-outs or knock-downs. RESULTS: Using the framework of causal Gaussian Bayesian networks, we propose a Markov chain Monte Carlo algorithm with a Mallows proposal model and analytical likelihood maximization to sample from the posterior distribution of causal node orderings, and in turn, to estimate causal effects. The main advantage of the proposed algorithm over previously proposed methods is its flexibility to accommodate any kind of intervention design, including partial or multiple knock-out experiments. Using simulated data as well as data from the Dialogue for Reverse Engineering Assessments and Methods (DREAM) 2007 challenge, the proposed method was compared to two alternative approaches: one requiring a complete, single knock-out design, and one able to model only observational data. CONCLUSIONS: The proposed algorithm was found to perform as well as, and in most cases better, than the alternative methods in terms of accuracy for the estimation of causal effects. In addition, multiple knock-outs proved to contribute valuable additional information compared to single knock-outs. Finally, the simulation study confirmed that it is not possible to estimate the causal ordering of genes from observational data alone. In all cases, we found that the inclusion of intervention experiments enabled more accurate estimation of causal regulatory relationships than the use of wild-type data alone.
format Online
Article
Text
id pubmed-3834107
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38341072013-11-25 Joint estimation of causal effects from observational and intervention gene expression data Rau, Andrea Jaffrézic, Florence Nuel, Grégory BMC Syst Biol Methodology Article BACKGROUND: In recent years, there has been great interest in using transcriptomic data to infer gene regulatory networks. For the time being, methodological development in this area has primarily made use of graphical Gaussian models for observational wild-type data, resulting in undirected graphs that are not able to accurately highlight causal relationships among genes. In the present work, we seek to improve the estimation of causal effects among genes by jointly modeling observational transcriptomic data with arbitrarily complex intervention data obtained by performing partial, single, or multiple gene knock-outs or knock-downs. RESULTS: Using the framework of causal Gaussian Bayesian networks, we propose a Markov chain Monte Carlo algorithm with a Mallows proposal model and analytical likelihood maximization to sample from the posterior distribution of causal node orderings, and in turn, to estimate causal effects. The main advantage of the proposed algorithm over previously proposed methods is its flexibility to accommodate any kind of intervention design, including partial or multiple knock-out experiments. Using simulated data as well as data from the Dialogue for Reverse Engineering Assessments and Methods (DREAM) 2007 challenge, the proposed method was compared to two alternative approaches: one requiring a complete, single knock-out design, and one able to model only observational data. CONCLUSIONS: The proposed algorithm was found to perform as well as, and in most cases better, than the alternative methods in terms of accuracy for the estimation of causal effects. In addition, multiple knock-outs proved to contribute valuable additional information compared to single knock-outs. Finally, the simulation study confirmed that it is not possible to estimate the causal ordering of genes from observational data alone. In all cases, we found that the inclusion of intervention experiments enabled more accurate estimation of causal regulatory relationships than the use of wild-type data alone. BioMed Central 2013-10-31 /pmc/articles/PMC3834107/ /pubmed/24172639 http://dx.doi.org/10.1186/1752-0509-7-111 Text en Copyright © 2013 Rau et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Rau, Andrea
Jaffrézic, Florence
Nuel, Grégory
Joint estimation of causal effects from observational and intervention gene expression data
title Joint estimation of causal effects from observational and intervention gene expression data
title_full Joint estimation of causal effects from observational and intervention gene expression data
title_fullStr Joint estimation of causal effects from observational and intervention gene expression data
title_full_unstemmed Joint estimation of causal effects from observational and intervention gene expression data
title_short Joint estimation of causal effects from observational and intervention gene expression data
title_sort joint estimation of causal effects from observational and intervention gene expression data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3834107/
https://www.ncbi.nlm.nih.gov/pubmed/24172639
http://dx.doi.org/10.1186/1752-0509-7-111
work_keys_str_mv AT rauandrea jointestimationofcausaleffectsfromobservationalandinterventiongeneexpressiondata
AT jaffrezicflorence jointestimationofcausaleffectsfromobservationalandinterventiongeneexpressiondata
AT nuelgregory jointestimationofcausaleffectsfromobservationalandinterventiongeneexpressiondata