Cargando…

Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data

The identification of causal relationships between random variables from large-scale observational data using directed acyclic graphs (DAG) is highly challenging. We propose a new mixed-effects structural equation model (mSEM) framework to estimate subject-specific DAGs, where we represent joint dis...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Xiang, Xie, Shanghong, McColgan, Peter, Tabrizi, Sarah J., Scahill, Rachael I., Zeng, Donglin, Wang, Yuanjia
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2018
Materias:	Genetics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6176748/ https://www.ncbi.nlm.nih.gov/pubmed/30333854 http://dx.doi.org/10.3389/fgene.2018.00430

_version_	1783361748264288256
author	Li, Xiang Xie, Shanghong McColgan, Peter Tabrizi, Sarah J. Scahill, Rachael I. Zeng, Donglin Wang, Yuanjia
author_facet	Li, Xiang Xie, Shanghong McColgan, Peter Tabrizi, Sarah J. Scahill, Rachael I. Zeng, Donglin Wang, Yuanjia
author_sort	Li, Xiang
collection	PubMed
description	The identification of causal relationships between random variables from large-scale observational data using directed acyclic graphs (DAG) is highly challenging. We propose a new mixed-effects structural equation model (mSEM) framework to estimate subject-specific DAGs, where we represent joint distribution of random variables in the DAG as a set of structural causal equations with mixed effects. The directed edges between nodes depend on observed exogenous covariates on each of the individual and unobserved latent variables. The strength of the connection is decomposed into a fixed-effect term representing the average causal effect given the covariates and a random effect term representing the latent causal effect due to unobserved pathways. The advantage of such decomposition is to capture essential asymmetric structural information and heterogeneity between DAGs in order to allow for the identification of causal structure with observational data. In addition, by pooling information across subject-specific DAGs, we can identify causal structure with a high probability and estimate subject-specific networks with a high precision. We propose a penalized likelihood-based approach to handle multi-dimensionality of the DAG model. We propose a fast, iterative computational algorithm, DAG-MM, to estimate parameters in mSEM and achieve desirable sparsity by hard-thresholding the edges. We theoretically prove the identifiability of mSEM. Using simulations and an application to protein signaling data, we show substantially improved performances when compared to existing methods and consistent results with a network estimated from interventional data. Lastly, we identify gray matter atrophy networks in regions of brain from patients with Huntington's disease and corroborate our findings using white matter connectivity data collected from an independent study.
format	Online Article Text
id	pubmed-6176748
institution	National Center for Biotechnology Information
language	English
publishDate	2018
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-61767482018-10-17 Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data Li, Xiang Xie, Shanghong McColgan, Peter Tabrizi, Sarah J. Scahill, Rachael I. Zeng, Donglin Wang, Yuanjia Front Genet Genetics The identification of causal relationships between random variables from large-scale observational data using directed acyclic graphs (DAG) is highly challenging. We propose a new mixed-effects structural equation model (mSEM) framework to estimate subject-specific DAGs, where we represent joint distribution of random variables in the DAG as a set of structural causal equations with mixed effects. The directed edges between nodes depend on observed exogenous covariates on each of the individual and unobserved latent variables. The strength of the connection is decomposed into a fixed-effect term representing the average causal effect given the covariates and a random effect term representing the latent causal effect due to unobserved pathways. The advantage of such decomposition is to capture essential asymmetric structural information and heterogeneity between DAGs in order to allow for the identification of causal structure with observational data. In addition, by pooling information across subject-specific DAGs, we can identify causal structure with a high probability and estimate subject-specific networks with a high precision. We propose a penalized likelihood-based approach to handle multi-dimensionality of the DAG model. We propose a fast, iterative computational algorithm, DAG-MM, to estimate parameters in mSEM and achieve desirable sparsity by hard-thresholding the edges. We theoretically prove the identifiability of mSEM. Using simulations and an application to protein signaling data, we show substantially improved performances when compared to existing methods and consistent results with a network estimated from interventional data. Lastly, we identify gray matter atrophy networks in regions of brain from patients with Huntington's disease and corroborate our findings using white matter connectivity data collected from an independent study. Frontiers Media S.A. 2018-10-02 /pmc/articles/PMC6176748/ /pubmed/30333854 http://dx.doi.org/10.3389/fgene.2018.00430 Text en Copyright © 2018 Li, Xie, McColgan, Tabrizi, Scahill, Zeng and Wang. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Genetics Li, Xiang Xie, Shanghong McColgan, Peter Tabrizi, Sarah J. Scahill, Rachael I. Zeng, Donglin Wang, Yuanjia Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title	Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title_full	Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title_fullStr	Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title_full_unstemmed	Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title_short	Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data
title_sort	learning subject-specific directed acyclic graphs with mixed effects structural equation models from observational data
topic	Genetics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6176748/ https://www.ncbi.nlm.nih.gov/pubmed/30333854 http://dx.doi.org/10.3389/fgene.2018.00430
work_keys_str_mv	AT lixiang learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT xieshanghong learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT mccolganpeter learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT tabrizisarahj learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT scahillrachaeli learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT zengdonglin learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata AT wangyuanjia learningsubjectspecificdirectedacyclicgraphswithmixedeffectsstructuralequationmodelsfromobservationaldata

Learning Subject-Specific Directed Acyclic Graphs With Mixed Effects Structural Equation Models From Observational Data

Ejemplares similares