Cargando…

Simultaneous Clustering and Estimation of Heterogeneous Graphical Models

We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations. Unlike most previous approaches which assume that the cluster structure is given in advance, an appealing feature of our method is to learn cluster structure while estimating heter...

Descripción completa

Detalles Bibliográficos
Autores principales: Hao, Botao, Sun, Will Wei, Liu, Yufeng, Cheng, Guang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6338433/
https://www.ncbi.nlm.nih.gov/pubmed/30662373
_version_ 1783388450501689344
author Hao, Botao
Sun, Will Wei
Liu, Yufeng
Cheng, Guang
author_facet Hao, Botao
Sun, Will Wei
Liu, Yufeng
Cheng, Guang
author_sort Hao, Botao
collection PubMed
description We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations. Unlike most previous approaches which assume that the cluster structure is given in advance, an appealing feature of our method is to learn cluster structure while estimating heterogeneous graphical models. This is achieved via a high dimensional version of Expectation Conditional Maximization (ECM) algorithm (Meng and Rubin, 1993). A joint graphical lasso penalty is imposed on the conditional maximization step to extract both homogeneity and heterogeneity components across all clusters. Our algorithm is computationally efficient due to fast sparse learning routines and can be implemented without unsupervised learning knowledge. The superior performance of our method is demonstrated by extensive experiments and its application to a Glioblastoma cancer dataset reveals some new insights in understanding the Glioblastoma cancer. In theory, a non-asymptotic error bound is established for the output directly from our high dimensional ECM algorithm, and it consists of two quantities: statistical error (statistical accuracy) and optimization error (computational complexity). Such a result gives a theoretical guideline in terminating our ECM iterations.
format Online
Article
Text
id pubmed-6338433
institution National Center for Biotechnology Information
language English
publishDate 2018
record_format MEDLINE/PubMed
spelling pubmed-63384332019-01-18 Simultaneous Clustering and Estimation of Heterogeneous Graphical Models Hao, Botao Sun, Will Wei Liu, Yufeng Cheng, Guang J Mach Learn Res Article We consider joint estimation of multiple graphical models arising from heterogeneous and high-dimensional observations. Unlike most previous approaches which assume that the cluster structure is given in advance, an appealing feature of our method is to learn cluster structure while estimating heterogeneous graphical models. This is achieved via a high dimensional version of Expectation Conditional Maximization (ECM) algorithm (Meng and Rubin, 1993). A joint graphical lasso penalty is imposed on the conditional maximization step to extract both homogeneity and heterogeneity components across all clusters. Our algorithm is computationally efficient due to fast sparse learning routines and can be implemented without unsupervised learning knowledge. The superior performance of our method is demonstrated by extensive experiments and its application to a Glioblastoma cancer dataset reveals some new insights in understanding the Glioblastoma cancer. In theory, a non-asymptotic error bound is established for the output directly from our high dimensional ECM algorithm, and it consists of two quantities: statistical error (statistical accuracy) and optimization error (computational complexity). Such a result gives a theoretical guideline in terminating our ECM iterations. 2018-04 /pmc/articles/PMC6338433/ /pubmed/30662373 Text en License: CC-BY 4.0, see https://creativecommons.org/licenses/by/4.0/. Attribution requirements are provided at http://jmlr.org/papers/v18/17-019.html.
spellingShingle Article
Hao, Botao
Sun, Will Wei
Liu, Yufeng
Cheng, Guang
Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title_full Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title_fullStr Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title_full_unstemmed Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title_short Simultaneous Clustering and Estimation of Heterogeneous Graphical Models
title_sort simultaneous clustering and estimation of heterogeneous graphical models
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6338433/
https://www.ncbi.nlm.nih.gov/pubmed/30662373
work_keys_str_mv AT haobotao simultaneousclusteringandestimationofheterogeneousgraphicalmodels
AT sunwillwei simultaneousclusteringandestimationofheterogeneousgraphicalmodels
AT liuyufeng simultaneousclusteringandestimationofheterogeneousgraphicalmodels
AT chengguang simultaneousclusteringandestimationofheterogeneousgraphicalmodels