Cargando…

Covariate-assisted spectral clustering

Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreove...

Descripción completa

Detalles Bibliográficos
Autores principales: Binkiewicz, N., Vogelstein, J. T., Rohe, K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793492/
https://www.ncbi.nlm.nih.gov/pubmed/29430032
http://dx.doi.org/10.1093/biomet/asx008
_version_ 1783296965390368768
author Binkiewicz, N.
Vogelstein, J. T.
Rohe, K.
author_facet Binkiewicz, N.
Vogelstein, J. T.
Rohe, K.
author_sort Binkiewicz, N.
collection PubMed
description Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreover, in applications such as connectomics, social networks, and genomics, graph data are accompanied by contextualizing measures on each node. We utilize these node covariates to help uncover latent communities in a graph, using a modification of spectral clustering. Statistical guarantees are provided under a joint mixture model that we call the node-contextualized stochastic blockmodel, including a bound on the misclustering rate. The bound is used to derive conditions for achieving perfect clustering. For most simulated cases, covariate-assisted spectral clustering yields results superior both to regularized spectral clustering without node covariates and to an adaptation of canonical correlation analysis. We apply our clustering method to large brain graphs derived from diffusion MRI data, using the node locations or neurological region membership as covariates. In both cases, covariate-assisted spectral clustering yields clusters that are easier to interpret neurologically.
format Online
Article
Text
id pubmed-5793492
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-57934922018-06-01 Covariate-assisted spectral clustering Binkiewicz, N. Vogelstein, J. T. Rohe, K. Biometrika Articles Biological and social systems consist of myriad interacting units. The interactions can be represented in the form of a graph or network. Measurements of these graphs can reveal the underlying structure of these interactions, which provides insight into the systems that generated the graphs. Moreover, in applications such as connectomics, social networks, and genomics, graph data are accompanied by contextualizing measures on each node. We utilize these node covariates to help uncover latent communities in a graph, using a modification of spectral clustering. Statistical guarantees are provided under a joint mixture model that we call the node-contextualized stochastic blockmodel, including a bound on the misclustering rate. The bound is used to derive conditions for achieving perfect clustering. For most simulated cases, covariate-assisted spectral clustering yields results superior both to regularized spectral clustering without node covariates and to an adaptation of canonical correlation analysis. We apply our clustering method to large brain graphs derived from diffusion MRI data, using the node locations or neurological region membership as covariates. In both cases, covariate-assisted spectral clustering yields clusters that are easier to interpret neurologically. Oxford University Press 2017-06 2017-03-19 /pmc/articles/PMC5793492/ /pubmed/29430032 http://dx.doi.org/10.1093/biomet/asx008 Text en © 2017 Biometrika Trust
spellingShingle Articles
Binkiewicz, N.
Vogelstein, J. T.
Rohe, K.
Covariate-assisted spectral clustering
title Covariate-assisted spectral clustering
title_full Covariate-assisted spectral clustering
title_fullStr Covariate-assisted spectral clustering
title_full_unstemmed Covariate-assisted spectral clustering
title_short Covariate-assisted spectral clustering
title_sort covariate-assisted spectral clustering
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5793492/
https://www.ncbi.nlm.nih.gov/pubmed/29430032
http://dx.doi.org/10.1093/biomet/asx008
work_keys_str_mv AT binkiewiczn covariateassistedspectralclustering
AT vogelsteinjt covariateassistedspectralclustering
AT rohek covariateassistedspectralclustering