Cargando…

A framework for stability‐based module detection in correlation graphs

Graphs can be used to represent the direct and indirect relationships between variables, and elucidate complex relationships and interdependencies. Detecting structure within a graph is a challenging problem. This problem is studied over a range of fields and is sometimes termed community detection,...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Mingmei, Blair, Rachael Hageman, Mu, Lina, Bonner, Matthew, Browne, Richard, Yu, Han
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Wiley Subscription Services, Inc., A Wiley Company 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986843/
https://www.ncbi.nlm.nih.gov/pubmed/33777285
http://dx.doi.org/10.1002/sam.11495
_version_ 1783668520973762560
author Tian, Mingmei
Blair, Rachael Hageman
Mu, Lina
Bonner, Matthew
Browne, Richard
Yu, Han
author_facet Tian, Mingmei
Blair, Rachael Hageman
Mu, Lina
Bonner, Matthew
Browne, Richard
Yu, Han
author_sort Tian, Mingmei
collection PubMed
description Graphs can be used to represent the direct and indirect relationships between variables, and elucidate complex relationships and interdependencies. Detecting structure within a graph is a challenging problem. This problem is studied over a range of fields and is sometimes termed community detection, module detection, or graph partitioning. A popular class of algorithms for module detection relies on optimizing a function of modularity to identify the structure. In practice, graphs are often learned from the data, and thus prone to uncertainty. In these settings, the uncertainty of the network structure can become exaggerated by giving unreliable estimates of the module structure. In this work, we begin to address this challenge through the use of a nonparametric bootstrap approach to assessing the stability of module detection in a graph. Estimates of stability are presented at the level of the individual node, the inferred modules, and as an overall measure of performance for module detection in a given graph. Furthermore, bootstrap stability estimates are derived for complexity parameter selection that ultimately defines a graph from data in a way that optimizes stability. This approach is utilized in connection with correlation graphs but is generalizable to other graphs that are defined through the use of dissimilarity measures. We demonstrate our approach using a broad range of simulations and on a metabolomics dataset from the Beijing Olympics Air Pollution study. These approaches are implemented using bootcluster package that is available in the R programming language.
format Online
Article
Text
id pubmed-7986843
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Wiley Subscription Services, Inc., A Wiley Company
record_format MEDLINE/PubMed
spelling pubmed-79868432021-03-25 A framework for stability‐based module detection in correlation graphs Tian, Mingmei Blair, Rachael Hageman Mu, Lina Bonner, Matthew Browne, Richard Yu, Han Stat Anal Data Min Research Articles Graphs can be used to represent the direct and indirect relationships between variables, and elucidate complex relationships and interdependencies. Detecting structure within a graph is a challenging problem. This problem is studied over a range of fields and is sometimes termed community detection, module detection, or graph partitioning. A popular class of algorithms for module detection relies on optimizing a function of modularity to identify the structure. In practice, graphs are often learned from the data, and thus prone to uncertainty. In these settings, the uncertainty of the network structure can become exaggerated by giving unreliable estimates of the module structure. In this work, we begin to address this challenge through the use of a nonparametric bootstrap approach to assessing the stability of module detection in a graph. Estimates of stability are presented at the level of the individual node, the inferred modules, and as an overall measure of performance for module detection in a given graph. Furthermore, bootstrap stability estimates are derived for complexity parameter selection that ultimately defines a graph from data in a way that optimizes stability. This approach is utilized in connection with correlation graphs but is generalizable to other graphs that are defined through the use of dissimilarity measures. We demonstrate our approach using a broad range of simulations and on a metabolomics dataset from the Beijing Olympics Air Pollution study. These approaches are implemented using bootcluster package that is available in the R programming language. Wiley Subscription Services, Inc., A Wiley Company 2021-01-08 2021-04 /pmc/articles/PMC7986843/ /pubmed/33777285 http://dx.doi.org/10.1002/sam.11495 Text en © 2021 The Authors. Statistical Analysis and Data Mining published by Wiley Periodicals LLC. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Tian, Mingmei
Blair, Rachael Hageman
Mu, Lina
Bonner, Matthew
Browne, Richard
Yu, Han
A framework for stability‐based module detection in correlation graphs
title A framework for stability‐based module detection in correlation graphs
title_full A framework for stability‐based module detection in correlation graphs
title_fullStr A framework for stability‐based module detection in correlation graphs
title_full_unstemmed A framework for stability‐based module detection in correlation graphs
title_short A framework for stability‐based module detection in correlation graphs
title_sort framework for stability‐based module detection in correlation graphs
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7986843/
https://www.ncbi.nlm.nih.gov/pubmed/33777285
http://dx.doi.org/10.1002/sam.11495
work_keys_str_mv AT tianmingmei aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT blairrachaelhageman aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT mulina aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT bonnermatthew aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT brownerichard aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT yuhan aframeworkforstabilitybasedmoduledetectionincorrelationgraphs
AT tianmingmei frameworkforstabilitybasedmoduledetectionincorrelationgraphs
AT blairrachaelhageman frameworkforstabilitybasedmoduledetectionincorrelationgraphs
AT mulina frameworkforstabilitybasedmoduledetectionincorrelationgraphs
AT bonnermatthew frameworkforstabilitybasedmoduledetectionincorrelationgraphs
AT brownerichard frameworkforstabilitybasedmoduledetectionincorrelationgraphs
AT yuhan frameworkforstabilitybasedmoduledetectionincorrelationgraphs