Cargando…
Identifying communities from multiplex biological networks by randomized optimization of modularity
The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107982/ https://www.ncbi.nlm.nih.gov/pubmed/30210790 http://dx.doi.org/10.12688/f1000research.15486.2 |
_version_ | 1783350066251038720 |
---|---|
author | Didier, Gilles Valdeolivas, Alberto Baudot, Anaïs |
author_facet | Didier, Gilles Valdeolivas, Alberto Baudot, Anaïs |
author_sort | Didier, Gilles |
collection | PubMed |
description | The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-derived common trait and disease genes. We implemented here several extensions of the MolTi software that detects communities by optimizing multiplex (and monoplex) network modularity. In particular, MolTi now runs a randomized version of the Louvain algorithm, can consider edge and layer weights, and performs recursive clustering. On simulated networks, the randomization procedure clearly improves the detection of communities. On the DREAM challenge benchmark, the results strongly depend on the selected GWAS dataset and enrichment p-value threshold. However, the randomization procedure, as well as the consideration of weighted edges and layers generally increases the number of trait and disease community detected. The new version of MolTi and the scripts used for the DMI DREAM challenge are available at: https://github.com/gilles-didier/MolTi-DREAM. |
format | Online Article Text |
id | pubmed-6107982 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-61079822018-09-11 Identifying communities from multiplex biological networks by randomized optimization of modularity Didier, Gilles Valdeolivas, Alberto Baudot, Anaïs F1000Res Method Article The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-derived common trait and disease genes. We implemented here several extensions of the MolTi software that detects communities by optimizing multiplex (and monoplex) network modularity. In particular, MolTi now runs a randomized version of the Louvain algorithm, can consider edge and layer weights, and performs recursive clustering. On simulated networks, the randomization procedure clearly improves the detection of communities. On the DREAM challenge benchmark, the results strongly depend on the selected GWAS dataset and enrichment p-value threshold. However, the randomization procedure, as well as the consideration of weighted edges and layers generally increases the number of trait and disease community detected. The new version of MolTi and the scripts used for the DMI DREAM challenge are available at: https://github.com/gilles-didier/MolTi-DREAM. F1000 Research Limited 2018-11-22 /pmc/articles/PMC6107982/ /pubmed/30210790 http://dx.doi.org/10.12688/f1000research.15486.2 Text en Copyright: © 2018 Didier G et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article Didier, Gilles Valdeolivas, Alberto Baudot, Anaïs Identifying communities from multiplex biological networks by randomized optimization of modularity |
title | Identifying communities from multiplex biological networks by randomized optimization of modularity |
title_full | Identifying communities from multiplex biological networks by randomized optimization of modularity |
title_fullStr | Identifying communities from multiplex biological networks by randomized optimization of modularity |
title_full_unstemmed | Identifying communities from multiplex biological networks by randomized optimization of modularity |
title_short | Identifying communities from multiplex biological networks by randomized optimization of modularity |
title_sort | identifying communities from multiplex biological networks by randomized optimization of modularity |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107982/ https://www.ncbi.nlm.nih.gov/pubmed/30210790 http://dx.doi.org/10.12688/f1000research.15486.2 |
work_keys_str_mv | AT didiergilles identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity AT valdeolivasalberto identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity AT baudotanais identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity |