Cargando…

Identifying communities from multiplex biological networks by randomized optimization of modularity

The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-...

Descripción completa

Detalles Bibliográficos
Autores principales: Didier, Gilles, Valdeolivas, Alberto, Baudot, Anaïs
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107982/
https://www.ncbi.nlm.nih.gov/pubmed/30210790
http://dx.doi.org/10.12688/f1000research.15486.2
_version_ 1783350066251038720
author Didier, Gilles
Valdeolivas, Alberto
Baudot, Anaïs
author_facet Didier, Gilles
Valdeolivas, Alberto
Baudot, Anaïs
author_sort Didier, Gilles
collection PubMed
description The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-derived common trait and disease genes. We implemented here several extensions of the MolTi software that detects communities by optimizing multiplex (and monoplex) network modularity. In particular, MolTi now runs a randomized version of the Louvain algorithm, can consider edge and layer weights, and performs recursive clustering. On simulated networks, the randomization procedure clearly improves the detection of communities. On the DREAM challenge benchmark, the results strongly depend on the selected GWAS dataset and enrichment p-value threshold. However, the randomization procedure, as well as the consideration of weighted edges and layers generally increases the number of trait and disease community detected. The new version of MolTi and the scripts used for the DMI DREAM challenge are available at: https://github.com/gilles-didier/MolTi-DREAM.
format Online
Article
Text
id pubmed-6107982
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-61079822018-09-11 Identifying communities from multiplex biological networks by randomized optimization of modularity Didier, Gilles Valdeolivas, Alberto Baudot, Anaïs F1000Res Method Article The identification of communities, or modules, is a common operation in the analysis of large biological networks. The Disease Module Identification DREAM challenge established a framework to evaluate clustering approaches in a biomedical context, by testing the association of communities with GWAS-derived common trait and disease genes. We implemented here several extensions of the MolTi software that detects communities by optimizing multiplex (and monoplex) network modularity. In particular, MolTi now runs a randomized version of the Louvain algorithm, can consider edge and layer weights, and performs recursive clustering. On simulated networks, the randomization procedure clearly improves the detection of communities. On the DREAM challenge benchmark, the results strongly depend on the selected GWAS dataset and enrichment p-value threshold. However, the randomization procedure, as well as the consideration of weighted edges and layers generally increases the number of trait and disease community detected. The new version of MolTi and the scripts used for the DMI DREAM challenge are available at: https://github.com/gilles-didier/MolTi-DREAM. F1000 Research Limited 2018-11-22 /pmc/articles/PMC6107982/ /pubmed/30210790 http://dx.doi.org/10.12688/f1000research.15486.2 Text en Copyright: © 2018 Didier G et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Didier, Gilles
Valdeolivas, Alberto
Baudot, Anaïs
Identifying communities from multiplex biological networks by randomized optimization of modularity
title Identifying communities from multiplex biological networks by randomized optimization of modularity
title_full Identifying communities from multiplex biological networks by randomized optimization of modularity
title_fullStr Identifying communities from multiplex biological networks by randomized optimization of modularity
title_full_unstemmed Identifying communities from multiplex biological networks by randomized optimization of modularity
title_short Identifying communities from multiplex biological networks by randomized optimization of modularity
title_sort identifying communities from multiplex biological networks by randomized optimization of modularity
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6107982/
https://www.ncbi.nlm.nih.gov/pubmed/30210790
http://dx.doi.org/10.12688/f1000research.15486.2
work_keys_str_mv AT didiergilles identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity
AT valdeolivasalberto identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity
AT baudotanais identifyingcommunitiesfrommultiplexbiologicalnetworksbyrandomizedoptimizationofmodularity