Cargando…
Detecting modules in biological networks by edge weight clustering and entropy significance
Detection of the modular structure of biological networks is of interest to researchers adopting a systems perspective for the analysis of omics data. Computational systems biology has provided a rich array of methods for network clustering. To date, the majority of approaches address this task thro...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551098/ https://www.ncbi.nlm.nih.gov/pubmed/26379697 http://dx.doi.org/10.3389/fgene.2015.00265 |
_version_ | 1782387537607655424 |
---|---|
author | Lecca, Paola Re, Angela |
author_facet | Lecca, Paola Re, Angela |
author_sort | Lecca, Paola |
collection | PubMed |
description | Detection of the modular structure of biological networks is of interest to researchers adopting a systems perspective for the analysis of omics data. Computational systems biology has provided a rich array of methods for network clustering. To date, the majority of approaches address this task through a network node classification based on topological or external quantifiable properties of network nodes. Conversely, numerical properties of network edges are underused, even though the information content which can be associated with network edges has augmented due to steady advances in molecular biology technology over the last decade. Properly accounting for network edges in the development of clustering approaches can become crucial to improve quantitative interpretation of omics data, finally resulting in more biologically plausible models. In this study, we present a novel technique for network module detection, named WG-Cluster (Weighted Graph CLUSTERing). WG-Cluster's notable features, compared to current approaches, lie in: (1) the simultaneous exploitation of network node and edge weights to improve the biological interpretability of the connected components detected, (2) the assessment of their statistical significance, and (3) the identification of emerging topological properties in the detected connected components. WG-Cluster utilizes three major steps: (i) an unsupervised version of k-means edge-based algorithm detects sub-graphs with similar edge weights, (ii) a fast-greedy algorithm detects connected components which are then scored and selected according to the statistical significance of their scores, and (iii) an analysis of the convolution between sub-graph mean edge weight and connected component score provides a summarizing view of the connected components. WG-Cluster can be applied to directed and undirected networks of different types of interacting entities and scales up to large omics data sets. Here, we show that WG-Cluster can be successfully used in the differential analysis of physical protein–protein interaction (PPI) networks. Specifically, applying WG-Cluster to a PPI network weighted by measurements of differential gene expression permits to explore the changes in network topology under two distinct (normal vs. tumor) conditions. WG-Cluster code is available at https://sites.google.com/site/paolaleccapersonalpage/. |
format | Online Article Text |
id | pubmed-4551098 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-45510982015-09-14 Detecting modules in biological networks by edge weight clustering and entropy significance Lecca, Paola Re, Angela Front Genet Physiology Detection of the modular structure of biological networks is of interest to researchers adopting a systems perspective for the analysis of omics data. Computational systems biology has provided a rich array of methods for network clustering. To date, the majority of approaches address this task through a network node classification based on topological or external quantifiable properties of network nodes. Conversely, numerical properties of network edges are underused, even though the information content which can be associated with network edges has augmented due to steady advances in molecular biology technology over the last decade. Properly accounting for network edges in the development of clustering approaches can become crucial to improve quantitative interpretation of omics data, finally resulting in more biologically plausible models. In this study, we present a novel technique for network module detection, named WG-Cluster (Weighted Graph CLUSTERing). WG-Cluster's notable features, compared to current approaches, lie in: (1) the simultaneous exploitation of network node and edge weights to improve the biological interpretability of the connected components detected, (2) the assessment of their statistical significance, and (3) the identification of emerging topological properties in the detected connected components. WG-Cluster utilizes three major steps: (i) an unsupervised version of k-means edge-based algorithm detects sub-graphs with similar edge weights, (ii) a fast-greedy algorithm detects connected components which are then scored and selected according to the statistical significance of their scores, and (iii) an analysis of the convolution between sub-graph mean edge weight and connected component score provides a summarizing view of the connected components. WG-Cluster can be applied to directed and undirected networks of different types of interacting entities and scales up to large omics data sets. Here, we show that WG-Cluster can be successfully used in the differential analysis of physical protein–protein interaction (PPI) networks. Specifically, applying WG-Cluster to a PPI network weighted by measurements of differential gene expression permits to explore the changes in network topology under two distinct (normal vs. tumor) conditions. WG-Cluster code is available at https://sites.google.com/site/paolaleccapersonalpage/. Frontiers Media S.A. 2015-08-27 /pmc/articles/PMC4551098/ /pubmed/26379697 http://dx.doi.org/10.3389/fgene.2015.00265 Text en Copyright © 2015 Lecca and Re. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Physiology Lecca, Paola Re, Angela Detecting modules in biological networks by edge weight clustering and entropy significance |
title | Detecting modules in biological networks by edge weight clustering and entropy significance |
title_full | Detecting modules in biological networks by edge weight clustering and entropy significance |
title_fullStr | Detecting modules in biological networks by edge weight clustering and entropy significance |
title_full_unstemmed | Detecting modules in biological networks by edge weight clustering and entropy significance |
title_short | Detecting modules in biological networks by edge weight clustering and entropy significance |
title_sort | detecting modules in biological networks by edge weight clustering and entropy significance |
topic | Physiology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4551098/ https://www.ncbi.nlm.nih.gov/pubmed/26379697 http://dx.doi.org/10.3389/fgene.2015.00265 |
work_keys_str_mv | AT leccapaola detectingmodulesinbiologicalnetworksbyedgeweightclusteringandentropysignificance AT reangela detectingmodulesinbiologicalnetworksbyedgeweightclusteringandentropysignificance |