Cargando…

Modularity detection in protein-protein interaction networks

BACKGROUND: Many recent studies have investigated modularity in biological networks, and its role in functional and structural characterization of constituent biomolecules. A technique that has shown considerable promise in the domain of modularity detection is the Newman and Girvan (NG) algorithm,...

Descripción completa

Detalles Bibliográficos
Autores principales: Narayanan, Tejaswini, Gersten, Merril, Subramaniam, Shankar, Grama, Ananth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3292542/
https://www.ncbi.nlm.nih.gov/pubmed/22206604
http://dx.doi.org/10.1186/1756-0500-4-569
_version_ 1782225292516917248
author Narayanan, Tejaswini
Gersten, Merril
Subramaniam, Shankar
Grama, Ananth
author_facet Narayanan, Tejaswini
Gersten, Merril
Subramaniam, Shankar
Grama, Ananth
author_sort Narayanan, Tejaswini
collection PubMed
description BACKGROUND: Many recent studies have investigated modularity in biological networks, and its role in functional and structural characterization of constituent biomolecules. A technique that has shown considerable promise in the domain of modularity detection is the Newman and Girvan (NG) algorithm, which relies on the number of shortest-paths across pairs of vertices in the network traversing a given edge, referred to as the betweenness of that edge. The edge with the highest betweenness is iteratively eliminated from the network, with the betweenness of the remaining edges recalculated in every iteration. This generates a complete dendrogram, from which modules are extracted by applying a quality metric called modularity denoted by Q. This exhaustive computation can be prohibitively expensive for large networks such as Protein-Protein Interaction Networks. In this paper, we present a novel optimization to the modularity detection algorithm, in terms of an efficient termination criterion based on a target edge betweenness value, using which the process of iterative edge removal may be terminated. RESULTS: We validate the robustness of our approach by applying our algorithm on real-world protein-protein interaction networks of Yeast, C.Elegans and Drosophila, and demonstrate that our algorithm consistently has significant computational gains in terms of reduced runtime, when compared to the NG algorithm. Furthermore, our algorithm produces modules comparable to those from the NG algorithm, qualitatively and quantitatively. We illustrate this using comparison metrics such as module distribution, module membership cardinality, modularity Q, and Jaccard Similarity Coefficient. CONCLUSIONS: We have presented an optimized approach for efficient modularity detection in networks. The intuition driving our approach is the extraction of holistic measures of centrality from graphs, which are representative of inherent modular structure of the underlying network, and the application of those measures to efficiently guide the modularity detection process. We have empirically evaluated our approach in the specific context of real-world large scale biological networks, and have demonstrated significant savings in computational time while maintaining comparable quality of detected modules.
format Online
Article
Text
id pubmed-3292542
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32925422012-03-05 Modularity detection in protein-protein interaction networks Narayanan, Tejaswini Gersten, Merril Subramaniam, Shankar Grama, Ananth BMC Res Notes Research Article BACKGROUND: Many recent studies have investigated modularity in biological networks, and its role in functional and structural characterization of constituent biomolecules. A technique that has shown considerable promise in the domain of modularity detection is the Newman and Girvan (NG) algorithm, which relies on the number of shortest-paths across pairs of vertices in the network traversing a given edge, referred to as the betweenness of that edge. The edge with the highest betweenness is iteratively eliminated from the network, with the betweenness of the remaining edges recalculated in every iteration. This generates a complete dendrogram, from which modules are extracted by applying a quality metric called modularity denoted by Q. This exhaustive computation can be prohibitively expensive for large networks such as Protein-Protein Interaction Networks. In this paper, we present a novel optimization to the modularity detection algorithm, in terms of an efficient termination criterion based on a target edge betweenness value, using which the process of iterative edge removal may be terminated. RESULTS: We validate the robustness of our approach by applying our algorithm on real-world protein-protein interaction networks of Yeast, C.Elegans and Drosophila, and demonstrate that our algorithm consistently has significant computational gains in terms of reduced runtime, when compared to the NG algorithm. Furthermore, our algorithm produces modules comparable to those from the NG algorithm, qualitatively and quantitatively. We illustrate this using comparison metrics such as module distribution, module membership cardinality, modularity Q, and Jaccard Similarity Coefficient. CONCLUSIONS: We have presented an optimized approach for efficient modularity detection in networks. The intuition driving our approach is the extraction of holistic measures of centrality from graphs, which are representative of inherent modular structure of the underlying network, and the application of those measures to efficiently guide the modularity detection process. We have empirically evaluated our approach in the specific context of real-world large scale biological networks, and have demonstrated significant savings in computational time while maintaining comparable quality of detected modules. BioMed Central 2011-12-29 /pmc/articles/PMC3292542/ /pubmed/22206604 http://dx.doi.org/10.1186/1756-0500-4-569 Text en Copyright ©2011 Narayanan et al; licensee BioMed Central Ltd.. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Narayanan, Tejaswini
Gersten, Merril
Subramaniam, Shankar
Grama, Ananth
Modularity detection in protein-protein interaction networks
title Modularity detection in protein-protein interaction networks
title_full Modularity detection in protein-protein interaction networks
title_fullStr Modularity detection in protein-protein interaction networks
title_full_unstemmed Modularity detection in protein-protein interaction networks
title_short Modularity detection in protein-protein interaction networks
title_sort modularity detection in protein-protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3292542/
https://www.ncbi.nlm.nih.gov/pubmed/22206604
http://dx.doi.org/10.1186/1756-0500-4-569
work_keys_str_mv AT narayanantejaswini modularitydetectioninproteinproteininteractionnetworks
AT gerstenmerril modularitydetectioninproteinproteininteractionnetworks
AT subramaniamshankar modularitydetectioninproteinproteininteractionnetworks
AT gramaananth modularitydetectioninproteinproteininteractionnetworks