Cargando…

A fast approach to global alignment of protein-protein interaction networks

BACKGROUND: Global network alignment has been proposed as an effective tool for computing functional orthology. Commonly used global alignment techniques such as IsoRank rely on a two-step process: the first step is an iterative diffusion-based approach for assigning similarity scores to all possibl...

Descripción completa

Detalles Bibliográficos
Autores principales: Kollias, Giorgos, Sathe, Madan, Mohammadi, Shahin, Grama, Ananth
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3672019/
https://www.ncbi.nlm.nih.gov/pubmed/23363457
http://dx.doi.org/10.1186/1756-0500-6-35
_version_ 1782272061744349184
author Kollias, Giorgos
Sathe, Madan
Mohammadi, Shahin
Grama, Ananth
author_facet Kollias, Giorgos
Sathe, Madan
Mohammadi, Shahin
Grama, Ananth
author_sort Kollias, Giorgos
collection PubMed
description BACKGROUND: Global network alignment has been proposed as an effective tool for computing functional orthology. Commonly used global alignment techniques such as IsoRank rely on a two-step process: the first step is an iterative diffusion-based approach for assigning similarity scores to all possible node pairs (matchings); the second step applies a maximum-weight bipartite matching algorithm to this similarity score matrix to identify orthologous node pairs. While demonstrably successful in identifying orthologies beyond those based on sequences, this two-step process is computationally expensive. Recent work on computation of node-pair similarity matrices has demonstrated that the computational cost of the first step can be significantly reduced. The use of these accelerated methods renders the bipartite matching step as the dominant computational cost. This motivates a critical assessment of the tradeoffs of computational cost and solution quality (matching quality, topological matches, and biological significance) associated with the bipartite matching step. In this paper we utilize the state-of-the-art core diffusion-based step in IsoRank for similarity matrix computation, and couple it with two heuristic bipartite matching algorithms – a matrix-based greedy approach, and a tunable, adaptive, auction-based matching algorithm developed by us. We then compare our implementations against the performance and quality characteristics of the solution produced by the reference IsoRank binary, which also implements an optimal matching algorithm. RESULTS: Using heuristic matching algorithms in the IsoRank pipeline exhibits dramatic speedup improvements; typically ×30 times faster for the total alignment process in most cases of interest. More surprisingly, these improvements in compute times are typically accompanied by better or comparable topological and biological quality for the network alignments generated. These measures are quantified by the number of conserved edges in the alignment graph, the percentage of enriched components, and the total number of covered Gene Ontology (GO) terms. CONCLUSIONS: We have demonstrated significant reductions in global network alignment computation times by coupling heuristic bipartite matching methods with the similarity scoring step of the IsoRank procedure. Our heuristic matching techniques maintain comparable – if not better – quality in resulting alignments. A consequence of our work is that network-alignment based orthologies can be computed within minutes (as compared to hours) on typical protein interaction networks, enabling a more comprehensive tuning of alignment parameters for refined orthologies.
format Online
Article
Text
id pubmed-3672019
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-36720192013-06-10 A fast approach to global alignment of protein-protein interaction networks Kollias, Giorgos Sathe, Madan Mohammadi, Shahin Grama, Ananth BMC Res Notes Research Article BACKGROUND: Global network alignment has been proposed as an effective tool for computing functional orthology. Commonly used global alignment techniques such as IsoRank rely on a two-step process: the first step is an iterative diffusion-based approach for assigning similarity scores to all possible node pairs (matchings); the second step applies a maximum-weight bipartite matching algorithm to this similarity score matrix to identify orthologous node pairs. While demonstrably successful in identifying orthologies beyond those based on sequences, this two-step process is computationally expensive. Recent work on computation of node-pair similarity matrices has demonstrated that the computational cost of the first step can be significantly reduced. The use of these accelerated methods renders the bipartite matching step as the dominant computational cost. This motivates a critical assessment of the tradeoffs of computational cost and solution quality (matching quality, topological matches, and biological significance) associated with the bipartite matching step. In this paper we utilize the state-of-the-art core diffusion-based step in IsoRank for similarity matrix computation, and couple it with two heuristic bipartite matching algorithms – a matrix-based greedy approach, and a tunable, adaptive, auction-based matching algorithm developed by us. We then compare our implementations against the performance and quality characteristics of the solution produced by the reference IsoRank binary, which also implements an optimal matching algorithm. RESULTS: Using heuristic matching algorithms in the IsoRank pipeline exhibits dramatic speedup improvements; typically ×30 times faster for the total alignment process in most cases of interest. More surprisingly, these improvements in compute times are typically accompanied by better or comparable topological and biological quality for the network alignments generated. These measures are quantified by the number of conserved edges in the alignment graph, the percentage of enriched components, and the total number of covered Gene Ontology (GO) terms. CONCLUSIONS: We have demonstrated significant reductions in global network alignment computation times by coupling heuristic bipartite matching methods with the similarity scoring step of the IsoRank procedure. Our heuristic matching techniques maintain comparable – if not better – quality in resulting alignments. A consequence of our work is that network-alignment based orthologies can be computed within minutes (as compared to hours) on typical protein interaction networks, enabling a more comprehensive tuning of alignment parameters for refined orthologies. BioMed Central 2013-01-31 /pmc/articles/PMC3672019/ /pubmed/23363457 http://dx.doi.org/10.1186/1756-0500-6-35 Text en Copyright © 2013 Kollias et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kollias, Giorgos
Sathe, Madan
Mohammadi, Shahin
Grama, Ananth
A fast approach to global alignment of protein-protein interaction networks
title A fast approach to global alignment of protein-protein interaction networks
title_full A fast approach to global alignment of protein-protein interaction networks
title_fullStr A fast approach to global alignment of protein-protein interaction networks
title_full_unstemmed A fast approach to global alignment of protein-protein interaction networks
title_short A fast approach to global alignment of protein-protein interaction networks
title_sort fast approach to global alignment of protein-protein interaction networks
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3672019/
https://www.ncbi.nlm.nih.gov/pubmed/23363457
http://dx.doi.org/10.1186/1756-0500-6-35
work_keys_str_mv AT kolliasgiorgos afastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT sathemadan afastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT mohammadishahin afastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT gramaananth afastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT kolliasgiorgos fastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT sathemadan fastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT mohammadishahin fastapproachtoglobalalignmentofproteinproteininteractionnetworks
AT gramaananth fastapproachtoglobalalignmentofproteinproteininteractionnetworks