Cargando…

Discovering large conserved functional components in global network alignment by graph matching

BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhu, Yuanyuan, Li, Yuezhi, Liu, Juan, Qin, Lu, Yu, Jeffrey Xu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157291/
https://www.ncbi.nlm.nih.gov/pubmed/30255780
http://dx.doi.org/10.1186/s12864-018-5027-9
_version_ 1783358250081583104
author Zhu, Yuanyuan
Li, Yuezhi
Liu, Juan
Qin, Lu
Yu, Jeffrey Xu
author_facet Zhu, Yuanyuan
Li, Yuezhi
Liu, Juan
Qin, Lu
Yu, Jeffrey Xu
author_sort Zhu, Yuanyuan
collection PubMed
description BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with the maximum matching score. However, finding large conserved components remains challenging due to its NP-hardness. RESULTS: We propose a new graph matching method GMAlign for global PPI network alignment. It first selects some pairs of important proteins as seeds, followed by a gradual expansion to obtain an initial matching, and then it refines the current result to obtain an optimal alignment result iteratively based on the vertex cover. We compare GMAlign with the state-of-the-art methods on the PPI network pairs obtained from the largest BioGRID dataset and validate its performance. The results show that our algorithm can produce larger size of alignment, and can find bigger and denser common connected subgraphs as well for the first time. Meanwhile, GMAlign can achieve high quality biological results, as measured by functional consistency and semantic similarity of the Gene Ontology terms. Moreover, we also show that GMAlign can achieve better results which are structurally and biologically meaningful in the detection of large conserved biological pathways between species. CONCLUSIONS: GMAlign is a novel global network alignment tool to discover large conserved functional components between PPI networks. It also has many potential biological applications such as conserved pathway and protein complex discovery across species. The GMAlign software and datasets are avaialbile at https://github.com/yzlwhu/GMAlign. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5027-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6157291
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61572912018-10-01 Discovering large conserved functional components in global network alignment by graph matching Zhu, Yuanyuan Li, Yuezhi Liu, Juan Qin, Lu Yu, Jeffrey Xu BMC Genomics Research BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with the maximum matching score. However, finding large conserved components remains challenging due to its NP-hardness. RESULTS: We propose a new graph matching method GMAlign for global PPI network alignment. It first selects some pairs of important proteins as seeds, followed by a gradual expansion to obtain an initial matching, and then it refines the current result to obtain an optimal alignment result iteratively based on the vertex cover. We compare GMAlign with the state-of-the-art methods on the PPI network pairs obtained from the largest BioGRID dataset and validate its performance. The results show that our algorithm can produce larger size of alignment, and can find bigger and denser common connected subgraphs as well for the first time. Meanwhile, GMAlign can achieve high quality biological results, as measured by functional consistency and semantic similarity of the Gene Ontology terms. Moreover, we also show that GMAlign can achieve better results which are structurally and biologically meaningful in the detection of large conserved biological pathways between species. CONCLUSIONS: GMAlign is a novel global network alignment tool to discover large conserved functional components between PPI networks. It also has many potential biological applications such as conserved pathway and protein complex discovery across species. The GMAlign software and datasets are avaialbile at https://github.com/yzlwhu/GMAlign. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5027-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-24 /pmc/articles/PMC6157291/ /pubmed/30255780 http://dx.doi.org/10.1186/s12864-018-5027-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhu, Yuanyuan
Li, Yuezhi
Liu, Juan
Qin, Lu
Yu, Jeffrey Xu
Discovering large conserved functional components in global network alignment by graph matching
title Discovering large conserved functional components in global network alignment by graph matching
title_full Discovering large conserved functional components in global network alignment by graph matching
title_fullStr Discovering large conserved functional components in global network alignment by graph matching
title_full_unstemmed Discovering large conserved functional components in global network alignment by graph matching
title_short Discovering large conserved functional components in global network alignment by graph matching
title_sort discovering large conserved functional components in global network alignment by graph matching
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157291/
https://www.ncbi.nlm.nih.gov/pubmed/30255780
http://dx.doi.org/10.1186/s12864-018-5027-9
work_keys_str_mv AT zhuyuanyuan discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching
AT liyuezhi discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching
AT liujuan discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching
AT qinlu discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching
AT yujeffreyxu discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching