Cargando…
Discovering large conserved functional components in global network alignment by graph matching
BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157291/ https://www.ncbi.nlm.nih.gov/pubmed/30255780 http://dx.doi.org/10.1186/s12864-018-5027-9 |
_version_ | 1783358250081583104 |
---|---|
author | Zhu, Yuanyuan Li, Yuezhi Liu, Juan Qin, Lu Yu, Jeffrey Xu |
author_facet | Zhu, Yuanyuan Li, Yuezhi Liu, Juan Qin, Lu Yu, Jeffrey Xu |
author_sort | Zhu, Yuanyuan |
collection | PubMed |
description | BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with the maximum matching score. However, finding large conserved components remains challenging due to its NP-hardness. RESULTS: We propose a new graph matching method GMAlign for global PPI network alignment. It first selects some pairs of important proteins as seeds, followed by a gradual expansion to obtain an initial matching, and then it refines the current result to obtain an optimal alignment result iteratively based on the vertex cover. We compare GMAlign with the state-of-the-art methods on the PPI network pairs obtained from the largest BioGRID dataset and validate its performance. The results show that our algorithm can produce larger size of alignment, and can find bigger and denser common connected subgraphs as well for the first time. Meanwhile, GMAlign can achieve high quality biological results, as measured by functional consistency and semantic similarity of the Gene Ontology terms. Moreover, we also show that GMAlign can achieve better results which are structurally and biologically meaningful in the detection of large conserved biological pathways between species. CONCLUSIONS: GMAlign is a novel global network alignment tool to discover large conserved functional components between PPI networks. It also has many potential biological applications such as conserved pathway and protein complex discovery across species. The GMAlign software and datasets are avaialbile at https://github.com/yzlwhu/GMAlign. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5027-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6157291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61572912018-10-01 Discovering large conserved functional components in global network alignment by graph matching Zhu, Yuanyuan Li, Yuezhi Liu, Juan Qin, Lu Yu, Jeffrey Xu BMC Genomics Research BACKGROUND: Aligning protein-protein interaction (PPI) networks is very important to discover the functionally conserved sub-structures between different species. In recent years, the global PPI network alignment problem has been extensively studied aiming at finding the one-to-one alignment with the maximum matching score. However, finding large conserved components remains challenging due to its NP-hardness. RESULTS: We propose a new graph matching method GMAlign for global PPI network alignment. It first selects some pairs of important proteins as seeds, followed by a gradual expansion to obtain an initial matching, and then it refines the current result to obtain an optimal alignment result iteratively based on the vertex cover. We compare GMAlign with the state-of-the-art methods on the PPI network pairs obtained from the largest BioGRID dataset and validate its performance. The results show that our algorithm can produce larger size of alignment, and can find bigger and denser common connected subgraphs as well for the first time. Meanwhile, GMAlign can achieve high quality biological results, as measured by functional consistency and semantic similarity of the Gene Ontology terms. Moreover, we also show that GMAlign can achieve better results which are structurally and biologically meaningful in the detection of large conserved biological pathways between species. CONCLUSIONS: GMAlign is a novel global network alignment tool to discover large conserved functional components between PPI networks. It also has many potential biological applications such as conserved pathway and protein complex discovery across species. The GMAlign software and datasets are avaialbile at https://github.com/yzlwhu/GMAlign. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-5027-9) contains supplementary material, which is available to authorized users. BioMed Central 2018-09-24 /pmc/articles/PMC6157291/ /pubmed/30255780 http://dx.doi.org/10.1186/s12864-018-5027-9 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhu, Yuanyuan Li, Yuezhi Liu, Juan Qin, Lu Yu, Jeffrey Xu Discovering large conserved functional components in global network alignment by graph matching |
title | Discovering large conserved functional components in global network alignment by graph matching |
title_full | Discovering large conserved functional components in global network alignment by graph matching |
title_fullStr | Discovering large conserved functional components in global network alignment by graph matching |
title_full_unstemmed | Discovering large conserved functional components in global network alignment by graph matching |
title_short | Discovering large conserved functional components in global network alignment by graph matching |
title_sort | discovering large conserved functional components in global network alignment by graph matching |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6157291/ https://www.ncbi.nlm.nih.gov/pubmed/30255780 http://dx.doi.org/10.1186/s12864-018-5027-9 |
work_keys_str_mv | AT zhuyuanyuan discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching AT liyuezhi discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching AT liujuan discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching AT qinlu discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching AT yujeffreyxu discoveringlargeconservedfunctionalcomponentsinglobalnetworkalignmentbygraphmatching |