Cargando…
Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209909/ https://www.ncbi.nlm.nih.gov/pubmed/28049415 http://dx.doi.org/10.1186/s12859-016-1422-x |
_version_ | 1782490818715582464 |
---|---|
author | Taghipour, Shirin Zarrineh, Peyman Ganjtabesh, Mohammad Nowzari-Dalini, Abbas |
author_facet | Taghipour, Shirin Zarrineh, Peyman Ganjtabesh, Mohammad Nowzari-Dalini, Abbas |
author_sort | Taghipour, Shirin |
collection | PubMed |
description | BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly accepted method to integrate different PPI datasets into a single weighted PPI network, but detecting proper weights in such network is still a major problem. RESULTS: In this paper, we proposed a new methodology to integrate various physical PPI datasets into a single weighted PPI network in a way that the detected modules in PPI network exhibit the highest similarity to available functional modules. We used the co-expression modules as functional modules, and we shown that direct functional modules detected from Gene Ontology terms could be used as an alternative dataset. After running this integrating methodology over six different physical PPI datasets, orthologous high-confidence interactions from a related organism and two AP-MS PPI datasets gained high weights in the integrated networks, while the weights for one AP-MS PPI dataset and two other datasets derived from public databases have converged to zero. The majority of detected modules shaped around one or few hub protein(s). Still, a large number of highly interacting protein modules were detected which are functionally relevant and are likely to construct protein complexes. CONCLUSIONS: We provided a new high confidence protein complex prediction method supported by functional studies and literature mining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1422-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5209909 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52099092017-01-04 Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources Taghipour, Shirin Zarrineh, Peyman Ganjtabesh, Mohammad Nowzari-Dalini, Abbas BMC Bioinformatics Research Article BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly accepted method to integrate different PPI datasets into a single weighted PPI network, but detecting proper weights in such network is still a major problem. RESULTS: In this paper, we proposed a new methodology to integrate various physical PPI datasets into a single weighted PPI network in a way that the detected modules in PPI network exhibit the highest similarity to available functional modules. We used the co-expression modules as functional modules, and we shown that direct functional modules detected from Gene Ontology terms could be used as an alternative dataset. After running this integrating methodology over six different physical PPI datasets, orthologous high-confidence interactions from a related organism and two AP-MS PPI datasets gained high weights in the integrated networks, while the weights for one AP-MS PPI dataset and two other datasets derived from public databases have converged to zero. The majority of detected modules shaped around one or few hub protein(s). Still, a large number of highly interacting protein modules were detected which are functionally relevant and are likely to construct protein complexes. CONCLUSIONS: We provided a new high confidence protein complex prediction method supported by functional studies and literature mining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1422-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-03 /pmc/articles/PMC5209909/ /pubmed/28049415 http://dx.doi.org/10.1186/s12859-016-1422-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Taghipour, Shirin Zarrineh, Peyman Ganjtabesh, Mohammad Nowzari-Dalini, Abbas Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title | Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title_full | Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title_fullStr | Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title_full_unstemmed | Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title_short | Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources |
title_sort | improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of escherichia coli from different physical interaction data sources |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209909/ https://www.ncbi.nlm.nih.gov/pubmed/28049415 http://dx.doi.org/10.1186/s12859-016-1422-x |
work_keys_str_mv | AT taghipourshirin improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources AT zarrinehpeyman improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources AT ganjtabeshmohammad improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources AT nowzaridaliniabbas improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources |