Cargando…

Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources

BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly...

Descripción completa

Detalles Bibliográficos
Autores principales: Taghipour, Shirin, Zarrineh, Peyman, Ganjtabesh, Mohammad, Nowzari-Dalini, Abbas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209909/
https://www.ncbi.nlm.nih.gov/pubmed/28049415
http://dx.doi.org/10.1186/s12859-016-1422-x
_version_ 1782490818715582464
author Taghipour, Shirin
Zarrineh, Peyman
Ganjtabesh, Mohammad
Nowzari-Dalini, Abbas
author_facet Taghipour, Shirin
Zarrineh, Peyman
Ganjtabesh, Mohammad
Nowzari-Dalini, Abbas
author_sort Taghipour, Shirin
collection PubMed
description BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly accepted method to integrate different PPI datasets into a single weighted PPI network, but detecting proper weights in such network is still a major problem. RESULTS: In this paper, we proposed a new methodology to integrate various physical PPI datasets into a single weighted PPI network in a way that the detected modules in PPI network exhibit the highest similarity to available functional modules. We used the co-expression modules as functional modules, and we shown that direct functional modules detected from Gene Ontology terms could be used as an alternative dataset. After running this integrating methodology over six different physical PPI datasets, orthologous high-confidence interactions from a related organism and two AP-MS PPI datasets gained high weights in the integrated networks, while the weights for one AP-MS PPI dataset and two other datasets derived from public databases have converged to zero. The majority of detected modules shaped around one or few hub protein(s). Still, a large number of highly interacting protein modules were detected which are functionally relevant and are likely to construct protein complexes. CONCLUSIONS: We provided a new high confidence protein complex prediction method supported by functional studies and literature mining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1422-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5209909
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-52099092017-01-04 Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources Taghipour, Shirin Zarrineh, Peyman Ganjtabesh, Mohammad Nowzari-Dalini, Abbas BMC Bioinformatics Research Article BACKGROUND: Although different protein-protein physical interaction (PPI) datasets exist for Escherichia coli, no common methodology exists to integrate these datasets and extract reliable modules reflecting the existing biological process and protein complexes. Naïve Bayesian formula is the highly accepted method to integrate different PPI datasets into a single weighted PPI network, but detecting proper weights in such network is still a major problem. RESULTS: In this paper, we proposed a new methodology to integrate various physical PPI datasets into a single weighted PPI network in a way that the detected modules in PPI network exhibit the highest similarity to available functional modules. We used the co-expression modules as functional modules, and we shown that direct functional modules detected from Gene Ontology terms could be used as an alternative dataset. After running this integrating methodology over six different physical PPI datasets, orthologous high-confidence interactions from a related organism and two AP-MS PPI datasets gained high weights in the integrated networks, while the weights for one AP-MS PPI dataset and two other datasets derived from public databases have converged to zero. The majority of detected modules shaped around one or few hub protein(s). Still, a large number of highly interacting protein modules were detected which are functionally relevant and are likely to construct protein complexes. CONCLUSIONS: We provided a new high confidence protein complex prediction method supported by functional studies and literature mining. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1422-x) contains supplementary material, which is available to authorized users. BioMed Central 2017-01-03 /pmc/articles/PMC5209909/ /pubmed/28049415 http://dx.doi.org/10.1186/s12859-016-1422-x Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Taghipour, Shirin
Zarrineh, Peyman
Ganjtabesh, Mohammad
Nowzari-Dalini, Abbas
Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title_full Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title_fullStr Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title_full_unstemmed Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title_short Improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of Escherichia coli from different physical interaction data sources
title_sort improving protein complex prediction by reconstructing a high-confidence protein-protein interaction network of escherichia coli from different physical interaction data sources
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5209909/
https://www.ncbi.nlm.nih.gov/pubmed/28049415
http://dx.doi.org/10.1186/s12859-016-1422-x
work_keys_str_mv AT taghipourshirin improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources
AT zarrinehpeyman improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources
AT ganjtabeshmohammad improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources
AT nowzaridaliniabbas improvingproteincomplexpredictionbyreconstructingahighconfidenceproteinproteininteractionnetworkofescherichiacolifromdifferentphysicalinteractiondatasources