Cargando…
Determining the minimum number of protein-protein interactions required to support known protein complexes
The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (addit...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919440/ https://www.ncbi.nlm.nih.gov/pubmed/29698482 http://dx.doi.org/10.1371/journal.pone.0195545 |
_version_ | 1783317626950254592 |
---|---|
author | Nakajima, Natsu Hayashida, Morihiro Jansson, Jesper Maruyama, Osamu Akutsu, Tatsuya |
author_facet | Nakajima, Natsu Hayashida, Morihiro Jansson, Jesper Maruyama, Osamu Akutsu, Tatsuya |
author_sort | Nakajima, Natsu |
collection | PubMed |
description | The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (additional) required protein-protein interactions as a graph theoretic problem under the constraint that each complex constitutes a connected component in a PPI network. For this problem, we develop two computational methods: one is based on integer linear programming (ILPMinPPI) and the other one is based on an existing greedy-type approximation algorithm (GreedyMinPPI) originally developed in the context of communication and social networks. Since the former method is only applicable to datasets of small size, we apply the latter method to a combination of the CYC2008 protein complex dataset and each of eight PPI datasets (STRING, MINT, BioGRID, IntAct, DIP, BIND, WI-PHI, iRefIndex). The results show that the minimum number of additional required PPIs ranges from 51 (STRING) to 964 (BIND), and that even the four best PPI databases, STRING (51), BioGRID (67), WI-PHI (93) and iRefIndex (85), do not include enough PPIs to form all CYC2008 protein complexes. We also demonstrate that the proposed problem framework and our solutions can enhance the prediction accuracy of existing PPI prediction methods. ILPMinPPI can be freely downloaded from http://sunflower.kuicr.kyoto-u.ac.jp/~nakajima/. |
format | Online Article Text |
id | pubmed-5919440 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-59194402018-05-11 Determining the minimum number of protein-protein interactions required to support known protein complexes Nakajima, Natsu Hayashida, Morihiro Jansson, Jesper Maruyama, Osamu Akutsu, Tatsuya PLoS One Research Article The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (additional) required protein-protein interactions as a graph theoretic problem under the constraint that each complex constitutes a connected component in a PPI network. For this problem, we develop two computational methods: one is based on integer linear programming (ILPMinPPI) and the other one is based on an existing greedy-type approximation algorithm (GreedyMinPPI) originally developed in the context of communication and social networks. Since the former method is only applicable to datasets of small size, we apply the latter method to a combination of the CYC2008 protein complex dataset and each of eight PPI datasets (STRING, MINT, BioGRID, IntAct, DIP, BIND, WI-PHI, iRefIndex). The results show that the minimum number of additional required PPIs ranges from 51 (STRING) to 964 (BIND), and that even the four best PPI databases, STRING (51), BioGRID (67), WI-PHI (93) and iRefIndex (85), do not include enough PPIs to form all CYC2008 protein complexes. We also demonstrate that the proposed problem framework and our solutions can enhance the prediction accuracy of existing PPI prediction methods. ILPMinPPI can be freely downloaded from http://sunflower.kuicr.kyoto-u.ac.jp/~nakajima/. Public Library of Science 2018-04-26 /pmc/articles/PMC5919440/ /pubmed/29698482 http://dx.doi.org/10.1371/journal.pone.0195545 Text en © 2018 Nakajima et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Nakajima, Natsu Hayashida, Morihiro Jansson, Jesper Maruyama, Osamu Akutsu, Tatsuya Determining the minimum number of protein-protein interactions required to support known protein complexes |
title | Determining the minimum number of protein-protein interactions required to support known protein complexes |
title_full | Determining the minimum number of protein-protein interactions required to support known protein complexes |
title_fullStr | Determining the minimum number of protein-protein interactions required to support known protein complexes |
title_full_unstemmed | Determining the minimum number of protein-protein interactions required to support known protein complexes |
title_short | Determining the minimum number of protein-protein interactions required to support known protein complexes |
title_sort | determining the minimum number of protein-protein interactions required to support known protein complexes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919440/ https://www.ncbi.nlm.nih.gov/pubmed/29698482 http://dx.doi.org/10.1371/journal.pone.0195545 |
work_keys_str_mv | AT nakajimanatsu determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes AT hayashidamorihiro determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes AT janssonjesper determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes AT maruyamaosamu determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes AT akutsutatsuya determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes |