Cargando…

Determining the minimum number of protein-protein interactions required to support known protein complexes

The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (addit...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakajima, Natsu, Hayashida, Morihiro, Jansson, Jesper, Maruyama, Osamu, Akutsu, Tatsuya
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919440/
https://www.ncbi.nlm.nih.gov/pubmed/29698482
http://dx.doi.org/10.1371/journal.pone.0195545
_version_ 1783317626950254592
author Nakajima, Natsu
Hayashida, Morihiro
Jansson, Jesper
Maruyama, Osamu
Akutsu, Tatsuya
author_facet Nakajima, Natsu
Hayashida, Morihiro
Jansson, Jesper
Maruyama, Osamu
Akutsu, Tatsuya
author_sort Nakajima, Natsu
collection PubMed
description The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (additional) required protein-protein interactions as a graph theoretic problem under the constraint that each complex constitutes a connected component in a PPI network. For this problem, we develop two computational methods: one is based on integer linear programming (ILPMinPPI) and the other one is based on an existing greedy-type approximation algorithm (GreedyMinPPI) originally developed in the context of communication and social networks. Since the former method is only applicable to datasets of small size, we apply the latter method to a combination of the CYC2008 protein complex dataset and each of eight PPI datasets (STRING, MINT, BioGRID, IntAct, DIP, BIND, WI-PHI, iRefIndex). The results show that the minimum number of additional required PPIs ranges from 51 (STRING) to 964 (BIND), and that even the four best PPI databases, STRING (51), BioGRID (67), WI-PHI (93) and iRefIndex (85), do not include enough PPIs to form all CYC2008 protein complexes. We also demonstrate that the proposed problem framework and our solutions can enhance the prediction accuracy of existing PPI prediction methods. ILPMinPPI can be freely downloaded from http://sunflower.kuicr.kyoto-u.ac.jp/~nakajima/.
format Online
Article
Text
id pubmed-5919440
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-59194402018-05-11 Determining the minimum number of protein-protein interactions required to support known protein complexes Nakajima, Natsu Hayashida, Morihiro Jansson, Jesper Maruyama, Osamu Akutsu, Tatsuya PLoS One Research Article The prediction of protein complexes from protein-protein interactions (PPIs) is a well-studied problem in bioinformatics. However, the currently available PPI data is not enough to describe all known protein complexes. In this paper, we express the problem of determining the minimum number of (additional) required protein-protein interactions as a graph theoretic problem under the constraint that each complex constitutes a connected component in a PPI network. For this problem, we develop two computational methods: one is based on integer linear programming (ILPMinPPI) and the other one is based on an existing greedy-type approximation algorithm (GreedyMinPPI) originally developed in the context of communication and social networks. Since the former method is only applicable to datasets of small size, we apply the latter method to a combination of the CYC2008 protein complex dataset and each of eight PPI datasets (STRING, MINT, BioGRID, IntAct, DIP, BIND, WI-PHI, iRefIndex). The results show that the minimum number of additional required PPIs ranges from 51 (STRING) to 964 (BIND), and that even the four best PPI databases, STRING (51), BioGRID (67), WI-PHI (93) and iRefIndex (85), do not include enough PPIs to form all CYC2008 protein complexes. We also demonstrate that the proposed problem framework and our solutions can enhance the prediction accuracy of existing PPI prediction methods. ILPMinPPI can be freely downloaded from http://sunflower.kuicr.kyoto-u.ac.jp/~nakajima/. Public Library of Science 2018-04-26 /pmc/articles/PMC5919440/ /pubmed/29698482 http://dx.doi.org/10.1371/journal.pone.0195545 Text en © 2018 Nakajima et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Nakajima, Natsu
Hayashida, Morihiro
Jansson, Jesper
Maruyama, Osamu
Akutsu, Tatsuya
Determining the minimum number of protein-protein interactions required to support known protein complexes
title Determining the minimum number of protein-protein interactions required to support known protein complexes
title_full Determining the minimum number of protein-protein interactions required to support known protein complexes
title_fullStr Determining the minimum number of protein-protein interactions required to support known protein complexes
title_full_unstemmed Determining the minimum number of protein-protein interactions required to support known protein complexes
title_short Determining the minimum number of protein-protein interactions required to support known protein complexes
title_sort determining the minimum number of protein-protein interactions required to support known protein complexes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919440/
https://www.ncbi.nlm.nih.gov/pubmed/29698482
http://dx.doi.org/10.1371/journal.pone.0195545
work_keys_str_mv AT nakajimanatsu determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes
AT hayashidamorihiro determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes
AT janssonjesper determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes
AT maruyamaosamu determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes
AT akutsutatsuya determiningtheminimumnumberofproteinproteininteractionsrequiredtosupportknownproteincomplexes