Cargando…

RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks

BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two d...

Descripción completa

Detalles Bibliográficos
Autores principales: Maruyama, Osamu, Kuwahara, Yuki
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731504/
https://www.ncbi.nlm.nih.gov/pubmed/29244010
http://dx.doi.org/10.1186/s12859-017-1920-5
_version_ 1783286523285733376
author Maruyama, Osamu
Kuwahara, Yuki
author_facet Maruyama, Osamu
Kuwahara, Yuki
author_sort Maruyama, Osamu
collection PubMed
description BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two difficulties. One is related to small complexes, which contains two or three components. It is relatively difficult to identify them due to their simpler internal structure, but unfortunately complexes of such sizes are dominant in major protein complex databases, such as CYC2008. Another difficulty is how to model overlaps between predicted complexes, that is, how to evaluate different predicted complexes sharing common proteins because CYC2008 and other databases include such protein complexes. Thus, it is critical how to model overlaps between predicted complexes to identify them simultaneously. RESULTS: In this paper, we propose a sampling-based protein complex prediction method, RocSampler (Regularizing Overlapping Complexes), which exploits, as part of the whole scoring function, a regularization term for the overlaps of predicted complexes and that for the distribution of sizes of predicted complexes. We have implemented RocSampler in MATLAB and its executable file for Windows is available at the site, http://imi.kyushu-u.ac.jp/~om/software/RocSampler/. CONCLUSIONS: We have applied RocSampler to five yeast PPI networks and shown that it is superior to other existing methods. This implies that the design of scoring functions including regularization terms is an effective approach for protein complex prediction.
format Online
Article
Text
id pubmed-5731504
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57315042017-12-19 RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks Maruyama, Osamu Kuwahara, Yuki BMC Bioinformatics Research BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two difficulties. One is related to small complexes, which contains two or three components. It is relatively difficult to identify them due to their simpler internal structure, but unfortunately complexes of such sizes are dominant in major protein complex databases, such as CYC2008. Another difficulty is how to model overlaps between predicted complexes, that is, how to evaluate different predicted complexes sharing common proteins because CYC2008 and other databases include such protein complexes. Thus, it is critical how to model overlaps between predicted complexes to identify them simultaneously. RESULTS: In this paper, we propose a sampling-based protein complex prediction method, RocSampler (Regularizing Overlapping Complexes), which exploits, as part of the whole scoring function, a regularization term for the overlaps of predicted complexes and that for the distribution of sizes of predicted complexes. We have implemented RocSampler in MATLAB and its executable file for Windows is available at the site, http://imi.kyushu-u.ac.jp/~om/software/RocSampler/. CONCLUSIONS: We have applied RocSampler to five yeast PPI networks and shown that it is superior to other existing methods. This implies that the design of scoring functions including regularization terms is an effective approach for protein complex prediction. BioMed Central 2017-12-06 /pmc/articles/PMC5731504/ /pubmed/29244010 http://dx.doi.org/10.1186/s12859-017-1920-5 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Maruyama, Osamu
Kuwahara, Yuki
RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title_full RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title_fullStr RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title_full_unstemmed RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title_short RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
title_sort rocsampler: regularizing overlapping protein complexes in protein-protein interaction networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731504/
https://www.ncbi.nlm.nih.gov/pubmed/29244010
http://dx.doi.org/10.1186/s12859-017-1920-5
work_keys_str_mv AT maruyamaosamu rocsamplerregularizingoverlappingproteincomplexesinproteinproteininteractionnetworks
AT kuwaharayuki rocsamplerregularizingoverlappingproteincomplexesinproteinproteininteractionnetworks