Cargando…
RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks
BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two d...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731504/ https://www.ncbi.nlm.nih.gov/pubmed/29244010 http://dx.doi.org/10.1186/s12859-017-1920-5 |
_version_ | 1783286523285733376 |
---|---|
author | Maruyama, Osamu Kuwahara, Yuki |
author_facet | Maruyama, Osamu Kuwahara, Yuki |
author_sort | Maruyama, Osamu |
collection | PubMed |
description | BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two difficulties. One is related to small complexes, which contains two or three components. It is relatively difficult to identify them due to their simpler internal structure, but unfortunately complexes of such sizes are dominant in major protein complex databases, such as CYC2008. Another difficulty is how to model overlaps between predicted complexes, that is, how to evaluate different predicted complexes sharing common proteins because CYC2008 and other databases include such protein complexes. Thus, it is critical how to model overlaps between predicted complexes to identify them simultaneously. RESULTS: In this paper, we propose a sampling-based protein complex prediction method, RocSampler (Regularizing Overlapping Complexes), which exploits, as part of the whole scoring function, a regularization term for the overlaps of predicted complexes and that for the distribution of sizes of predicted complexes. We have implemented RocSampler in MATLAB and its executable file for Windows is available at the site, http://imi.kyushu-u.ac.jp/~om/software/RocSampler/. CONCLUSIONS: We have applied RocSampler to five yeast PPI networks and shown that it is superior to other existing methods. This implies that the design of scoring functions including regularization terms is an effective approach for protein complex prediction. |
format | Online Article Text |
id | pubmed-5731504 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57315042017-12-19 RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks Maruyama, Osamu Kuwahara, Yuki BMC Bioinformatics Research BACKGROUND: In recent years, protein-protein interaction (PPI) networks have been well recognized as important resources to elucidate various biological processes and cellular mechanisms. In this paper, we address the problem of predicting protein complexes from a PPI network. This problem has two difficulties. One is related to small complexes, which contains two or three components. It is relatively difficult to identify them due to their simpler internal structure, but unfortunately complexes of such sizes are dominant in major protein complex databases, such as CYC2008. Another difficulty is how to model overlaps between predicted complexes, that is, how to evaluate different predicted complexes sharing common proteins because CYC2008 and other databases include such protein complexes. Thus, it is critical how to model overlaps between predicted complexes to identify them simultaneously. RESULTS: In this paper, we propose a sampling-based protein complex prediction method, RocSampler (Regularizing Overlapping Complexes), which exploits, as part of the whole scoring function, a regularization term for the overlaps of predicted complexes and that for the distribution of sizes of predicted complexes. We have implemented RocSampler in MATLAB and its executable file for Windows is available at the site, http://imi.kyushu-u.ac.jp/~om/software/RocSampler/. CONCLUSIONS: We have applied RocSampler to five yeast PPI networks and shown that it is superior to other existing methods. This implies that the design of scoring functions including regularization terms is an effective approach for protein complex prediction. BioMed Central 2017-12-06 /pmc/articles/PMC5731504/ /pubmed/29244010 http://dx.doi.org/10.1186/s12859-017-1920-5 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Maruyama, Osamu Kuwahara, Yuki RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title | RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title_full | RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title_fullStr | RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title_full_unstemmed | RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title_short | RocSampler: regularizing overlapping protein complexes in protein-protein interaction networks |
title_sort | rocsampler: regularizing overlapping protein complexes in protein-protein interaction networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5731504/ https://www.ncbi.nlm.nih.gov/pubmed/29244010 http://dx.doi.org/10.1186/s12859-017-1920-5 |
work_keys_str_mv | AT maruyamaosamu rocsamplerregularizingoverlappingproteincomplexesinproteinproteininteractionnetworks AT kuwaharayuki rocsamplerregularizingoverlappingproteincomplexesinproteinproteininteractionnetworks |