Cargando…
PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Ma...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6652138/ https://www.ncbi.nlm.nih.gov/pubmed/31337329 http://dx.doi.org/10.1186/s12859-019-2856-8 |
_version_ | 1783438507010686976 |
---|---|
author | Lim, Yongsub Yu, Injae Seo, Dongmin Kang, U Sael, Lee |
author_facet | Lim, Yongsub Yu, Injae Seo, Dongmin Kang, U Sael, Lee |
author_sort | Lim, Yongsub |
collection | PubMed |
description | BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Markov Clustering) has been spotlighted due to its superior performance on bio-networks. MCL, however, is skewed towards finding a large number of very small clusters (size 1-3) and fails to detect many larger clusters (size 10+). To resolve this fragmentation problem, MLR-MCL (Multi-level Regularized MCL) has been developed. MLR-MCL still suffers from the fragmentation and, in cases, unrealistically large clusters are generated. RESULTS: In this paper, we propose PS-MCL (Parallel Shotgun Coarsened MCL), a parallel graph clustering method outperforming MLR-MCL in terms of running time and cluster quality. PS-MCL adopts an efficient coarsening scheme, called SC (Shotgun Coarsening), to improve graph coarsening in MLR-MCL. SC allows merging multiple nodes at a time, which leads to improvement in quality, time and space usage. Also, PS-MCL parallelizes main operations used in MLR-MCL which includes matrix multiplication. CONCLUSIONS: Experiments show that PS-MCL dramatically alleviates the fragmentation problem, and outperforms MLR-MCL in quality and running time. We also show that the running time of PS-MCL is effectively reduced with parallelization. |
format | Online Article Text |
id | pubmed-6652138 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-66521382019-07-31 PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks Lim, Yongsub Yu, Injae Seo, Dongmin Kang, U Sael, Lee BMC Bioinformatics Research BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Markov Clustering) has been spotlighted due to its superior performance on bio-networks. MCL, however, is skewed towards finding a large number of very small clusters (size 1-3) and fails to detect many larger clusters (size 10+). To resolve this fragmentation problem, MLR-MCL (Multi-level Regularized MCL) has been developed. MLR-MCL still suffers from the fragmentation and, in cases, unrealistically large clusters are generated. RESULTS: In this paper, we propose PS-MCL (Parallel Shotgun Coarsened MCL), a parallel graph clustering method outperforming MLR-MCL in terms of running time and cluster quality. PS-MCL adopts an efficient coarsening scheme, called SC (Shotgun Coarsening), to improve graph coarsening in MLR-MCL. SC allows merging multiple nodes at a time, which leads to improvement in quality, time and space usage. Also, PS-MCL parallelizes main operations used in MLR-MCL which includes matrix multiplication. CONCLUSIONS: Experiments show that PS-MCL dramatically alleviates the fragmentation problem, and outperforms MLR-MCL in quality and running time. We also show that the running time of PS-MCL is effectively reduced with parallelization. BioMed Central 2019-07-24 /pmc/articles/PMC6652138/ /pubmed/31337329 http://dx.doi.org/10.1186/s12859-019-2856-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Lim, Yongsub Yu, Injae Seo, Dongmin Kang, U Sael, Lee PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title | PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title_full | PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title_fullStr | PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title_full_unstemmed | PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title_short | PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks |
title_sort | ps-mcl: parallel shotgun coarsened markov clustering of protein interaction networks |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6652138/ https://www.ncbi.nlm.nih.gov/pubmed/31337329 http://dx.doi.org/10.1186/s12859-019-2856-8 |
work_keys_str_mv | AT limyongsub psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks AT yuinjae psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks AT seodongmin psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks AT kangu psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks AT saellee psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks |