Cargando…

PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks

BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Ma...

Descripción completa

Detalles Bibliográficos
Autores principales: Lim, Yongsub, Yu, Injae, Seo, Dongmin, Kang, U, Sael, Lee
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6652138/
https://www.ncbi.nlm.nih.gov/pubmed/31337329
http://dx.doi.org/10.1186/s12859-019-2856-8
_version_ 1783438507010686976
author Lim, Yongsub
Yu, Injae
Seo, Dongmin
Kang, U
Sael, Lee
author_facet Lim, Yongsub
Yu, Injae
Seo, Dongmin
Kang, U
Sael, Lee
author_sort Lim, Yongsub
collection PubMed
description BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Markov Clustering) has been spotlighted due to its superior performance on bio-networks. MCL, however, is skewed towards finding a large number of very small clusters (size 1-3) and fails to detect many larger clusters (size 10+). To resolve this fragmentation problem, MLR-MCL (Multi-level Regularized MCL) has been developed. MLR-MCL still suffers from the fragmentation and, in cases, unrealistically large clusters are generated. RESULTS: In this paper, we propose PS-MCL (Parallel Shotgun Coarsened MCL), a parallel graph clustering method outperforming MLR-MCL in terms of running time and cluster quality. PS-MCL adopts an efficient coarsening scheme, called SC (Shotgun Coarsening), to improve graph coarsening in MLR-MCL. SC allows merging multiple nodes at a time, which leads to improvement in quality, time and space usage. Also, PS-MCL parallelizes main operations used in MLR-MCL which includes matrix multiplication. CONCLUSIONS: Experiments show that PS-MCL dramatically alleviates the fragmentation problem, and outperforms MLR-MCL in quality and running time. We also show that the running time of PS-MCL is effectively reduced with parallelization.
format Online
Article
Text
id pubmed-6652138
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66521382019-07-31 PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks Lim, Yongsub Yu, Injae Seo, Dongmin Kang, U Sael, Lee BMC Bioinformatics Research BACKGROUND: How can we obtain fast and high-quality clusters in genome scale bio-networks? Graph clustering is a powerful tool applied on bio-networks to solve various biological problems such as protein complexes detection, disease module detection, and gene function prediction. Especially, MCL (Markov Clustering) has been spotlighted due to its superior performance on bio-networks. MCL, however, is skewed towards finding a large number of very small clusters (size 1-3) and fails to detect many larger clusters (size 10+). To resolve this fragmentation problem, MLR-MCL (Multi-level Regularized MCL) has been developed. MLR-MCL still suffers from the fragmentation and, in cases, unrealistically large clusters are generated. RESULTS: In this paper, we propose PS-MCL (Parallel Shotgun Coarsened MCL), a parallel graph clustering method outperforming MLR-MCL in terms of running time and cluster quality. PS-MCL adopts an efficient coarsening scheme, called SC (Shotgun Coarsening), to improve graph coarsening in MLR-MCL. SC allows merging multiple nodes at a time, which leads to improvement in quality, time and space usage. Also, PS-MCL parallelizes main operations used in MLR-MCL which includes matrix multiplication. CONCLUSIONS: Experiments show that PS-MCL dramatically alleviates the fragmentation problem, and outperforms MLR-MCL in quality and running time. We also show that the running time of PS-MCL is effectively reduced with parallelization. BioMed Central 2019-07-24 /pmc/articles/PMC6652138/ /pubmed/31337329 http://dx.doi.org/10.1186/s12859-019-2856-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Lim, Yongsub
Yu, Injae
Seo, Dongmin
Kang, U
Sael, Lee
PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title_full PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title_fullStr PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title_full_unstemmed PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title_short PS-MCL: parallel shotgun coarsened Markov clustering of protein interaction networks
title_sort ps-mcl: parallel shotgun coarsened markov clustering of protein interaction networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6652138/
https://www.ncbi.nlm.nih.gov/pubmed/31337329
http://dx.doi.org/10.1186/s12859-019-2856-8
work_keys_str_mv AT limyongsub psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks
AT yuinjae psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks
AT seodongmin psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks
AT kangu psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks
AT saellee psmclparallelshotguncoarsenedmarkovclusteringofproteininteractionnetworks