Cargando…

PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks

BACKGROUND: Many biological processes are carried out by proteins interacting with each other in the form of protein complexes. However, large-scale detection of protein complexes has remained constrained by experimental limitations. As such, computational detection of protein complexes by applying...

Descripción completa

Detalles Bibliográficos
Autores principales: Wong, Daniel Lin-Kit, Li, Xiao-Li, Wu, Min, Zheng, Jie, Ng, See-Kiong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852146/
https://www.ncbi.nlm.nih.gov/pubmed/24564427
http://dx.doi.org/10.1186/1471-2164-14-S5-S15
_version_ 1782478619933671424
author Wong, Daniel Lin-Kit
Li, Xiao-Li
Wu, Min
Zheng, Jie
Ng, See-Kiong
author_facet Wong, Daniel Lin-Kit
Li, Xiao-Li
Wu, Min
Zheng, Jie
Ng, See-Kiong
author_sort Wong, Daniel Lin-Kit
collection PubMed
description BACKGROUND: Many biological processes are carried out by proteins interacting with each other in the form of protein complexes. However, large-scale detection of protein complexes has remained constrained by experimental limitations. As such, computational detection of protein complexes by applying clustering algorithms on the abundantly available protein-protein interaction (PPI) networks is an important alternative. However, many current algorithms have overlooked the importance of selecting seeds for expansion into clusters without excluding important proteins and including many noisy ones, while ensuring a high degree of functional homogeneity amongst the proteins detected for the complexes. RESULTS: We designed a novel method called Probabilistic Local Walks (PLW) which clusters regions in a PPI network with high functional similarity to find protein complex cores with high precision and efficiency in [Formula: see text] (|V| log |V| + |E|) time. A seed selection strategy, which prioritises seeds with dense neighbourhoods, was devised. We defined a topological measure, called common neighbour similarity, to estimate the functional similarity of two proteins given the number of their common neighbours. CONCLUSIONS: Our proposed PLW algorithm achieved the highest F-measure (recall and precision) when compared to 11 state-of-the-art methods on yeast protein interaction data, with an improvement of 16.7% over the next highest score. Our experiments also demonstrated that our seed selection strategy is able to increase algorithm precision when applied to three previous protein complex mining techniques. AVAILABILITY: The software, datasets and predicted complexes are available at http://wonglkd.github.io/PLW
format Online
Article
Text
id pubmed-3852146
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38521462013-12-20 PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks Wong, Daniel Lin-Kit Li, Xiao-Li Wu, Min Zheng, Jie Ng, See-Kiong BMC Genomics Research BACKGROUND: Many biological processes are carried out by proteins interacting with each other in the form of protein complexes. However, large-scale detection of protein complexes has remained constrained by experimental limitations. As such, computational detection of protein complexes by applying clustering algorithms on the abundantly available protein-protein interaction (PPI) networks is an important alternative. However, many current algorithms have overlooked the importance of selecting seeds for expansion into clusters without excluding important proteins and including many noisy ones, while ensuring a high degree of functional homogeneity amongst the proteins detected for the complexes. RESULTS: We designed a novel method called Probabilistic Local Walks (PLW) which clusters regions in a PPI network with high functional similarity to find protein complex cores with high precision and efficiency in [Formula: see text] (|V| log |V| + |E|) time. A seed selection strategy, which prioritises seeds with dense neighbourhoods, was devised. We defined a topological measure, called common neighbour similarity, to estimate the functional similarity of two proteins given the number of their common neighbours. CONCLUSIONS: Our proposed PLW algorithm achieved the highest F-measure (recall and precision) when compared to 11 state-of-the-art methods on yeast protein interaction data, with an improvement of 16.7% over the next highest score. Our experiments also demonstrated that our seed selection strategy is able to increase algorithm precision when applied to three previous protein complex mining techniques. AVAILABILITY: The software, datasets and predicted complexes are available at http://wonglkd.github.io/PLW BioMed Central 2013-10-16 /pmc/articles/PMC3852146/ /pubmed/24564427 http://dx.doi.org/10.1186/1471-2164-14-S5-S15 Text en Copyright © 2013 Wong et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Wong, Daniel Lin-Kit
Li, Xiao-Li
Wu, Min
Zheng, Jie
Ng, See-Kiong
PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title_full PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title_fullStr PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title_full_unstemmed PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title_short PLW: Probabilistic Local Walks for detecting protein complexes from protein interaction networks
title_sort plw: probabilistic local walks for detecting protein complexes from protein interaction networks
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3852146/
https://www.ncbi.nlm.nih.gov/pubmed/24564427
http://dx.doi.org/10.1186/1471-2164-14-S5-S15
work_keys_str_mv AT wongdaniellinkit plwprobabilisticlocalwalksfordetectingproteincomplexesfromproteininteractionnetworks
AT lixiaoli plwprobabilisticlocalwalksfordetectingproteincomplexesfromproteininteractionnetworks
AT wumin plwprobabilisticlocalwalksfordetectingproteincomplexesfromproteininteractionnetworks
AT zhengjie plwprobabilisticlocalwalksfordetectingproteincomplexesfromproteininteractionnetworks
AT ngseekiong plwprobabilisticlocalwalksfordetectingproteincomplexesfromproteininteractionnetworks