Cargando…
Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751475/ https://www.ncbi.nlm.nih.gov/pubmed/26868667 http://dx.doi.org/10.1038/srep21223 |
_version_ | 1782415589341396992 |
---|---|
author | Liu, Quanzhong Song, Jiangning Li, Jinyan |
author_facet | Liu, Quanzhong Song, Jiangning Li, Jinyan |
author_sort | Liu, Quanzhong |
collection | PubMed |
description | Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why a group of proteins are predicted as a complex, and they have not investigated how to detect new complexes of one species by training the model on the PPI data of another species. We propose a novel supervised method to address these issues. The key idea is to discover emerging patterns (EPs), a type of contrast pattern, which can clearly distinguish true complexes from random subgraphs in a PPI network. An integrative score of EPs is defined to measure how likely a subgraph of proteins can form a complex. New complexes thus can grow from our seed proteins by iteratively updating this score. The performance of our method is tested on eight benchmark PPI datasets and compared with seven unsupervised methods, two supervised and one semi-supervised methods under five standards to assess the quality of the predicted complexes. The results show that in most cases our method achieved a better performance, sometimes significantly. |
format | Online Article Text |
id | pubmed-4751475 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-47514752016-02-22 Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes Liu, Quanzhong Song, Jiangning Li, Jinyan Sci Rep Article Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why a group of proteins are predicted as a complex, and they have not investigated how to detect new complexes of one species by training the model on the PPI data of another species. We propose a novel supervised method to address these issues. The key idea is to discover emerging patterns (EPs), a type of contrast pattern, which can clearly distinguish true complexes from random subgraphs in a PPI network. An integrative score of EPs is defined to measure how likely a subgraph of proteins can form a complex. New complexes thus can grow from our seed proteins by iteratively updating this score. The performance of our method is tested on eight benchmark PPI datasets and compared with seven unsupervised methods, two supervised and one semi-supervised methods under five standards to assess the quality of the predicted complexes. The results show that in most cases our method achieved a better performance, sometimes significantly. Nature Publishing Group 2016-02-12 /pmc/articles/PMC4751475/ /pubmed/26868667 http://dx.doi.org/10.1038/srep21223 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Liu, Quanzhong Song, Jiangning Li, Jinyan Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title | Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title_full | Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title_fullStr | Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title_full_unstemmed | Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title_short | Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes |
title_sort | using contrast patterns between true complexes and random subgraphs in ppi networks to predict unknown protein complexes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751475/ https://www.ncbi.nlm.nih.gov/pubmed/26868667 http://dx.doi.org/10.1038/srep21223 |
work_keys_str_mv | AT liuquanzhong usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes AT songjiangning usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes AT lijinyan usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes |