Cargando…

Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes

Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Quanzhong, Song, Jiangning, Li, Jinyan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751475/
https://www.ncbi.nlm.nih.gov/pubmed/26868667
http://dx.doi.org/10.1038/srep21223
_version_ 1782415589341396992
author Liu, Quanzhong
Song, Jiangning
Li, Jinyan
author_facet Liu, Quanzhong
Song, Jiangning
Li, Jinyan
author_sort Liu, Quanzhong
collection PubMed
description Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why a group of proteins are predicted as a complex, and they have not investigated how to detect new complexes of one species by training the model on the PPI data of another species. We propose a novel supervised method to address these issues. The key idea is to discover emerging patterns (EPs), a type of contrast pattern, which can clearly distinguish true complexes from random subgraphs in a PPI network. An integrative score of EPs is defined to measure how likely a subgraph of proteins can form a complex. New complexes thus can grow from our seed proteins by iteratively updating this score. The performance of our method is tested on eight benchmark PPI datasets and compared with seven unsupervised methods, two supervised and one semi-supervised methods under five standards to assess the quality of the predicted complexes. The results show that in most cases our method achieved a better performance, sometimes significantly.
format Online
Article
Text
id pubmed-4751475
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-47514752016-02-22 Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes Liu, Quanzhong Song, Jiangning Li, Jinyan Sci Rep Article Most protein complex detection methods utilize unsupervised techniques to cluster densely connected nodes in a protein-protein interaction (PPI) network, in spite of the fact that many true complexes are not dense subgraphs. Supervised methods have been proposed recently, but they do not answer why a group of proteins are predicted as a complex, and they have not investigated how to detect new complexes of one species by training the model on the PPI data of another species. We propose a novel supervised method to address these issues. The key idea is to discover emerging patterns (EPs), a type of contrast pattern, which can clearly distinguish true complexes from random subgraphs in a PPI network. An integrative score of EPs is defined to measure how likely a subgraph of proteins can form a complex. New complexes thus can grow from our seed proteins by iteratively updating this score. The performance of our method is tested on eight benchmark PPI datasets and compared with seven unsupervised methods, two supervised and one semi-supervised methods under five standards to assess the quality of the predicted complexes. The results show that in most cases our method achieved a better performance, sometimes significantly. Nature Publishing Group 2016-02-12 /pmc/articles/PMC4751475/ /pubmed/26868667 http://dx.doi.org/10.1038/srep21223 Text en Copyright © 2016, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Liu, Quanzhong
Song, Jiangning
Li, Jinyan
Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title_full Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title_fullStr Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title_full_unstemmed Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title_short Using contrast patterns between true complexes and random subgraphs in PPI networks to predict unknown protein complexes
title_sort using contrast patterns between true complexes and random subgraphs in ppi networks to predict unknown protein complexes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4751475/
https://www.ncbi.nlm.nih.gov/pubmed/26868667
http://dx.doi.org/10.1038/srep21223
work_keys_str_mv AT liuquanzhong usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes
AT songjiangning usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes
AT lijinyan usingcontrastpatternsbetweentruecomplexesandrandomsubgraphsinppinetworkstopredictunknownproteincomplexes