Cargando…

Predicting protein complexes using a supervised learning method combined with local structural information

The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true co...

Descripción completa

Detalles Bibliográficos
Autores principales: Dong, Yadong, Sun, Yongqi, Qin, Chao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5858846/
https://www.ncbi.nlm.nih.gov/pubmed/29554120
http://dx.doi.org/10.1371/journal.pone.0194124
_version_ 1783307721740648448
author Dong, Yadong
Sun, Yongqi
Qin, Chao
author_facet Dong, Yadong
Sun, Yongqi
Qin, Chao
author_sort Dong, Yadong
collection PubMed
description The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true complexes are not dense subgraphs. Supervised learning methods utilize the informative properties of known complexes; they often extract features from existing complexes and then use the features to train a classification model. The trained model is used to guide the search process for new complexes. However, insufficient extracted features, noise in the PPI data and the incompleteness of complex data make the classification model imprecise. Consequently, the classification model is not sufficient for guiding the detection of complexes. Therefore, we propose a new robust score function that combines the classification model with local structural information. Based on the score function, we provide a search method that works both forwards and backwards. The results from experiments on six benchmark PPI datasets and three protein complex datasets show that our approach can achieve better performance compared with the state-of-the-art supervised, semi-supervised and unsupervised methods for protein complex detection, occasionally significantly outperforming such methods.
format Online
Article
Text
id pubmed-5858846
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-58588462018-03-28 Predicting protein complexes using a supervised learning method combined with local structural information Dong, Yadong Sun, Yongqi Qin, Chao PLoS One Research Article The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true complexes are not dense subgraphs. Supervised learning methods utilize the informative properties of known complexes; they often extract features from existing complexes and then use the features to train a classification model. The trained model is used to guide the search process for new complexes. However, insufficient extracted features, noise in the PPI data and the incompleteness of complex data make the classification model imprecise. Consequently, the classification model is not sufficient for guiding the detection of complexes. Therefore, we propose a new robust score function that combines the classification model with local structural information. Based on the score function, we provide a search method that works both forwards and backwards. The results from experiments on six benchmark PPI datasets and three protein complex datasets show that our approach can achieve better performance compared with the state-of-the-art supervised, semi-supervised and unsupervised methods for protein complex detection, occasionally significantly outperforming such methods. Public Library of Science 2018-03-19 /pmc/articles/PMC5858846/ /pubmed/29554120 http://dx.doi.org/10.1371/journal.pone.0194124 Text en © 2018 Dong et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Dong, Yadong
Sun, Yongqi
Qin, Chao
Predicting protein complexes using a supervised learning method combined with local structural information
title Predicting protein complexes using a supervised learning method combined with local structural information
title_full Predicting protein complexes using a supervised learning method combined with local structural information
title_fullStr Predicting protein complexes using a supervised learning method combined with local structural information
title_full_unstemmed Predicting protein complexes using a supervised learning method combined with local structural information
title_short Predicting protein complexes using a supervised learning method combined with local structural information
title_sort predicting protein complexes using a supervised learning method combined with local structural information
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5858846/
https://www.ncbi.nlm.nih.gov/pubmed/29554120
http://dx.doi.org/10.1371/journal.pone.0194124
work_keys_str_mv AT dongyadong predictingproteincomplexesusingasupervisedlearningmethodcombinedwithlocalstructuralinformation
AT sunyongqi predictingproteincomplexesusingasupervisedlearningmethodcombinedwithlocalstructuralinformation
AT qinchao predictingproteincomplexesusingasupervisedlearningmethodcombinedwithlocalstructuralinformation