Cargando…

CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations

BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraph...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Ying, Zhou, Jiaogen, Zhou, Shuigeng, Guan, Jihong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763309/
https://www.ncbi.nlm.nih.gov/pubmed/29322927
http://dx.doi.org/10.1186/s12918-017-0504-3
_version_ 1783291859108364288
author Xu, Ying
Zhou, Jiaogen
Zhou, Shuigeng
Guan, Jihong
author_facet Xu, Ying
Zhou, Jiaogen
Zhou, Shuigeng
Guan, Jihong
author_sort Xu, Ying
collection PubMed
description BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraphs from static protein-protein interaction (PPI) networks, while ignoring the value of other biological information and the dynamic properties of cellular systems. RESULTS: In this paper, based on our previous works CPredictor and CPredictor2.0, we present a new method for predicting complexes from PPI networks with both gene expression data and protein functional annotations, which is called CPredictor3.0. This new method follows the viewpoint that proteins in the same complex should roughly have similar functions and are active at the same time and place in cellular systems. We first detect active proteins by using gene express data of different time points and cluster proteins by using gene ontology (GO) functional annotations, respectively. Then, for each time point, we do set intersections with one set corresponding to active proteins generated from expression data and the other set corresponding to a protein cluster generated from functional annotations. Each resulting unique set indicates a cluster of proteins that have similar function(s) and are active at that time point. Following that, we map each cluster of active proteins of similar function onto a static PPI network, and get a series of induced connected subgraphs. We treat these subgraphs as candidate complexes. Finally, by expanding and merging these candidate complexes, the predicted complexes are obtained. We evaluate CPredictor3.0 and compare it with a number of existing methods on several PPI networks and benchmarking complex datasets. The experimental results show that CPredictor3.0 achieves the highest F1-measure, which indicates that CPredictor3.0 outperforms these existing method in overall. CONCLUSION: CPredictor3.0 can serve as a promising tool of protein complex prediction.
format Online
Article
Text
id pubmed-5763309
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57633092018-01-17 CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations Xu, Ying Zhou, Jiaogen Zhou, Shuigeng Guan, Jihong BMC Syst Biol Research BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraphs from static protein-protein interaction (PPI) networks, while ignoring the value of other biological information and the dynamic properties of cellular systems. RESULTS: In this paper, based on our previous works CPredictor and CPredictor2.0, we present a new method for predicting complexes from PPI networks with both gene expression data and protein functional annotations, which is called CPredictor3.0. This new method follows the viewpoint that proteins in the same complex should roughly have similar functions and are active at the same time and place in cellular systems. We first detect active proteins by using gene express data of different time points and cluster proteins by using gene ontology (GO) functional annotations, respectively. Then, for each time point, we do set intersections with one set corresponding to active proteins generated from expression data and the other set corresponding to a protein cluster generated from functional annotations. Each resulting unique set indicates a cluster of proteins that have similar function(s) and are active at that time point. Following that, we map each cluster of active proteins of similar function onto a static PPI network, and get a series of induced connected subgraphs. We treat these subgraphs as candidate complexes. Finally, by expanding and merging these candidate complexes, the predicted complexes are obtained. We evaluate CPredictor3.0 and compare it with a number of existing methods on several PPI networks and benchmarking complex datasets. The experimental results show that CPredictor3.0 achieves the highest F1-measure, which indicates that CPredictor3.0 outperforms these existing method in overall. CONCLUSION: CPredictor3.0 can serve as a promising tool of protein complex prediction. BioMed Central 2017-12-21 /pmc/articles/PMC5763309/ /pubmed/29322927 http://dx.doi.org/10.1186/s12918-017-0504-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Xu, Ying
Zhou, Jiaogen
Zhou, Shuigeng
Guan, Jihong
CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title_full CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title_fullStr CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title_full_unstemmed CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title_short CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
title_sort cpredictor3.0: detecting protein complexes from ppi networks with expression data and functional annotations
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763309/
https://www.ncbi.nlm.nih.gov/pubmed/29322927
http://dx.doi.org/10.1186/s12918-017-0504-3
work_keys_str_mv AT xuying cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations
AT zhoujiaogen cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations
AT zhoushuigeng cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations
AT guanjihong cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations