Cargando…
CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations
BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraph...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763309/ https://www.ncbi.nlm.nih.gov/pubmed/29322927 http://dx.doi.org/10.1186/s12918-017-0504-3 |
_version_ | 1783291859108364288 |
---|---|
author | Xu, Ying Zhou, Jiaogen Zhou, Shuigeng Guan, Jihong |
author_facet | Xu, Ying Zhou, Jiaogen Zhou, Shuigeng Guan, Jihong |
author_sort | Xu, Ying |
collection | PubMed |
description | BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraphs from static protein-protein interaction (PPI) networks, while ignoring the value of other biological information and the dynamic properties of cellular systems. RESULTS: In this paper, based on our previous works CPredictor and CPredictor2.0, we present a new method for predicting complexes from PPI networks with both gene expression data and protein functional annotations, which is called CPredictor3.0. This new method follows the viewpoint that proteins in the same complex should roughly have similar functions and are active at the same time and place in cellular systems. We first detect active proteins by using gene express data of different time points and cluster proteins by using gene ontology (GO) functional annotations, respectively. Then, for each time point, we do set intersections with one set corresponding to active proteins generated from expression data and the other set corresponding to a protein cluster generated from functional annotations. Each resulting unique set indicates a cluster of proteins that have similar function(s) and are active at that time point. Following that, we map each cluster of active proteins of similar function onto a static PPI network, and get a series of induced connected subgraphs. We treat these subgraphs as candidate complexes. Finally, by expanding and merging these candidate complexes, the predicted complexes are obtained. We evaluate CPredictor3.0 and compare it with a number of existing methods on several PPI networks and benchmarking complex datasets. The experimental results show that CPredictor3.0 achieves the highest F1-measure, which indicates that CPredictor3.0 outperforms these existing method in overall. CONCLUSION: CPredictor3.0 can serve as a promising tool of protein complex prediction. |
format | Online Article Text |
id | pubmed-5763309 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57633092018-01-17 CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations Xu, Ying Zhou, Jiaogen Zhou, Shuigeng Guan, Jihong BMC Syst Biol Research BACKGROUND: Effectively predicting protein complexes not only helps to understand the structures and functions of proteins and their complexes, but also is useful for diagnosing disease and developing new drugs. Up to now, many methods have been developed to detect complexes by mining dense subgraphs from static protein-protein interaction (PPI) networks, while ignoring the value of other biological information and the dynamic properties of cellular systems. RESULTS: In this paper, based on our previous works CPredictor and CPredictor2.0, we present a new method for predicting complexes from PPI networks with both gene expression data and protein functional annotations, which is called CPredictor3.0. This new method follows the viewpoint that proteins in the same complex should roughly have similar functions and are active at the same time and place in cellular systems. We first detect active proteins by using gene express data of different time points and cluster proteins by using gene ontology (GO) functional annotations, respectively. Then, for each time point, we do set intersections with one set corresponding to active proteins generated from expression data and the other set corresponding to a protein cluster generated from functional annotations. Each resulting unique set indicates a cluster of proteins that have similar function(s) and are active at that time point. Following that, we map each cluster of active proteins of similar function onto a static PPI network, and get a series of induced connected subgraphs. We treat these subgraphs as candidate complexes. Finally, by expanding and merging these candidate complexes, the predicted complexes are obtained. We evaluate CPredictor3.0 and compare it with a number of existing methods on several PPI networks and benchmarking complex datasets. The experimental results show that CPredictor3.0 achieves the highest F1-measure, which indicates that CPredictor3.0 outperforms these existing method in overall. CONCLUSION: CPredictor3.0 can serve as a promising tool of protein complex prediction. BioMed Central 2017-12-21 /pmc/articles/PMC5763309/ /pubmed/29322927 http://dx.doi.org/10.1186/s12918-017-0504-3 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Xu, Ying Zhou, Jiaogen Zhou, Shuigeng Guan, Jihong CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title | CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title_full | CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title_fullStr | CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title_full_unstemmed | CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title_short | CPredictor3.0: detecting protein complexes from PPI networks with expression data and functional annotations |
title_sort | cpredictor3.0: detecting protein complexes from ppi networks with expression data and functional annotations |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763309/ https://www.ncbi.nlm.nih.gov/pubmed/29322927 http://dx.doi.org/10.1186/s12918-017-0504-3 |
work_keys_str_mv | AT xuying cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations AT zhoujiaogen cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations AT zhoushuigeng cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations AT guanjihong cpredictor30detectingproteincomplexesfromppinetworkswithexpressiondataandfunctionalannotations |