Cargando…

A two-layer integration framework for protein complex detection

BACKGROUND: Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou-Yang, Le, Wu, Min, Zhang, Xiao-Fei, Dai, Dao-Qing, Li, Xiao-Li, Yan, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765032/
https://www.ncbi.nlm.nih.gov/pubmed/26911324
http://dx.doi.org/10.1186/s12859-016-0939-3
_version_ 1782417486864449536
author Ou-Yang, Le
Wu, Min
Zhang, Xiao-Fei
Dai, Dao-Qing
Li, Xiao-Li
Yan, Hong
author_facet Ou-Yang, Le
Wu, Min
Zhang, Xiao-Fei
Dai, Dao-Qing
Li, Xiao-Li
Yan, Hong
author_sort Ou-Yang, Le
collection PubMed
description BACKGROUND: Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have recently been developed to predict protein complexes. However, different computational methods are based on their own assumptions and designed to work on different data sources, and various biological screening methods have their unique experiment conditions, and are often different in scale and noise level. Therefore, a single computational method on a specific data source is generally not able to generate comprehensive and reliable prediction results. RESULTS: In this paper, we develop a novel Two-layer INtegrative Complex Detection (TINCD) model to detect protein complexes, leveraging the information from both clustering results and raw data sources. In particular, we first integrate various clustering results to construct consensus matrices for proteins to measure their overall co-complex propensity. Second, we combine these consensus matrices with the co-complex score matrix derived from Tandem Affinity Purification/Mass Spectrometry (TAP) data and obtain an integrated co-complex similarity network via an unsupervised metric fusion method. Finally, a novel graph regularized doubly stochastic matrix decomposition model is proposed to detect overlapping protein complexes from the integrated similarity network. CONCLUSIONS: Extensive experimental results demonstrate that TINCD performs much better than 21 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0939-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4765032
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-47650322016-02-25 A two-layer integration framework for protein complex detection Ou-Yang, Le Wu, Min Zhang, Xiao-Fei Dai, Dao-Qing Li, Xiao-Li Yan, Hong BMC Bioinformatics Research Article BACKGROUND: Protein complexes carry out nearly all signaling and functional processes within cells. The study of protein complexes is an effective strategy to analyze cellular functions and biological processes. With the increasing availability of proteomics data, various computational methods have recently been developed to predict protein complexes. However, different computational methods are based on their own assumptions and designed to work on different data sources, and various biological screening methods have their unique experiment conditions, and are often different in scale and noise level. Therefore, a single computational method on a specific data source is generally not able to generate comprehensive and reliable prediction results. RESULTS: In this paper, we develop a novel Two-layer INtegrative Complex Detection (TINCD) model to detect protein complexes, leveraging the information from both clustering results and raw data sources. In particular, we first integrate various clustering results to construct consensus matrices for proteins to measure their overall co-complex propensity. Second, we combine these consensus matrices with the co-complex score matrix derived from Tandem Affinity Purification/Mass Spectrometry (TAP) data and obtain an integrated co-complex similarity network via an unsupervised metric fusion method. Finally, a novel graph regularized doubly stochastic matrix decomposition model is proposed to detect overlapping protein complexes from the integrated similarity network. CONCLUSIONS: Extensive experimental results demonstrate that TINCD performs much better than 21 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-0939-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-02-24 /pmc/articles/PMC4765032/ /pubmed/26911324 http://dx.doi.org/10.1186/s12859-016-0939-3 Text en © Ou-Yang et al. 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ou-Yang, Le
Wu, Min
Zhang, Xiao-Fei
Dai, Dao-Qing
Li, Xiao-Li
Yan, Hong
A two-layer integration framework for protein complex detection
title A two-layer integration framework for protein complex detection
title_full A two-layer integration framework for protein complex detection
title_fullStr A two-layer integration framework for protein complex detection
title_full_unstemmed A two-layer integration framework for protein complex detection
title_short A two-layer integration framework for protein complex detection
title_sort two-layer integration framework for protein complex detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4765032/
https://www.ncbi.nlm.nih.gov/pubmed/26911324
http://dx.doi.org/10.1186/s12859-016-0939-3
work_keys_str_mv AT ouyangle atwolayerintegrationframeworkforproteincomplexdetection
AT wumin atwolayerintegrationframeworkforproteincomplexdetection
AT zhangxiaofei atwolayerintegrationframeworkforproteincomplexdetection
AT daidaoqing atwolayerintegrationframeworkforproteincomplexdetection
AT lixiaoli atwolayerintegrationframeworkforproteincomplexdetection
AT yanhong atwolayerintegrationframeworkforproteincomplexdetection
AT ouyangle twolayerintegrationframeworkforproteincomplexdetection
AT wumin twolayerintegrationframeworkforproteincomplexdetection
AT zhangxiaofei twolayerintegrationframeworkforproteincomplexdetection
AT daidaoqing twolayerintegrationframeworkforproteincomplexdetection
AT lixiaoli twolayerintegrationframeworkforproteincomplexdetection
AT yanhong twolayerintegrationframeworkforproteincomplexdetection