Cargando…
Protein complex detection based on partially shared multi-view clustering
BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex d...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5022186/ https://www.ncbi.nlm.nih.gov/pubmed/27623844 http://dx.doi.org/10.1186/s12859-016-1164-9 |
_version_ | 1782453473739014144 |
---|---|
author | Ou-Yang, Le Zhang, Xiao-Fei Dai, Dao-Qing Wu, Meng-Yun Zhu, Yuan Liu, Zhiyong Yan, Hong |
author_facet | Ou-Yang, Le Zhang, Xiao-Fei Dai, Dao-Qing Wu, Meng-Yun Zhu, Yuan Liu, Zhiyong Yan, Hong |
author_sort | Ou-Yang, Le |
collection | PubMed |
description | BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. RESULTS: In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. CONCLUSIONS: In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1164-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5022186 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50221862016-09-20 Protein complex detection based on partially shared multi-view clustering Ou-Yang, Le Zhang, Xiao-Fei Dai, Dao-Qing Wu, Meng-Yun Zhu, Yuan Liu, Zhiyong Yan, Hong BMC Bioinformatics Research Article BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. RESULTS: In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. CONCLUSIONS: In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1164-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-13 /pmc/articles/PMC5022186/ /pubmed/27623844 http://dx.doi.org/10.1186/s12859-016-1164-9 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Ou-Yang, Le Zhang, Xiao-Fei Dai, Dao-Qing Wu, Meng-Yun Zhu, Yuan Liu, Zhiyong Yan, Hong Protein complex detection based on partially shared multi-view clustering |
title | Protein complex detection based on partially shared multi-view clustering |
title_full | Protein complex detection based on partially shared multi-view clustering |
title_fullStr | Protein complex detection based on partially shared multi-view clustering |
title_full_unstemmed | Protein complex detection based on partially shared multi-view clustering |
title_short | Protein complex detection based on partially shared multi-view clustering |
title_sort | protein complex detection based on partially shared multi-view clustering |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5022186/ https://www.ncbi.nlm.nih.gov/pubmed/27623844 http://dx.doi.org/10.1186/s12859-016-1164-9 |
work_keys_str_mv | AT ouyangle proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT zhangxiaofei proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT daidaoqing proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT wumengyun proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT zhuyuan proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT liuzhiyong proteincomplexdetectionbasedonpartiallysharedmultiviewclustering AT yanhong proteincomplexdetectionbasedonpartiallysharedmultiviewclustering |