Cargando…

Protein complex detection based on partially shared multi-view clustering

BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex d...

Descripción completa

Detalles Bibliográficos
Autores principales: Ou-Yang, Le, Zhang, Xiao-Fei, Dai, Dao-Qing, Wu, Meng-Yun, Zhu, Yuan, Liu, Zhiyong, Yan, Hong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5022186/
https://www.ncbi.nlm.nih.gov/pubmed/27623844
http://dx.doi.org/10.1186/s12859-016-1164-9
_version_ 1782453473739014144
author Ou-Yang, Le
Zhang, Xiao-Fei
Dai, Dao-Qing
Wu, Meng-Yun
Zhu, Yuan
Liu, Zhiyong
Yan, Hong
author_facet Ou-Yang, Le
Zhang, Xiao-Fei
Dai, Dao-Qing
Wu, Meng-Yun
Zhu, Yuan
Liu, Zhiyong
Yan, Hong
author_sort Ou-Yang, Le
collection PubMed
description BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. RESULTS: In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. CONCLUSIONS: In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1164-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5022186
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50221862016-09-20 Protein complex detection based on partially shared multi-view clustering Ou-Yang, Le Zhang, Xiao-Fei Dai, Dao-Qing Wu, Meng-Yun Zhu, Yuan Liu, Zhiyong Yan, Hong BMC Bioinformatics Research Article BACKGROUND: Protein complexes are the key molecular entities to perform many essential biological functions. In recent years, high-throughput experimental techniques have generated a large amount of protein interaction data. As a consequence, computational analysis of such data for protein complex detection has received increased attention in the literature. However, most existing works focus on predicting protein complexes from a single type of data, either physical interaction data or co-complex interaction data. These two types of data provide compatible and complementary information, so it is necessary to integrate them to discover the underlying structures and obtain better performance in complex detection. RESULTS: In this study, we propose a novel multi-view clustering algorithm, called the Partially Shared Multi-View Clustering model (PSMVC), to carry out such an integrated analysis. Unlike traditional multi-view learning algorithms that focus on mining either consistent or complementary information embedded in the multi-view data, PSMVC can jointly explore the shared and specific information inherent in different views. In our experiments, we compare the complexes detected by PSMVC from single data source with those detected from multiple data sources. We observe that jointly analyzing multi-view data benefits the detection of protein complexes. Furthermore, extensive experiment results demonstrate that PSMVC performs much better than 16 state-of-the-art complex detection techniques, including ensemble clustering and data integration techniques. CONCLUSIONS: In this work, we demonstrate that when integrating multiple data sources, using partially shared multi-view clustering model can help to identify protein complexes which are not readily identifiable by conventional single-view-based methods and other integrative analysis methods. All the results and source codes are available on https://github.com/Oyl-CityU/PSMVC. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1164-9) contains supplementary material, which is available to authorized users. BioMed Central 2016-09-13 /pmc/articles/PMC5022186/ /pubmed/27623844 http://dx.doi.org/10.1186/s12859-016-1164-9 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ou-Yang, Le
Zhang, Xiao-Fei
Dai, Dao-Qing
Wu, Meng-Yun
Zhu, Yuan
Liu, Zhiyong
Yan, Hong
Protein complex detection based on partially shared multi-view clustering
title Protein complex detection based on partially shared multi-view clustering
title_full Protein complex detection based on partially shared multi-view clustering
title_fullStr Protein complex detection based on partially shared multi-view clustering
title_full_unstemmed Protein complex detection based on partially shared multi-view clustering
title_short Protein complex detection based on partially shared multi-view clustering
title_sort protein complex detection based on partially shared multi-view clustering
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5022186/
https://www.ncbi.nlm.nih.gov/pubmed/27623844
http://dx.doi.org/10.1186/s12859-016-1164-9
work_keys_str_mv AT ouyangle proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT zhangxiaofei proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT daidaoqing proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT wumengyun proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT zhuyuan proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT liuzhiyong proteincomplexdetectionbasedonpartiallysharedmultiviewclustering
AT yanhong proteincomplexdetectionbasedonpartiallysharedmultiviewclustering