Cargando…

Detection of protein complexes from affinity purification/mass spectrometry data

BACKGROUND: Recent advances in molecular biology have led to the accumulation of large amounts of data on protein-protein interaction networks in different species. An important challenge for the analysis of these data is to extract functional modules such as protein complexes and biological process...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cai, Bingjing, Wang, Haiying, Zheng, Huiru, Wang, Hui
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2012
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524315/ https://www.ncbi.nlm.nih.gov/pubmed/23282282 http://dx.doi.org/10.1186/1752-0509-6-S3-S4

_version_	1782253311358926848
author	Cai, Bingjing Wang, Haiying Zheng, Huiru Wang, Hui
author_facet	Cai, Bingjing Wang, Haiying Zheng, Huiru Wang, Hui
author_sort	Cai, Bingjing
collection	PubMed
description	BACKGROUND: Recent advances in molecular biology have led to the accumulation of large amounts of data on protein-protein interaction networks in different species. An important challenge for the analysis of these data is to extract functional modules such as protein complexes and biological processes from networks which are characterised by the present of a significant number of false positives. Various computational techniques have been applied in recent years. However, most of them treat protein interaction as binary. Co-complex relations derived from affinity purification/mass spectrometry (AP-MS) experiments have been largely ignored. METHODS: This paper presents a new algorithm for detecting protein complexes from AP-MS data. The algorithm intends to detect groups of prey proteins that are significantly co-associated with the same set of bait proteins. We first construct AP-MS data as a bipartite network, where one set of nodes consists of bait proteins and the other set is composed of prey proteins. We then calculate pair-wise similarities of bait proteins based on the number of their commonly shared neighbours. A hierarchical clustering algorithm is employed to cluster bait proteins based on the similarities and thus a set of 'seed' clusters is obtained. Starting from these 'seed' clusters, an expansion process is developed to identify prey proteins which are significantly associated with the same set of bait proteins. Then, a set of complete protein complexes is derived. In application to two real AP-MS datasets, we validate biological significance of predicted protein complexes by using curated protein complexes and well-characterized cellular component annotation from Gene Ontology (GO). Several statistical metrics have been applied for evaluation. RESULTS: Experimental results show that, the proposed algorithm achieves significant improvement in detecting protein complexes from AP-MS data. In comparison to the well-known MCL algorithm, our algorithm improves the accuracy rate by about 20% in detecting protein complexes in both networks and increases the F-Measure value by about 50% in Krogan_2006 network. Greater precision and better accuracy have been achieved and the identified complexes are demonstrated to match well with existing curated protein complexes. CONCLUSIONS: Our study highlights the significance of taking co-complex relations into account when extracting protein complexes from AP-MS data. The algorithm proposed in this paper can be easily extended to the analysis of other biological networks which can be conveniently represented by bipartite graphs such as drug-target networks.
format	Online Article Text
id	pubmed-3524315
institution	National Center for Biotechnology Information
language	English
publishDate	2012
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-35243152012-12-21 Detection of protein complexes from affinity purification/mass spectrometry data Cai, Bingjing Wang, Haiying Zheng, Huiru Wang, Hui BMC Syst Biol Research BACKGROUND: Recent advances in molecular biology have led to the accumulation of large amounts of data on protein-protein interaction networks in different species. An important challenge for the analysis of these data is to extract functional modules such as protein complexes and biological processes from networks which are characterised by the present of a significant number of false positives. Various computational techniques have been applied in recent years. However, most of them treat protein interaction as binary. Co-complex relations derived from affinity purification/mass spectrometry (AP-MS) experiments have been largely ignored. METHODS: This paper presents a new algorithm for detecting protein complexes from AP-MS data. The algorithm intends to detect groups of prey proteins that are significantly co-associated with the same set of bait proteins. We first construct AP-MS data as a bipartite network, where one set of nodes consists of bait proteins and the other set is composed of prey proteins. We then calculate pair-wise similarities of bait proteins based on the number of their commonly shared neighbours. A hierarchical clustering algorithm is employed to cluster bait proteins based on the similarities and thus a set of 'seed' clusters is obtained. Starting from these 'seed' clusters, an expansion process is developed to identify prey proteins which are significantly associated with the same set of bait proteins. Then, a set of complete protein complexes is derived. In application to two real AP-MS datasets, we validate biological significance of predicted protein complexes by using curated protein complexes and well-characterized cellular component annotation from Gene Ontology (GO). Several statistical metrics have been applied for evaluation. RESULTS: Experimental results show that, the proposed algorithm achieves significant improvement in detecting protein complexes from AP-MS data. In comparison to the well-known MCL algorithm, our algorithm improves the accuracy rate by about 20% in detecting protein complexes in both networks and increases the F-Measure value by about 50% in Krogan_2006 network. Greater precision and better accuracy have been achieved and the identified complexes are demonstrated to match well with existing curated protein complexes. CONCLUSIONS: Our study highlights the significance of taking co-complex relations into account when extracting protein complexes from AP-MS data. The algorithm proposed in this paper can be easily extended to the analysis of other biological networks which can be conveniently represented by bipartite graphs such as drug-target networks. BioMed Central 2012-12-17 /pmc/articles/PMC3524315/ /pubmed/23282282 http://dx.doi.org/10.1186/1752-0509-6-S3-S4 Text en Copyright ©2012 Cai et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Research Cai, Bingjing Wang, Haiying Zheng, Huiru Wang, Hui Detection of protein complexes from affinity purification/mass spectrometry data
title	Detection of protein complexes from affinity purification/mass spectrometry data
title_full	Detection of protein complexes from affinity purification/mass spectrometry data
title_fullStr	Detection of protein complexes from affinity purification/mass spectrometry data
title_full_unstemmed	Detection of protein complexes from affinity purification/mass spectrometry data
title_short	Detection of protein complexes from affinity purification/mass spectrometry data
title_sort	detection of protein complexes from affinity purification/mass spectrometry data
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3524315/ https://www.ncbi.nlm.nih.gov/pubmed/23282282 http://dx.doi.org/10.1186/1752-0509-6-S3-S4
work_keys_str_mv	AT caibingjing detectionofproteincomplexesfromaffinitypurificationmassspectrometrydata AT wanghaiying detectionofproteincomplexesfromaffinitypurificationmassspectrometrydata AT zhenghuiru detectionofproteincomplexesfromaffinitypurificationmassspectrometrydata AT wanghui detectionofproteincomplexesfromaffinitypurificationmassspectrometrydata

Detection of protein complexes from affinity purification/mass spectrometry data

Ejemplares similares