Cargando…
Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential prote...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380738/ https://www.ncbi.nlm.nih.gov/pubmed/22759580 http://dx.doi.org/10.1186/1477-5956-10-S1-S3 |
_version_ | 1782236337907171328 |
---|---|
author | Chiam, Tak Chien Cho, Young-Rae |
author_facet | Chiam, Tak Chien Cho, Young-Rae |
author_sort | Chiam, Tak Chien |
collection | PubMed |
description | BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential protein complexes. In particular, the density-based clustering algorithms which are able to generate overlapping clusters, i.e. the clusters sharing a set of nodes, are well-suited to protein complex detection because each protein could be a member of multiple complexes. However, their accuracy is still limited because of complex overlap patterns of their output clusters. RESULTS: We present a systematic approach of refining the overlapping clusters identified from protein interaction networks. We have designed novel metrics to assess cluster overlaps: overlap coverage and overlapping consistency. We then propose an overlap refinement algorithm. It takes as input the clusters produced by existing density-based graph-clustering methods and generates a set of refined clusters by parameterizing the metrics. To evaluate protein complex prediction accuracy, we used the f-measure by comparing each refined cluster to known protein complexes. The experimental results with the yeast protein-protein interaction data sets from BioGRID and DIP demonstrate that accuracy on protein complex prediction has increased significantly after refining cluster overlaps. CONCLUSIONS: The effectiveness of the proposed cluster overlap refinement approach for protein complex detection has been validated in this study. Analyzing overlaps of the clusters from protein interaction networks is a crucial task for understanding of functional roles of proteins and topological characteristics of the functional systems. |
format | Online Article Text |
id | pubmed-3380738 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-33807382012-06-25 Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps Chiam, Tak Chien Cho, Young-Rae Proteome Sci Proceedings BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential protein complexes. In particular, the density-based clustering algorithms which are able to generate overlapping clusters, i.e. the clusters sharing a set of nodes, are well-suited to protein complex detection because each protein could be a member of multiple complexes. However, their accuracy is still limited because of complex overlap patterns of their output clusters. RESULTS: We present a systematic approach of refining the overlapping clusters identified from protein interaction networks. We have designed novel metrics to assess cluster overlaps: overlap coverage and overlapping consistency. We then propose an overlap refinement algorithm. It takes as input the clusters produced by existing density-based graph-clustering methods and generates a set of refined clusters by parameterizing the metrics. To evaluate protein complex prediction accuracy, we used the f-measure by comparing each refined cluster to known protein complexes. The experimental results with the yeast protein-protein interaction data sets from BioGRID and DIP demonstrate that accuracy on protein complex prediction has increased significantly after refining cluster overlaps. CONCLUSIONS: The effectiveness of the proposed cluster overlap refinement approach for protein complex detection has been validated in this study. Analyzing overlaps of the clusters from protein interaction networks is a crucial task for understanding of functional roles of proteins and topological characteristics of the functional systems. BioMed Central 2012-06-21 /pmc/articles/PMC3380738/ /pubmed/22759580 http://dx.doi.org/10.1186/1477-5956-10-S1-S3 Text en Copyright ©2012 Chiam and Cho; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Proceedings Chiam, Tak Chien Cho, Young-Rae Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title | Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title_full | Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title_fullStr | Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title_full_unstemmed | Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title_short | Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
title_sort | accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380738/ https://www.ncbi.nlm.nih.gov/pubmed/22759580 http://dx.doi.org/10.1186/1477-5956-10-S1-S3 |
work_keys_str_mv | AT chiamtakchien accuracyimprovementinproteincomplexpredictionfromproteininteractionnetworksbyrefiningclusteroverlaps AT choyoungrae accuracyimprovementinproteincomplexpredictionfromproteininteractionnetworksbyrefiningclusteroverlaps |