Cargando…

Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps

BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential prote...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiam, Tak Chien, Cho, Young-Rae
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380738/
https://www.ncbi.nlm.nih.gov/pubmed/22759580
http://dx.doi.org/10.1186/1477-5956-10-S1-S3
_version_ 1782236337907171328
author Chiam, Tak Chien
Cho, Young-Rae
author_facet Chiam, Tak Chien
Cho, Young-Rae
author_sort Chiam, Tak Chien
collection PubMed
description BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential protein complexes. In particular, the density-based clustering algorithms which are able to generate overlapping clusters, i.e. the clusters sharing a set of nodes, are well-suited to protein complex detection because each protein could be a member of multiple complexes. However, their accuracy is still limited because of complex overlap patterns of their output clusters. RESULTS: We present a systematic approach of refining the overlapping clusters identified from protein interaction networks. We have designed novel metrics to assess cluster overlaps: overlap coverage and overlapping consistency. We then propose an overlap refinement algorithm. It takes as input the clusters produced by existing density-based graph-clustering methods and generates a set of refined clusters by parameterizing the metrics. To evaluate protein complex prediction accuracy, we used the f-measure by comparing each refined cluster to known protein complexes. The experimental results with the yeast protein-protein interaction data sets from BioGRID and DIP demonstrate that accuracy on protein complex prediction has increased significantly after refining cluster overlaps. CONCLUSIONS: The effectiveness of the proposed cluster overlap refinement approach for protein complex detection has been validated in this study. Analyzing overlaps of the clusters from protein interaction networks is a crucial task for understanding of functional roles of proteins and topological characteristics of the functional systems.
format Online
Article
Text
id pubmed-3380738
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33807382012-06-25 Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps Chiam, Tak Chien Cho, Young-Rae Proteome Sci Proceedings BACKGROUND: Recent computational techniques have facilitated analyzing genome-wide protein-protein interaction data for several model organisms. Various graph-clustering algorithms have been applied to protein interaction networks on the genomic scale for predicting the entire set of potential protein complexes. In particular, the density-based clustering algorithms which are able to generate overlapping clusters, i.e. the clusters sharing a set of nodes, are well-suited to protein complex detection because each protein could be a member of multiple complexes. However, their accuracy is still limited because of complex overlap patterns of their output clusters. RESULTS: We present a systematic approach of refining the overlapping clusters identified from protein interaction networks. We have designed novel metrics to assess cluster overlaps: overlap coverage and overlapping consistency. We then propose an overlap refinement algorithm. It takes as input the clusters produced by existing density-based graph-clustering methods and generates a set of refined clusters by parameterizing the metrics. To evaluate protein complex prediction accuracy, we used the f-measure by comparing each refined cluster to known protein complexes. The experimental results with the yeast protein-protein interaction data sets from BioGRID and DIP demonstrate that accuracy on protein complex prediction has increased significantly after refining cluster overlaps. CONCLUSIONS: The effectiveness of the proposed cluster overlap refinement approach for protein complex detection has been validated in this study. Analyzing overlaps of the clusters from protein interaction networks is a crucial task for understanding of functional roles of proteins and topological characteristics of the functional systems. BioMed Central 2012-06-21 /pmc/articles/PMC3380738/ /pubmed/22759580 http://dx.doi.org/10.1186/1477-5956-10-S1-S3 Text en Copyright ©2012 Chiam and Cho; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Chiam, Tak Chien
Cho, Young-Rae
Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title_full Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title_fullStr Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title_full_unstemmed Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title_short Accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
title_sort accuracy improvement in protein complex prediction from protein interaction networks by refining cluster overlaps
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380738/
https://www.ncbi.nlm.nih.gov/pubmed/22759580
http://dx.doi.org/10.1186/1477-5956-10-S1-S3
work_keys_str_mv AT chiamtakchien accuracyimprovementinproteincomplexpredictionfromproteininteractionnetworksbyrefiningclusteroverlaps
AT choyoungrae accuracyimprovementinproteincomplexpredictionfromproteininteractionnetworksbyrefiningclusteroverlaps