Cargando…

Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks

BACKGROUND: Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identifica...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Jian, Xie, Dong, Lin, Hongfei, Yang, Zhihao, Zhang, Yijia
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380758/
https://www.ncbi.nlm.nih.gov/pubmed/22759576
http://dx.doi.org/10.1186/1477-5956-10-S1-S18
_version_ 1782236339055362048
author Wang, Jian
Xie, Dong
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
author_facet Wang, Jian
Xie, Dong
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
author_sort Wang, Jian
collection PubMed
description BACKGROUND: Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. RESULTS: A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. CONCLUSIONS: The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes.
format Online
Article
Text
id pubmed-3380758
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-33807582012-06-25 Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks Wang, Jian Xie, Dong Lin, Hongfei Yang, Zhihao Zhang, Yijia Proteome Sci Proceedings BACKGROUND: Many biological processes recognize in particular the importance of protein complexes, and various computational approaches have been developed to identify complexes from protein-protein interaction (PPI) networks. However, high false-positive rate of PPIs leads to challenging identification. RESULTS: A protein semantic similarity measure is proposed in this study, based on the ontology structure of Gene Ontology (GO) terms and GO annotations to estimate the reliability of interactions in PPI networks. Interaction pairs with low GO semantic similarity are removed from the network as unreliable interactions. Then, a cluster-expanding algorithm is used to detect complexes with core-attachment structure on filtered network. Our method is applied to three different yeast PPI networks. The effectiveness of our method is examined on two benchmark complex datasets. Experimental results show that our method performed better than other state-of-the-art approaches in most evaluation metrics. CONCLUSIONS: The method detects protein complexes from large scale PPI networks by filtering GO semantic similarity. Removing interactions with low GO similarity significantly improves the performance of complex identification. The expanding strategy is also effective to identify attachment proteins of complexes. BioMed Central 2012-06-21 /pmc/articles/PMC3380758/ /pubmed/22759576 http://dx.doi.org/10.1186/1477-5956-10-S1-S18 Text en Copyright ©2012 Wang et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Wang, Jian
Xie, Dong
Lin, Hongfei
Yang, Zhihao
Zhang, Yijia
Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title_full Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title_fullStr Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title_full_unstemmed Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title_short Filtering Gene Ontology semantic similarity for identifying protein complexes in large protein interaction networks
title_sort filtering gene ontology semantic similarity for identifying protein complexes in large protein interaction networks
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3380758/
https://www.ncbi.nlm.nih.gov/pubmed/22759576
http://dx.doi.org/10.1186/1477-5956-10-S1-S18
work_keys_str_mv AT wangjian filteringgeneontologysemanticsimilarityforidentifyingproteincomplexesinlargeproteininteractionnetworks
AT xiedong filteringgeneontologysemanticsimilarityforidentifyingproteincomplexesinlargeproteininteractionnetworks
AT linhongfei filteringgeneontologysemanticsimilarityforidentifyingproteincomplexesinlargeproteininteractionnetworks
AT yangzhihao filteringgeneontologysemanticsimilarityforidentifyingproteincomplexesinlargeproteininteractionnetworks
AT zhangyijia filteringgeneontologysemanticsimilarityforidentifyingproteincomplexesinlargeproteininteractionnetworks