Cargando…

A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge

BACKGROUND: The functional characterization of newly discovered proteins has been a challenge in the post-genomic era. Protein-protein interactions provide insights into the functional analysis because the function of unknown proteins can be postulated on the basis of their interaction evidence with...

Descripción completa

Detalles Bibliográficos
Autores principales: Cho, Young-Rae, Shi, Lei, Ramanathan, Murali, Zhang, Aidong
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2570367/
https://www.ncbi.nlm.nih.gov/pubmed/18801191
http://dx.doi.org/10.1186/1471-2105-9-382
_version_ 1782160117346598912
author Cho, Young-Rae
Shi, Lei
Ramanathan, Murali
Zhang, Aidong
author_facet Cho, Young-Rae
Shi, Lei
Ramanathan, Murali
Zhang, Aidong
author_sort Cho, Young-Rae
collection PubMed
description BACKGROUND: The functional characterization of newly discovered proteins has been a challenge in the post-genomic era. Protein-protein interactions provide insights into the functional analysis because the function of unknown proteins can be postulated on the basis of their interaction evidence with known proteins. The protein-protein interaction data sets have been enriched by high-throughput experimental methods. However, the functional analysis using the interaction data has a limitation in accuracy because of the presence of the false positive data experimentally generated and the interactions that are a lack of functional linkage. RESULTS: Protein-protein interaction data can be integrated with the functional knowledge existing in the Gene Ontology (GO) database. We apply similarity measures to assess the functional similarity between interacting proteins. We present a probabilistic framework for predicting functions of unknown proteins based on the functional similarity. We use the leave-one-out cross validation to compare the performance. The experimental results demonstrate that our algorithm performs better than other competing methods in terms of prediction accuracy. In particular, it handles the high false positive rates of current interaction data well. CONCLUSION: The experimentally determined protein-protein interactions are erroneous to uncover the functional associations among proteins. The performance of function prediction for uncharacterized proteins can be enhanced by the integration of multiple data sources available.
format Text
id pubmed-2570367
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-25703672008-10-21 A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge Cho, Young-Rae Shi, Lei Ramanathan, Murali Zhang, Aidong BMC Bioinformatics Methodology Article BACKGROUND: The functional characterization of newly discovered proteins has been a challenge in the post-genomic era. Protein-protein interactions provide insights into the functional analysis because the function of unknown proteins can be postulated on the basis of their interaction evidence with known proteins. The protein-protein interaction data sets have been enriched by high-throughput experimental methods. However, the functional analysis using the interaction data has a limitation in accuracy because of the presence of the false positive data experimentally generated and the interactions that are a lack of functional linkage. RESULTS: Protein-protein interaction data can be integrated with the functional knowledge existing in the Gene Ontology (GO) database. We apply similarity measures to assess the functional similarity between interacting proteins. We present a probabilistic framework for predicting functions of unknown proteins based on the functional similarity. We use the leave-one-out cross validation to compare the performance. The experimental results demonstrate that our algorithm performs better than other competing methods in terms of prediction accuracy. In particular, it handles the high false positive rates of current interaction data well. CONCLUSION: The experimentally determined protein-protein interactions are erroneous to uncover the functional associations among proteins. The performance of function prediction for uncharacterized proteins can be enhanced by the integration of multiple data sources available. BioMed Central 2008-09-18 /pmc/articles/PMC2570367/ /pubmed/18801191 http://dx.doi.org/10.1186/1471-2105-9-382 Text en Copyright © 2008 Cho et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Cho, Young-Rae
Shi, Lei
Ramanathan, Murali
Zhang, Aidong
A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title_full A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title_fullStr A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title_full_unstemmed A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title_short A probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
title_sort probabilistic framework to predict protein function from interaction data integrated with semantic knowledge
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2570367/
https://www.ncbi.nlm.nih.gov/pubmed/18801191
http://dx.doi.org/10.1186/1471-2105-9-382
work_keys_str_mv AT choyoungrae aprobabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT shilei aprobabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT ramanathanmurali aprobabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT zhangaidong aprobabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT choyoungrae probabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT shilei probabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT ramanathanmurali probabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge
AT zhangaidong probabilisticframeworktopredictproteinfunctionfrominteractiondataintegratedwithsemanticknowledge