Cargando…

Information-theoretic evaluation of predicted ontological annotations

Motivation: The development of effective methods for the prediction of ontological annotations is an important goal in computational biology, with protein function prediction and disease gene prioritization gaining wide recognition. Although various algorithms have been proposed for these tasks, eva...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Wyatt T., Radivojac, Predrag
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694662/
https://www.ncbi.nlm.nih.gov/pubmed/23813009
http://dx.doi.org/10.1093/bioinformatics/btt228
_version_ 1782274884079976448
author Clark, Wyatt T.
Radivojac, Predrag
author_facet Clark, Wyatt T.
Radivojac, Predrag
author_sort Clark, Wyatt T.
collection PubMed
description Motivation: The development of effective methods for the prediction of ontological annotations is an important goal in computational biology, with protein function prediction and disease gene prioritization gaining wide recognition. Although various algorithms have been proposed for these tasks, evaluating their performance is difficult owing to problems caused both by the structure of biomedical ontologies and biased or incomplete experimental annotations of genes and gene products. Results: We propose an information-theoretic framework to evaluate the performance of computational protein function prediction. We use a Bayesian network, structured according to the underlying ontology, to model the prior probability of a protein’s function. We then define two concepts, misinformation and remaining uncertainty, that can be seen as information-theoretic analogs of precision and recall. Finally, we propose a single statistic, referred to as semantic distance, that can be used to rank classification models. We evaluate our approach by analyzing the performance of three protein function predictors of Gene Ontology terms and provide evidence that it addresses several weaknesses of currently used metrics. We believe this framework provides useful insights into the performance of protein function prediction tools. Contact: predrag@indiana.edu Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3694662
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36946622013-06-27 Information-theoretic evaluation of predicted ontological annotations Clark, Wyatt T. Radivojac, Predrag Bioinformatics Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany Motivation: The development of effective methods for the prediction of ontological annotations is an important goal in computational biology, with protein function prediction and disease gene prioritization gaining wide recognition. Although various algorithms have been proposed for these tasks, evaluating their performance is difficult owing to problems caused both by the structure of biomedical ontologies and biased or incomplete experimental annotations of genes and gene products. Results: We propose an information-theoretic framework to evaluate the performance of computational protein function prediction. We use a Bayesian network, structured according to the underlying ontology, to model the prior probability of a protein’s function. We then define two concepts, misinformation and remaining uncertainty, that can be seen as information-theoretic analogs of precision and recall. Finally, we propose a single statistic, referred to as semantic distance, that can be used to rank classification models. We evaluate our approach by analyzing the performance of three protein function predictors of Gene Ontology terms and provide evidence that it addresses several weaknesses of currently used metrics. We believe this framework provides useful insights into the performance of protein function prediction tools. Contact: predrag@indiana.edu Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-07-01 2013-06-19 /pmc/articles/PMC3694662/ /pubmed/23813009 http://dx.doi.org/10.1093/bioinformatics/btt228 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
Clark, Wyatt T.
Radivojac, Predrag
Information-theoretic evaluation of predicted ontological annotations
title Information-theoretic evaluation of predicted ontological annotations
title_full Information-theoretic evaluation of predicted ontological annotations
title_fullStr Information-theoretic evaluation of predicted ontological annotations
title_full_unstemmed Information-theoretic evaluation of predicted ontological annotations
title_short Information-theoretic evaluation of predicted ontological annotations
title_sort information-theoretic evaluation of predicted ontological annotations
topic Ismb/Eccb 2013 Proceedings Papers Committee July 21 to July 23, 2013, Berlin, Germany
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3694662/
https://www.ncbi.nlm.nih.gov/pubmed/23813009
http://dx.doi.org/10.1093/bioinformatics/btt228
work_keys_str_mv AT clarkwyattt informationtheoreticevaluationofpredictedontologicalannotations
AT radivojacpredrag informationtheoreticevaluationofpredictedontologicalannotations