Cargando…

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics

Motivation: Since database retrieval is a fundamental operation, the measurement of retrieval efficacy is critical to progress in bioinformatics. This article points out some issues with current methods of measuring retrieval efficacy and suggests some improvements. In particular, many studies have...

Descripción completa

Detalles Bibliográficos
Autores principales:	Carroll, Hyrum D., Kann, Maricel G., Sheetlin, Sergey L., Spouge, John L.
Formato:	Texto
Lenguaje:	English
Publicado:	Oxford University Press 2010
Materias:	Original Papers
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2894514/ https://www.ncbi.nlm.nih.gov/pubmed/20505002 http://dx.doi.org/10.1093/bioinformatics/btq270

_version_	1782183197162864640
author	Carroll, Hyrum D. Kann, Maricel G. Sheetlin, Sergey L. Spouge, John L.
author_facet	Carroll, Hyrum D. Kann, Maricel G. Sheetlin, Sergey L. Spouge, John L.
author_sort	Carroll, Hyrum D.
collection	PubMed
description	Motivation: Since database retrieval is a fundamental operation, the measurement of retrieval efficacy is critical to progress in bioinformatics. This article points out some issues with current methods of measuring retrieval efficacy and suggests some improvements. In particular, many studies have used the pooled receiver operating characteristic for n irrelevant records (ROC(n)) score, the area under the ROC curve (AUC) of a ‘pooled’ ROC curve, truncated at n irrelevant records. Unfortunately, the pooled ROC(n) score does not faithfully reflect actual usage of retrieval algorithms. Additionally, a pooled ROC(n) score can be very sensitive to retrieval results from as little as a single query. Methods: To replace the pooled ROC(n) score, we propose the Threshold Average Precision (TAP-k), a measure closely related to the well-known average precision in information retrieval, but reflecting the usage of E-values in bioinformatics. Furthermore, in addition to conditions previously given in the literature, we introduce three new criteria that an ideal measure of retrieval efficacy should satisfy. Results: PSI-BLAST, GLOBAL, HMMER and RPS-BLAST provided examples of using the TAP-k and pooled ROC(n) scores to evaluate sequence retrieval algorithms. In particular, compelling examples using real data highlight the drawbacks of the pooled ROC(n) score, showing that it can produce evaluations skewing far from intuitive expectations. In contrast, the TAP-k satisfies most of the criteria desired in an ideal measure of retrieval efficacy. Availability and Implementation: The TAP-k web server and downloadable Perl script are freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html.ncbi/tap/ Contact: spouge@ncbi.nlm.nih.gov Supplementary Information: Supplementary data are available at Bioinformatics online.
format	Text
id	pubmed-2894514
institution	National Center for Biotechnology Information
language	English
publishDate	2010
publisher	Oxford University Press
record_format	MEDLINE/PubMed
spelling	pubmed-28945142010-07-01 Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics Carroll, Hyrum D. Kann, Maricel G. Sheetlin, Sergey L. Spouge, John L. Bioinformatics Original Papers Motivation: Since database retrieval is a fundamental operation, the measurement of retrieval efficacy is critical to progress in bioinformatics. This article points out some issues with current methods of measuring retrieval efficacy and suggests some improvements. In particular, many studies have used the pooled receiver operating characteristic for n irrelevant records (ROC(n)) score, the area under the ROC curve (AUC) of a ‘pooled’ ROC curve, truncated at n irrelevant records. Unfortunately, the pooled ROC(n) score does not faithfully reflect actual usage of retrieval algorithms. Additionally, a pooled ROC(n) score can be very sensitive to retrieval results from as little as a single query. Methods: To replace the pooled ROC(n) score, we propose the Threshold Average Precision (TAP-k), a measure closely related to the well-known average precision in information retrieval, but reflecting the usage of E-values in bioinformatics. Furthermore, in addition to conditions previously given in the literature, we introduce three new criteria that an ideal measure of retrieval efficacy should satisfy. Results: PSI-BLAST, GLOBAL, HMMER and RPS-BLAST provided examples of using the TAP-k and pooled ROC(n) scores to evaluate sequence retrieval algorithms. In particular, compelling examples using real data highlight the drawbacks of the pooled ROC(n) score, showing that it can produce evaluations skewing far from intuitive expectations. In contrast, the TAP-k satisfies most of the criteria desired in an ideal measure of retrieval efficacy. Availability and Implementation: The TAP-k web server and downloadable Perl script are freely available at http://www.ncbi.nlm.nih.gov/CBBresearch/Spouge/html.ncbi/tap/ Contact: spouge@ncbi.nlm.nih.gov Supplementary Information: Supplementary data are available at Bioinformatics online. Oxford University Press 2010-07-15 2010-05-26 /pmc/articles/PMC2894514/ /pubmed/20505002 http://dx.doi.org/10.1093/bioinformatics/btq270 Text en Published by Oxford University Press on behalf of the US Government 2010. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Original Papers Carroll, Hyrum D. Kann, Maricel G. Sheetlin, Sergey L. Spouge, John L. Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title	Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title_full	Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title_fullStr	Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title_full_unstemmed	Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title_short	Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics
title_sort	threshold average precision (tap-k): a measure of retrieval designed for bioinformatics
topic	Original Papers
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2894514/ https://www.ncbi.nlm.nih.gov/pubmed/20505002 http://dx.doi.org/10.1093/bioinformatics/btq270
work_keys_str_mv	AT carrollhyrumd thresholdaverageprecisiontapkameasureofretrievaldesignedforbioinformatics AT kannmaricelg thresholdaverageprecisiontapkameasureofretrievaldesignedforbioinformatics AT sheetlinsergeyl thresholdaverageprecisiontapkameasureofretrievaldesignedforbioinformatics AT spougejohnl thresholdaverageprecisiontapkameasureofretrievaldesignedforbioinformatics

Threshold Average Precision (TAP-k): a measure of retrieval designed for bioinformatics

Ejemplares similares