Cargando…

StAR: a simple tool for the statistical comparison of ROC curves

BACKGROUND: As in many different areas of science and technology, most important problems in bioinformatics rely on the proper development and assessment of binary classifiers. A generalized assessment of the performance of binary classifiers is typically carried out through the analysis of their re...

Descripción completa

Detalles Bibliográficos
Autores principales:	Vergara, Ismael A, Norambuena, Tomás, Ferrada, Evandro, Slater, Alex W, Melo, Francisco
Formato:	Texto
Lenguaje:	English
Publicado:	BioMed Central 2008
Materias:	Software
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2435548/ https://www.ncbi.nlm.nih.gov/pubmed/18534022 http://dx.doi.org/10.1186/1471-2105-9-265

_version_	1782156492516884480
author	Vergara, Ismael A Norambuena, Tomás Ferrada, Evandro Slater, Alex W Melo, Francisco
author_facet	Vergara, Ismael A Norambuena, Tomás Ferrada, Evandro Slater, Alex W Melo, Francisco
author_sort	Vergara, Ismael A
collection	PubMed
description	BACKGROUND: As in many different areas of science and technology, most important problems in bioinformatics rely on the proper development and assessment of binary classifiers. A generalized assessment of the performance of binary classifiers is typically carried out through the analysis of their receiver operating characteristic (ROC) curves. The area under the ROC curve (AUC) constitutes a popular indicator of the performance of a binary classifier. However, the assessment of the statistical significance of the difference between any two classifiers based on this measure is not a straightforward task, since not many freely available tools exist. Most existing software is either not free, difficult to use or not easy to automate when a comparative assessment of the performance of many binary classifiers is intended. This constitutes the typical scenario for the optimization of parameters when developing new classifiers and also for their performance validation through the comparison to previous art. RESULTS: In this work we describe and release new software to assess the statistical significance of the observed difference between the AUCs of any two classifiers for a common task estimated from paired data or unpaired balanced data. The software is able to perform a pairwise comparison of many classifiers in a single run, without requiring any expert or advanced knowledge to use it. The software relies on a non-parametric test for the difference of the AUCs that accounts for the correlation of the ROC curves. The results are displayed graphically and can be easily customized by the user. A human-readable report is generated and the complete data resulting from the analysis are also available for download, which can be used for further analysis with other software. The software is released as a web server that can be used in any client platform and also as a standalone application for the Linux operating system. CONCLUSION: A new software for the statistical comparison of ROC curves is released here as a web server and also as standalone software for the LINUX operating system.
format	Text
id	pubmed-2435548
institution	National Center for Biotechnology Information
language	English
publishDate	2008
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-24355482008-06-24 StAR: a simple tool for the statistical comparison of ROC curves Vergara, Ismael A Norambuena, Tomás Ferrada, Evandro Slater, Alex W Melo, Francisco BMC Bioinformatics Software BACKGROUND: As in many different areas of science and technology, most important problems in bioinformatics rely on the proper development and assessment of binary classifiers. A generalized assessment of the performance of binary classifiers is typically carried out through the analysis of their receiver operating characteristic (ROC) curves. The area under the ROC curve (AUC) constitutes a popular indicator of the performance of a binary classifier. However, the assessment of the statistical significance of the difference between any two classifiers based on this measure is not a straightforward task, since not many freely available tools exist. Most existing software is either not free, difficult to use or not easy to automate when a comparative assessment of the performance of many binary classifiers is intended. This constitutes the typical scenario for the optimization of parameters when developing new classifiers and also for their performance validation through the comparison to previous art. RESULTS: In this work we describe and release new software to assess the statistical significance of the observed difference between the AUCs of any two classifiers for a common task estimated from paired data or unpaired balanced data. The software is able to perform a pairwise comparison of many classifiers in a single run, without requiring any expert or advanced knowledge to use it. The software relies on a non-parametric test for the difference of the AUCs that accounts for the correlation of the ROC curves. The results are displayed graphically and can be easily customized by the user. A human-readable report is generated and the complete data resulting from the analysis are also available for download, which can be used for further analysis with other software. The software is released as a web server that can be used in any client platform and also as a standalone application for the Linux operating system. CONCLUSION: A new software for the statistical comparison of ROC curves is released here as a web server and also as standalone software for the LINUX operating system. BioMed Central 2008-06-05 /pmc/articles/PMC2435548/ /pubmed/18534022 http://dx.doi.org/10.1186/1471-2105-9-265 Text en Copyright © 2008 Vergara et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Software Vergara, Ismael A Norambuena, Tomás Ferrada, Evandro Slater, Alex W Melo, Francisco StAR: a simple tool for the statistical comparison of ROC curves
title	StAR: a simple tool for the statistical comparison of ROC curves
title_full	StAR: a simple tool for the statistical comparison of ROC curves
title_fullStr	StAR: a simple tool for the statistical comparison of ROC curves
title_full_unstemmed	StAR: a simple tool for the statistical comparison of ROC curves
title_short	StAR: a simple tool for the statistical comparison of ROC curves
title_sort	star: a simple tool for the statistical comparison of roc curves
topic	Software
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2435548/ https://www.ncbi.nlm.nih.gov/pubmed/18534022 http://dx.doi.org/10.1186/1471-2105-9-265
work_keys_str_mv	AT vergaraismaela starasimpletoolforthestatisticalcomparisonofroccurves AT norambuenatomas starasimpletoolforthestatisticalcomparisonofroccurves AT ferradaevandro starasimpletoolforthestatisticalcomparisonofroccurves AT slateralexw starasimpletoolforthestatisticalcomparisonofroccurves AT melofrancisco starasimpletoolforthestatisticalcomparisonofroccurves

StAR: a simple tool for the statistical comparison of ROC curves

Ejemplares similares