Cargando…

Tournament leave-pair-out cross-validation for receiver operating characteristic analysis

Receiver operating characteristic analysis is widely used for evaluating diagnostic systems. Recent studies have shown that estimating an area under receiver operating characteristic curve with standard cross-validation methods suffers from a large bias. The leave-pair-out cross-validation has been...

Descripción completa

Detalles Bibliográficos
Autores principales: Montoya Perez, Ileana, Airola, Antti, Boström, Peter J, Jambor, Ivan, Pahikkala, Tapio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6745617/
https://www.ncbi.nlm.nih.gov/pubmed/30126322
http://dx.doi.org/10.1177/0962280218795190
_version_ 1783451571078561792
author Montoya Perez, Ileana
Airola, Antti
Boström, Peter J
Jambor, Ivan
Pahikkala, Tapio
author_facet Montoya Perez, Ileana
Airola, Antti
Boström, Peter J
Jambor, Ivan
Pahikkala, Tapio
author_sort Montoya Perez, Ileana
collection PubMed
description Receiver operating characteristic analysis is widely used for evaluating diagnostic systems. Recent studies have shown that estimating an area under receiver operating characteristic curve with standard cross-validation methods suffers from a large bias. The leave-pair-out cross-validation has been shown to correct this bias. However, while leave-pair-out produces an almost unbiased estimate of area under receiver operating characteristic curve, it does not provide a ranking of the data needed for plotting and analyzing the receiver operating characteristic curve. In this study, we propose a new method called tournament leave-pair-out cross-validation. This method extends leave-pair-out by creating a tournament from pair comparisons to produce a ranking for the data. Tournament leave-pair-out preserves the advantage of leave-pair-out for estimating area under receiver operating characteristic curve, while it also allows performing receiver operating characteristic analyses. We have shown using both synthetic and real-world data that tournament leave-pair-out is as reliable as leave-pair-out for area under receiver operating characteristic curve estimation and confirmed the bias in leave-one-out cross-validation on low-dimensional data. As a case study on receiver operating characteristic analysis, we also evaluate how reliably sensitivity and specificity can be estimated from tournament leave-pair-out receiver operating characteristic curves.
format Online
Article
Text
id pubmed-6745617
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-67456172019-10-03 Tournament leave-pair-out cross-validation for receiver operating characteristic analysis Montoya Perez, Ileana Airola, Antti Boström, Peter J Jambor, Ivan Pahikkala, Tapio Stat Methods Med Res Articles Receiver operating characteristic analysis is widely used for evaluating diagnostic systems. Recent studies have shown that estimating an area under receiver operating characteristic curve with standard cross-validation methods suffers from a large bias. The leave-pair-out cross-validation has been shown to correct this bias. However, while leave-pair-out produces an almost unbiased estimate of area under receiver operating characteristic curve, it does not provide a ranking of the data needed for plotting and analyzing the receiver operating characteristic curve. In this study, we propose a new method called tournament leave-pair-out cross-validation. This method extends leave-pair-out by creating a tournament from pair comparisons to produce a ranking for the data. Tournament leave-pair-out preserves the advantage of leave-pair-out for estimating area under receiver operating characteristic curve, while it also allows performing receiver operating characteristic analyses. We have shown using both synthetic and real-world data that tournament leave-pair-out is as reliable as leave-pair-out for area under receiver operating characteristic curve estimation and confirmed the bias in leave-one-out cross-validation on low-dimensional data. As a case study on receiver operating characteristic analysis, we also evaluate how reliably sensitivity and specificity can be estimated from tournament leave-pair-out receiver operating characteristic curves. SAGE Publications 2018-08-20 2019-11 /pmc/articles/PMC6745617/ /pubmed/30126322 http://dx.doi.org/10.1177/0962280218795190 Text en © The Author(s) 2018 http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (http://www.creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Articles
Montoya Perez, Ileana
Airola, Antti
Boström, Peter J
Jambor, Ivan
Pahikkala, Tapio
Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title_full Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title_fullStr Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title_full_unstemmed Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title_short Tournament leave-pair-out cross-validation for receiver operating characteristic analysis
title_sort tournament leave-pair-out cross-validation for receiver operating characteristic analysis
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6745617/
https://www.ncbi.nlm.nih.gov/pubmed/30126322
http://dx.doi.org/10.1177/0962280218795190
work_keys_str_mv AT montoyaperezileana tournamentleavepairoutcrossvalidationforreceiveroperatingcharacteristicanalysis
AT airolaantti tournamentleavepairoutcrossvalidationforreceiveroperatingcharacteristicanalysis
AT bostrompeterj tournamentleavepairoutcrossvalidationforreceiveroperatingcharacteristicanalysis
AT jamborivan tournamentleavepairoutcrossvalidationforreceiveroperatingcharacteristicanalysis
AT pahikkalatapio tournamentleavepairoutcrossvalidationforreceiveroperatingcharacteristicanalysis