Cargando…

The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers

OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because i...

Descripción completa

Detalles Bibliográficos
Autores principales: Parodi, Stefano, Verda, Damiano, Bagnasco, Francesca, Muselli, Marco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korean Society of Epidemiology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089712/
https://www.ncbi.nlm.nih.gov/pubmed/36265519
http://dx.doi.org/10.4178/epih.e2022088
_version_ 1785022817576681472
author Parodi, Stefano
Verda, Damiano
Bagnasco, Francesca
Muselli, Marco
author_facet Parodi, Stefano
Verda, Damiano
Bagnasco, Francesca
Muselli, Marco
author_sort Parodi, Stefano
collection PubMed
description OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS: We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS: Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS: The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve.
format Online
Article
Text
id pubmed-10089712
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Korean Society of Epidemiology
record_format MEDLINE/PubMed
spelling pubmed-100897122023-04-12 The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers Parodi, Stefano Verda, Damiano Bagnasco, Francesca Muselli, Marco Epidemiol Health Methods OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS: We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS: Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS: The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve. Korean Society of Epidemiology 2022-10-17 /pmc/articles/PMC10089712/ /pubmed/36265519 http://dx.doi.org/10.4178/epih.e2022088 Text en © 2022, Korean Society of Epidemiology https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Parodi, Stefano
Verda, Damiano
Bagnasco, Francesca
Muselli, Marco
The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_full The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_fullStr The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_full_unstemmed The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_short The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
title_sort clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089712/
https://www.ncbi.nlm.nih.gov/pubmed/36265519
http://dx.doi.org/10.4178/epih.e2022088
work_keys_str_mv AT parodistefano theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT verdadamiano theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT bagnascofrancesca theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT musellimarco theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT parodistefano clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT verdadamiano clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT bagnascofrancesca clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers
AT musellimarco clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers