Cargando…
The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers
OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because i...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Korean Society of Epidemiology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089712/ https://www.ncbi.nlm.nih.gov/pubmed/36265519 http://dx.doi.org/10.4178/epih.e2022088 |
_version_ | 1785022817576681472 |
---|---|
author | Parodi, Stefano Verda, Damiano Bagnasco, Francesca Muselli, Marco |
author_facet | Parodi, Stefano Verda, Damiano Bagnasco, Francesca Muselli, Marco |
author_sort | Parodi, Stefano |
collection | PubMed |
description | OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS: We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS: Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS: The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve. |
format | Online Article Text |
id | pubmed-10089712 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Korean Society of Epidemiology |
record_format | MEDLINE/PubMed |
spelling | pubmed-100897122023-04-12 The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers Parodi, Stefano Verda, Damiano Bagnasco, Francesca Muselli, Marco Epidemiol Health Methods OBJECTIVES: The area under a receiver operating characteristic (ROC) curve (AUC) is a popular measure of pure diagnostic accuracy that is independent from the proportion of diseased subjects in the analysed sample. However, its actual usefulness in the clinical context has been questioned, because it does not seem to be directly related to the actual performance of a diagnostic marker in identifying diseased and non-diseased subjects in real clinical settings. This study evaluates the relationship between the AUC and the proportion of correct classifications (global diagnostic accuracy, GDA) in relation to the shape of the corresponding ROC curves. METHODS: We demonstrate that AUC represents an upward-biased measure of GDA at an optimal accuracy cut-off for balanced groups. The magnitude of bias depends on the shape of the ROC plot and on the proportion of diseased and non-diseased subjects. In proper curves, the bias is independent from the diseased/non-diseased ratio and can be easily estimated and removed. Moreover, a comparison between 2 partial AUCs can be replaced by a more powerful test for the corresponding whole AUCs. RESULTS: Applications to 3 real datasets are provided: a marker for a hormone deficit in children, 2 tumour markers for malignant mesothelioma, and 2 gene expression profiles in ovarian cancer patients. CONCLUSIONS: The AUC is a measure of accuracy with potential clinical relevance for the evaluation of disease markers. The clinical meaning of ROC parameters should always be evaluated with an analysis of the shape of the corresponding ROC curve. Korean Society of Epidemiology 2022-10-17 /pmc/articles/PMC10089712/ /pubmed/36265519 http://dx.doi.org/10.4178/epih.e2022088 Text en © 2022, Korean Society of Epidemiology https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Parodi, Stefano Verda, Damiano Bagnasco, Francesca Muselli, Marco The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title | The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title_full | The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title_fullStr | The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title_full_unstemmed | The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title_short | The clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
title_sort | clinical meaning of the area under a receiver operating characteristic curve for the evaluation of the performance of disease markers |
topic | Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10089712/ https://www.ncbi.nlm.nih.gov/pubmed/36265519 http://dx.doi.org/10.4178/epih.e2022088 |
work_keys_str_mv | AT parodistefano theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT verdadamiano theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT bagnascofrancesca theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT musellimarco theclinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT parodistefano clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT verdadamiano clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT bagnascofrancesca clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers AT musellimarco clinicalmeaningoftheareaunderareceiveroperatingcharacteristiccurvefortheevaluationoftheperformanceofdiseasemarkers |