Cargando…

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a sligh...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Qian M., Zhe, Lu, Brooke, Russell J., Hudson, Melissa M., Yuan, Yan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/
https://www.ncbi.nlm.nih.gov/pubmed/34261544
http://dx.doi.org/10.1186/s41512-021-00102-w
_version_ 1783722329958776832
author Zhou, Qian M.
Zhe, Lu
Brooke, Russell J.
Hudson, Melissa M.
Yuan, Yan
author_facet Zhou, Qian M.
Zhe, Lu
Brooke, Russell J.
Hudson, Melissa M.
Yuan, Yan
author_sort Zhou, Qian M.
collection PubMed
description BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w).
format Online
Article
Text
id pubmed-8278775
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82787752021-07-15 A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan Diagn Progn Res Research BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w). BioMed Central 2021-07-14 /pmc/articles/PMC8278775/ /pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Zhou, Qian M.
Zhe, Lu
Brooke, Russell J.
Hudson, Melissa M.
Yuan, Yan
A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_full A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_fullStr A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_full_unstemmed A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_short A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_sort relationship between the incremental values of area under the roc curve and of area under the precision-recall curve
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/
https://www.ncbi.nlm.nih.gov/pubmed/34261544
http://dx.doi.org/10.1186/s41512-021-00102-w
work_keys_str_mv AT zhouqianm arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT zhelu arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT brookerussellj arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT hudsonmelissam arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT yuanyan arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT zhouqianm relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT zhelu relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT brookerussellj relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT hudsonmelissam relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve
AT yuanyan relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve