Cargando…
A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a sligh...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/ https://www.ncbi.nlm.nih.gov/pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w |
_version_ | 1783722329958776832 |
---|---|
author | Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan |
author_facet | Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan |
author_sort | Zhou, Qian M. |
collection | PubMed |
description | BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w). |
format | Online Article Text |
id | pubmed-8278775 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-82787752021-07-15 A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan Diagn Progn Res Research BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w). BioMed Central 2021-07-14 /pmc/articles/PMC8278775/ /pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title | A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title_full | A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title_fullStr | A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title_full_unstemmed | A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title_short | A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve |
title_sort | relationship between the incremental values of area under the roc curve and of area under the precision-recall curve |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/ https://www.ncbi.nlm.nih.gov/pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w |
work_keys_str_mv | AT zhouqianm arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhelu arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT brookerussellj arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT hudsonmelissam arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT yuanyan arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhouqianm relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhelu relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT brookerussellj relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT hudsonmelissam relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT yuanyan relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve |