Cargando…

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a sligh...

Descripción completa

Detalles Bibliográficos
Autores principales:	Zhou, Qian M., Zhe, Lu, Brooke, Russell J., Hudson, Melissa M., Yuan, Yan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2021
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/ https://www.ncbi.nlm.nih.gov/pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w

_version_	1783722329958776832
author	Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan
author_facet	Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan
author_sort	Zhou, Qian M.
collection	PubMed
description	BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w).
format	Online Article Text
id	pubmed-8278775
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-82787752021-07-15 A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan Diagn Progn Res Research BACKGROUND: Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. METHODS: In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. RESULTS: We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. CONCLUSIONS: ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s41512-021-00102-w). BioMed Central 2021-07-14 /pmc/articles/PMC8278775/ /pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle	Research Zhou, Qian M. Zhe, Lu Brooke, Russell J. Hudson, Melissa M. Yuan, Yan A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title	A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_full	A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_fullStr	A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_full_unstemmed	A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_short	A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve
title_sort	relationship between the incremental values of area under the roc curve and of area under the precision-recall curve
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8278775/ https://www.ncbi.nlm.nih.gov/pubmed/34261544 http://dx.doi.org/10.1186/s41512-021-00102-w
work_keys_str_mv	AT zhouqianm arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhelu arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT brookerussellj arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT hudsonmelissam arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT yuanyan arelationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhouqianm relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT zhelu relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT brookerussellj relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT hudsonmelissam relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve AT yuanyan relationshipbetweentheincrementalvaluesofareaundertheroccurveandofareaundertheprecisionrecallcurve

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

Ejemplares similares