Cargando…
Alternative Performance Measures for Prediction Models
As a performance measure for a prediction model, the area under the receiver operating characteristic curve (AUC) is insensitive to the addition of strong markers. A number of measures sensitive to performance change have recently been proposed; however, these relative-performance measures may lead...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3946724/ https://www.ncbi.nlm.nih.gov/pubmed/24608868 http://dx.doi.org/10.1371/journal.pone.0091249 |
_version_ | 1782306682921025536 |
---|---|
author | Wu, Yun-Chun Lee, Wen-Chung |
author_facet | Wu, Yun-Chun Lee, Wen-Chung |
author_sort | Wu, Yun-Chun |
collection | PubMed |
description | As a performance measure for a prediction model, the area under the receiver operating characteristic curve (AUC) is insensitive to the addition of strong markers. A number of measures sensitive to performance change have recently been proposed; however, these relative-performance measures may lead to self-contradictory conclusions. This paper examines alternative performance measures for prediction models: the Lorenz curve-based Gini and Pietra indices, and a standardized version of the Brier score, the scaled Brier. Computer simulations are performed in order to study the sensitivity of these measures to performance change when a new marker is added to a baseline model. When the discrimination power of the added marker is concentrated in the gray zone of the baseline model, the AUC and the Gini show minimal performance improvements. The Pietra and the scaled Brier show more significant improvements in the same situation, comparatively. The Pietra and the scaled Brier indices are therefore recommended for prediction model performance measurement, in light of their ease of interpretation, clinical relevance and sensitivity to gray-zone resolving markers. |
format | Online Article Text |
id | pubmed-3946724 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39467242014-03-10 Alternative Performance Measures for Prediction Models Wu, Yun-Chun Lee, Wen-Chung PLoS One Research Article As a performance measure for a prediction model, the area under the receiver operating characteristic curve (AUC) is insensitive to the addition of strong markers. A number of measures sensitive to performance change have recently been proposed; however, these relative-performance measures may lead to self-contradictory conclusions. This paper examines alternative performance measures for prediction models: the Lorenz curve-based Gini and Pietra indices, and a standardized version of the Brier score, the scaled Brier. Computer simulations are performed in order to study the sensitivity of these measures to performance change when a new marker is added to a baseline model. When the discrimination power of the added marker is concentrated in the gray zone of the baseline model, the AUC and the Gini show minimal performance improvements. The Pietra and the scaled Brier show more significant improvements in the same situation, comparatively. The Pietra and the scaled Brier indices are therefore recommended for prediction model performance measurement, in light of their ease of interpretation, clinical relevance and sensitivity to gray-zone resolving markers. Public Library of Science 2014-03-07 /pmc/articles/PMC3946724/ /pubmed/24608868 http://dx.doi.org/10.1371/journal.pone.0091249 Text en © 2014 Wu, Lee http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Wu, Yun-Chun Lee, Wen-Chung Alternative Performance Measures for Prediction Models |
title | Alternative Performance Measures for Prediction Models |
title_full | Alternative Performance Measures for Prediction Models |
title_fullStr | Alternative Performance Measures for Prediction Models |
title_full_unstemmed | Alternative Performance Measures for Prediction Models |
title_short | Alternative Performance Measures for Prediction Models |
title_sort | alternative performance measures for prediction models |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3946724/ https://www.ncbi.nlm.nih.gov/pubmed/24608868 http://dx.doi.org/10.1371/journal.pone.0091249 |
work_keys_str_mv | AT wuyunchun alternativeperformancemeasuresforpredictionmodels AT leewenchung alternativeperformancemeasuresforpredictionmodels |