Cargando…

The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models

BACKGROUND: A variety of statistics have been proposed as tools to help investigators assess the value of diagnostic tests or prediction models. The Brier score has been recommended on the grounds that it is a proper scoring rule that is affected by both discrimination and calibration. However, the...

Descripción completa

Detalles Bibliográficos
Autores principales: Assel, Melissa, Sjoberg, Daniel D., Vickers, Andrew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460786/
https://www.ncbi.nlm.nih.gov/pubmed/31093548
http://dx.doi.org/10.1186/s41512-017-0020-3
_version_ 1783410381847265280
author Assel, Melissa
Sjoberg, Daniel D.
Vickers, Andrew J.
author_facet Assel, Melissa
Sjoberg, Daniel D.
Vickers, Andrew J.
author_sort Assel, Melissa
collection PubMed
description BACKGROUND: A variety of statistics have been proposed as tools to help investigators assess the value of diagnostic tests or prediction models. The Brier score has been recommended on the grounds that it is a proper scoring rule that is affected by both discrimination and calibration. However, the Brier score is prevalence dependent in such a way that the rank ordering of tests or models may inappropriately vary by prevalence. METHODS: We explored four common clinical scenarios: comparison of a highly accurate binary test with a continuous prediction model of moderate predictiveness; comparison of two binary tests where the importance of sensitivity versus specificity is inversely associated with prevalence; comparison of models and tests to default strategies of assuming that all or no patients are positive; and comparison of two models with miscalibration in opposite directions. RESULTS: In each case, we found that the Brier score gave an inappropriate rank ordering of the tests and models. Conversely, net benefit, a decision-analytic measure, gave results that always favored the preferable test or model. CONCLUSIONS: Brier score does not evaluate clinical value of diagnostic tests or prediction models. We advocate, as an alternative, the use of decision-analytic measures such as net benefit. TRIAL REGISTRATION: Not applicable.
format Online
Article
Text
id pubmed-6460786
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64607862019-05-15 The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models Assel, Melissa Sjoberg, Daniel D. Vickers, Andrew J. Diagn Progn Res Research BACKGROUND: A variety of statistics have been proposed as tools to help investigators assess the value of diagnostic tests or prediction models. The Brier score has been recommended on the grounds that it is a proper scoring rule that is affected by both discrimination and calibration. However, the Brier score is prevalence dependent in such a way that the rank ordering of tests or models may inappropriately vary by prevalence. METHODS: We explored four common clinical scenarios: comparison of a highly accurate binary test with a continuous prediction model of moderate predictiveness; comparison of two binary tests where the importance of sensitivity versus specificity is inversely associated with prevalence; comparison of models and tests to default strategies of assuming that all or no patients are positive; and comparison of two models with miscalibration in opposite directions. RESULTS: In each case, we found that the Brier score gave an inappropriate rank ordering of the tests and models. Conversely, net benefit, a decision-analytic measure, gave results that always favored the preferable test or model. CONCLUSIONS: Brier score does not evaluate clinical value of diagnostic tests or prediction models. We advocate, as an alternative, the use of decision-analytic measures such as net benefit. TRIAL REGISTRATION: Not applicable. BioMed Central 2017-12-02 /pmc/articles/PMC6460786/ /pubmed/31093548 http://dx.doi.org/10.1186/s41512-017-0020-3 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Assel, Melissa
Sjoberg, Daniel D.
Vickers, Andrew J.
The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title_full The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title_fullStr The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title_full_unstemmed The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title_short The Brier score does not evaluate the clinical utility of diagnostic tests or prediction models
title_sort brier score does not evaluate the clinical utility of diagnostic tests or prediction models
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6460786/
https://www.ncbi.nlm.nih.gov/pubmed/31093548
http://dx.doi.org/10.1186/s41512-017-0020-3
work_keys_str_mv AT asselmelissa thebrierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels
AT sjobergdanield thebrierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels
AT vickersandrewj thebrierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels
AT asselmelissa brierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels
AT sjobergdanield brierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels
AT vickersandrewj brierscoredoesnotevaluatetheclinicalutilityofdiagnostictestsorpredictionmodels