Cargando…
Uncertainty and the Value of Information in Risk Prediction Modeling
BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspecti...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194963/ https://www.ncbi.nlm.nih.gov/pubmed/35209762 http://dx.doi.org/10.1177/0272989X221078789 |
_version_ | 1784726860553256960 |
---|---|
author | Sadatsafavi, Mohsen Yoon Lee, Tae Gustafson, Paul |
author_facet | Sadatsafavi, Mohsen Yoon Lee, Tae Gustafson, Paul |
author_sort | Sadatsafavi, Mohsen |
collection | PubMed |
description | BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspective, we extend the definition of the expected value of perfect information (EVPI) from decision analysis to net benefit calculations in risk prediction. In the context of model development, EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. We used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI changes with sample size. RESULTS: With a sample size of 1000 and at the prespecified threshold of 2% on predicted risks, the gains in net benefit using the proposed and the correct models were 0.0006 and 0.0011, respectively, resulting in an EVPI of 0.0005 and a relative EVPI of 87%. EVPI was zero only at unrealistically high thresholds (>85%). As expected, EVPI declined with larger samples. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. CONCLUSION: The development EVPI can be used to decide whether a model can advance to validation, whether it should be abandoned, or whether a larger development sample is needed. Value-of-information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction and can complement inferential methods in predictive analytics. R code for implementing this method is provided. |
format | Online Article Text |
id | pubmed-9194963 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-91949632022-06-15 Uncertainty and the Value of Information in Risk Prediction Modeling Sadatsafavi, Mohsen Yoon Lee, Tae Gustafson, Paul Med Decis Making Original Research Articles BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspective, we extend the definition of the expected value of perfect information (EVPI) from decision analysis to net benefit calculations in risk prediction. In the context of model development, EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. We used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI changes with sample size. RESULTS: With a sample size of 1000 and at the prespecified threshold of 2% on predicted risks, the gains in net benefit using the proposed and the correct models were 0.0006 and 0.0011, respectively, resulting in an EVPI of 0.0005 and a relative EVPI of 87%. EVPI was zero only at unrealistically high thresholds (>85%). As expected, EVPI declined with larger samples. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. CONCLUSION: The development EVPI can be used to decide whether a model can advance to validation, whether it should be abandoned, or whether a larger development sample is needed. Value-of-information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction and can complement inferential methods in predictive analytics. R code for implementing this method is provided. SAGE Publications 2022-02-25 2022-07 /pmc/articles/PMC9194963/ /pubmed/35209762 http://dx.doi.org/10.1177/0272989X221078789 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Articles Sadatsafavi, Mohsen Yoon Lee, Tae Gustafson, Paul Uncertainty and the Value of Information in Risk Prediction Modeling |
title | Uncertainty and the Value of Information in Risk Prediction Modeling |
title_full | Uncertainty and the Value of Information in Risk Prediction Modeling |
title_fullStr | Uncertainty and the Value of Information in Risk Prediction Modeling |
title_full_unstemmed | Uncertainty and the Value of Information in Risk Prediction Modeling |
title_short | Uncertainty and the Value of Information in Risk Prediction Modeling |
title_sort | uncertainty and the value of information in risk prediction modeling |
topic | Original Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194963/ https://www.ncbi.nlm.nih.gov/pubmed/35209762 http://dx.doi.org/10.1177/0272989X221078789 |
work_keys_str_mv | AT sadatsafavimohsen uncertaintyandthevalueofinformationinriskpredictionmodeling AT yoonleetae uncertaintyandthevalueofinformationinriskpredictionmodeling AT gustafsonpaul uncertaintyandthevalueofinformationinriskpredictionmodeling |