Cargando…

Uncertainty and the Value of Information in Risk Prediction Modeling

BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspecti...

Descripción completa

Detalles Bibliográficos
Autores principales: Sadatsafavi, Mohsen, Yoon Lee, Tae, Gustafson, Paul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194963/
https://www.ncbi.nlm.nih.gov/pubmed/35209762
http://dx.doi.org/10.1177/0272989X221078789
_version_ 1784726860553256960
author Sadatsafavi, Mohsen
Yoon Lee, Tae
Gustafson, Paul
author_facet Sadatsafavi, Mohsen
Yoon Lee, Tae
Gustafson, Paul
author_sort Sadatsafavi, Mohsen
collection PubMed
description BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspective, we extend the definition of the expected value of perfect information (EVPI) from decision analysis to net benefit calculations in risk prediction. In the context of model development, EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. We used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI changes with sample size. RESULTS: With a sample size of 1000 and at the prespecified threshold of 2% on predicted risks, the gains in net benefit using the proposed and the correct models were 0.0006 and 0.0011, respectively, resulting in an EVPI of 0.0005 and a relative EVPI of 87%. EVPI was zero only at unrealistically high thresholds (>85%). As expected, EVPI declined with larger samples. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. CONCLUSION: The development EVPI can be used to decide whether a model can advance to validation, whether it should be abandoned, or whether a larger development sample is needed. Value-of-information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction and can complement inferential methods in predictive analytics. R code for implementing this method is provided.
format Online
Article
Text
id pubmed-9194963
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-91949632022-06-15 Uncertainty and the Value of Information in Risk Prediction Modeling Sadatsafavi, Mohsen Yoon Lee, Tae Gustafson, Paul Med Decis Making Original Research Articles BACKGROUND: Because of the finite size of the development sample, predicted probabilities from a risk prediction model are inevitably uncertain. We apply value-of-information methodology to evaluate the decision-theoretic implications of prediction uncertainty. METHODS: Adopting a Bayesian perspective, we extend the definition of the expected value of perfect information (EVPI) from decision analysis to net benefit calculations in risk prediction. In the context of model development, EVPI is the expected gain in net benefit by using the correct predictions as opposed to predictions from a proposed model. We suggest bootstrap methods for sampling from the posterior distribution of predictions for EVPI calculation using Monte Carlo simulations. We used subsets of data of various sizes from a clinical trial for predicting mortality after myocardial infarction to show how EVPI changes with sample size. RESULTS: With a sample size of 1000 and at the prespecified threshold of 2% on predicted risks, the gains in net benefit using the proposed and the correct models were 0.0006 and 0.0011, respectively, resulting in an EVPI of 0.0005 and a relative EVPI of 87%. EVPI was zero only at unrealistically high thresholds (>85%). As expected, EVPI declined with larger samples. We summarize an algorithm for incorporating EVPI calculations into the commonly used bootstrap method for optimism correction. CONCLUSION: The development EVPI can be used to decide whether a model can advance to validation, whether it should be abandoned, or whether a larger development sample is needed. Value-of-information methods can be applied to explore decision-theoretic consequences of uncertainty in risk prediction and can complement inferential methods in predictive analytics. R code for implementing this method is provided. SAGE Publications 2022-02-25 2022-07 /pmc/articles/PMC9194963/ /pubmed/35209762 http://dx.doi.org/10.1177/0272989X221078789 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution 4.0 License (https://creativecommons.org/licenses/by/4.0/) which permits any use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access pages (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research Articles
Sadatsafavi, Mohsen
Yoon Lee, Tae
Gustafson, Paul
Uncertainty and the Value of Information in Risk Prediction Modeling
title Uncertainty and the Value of Information in Risk Prediction Modeling
title_full Uncertainty and the Value of Information in Risk Prediction Modeling
title_fullStr Uncertainty and the Value of Information in Risk Prediction Modeling
title_full_unstemmed Uncertainty and the Value of Information in Risk Prediction Modeling
title_short Uncertainty and the Value of Information in Risk Prediction Modeling
title_sort uncertainty and the value of information in risk prediction modeling
topic Original Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9194963/
https://www.ncbi.nlm.nih.gov/pubmed/35209762
http://dx.doi.org/10.1177/0272989X221078789
work_keys_str_mv AT sadatsafavimohsen uncertaintyandthevalueofinformationinriskpredictionmodeling
AT yoonleetae uncertaintyandthevalueofinformationinriskpredictionmodeling
AT gustafsonpaul uncertaintyandthevalueofinformationinriskpredictionmodeling