Cargando…

Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources

INTRODUCTION: External validation of prediction models is increasingly being seen as a minimum requirement for acceptance in clinical practice. However, the lack of interoperability of healthcare databases has been the biggest barrier to this occurring on a large scale. Recent improvements in databa...

Descripción completa

Detalles Bibliográficos
Autores principales: Williams, Ross D., Reps, Jenna M., Kors, Jan A., Ryan, Patrick B., Steyerberg, Ewout, Verhamme, Katia M., Rijnbeek, Peter R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114056/
https://www.ncbi.nlm.nih.gov/pubmed/35579818
http://dx.doi.org/10.1007/s40264-022-01161-8
_version_ 1784709701139693568
author Williams, Ross D.
Reps, Jenna M.
Kors, Jan A.
Ryan, Patrick B.
Steyerberg, Ewout
Verhamme, Katia M.
Rijnbeek, Peter R.
author_facet Williams, Ross D.
Reps, Jenna M.
Kors, Jan A.
Ryan, Patrick B.
Steyerberg, Ewout
Verhamme, Katia M.
Rijnbeek, Peter R.
author_sort Williams, Ross D.
collection PubMed
description INTRODUCTION: External validation of prediction models is increasingly being seen as a minimum requirement for acceptance in clinical practice. However, the lack of interoperability of healthcare databases has been the biggest barrier to this occurring on a large scale. Recent improvements in database interoperability enable a standardized analytical framework for model development and external validation. External validation of a model in a new database lacks context, whereby the external validation can be compared with a benchmark in this database. Iterative pairwise external validation (IPEV) is a framework that uses a rotating model development and validation approach to contextualize the assessment of performance across a network of databases. As a use case, we predicted 1-year risk of heart failure in patients with type 2 diabetes mellitus. METHODS: The method follows a two-step process involving (1) development of baseline and data-driven models in each database according to best practices and (2) validation of these models across the remaining databases. We introduce a heatmap visualization that supports the assessment of the internal and external model performance in all available databases. As a use case, we developed and validated models to predict 1-year risk of heart failure in patients initializing a second pharmacological intervention for type 2 diabetes mellitus. We leveraged the power of the Observational Medical Outcomes Partnership common data model to create an open-source software package to increase the consistency, speed, and transparency of this process. RESULTS: A total of 403,187 patients from five databases were included in the study. We developed five models that, when assessed internally, had a discriminative performance ranging from 0.73 to 0.81 area under the receiver operating characteristic curve with acceptable calibration. When we externally validated these models in a new database, three models achieved consistent performance and in context often performed similarly to models developed in the database itself. The visualization of IPEV provided valuable insights. From this, we identified the model developed in the Commercial Claims and Encounters (CCAE) database as the best performing model overall. CONCLUSION: Using IPEV lends weight to the model development process. The rotation of development through multiple databases provides context to model assessment, leading to improved understanding of transportability and generalizability. The inclusion of a baseline model in all modelling steps provides further context to the performance gains of increasing model complexity. The CCAE model was identified as a candidate for clinical use. The use case demonstrates that IPEV provides a huge opportunity in a new era of standardised data and analytics to improve insight into and trust in prediction models at an unprecedented scale. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s40264-022-01161-8.
format Online
Article
Text
id pubmed-9114056
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-91140562022-05-19 Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources Williams, Ross D. Reps, Jenna M. Kors, Jan A. Ryan, Patrick B. Steyerberg, Ewout Verhamme, Katia M. Rijnbeek, Peter R. Drug Saf Original Research Article INTRODUCTION: External validation of prediction models is increasingly being seen as a minimum requirement for acceptance in clinical practice. However, the lack of interoperability of healthcare databases has been the biggest barrier to this occurring on a large scale. Recent improvements in database interoperability enable a standardized analytical framework for model development and external validation. External validation of a model in a new database lacks context, whereby the external validation can be compared with a benchmark in this database. Iterative pairwise external validation (IPEV) is a framework that uses a rotating model development and validation approach to contextualize the assessment of performance across a network of databases. As a use case, we predicted 1-year risk of heart failure in patients with type 2 diabetes mellitus. METHODS: The method follows a two-step process involving (1) development of baseline and data-driven models in each database according to best practices and (2) validation of these models across the remaining databases. We introduce a heatmap visualization that supports the assessment of the internal and external model performance in all available databases. As a use case, we developed and validated models to predict 1-year risk of heart failure in patients initializing a second pharmacological intervention for type 2 diabetes mellitus. We leveraged the power of the Observational Medical Outcomes Partnership common data model to create an open-source software package to increase the consistency, speed, and transparency of this process. RESULTS: A total of 403,187 patients from five databases were included in the study. We developed five models that, when assessed internally, had a discriminative performance ranging from 0.73 to 0.81 area under the receiver operating characteristic curve with acceptable calibration. When we externally validated these models in a new database, three models achieved consistent performance and in context often performed similarly to models developed in the database itself. The visualization of IPEV provided valuable insights. From this, we identified the model developed in the Commercial Claims and Encounters (CCAE) database as the best performing model overall. CONCLUSION: Using IPEV lends weight to the model development process. The rotation of development through multiple databases provides context to model assessment, leading to improved understanding of transportability and generalizability. The inclusion of a baseline model in all modelling steps provides further context to the performance gains of increasing model complexity. The CCAE model was identified as a candidate for clinical use. The use case demonstrates that IPEV provides a huge opportunity in a new era of standardised data and analytics to improve insight into and trust in prediction models at an unprecedented scale. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s40264-022-01161-8. Springer International Publishing 2022-05-17 2022 /pmc/articles/PMC9114056/ /pubmed/35579818 http://dx.doi.org/10.1007/s40264-022-01161-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by-nc/4.0/Open AccessThis article is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License, which permits any non-commercial use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) .
spellingShingle Original Research Article
Williams, Ross D.
Reps, Jenna M.
Kors, Jan A.
Ryan, Patrick B.
Steyerberg, Ewout
Verhamme, Katia M.
Rijnbeek, Peter R.
Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title_full Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title_fullStr Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title_full_unstemmed Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title_short Using Iterative Pairwise External Validation to Contextualize Prediction Model Performance: A Use Case Predicting 1-Year Heart Failure Risk in Patients with Diabetes Across Five Data Sources
title_sort using iterative pairwise external validation to contextualize prediction model performance: a use case predicting 1-year heart failure risk in patients with diabetes across five data sources
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9114056/
https://www.ncbi.nlm.nih.gov/pubmed/35579818
http://dx.doi.org/10.1007/s40264-022-01161-8
work_keys_str_mv AT williamsrossd usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT repsjennam usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT korsjana usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT ryanpatrickb usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT steyerbergewout usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT verhammekatiam usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources
AT rijnbeekpeterr usingiterativepairwiseexternalvalidationtocontextualizepredictionmodelperformanceausecasepredicting1yearheartfailureriskinpatientswithdiabetesacrossfivedatasources