Cargando…

Handling missing predictor values when validating and applying a prediction model to new patients

Missing data present challenges for development and real‐world application of clinical prediction models. While these challenges have received considerable attention in the development setting, there is only sparse research on the handling of missing data in applied settings. The main unique feature...

Descripción completa

Detalles Bibliográficos
Autores principales:	Hoogland, Jeroen, van Barreveld, Marit, Debray, Thomas P. A., Reitsma, Johannes B., Verstraelen, Tom E., Dijkgraaf, Marcel G. W., Zwinderman, Aeilko H.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	John Wiley & Sons, Inc. 2020
Materias:	Research Articles
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586995/ https://www.ncbi.nlm.nih.gov/pubmed/32687233 http://dx.doi.org/10.1002/sim.8682

_version_	1783600102268469248
author	Hoogland, Jeroen van Barreveld, Marit Debray, Thomas P. A. Reitsma, Johannes B. Verstraelen, Tom E. Dijkgraaf, Marcel G. W. Zwinderman, Aeilko H.
author_facet	Hoogland, Jeroen van Barreveld, Marit Debray, Thomas P. A. Reitsma, Johannes B. Verstraelen, Tom E. Dijkgraaf, Marcel G. W. Zwinderman, Aeilko H.
author_sort	Hoogland, Jeroen
collection	PubMed
description	Missing data present challenges for development and real‐world application of clinical prediction models. While these challenges have received considerable attention in the development setting, there is only sparse research on the handling of missing data in applied settings. The main unique feature of handling missing data in these settings is that missing data methods have to be performed for a single new individual, precluding direct application of mainstay methods used during model development. Correspondingly, we propose that it is desirable to perform model validation using missing data methods that transfer to practice in single new patients. This article compares existing and new methods to account for missing data for a new individual in the context of prediction. These methods are based on (i) submodels based on observed data only, (ii) marginalization over the missing variables, or (iii) imputation based on fully conditional specification (also known as chained equations). They were compared in an internal validation setting to highlight the use of missing data methods that transfer to practice while validating a model. As a reference, they were compared to the use of multiple imputation by chained equations in a set of test patients, because this has been used in validation studies in the past. The methods were evaluated in a simulation study where performance was measured by means of optimism corrected C‐statistic and mean squared prediction error. Furthermore, they were applied in data from a large Dutch cohort of prophylactic implantable cardioverter defibrillator patients.
format	Online Article Text
id	pubmed-7586995
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	John Wiley & Sons, Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-75869952020-10-30 Handling missing predictor values when validating and applying a prediction model to new patients Hoogland, Jeroen van Barreveld, Marit Debray, Thomas P. A. Reitsma, Johannes B. Verstraelen, Tom E. Dijkgraaf, Marcel G. W. Zwinderman, Aeilko H. Stat Med Research Articles Missing data present challenges for development and real‐world application of clinical prediction models. While these challenges have received considerable attention in the development setting, there is only sparse research on the handling of missing data in applied settings. The main unique feature of handling missing data in these settings is that missing data methods have to be performed for a single new individual, precluding direct application of mainstay methods used during model development. Correspondingly, we propose that it is desirable to perform model validation using missing data methods that transfer to practice in single new patients. This article compares existing and new methods to account for missing data for a new individual in the context of prediction. These methods are based on (i) submodels based on observed data only, (ii) marginalization over the missing variables, or (iii) imputation based on fully conditional specification (also known as chained equations). They were compared in an internal validation setting to highlight the use of missing data methods that transfer to practice while validating a model. As a reference, they were compared to the use of multiple imputation by chained equations in a set of test patients, because this has been used in validation studies in the past. The methods were evaluated in a simulation study where performance was measured by means of optimism corrected C‐statistic and mean squared prediction error. Furthermore, they were applied in data from a large Dutch cohort of prophylactic implantable cardioverter defibrillator patients. John Wiley & Sons, Inc. 2020-07-20 2020-11-10 /pmc/articles/PMC7586995/ /pubmed/32687233 http://dx.doi.org/10.1002/sim.8682 Text en © 2020 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle	Research Articles Hoogland, Jeroen van Barreveld, Marit Debray, Thomas P. A. Reitsma, Johannes B. Verstraelen, Tom E. Dijkgraaf, Marcel G. W. Zwinderman, Aeilko H. Handling missing predictor values when validating and applying a prediction model to new patients
title	Handling missing predictor values when validating and applying a prediction model to new patients
title_full	Handling missing predictor values when validating and applying a prediction model to new patients
title_fullStr	Handling missing predictor values when validating and applying a prediction model to new patients
title_full_unstemmed	Handling missing predictor values when validating and applying a prediction model to new patients
title_short	Handling missing predictor values when validating and applying a prediction model to new patients
title_sort	handling missing predictor values when validating and applying a prediction model to new patients
topic	Research Articles
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7586995/ https://www.ncbi.nlm.nih.gov/pubmed/32687233 http://dx.doi.org/10.1002/sim.8682
work_keys_str_mv	AT hooglandjeroen handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT vanbarreveldmarit handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT debraythomaspa handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT reitsmajohannesb handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT verstraelentome handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT dijkgraafmarcelgw handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients AT zwindermanaeilkoh handlingmissingpredictorvalueswhenvalidatingandapplyingapredictionmodeltonewpatients

Handling missing predictor values when validating and applying a prediction model to new patients

Ejemplares similares