Cargando…

External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb

INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance es...

Descripción completa

Detalles Bibliográficos
Autores principales: Snell, Kym I.E., Archer, Lucinda, Ensor, Joie, Bonnett, Laura J., Debray, Thomas P.A., Phillips, Bob, Collins, Gary S., Riley, Richard D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352630/
https://www.ncbi.nlm.nih.gov/pubmed/33596458
http://dx.doi.org/10.1016/j.jclinepi.2021.02.011
_version_ 1783736224009158656
author Snell, Kym I.E.
Archer, Lucinda
Ensor, Joie
Bonnett, Laura J.
Debray, Thomas P.A.
Phillips, Bob
Collins, Gary S.
Riley, Richard D.
author_facet Snell, Kym I.E.
Archer, Lucinda
Ensor, Joie
Bonnett, Laura J.
Debray, Thomas P.A.
Phillips, Bob
Collins, Gary S.
Riley, Richard D.
author_sort Snell, Kym I.E.
collection PubMed
description INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach. METHODS: Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility. RESULTS: Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulation-based calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model. CONCLUSION: Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb.
format Online
Article
Text
id pubmed-8352630
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-83526302021-08-15 External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb Snell, Kym I.E. Archer, Lucinda Ensor, Joie Bonnett, Laura J. Debray, Thomas P.A. Phillips, Bob Collins, Gary S. Riley, Richard D. J Clin Epidemiol Original Article INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach. METHODS: Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility. RESULTS: Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulation-based calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model. CONCLUSION: Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb. Elsevier 2021-07 /pmc/articles/PMC8352630/ /pubmed/33596458 http://dx.doi.org/10.1016/j.jclinepi.2021.02.011 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Original Article
Snell, Kym I.E.
Archer, Lucinda
Ensor, Joie
Bonnett, Laura J.
Debray, Thomas P.A.
Phillips, Bob
Collins, Gary S.
Riley, Richard D.
External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title_full External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title_fullStr External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title_full_unstemmed External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title_short External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
title_sort external validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352630/
https://www.ncbi.nlm.nih.gov/pubmed/33596458
http://dx.doi.org/10.1016/j.jclinepi.2021.02.011
work_keys_str_mv AT snellkymie externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT archerlucinda externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT ensorjoie externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT bonnettlauraj externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT debraythomaspa externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT phillipsbob externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT collinsgarys externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb
AT rileyrichardd externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb