Cargando…
External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb
INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance es...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352630/ https://www.ncbi.nlm.nih.gov/pubmed/33596458 http://dx.doi.org/10.1016/j.jclinepi.2021.02.011 |
_version_ | 1783736224009158656 |
---|---|
author | Snell, Kym I.E. Archer, Lucinda Ensor, Joie Bonnett, Laura J. Debray, Thomas P.A. Phillips, Bob Collins, Gary S. Riley, Richard D. |
author_facet | Snell, Kym I.E. Archer, Lucinda Ensor, Joie Bonnett, Laura J. Debray, Thomas P.A. Phillips, Bob Collins, Gary S. Riley, Richard D. |
author_sort | Snell, Kym I.E. |
collection | PubMed |
description | INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach. METHODS: Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility. RESULTS: Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulation-based calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model. CONCLUSION: Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb. |
format | Online Article Text |
id | pubmed-8352630 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-83526302021-08-15 External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb Snell, Kym I.E. Archer, Lucinda Ensor, Joie Bonnett, Laura J. Debray, Thomas P.A. Phillips, Bob Collins, Gary S. Riley, Richard D. J Clin Epidemiol Original Article INTRODUCTION: Sample size “rules-of-thumb” for external validation of clinical prediction models suggest at least 100 events and 100 non-events. Such blanket guidance is imprecise, and not specific to the model or validation setting. We investigate factors affecting precision of model performance estimates upon external validation, and propose a more tailored sample size approach. METHODS: Simulation of logistic regression prediction models to investigate factors associated with precision of performance estimates. Then, explanation and illustration of a simulation-based approach to calculate the minimum sample size required to precisely estimate a model's calibration, discrimination and clinical utility. RESULTS: Precision is affected by the model's linear predictor (LP) distribution, in addition to number of events and total sample size. Sample sizes of 100 (or even 200) events and non-events can give imprecise estimates, especially for calibration. The simulation-based calculation accounts for the LP distribution and (mis)calibration in the validation sample. Application identifies 2430 required participants (531 events) for external validation of a deep vein thrombosis diagnostic model. CONCLUSION: Where researchers can anticipate the distribution of the model's LP (eg, based on development sample, or a pilot study), a simulation-based approach for calculating sample size for external validation offers more flexibility and reliability than rules-of-thumb. Elsevier 2021-07 /pmc/articles/PMC8352630/ /pubmed/33596458 http://dx.doi.org/10.1016/j.jclinepi.2021.02.011 Text en © 2021 The Authors https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Original Article Snell, Kym I.E. Archer, Lucinda Ensor, Joie Bonnett, Laura J. Debray, Thomas P.A. Phillips, Bob Collins, Gary S. Riley, Richard D. External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title | External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title_full | External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title_fullStr | External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title_full_unstemmed | External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title_short | External validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
title_sort | external validation of clinical prediction models: simulation-based sample size calculations were more reliable than rules-of-thumb |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8352630/ https://www.ncbi.nlm.nih.gov/pubmed/33596458 http://dx.doi.org/10.1016/j.jclinepi.2021.02.011 |
work_keys_str_mv | AT snellkymie externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT archerlucinda externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT ensorjoie externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT bonnettlauraj externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT debraythomaspa externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT phillipsbob externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT collinsgarys externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb AT rileyrichardd externalvalidationofclinicalpredictionmodelssimulationbasedsamplesizecalculationsweremorereliablethanrulesofthumb |