Cargando…

Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials

BACKGROUND: A key requirement for a useful power calculation is that the calculation mimic the data analysis that will be performed on the actual data, once that data is observed. Close approximations may be difficult to achieve using analytic solutions, however, and thus Monte Carlo approaches, inc...

Descripción completa

Detalles Bibliográficos
Autores principales: Kleinman, Ken, Huang, Susan S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: AcademyHealth 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340517/
https://www.ncbi.nlm.nih.gov/pubmed/28303254
http://dx.doi.org/10.13063/2327-9214.1202
_version_ 1782512843792318464
author Kleinman, Ken
Huang, Susan S.
author_facet Kleinman, Ken
Huang, Susan S.
author_sort Kleinman, Ken
collection PubMed
description BACKGROUND: A key requirement for a useful power calculation is that the calculation mimic the data analysis that will be performed on the actual data, once that data is observed. Close approximations may be difficult to achieve using analytic solutions, however, and thus Monte Carlo approaches, including both simulation and bootstrap resampling, are often attractive. One setting in which this is particularly true is cluster-randomized trial designs. However, Monte Carlo approaches are useful in many additional settings as well. Calculating power for cluster-randomized trials using analytic or simulation-based methods is frequently unsatisfactory due to the complexity of the data analysis methods to be employed and to the sparseness of data to inform the choice of important parameters in these methods. METHODS: We propose that among Monte Carlo methods, bootstrap approaches are most likely to generate data similar to the observed data. In bootstrap approaches, real data are resampled to build complete data sets based on real data that resemble the data for the intended analyses. In contrast, simulation methods would use the real data to estimate parameters for the data, and would then simulate data using these parameters. We describe means of implementing bootstrap power calculation. RESULTS: We demonstrate bootstrap power calculation for a cluster-randomized trial with a censored survival outcome and a baseline observation period. CONCLUSIONS: Bootstrap power calculation is a natural application of resampling methods. It provides a relatively simple solution to power calculation that is likely to be more accurate than analytic solutions or simulation-based calculations, in the sense that the bootstrap approach does not rely on the assumptions inherent in analytic calculations. This method of calculation has several important strengths. Notably, it is simple to achieve great fidelity to the proposed data analysis method and there is no requirement for parameter estimates, or estimates of their variability, from outside settings. So, for example, for cluster-randomized trials, power calculations need not depend on intracluster correlation coefficient estimates from outside studies. In contrast, bootstrap power calculation requires initial data that resemble data that are to be used in the planned study. We are not aware of bootstrap power calculation being previously proposed or explored for cluster-randomized trials, but it can also be applied for other study designs. We show with a simulation study that bootstrap power calculation can replicate analytic power in cases where analytic power can be accurately calculated. We also demonstrate power calculations for correlated censored survival outcomes in a cluster-randomized trial setting, for which we are unaware of an analytic alternative. The method can easily be used when preliminary data are available, as is likely to be the case when research is performed in health delivery systems or other settings where electronic medical records can be obtained.
format Online
Article
Text
id pubmed-5340517
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher AcademyHealth
record_format MEDLINE/PubMed
spelling pubmed-53405172017-03-16 Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials Kleinman, Ken Huang, Susan S. EGEMS (Wash DC) Articles BACKGROUND: A key requirement for a useful power calculation is that the calculation mimic the data analysis that will be performed on the actual data, once that data is observed. Close approximations may be difficult to achieve using analytic solutions, however, and thus Monte Carlo approaches, including both simulation and bootstrap resampling, are often attractive. One setting in which this is particularly true is cluster-randomized trial designs. However, Monte Carlo approaches are useful in many additional settings as well. Calculating power for cluster-randomized trials using analytic or simulation-based methods is frequently unsatisfactory due to the complexity of the data analysis methods to be employed and to the sparseness of data to inform the choice of important parameters in these methods. METHODS: We propose that among Monte Carlo methods, bootstrap approaches are most likely to generate data similar to the observed data. In bootstrap approaches, real data are resampled to build complete data sets based on real data that resemble the data for the intended analyses. In contrast, simulation methods would use the real data to estimate parameters for the data, and would then simulate data using these parameters. We describe means of implementing bootstrap power calculation. RESULTS: We demonstrate bootstrap power calculation for a cluster-randomized trial with a censored survival outcome and a baseline observation period. CONCLUSIONS: Bootstrap power calculation is a natural application of resampling methods. It provides a relatively simple solution to power calculation that is likely to be more accurate than analytic solutions or simulation-based calculations, in the sense that the bootstrap approach does not rely on the assumptions inherent in analytic calculations. This method of calculation has several important strengths. Notably, it is simple to achieve great fidelity to the proposed data analysis method and there is no requirement for parameter estimates, or estimates of their variability, from outside settings. So, for example, for cluster-randomized trials, power calculations need not depend on intracluster correlation coefficient estimates from outside studies. In contrast, bootstrap power calculation requires initial data that resemble data that are to be used in the planned study. We are not aware of bootstrap power calculation being previously proposed or explored for cluster-randomized trials, but it can also be applied for other study designs. We show with a simulation study that bootstrap power calculation can replicate analytic power in cases where analytic power can be accurately calculated. We also demonstrate power calculations for correlated censored survival outcomes in a cluster-randomized trial setting, for which we are unaware of an analytic alternative. The method can easily be used when preliminary data are available, as is likely to be the case when research is performed in health delivery systems or other settings where electronic medical records can be obtained. AcademyHealth 2017-02-09 /pmc/articles/PMC5340517/ /pubmed/28303254 http://dx.doi.org/10.13063/2327-9214.1202 Text en All eGEMs publications are licensed under a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 License http://creativecommons.org/licenses/by-nc-nd/3.0/
spellingShingle Articles
Kleinman, Ken
Huang, Susan S.
Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title_full Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title_fullStr Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title_full_unstemmed Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title_short Calculating Power by Bootstrap, with an Application to Cluster-Randomized Trials
title_sort calculating power by bootstrap, with an application to cluster-randomized trials
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340517/
https://www.ncbi.nlm.nih.gov/pubmed/28303254
http://dx.doi.org/10.13063/2327-9214.1202
work_keys_str_mv AT kleinmanken calculatingpowerbybootstrapwithanapplicationtoclusterrandomizedtrials
AT huangsusans calculatingpowerbybootstrapwithanapplicationtoclusterrandomizedtrials