Cargando…

Sample size calculation for a stepped wedge trial

BACKGROUND: Stepped wedge trials (SWTs) can be considered as a variant of a clustered randomised trial, although in many ways they embed additional complications from the point of view of statistical design and analysis. While the literature is rich for standard parallel or clustered randomised clin...

Descripción completa

Detalles Bibliográficos
Autores principales: Baio, Gianluca, Copas, Andrew, Ambler, Gareth, Hargreaves, James, Beard, Emma, Omar, Rumana Z
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538764/
https://www.ncbi.nlm.nih.gov/pubmed/26282553
http://dx.doi.org/10.1186/s13063-015-0840-9
_version_ 1782386029614858240
author Baio, Gianluca
Copas, Andrew
Ambler, Gareth
Hargreaves, James
Beard, Emma
Omar, Rumana Z
author_facet Baio, Gianluca
Copas, Andrew
Ambler, Gareth
Hargreaves, James
Beard, Emma
Omar, Rumana Z
author_sort Baio, Gianluca
collection PubMed
description BACKGROUND: Stepped wedge trials (SWTs) can be considered as a variant of a clustered randomised trial, although in many ways they embed additional complications from the point of view of statistical design and analysis. While the literature is rich for standard parallel or clustered randomised clinical trials (CRTs), it is much less so for SWTs. The specific features of SWTs need to be addressed properly in the sample size calculations to ensure valid estimates of the intervention effect. METHODS: We critically review the available literature on analytical methods to perform sample size and power calculations in a SWT. In particular, we highlight the specific assumptions underlying currently used methods and comment on their validity and potential for extensions. Finally, we propose the use of simulation-based methods to overcome some of the limitations of analytical formulae. We performed a simulation exercise in which we compared simulation-based sample size computations with analytical methods and assessed the impact of varying the basic parameters to the resulting sample size/power, in the case of continuous and binary outcomes and assuming both cross-sectional data and the closed cohort design. RESULTS: We compared the sample size requirements for a SWT in comparison to CRTs based on comparable number of measurements in each cluster. In line with the existing literature, we found that when the level of correlation within the clusters is relatively high (for example, greater than 0.1), the SWT requires a smaller number of clusters. For low values of the intracluster correlation, the two designs produce more similar requirements in terms of total number of clusters. We validated our simulation-based approach and compared the results of sample size calculations to analytical methods; the simulation-based procedures perform well, producing results that are extremely similar to the analytical methods. We found that usually the SWT is relatively insensitive to variations in the intracluster correlation, and that failure to account for a potential time effect will artificially and grossly overestimate the power of a study. CONCLUSIONS: We provide a framework for handling the sample size and power calculations of a SWT and suggest that simulation-based procedures may be more effective, especially in dealing with the specific features of the study at hand. In selected situations and depending on the level of intracluster correlation and the cluster size, SWTs may be more efficient than comparable CRTs. However, the decision about the design to be implemented will be based on a wide range of considerations, including the cost associated with the number of clusters, number of measurements and the trial duration.
format Online
Article
Text
id pubmed-4538764
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45387642015-08-18 Sample size calculation for a stepped wedge trial Baio, Gianluca Copas, Andrew Ambler, Gareth Hargreaves, James Beard, Emma Omar, Rumana Z Trials Research BACKGROUND: Stepped wedge trials (SWTs) can be considered as a variant of a clustered randomised trial, although in many ways they embed additional complications from the point of view of statistical design and analysis. While the literature is rich for standard parallel or clustered randomised clinical trials (CRTs), it is much less so for SWTs. The specific features of SWTs need to be addressed properly in the sample size calculations to ensure valid estimates of the intervention effect. METHODS: We critically review the available literature on analytical methods to perform sample size and power calculations in a SWT. In particular, we highlight the specific assumptions underlying currently used methods and comment on their validity and potential for extensions. Finally, we propose the use of simulation-based methods to overcome some of the limitations of analytical formulae. We performed a simulation exercise in which we compared simulation-based sample size computations with analytical methods and assessed the impact of varying the basic parameters to the resulting sample size/power, in the case of continuous and binary outcomes and assuming both cross-sectional data and the closed cohort design. RESULTS: We compared the sample size requirements for a SWT in comparison to CRTs based on comparable number of measurements in each cluster. In line with the existing literature, we found that when the level of correlation within the clusters is relatively high (for example, greater than 0.1), the SWT requires a smaller number of clusters. For low values of the intracluster correlation, the two designs produce more similar requirements in terms of total number of clusters. We validated our simulation-based approach and compared the results of sample size calculations to analytical methods; the simulation-based procedures perform well, producing results that are extremely similar to the analytical methods. We found that usually the SWT is relatively insensitive to variations in the intracluster correlation, and that failure to account for a potential time effect will artificially and grossly overestimate the power of a study. CONCLUSIONS: We provide a framework for handling the sample size and power calculations of a SWT and suggest that simulation-based procedures may be more effective, especially in dealing with the specific features of the study at hand. In selected situations and depending on the level of intracluster correlation and the cluster size, SWTs may be more efficient than comparable CRTs. However, the decision about the design to be implemented will be based on a wide range of considerations, including the cost associated with the number of clusters, number of measurements and the trial duration. BioMed Central 2015-08-17 /pmc/articles/PMC4538764/ /pubmed/26282553 http://dx.doi.org/10.1186/s13063-015-0840-9 Text en © Baio et al. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Baio, Gianluca
Copas, Andrew
Ambler, Gareth
Hargreaves, James
Beard, Emma
Omar, Rumana Z
Sample size calculation for a stepped wedge trial
title Sample size calculation for a stepped wedge trial
title_full Sample size calculation for a stepped wedge trial
title_fullStr Sample size calculation for a stepped wedge trial
title_full_unstemmed Sample size calculation for a stepped wedge trial
title_short Sample size calculation for a stepped wedge trial
title_sort sample size calculation for a stepped wedge trial
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4538764/
https://www.ncbi.nlm.nih.gov/pubmed/26282553
http://dx.doi.org/10.1186/s13063-015-0840-9
work_keys_str_mv AT baiogianluca samplesizecalculationforasteppedwedgetrial
AT copasandrew samplesizecalculationforasteppedwedgetrial
AT amblergareth samplesizecalculationforasteppedwedgetrial
AT hargreavesjames samplesizecalculationforasteppedwedgetrial
AT beardemma samplesizecalculationforasteppedwedgetrial
AT omarrumanaz samplesizecalculationforasteppedwedgetrial