Cargando…

Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?

BACKGROUND: Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of estimates while increasing the pot...

Descripción completa

Detalles Bibliográficos
Autores principales: Mukaka, Mavuto, White, Sarah A., Terlouw, Dianne J., Mwapasa, Victor, Kalilani-Phiri, Linda, Faragher, E. Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4957845/
https://www.ncbi.nlm.nih.gov/pubmed/27450066
http://dx.doi.org/10.1186/s13063-016-1473-3
_version_ 1782444235745656832
author Mukaka, Mavuto
White, Sarah A.
Terlouw, Dianne J.
Mwapasa, Victor
Kalilani-Phiri, Linda
Faragher, E. Brian
author_facet Mukaka, Mavuto
White, Sarah A.
Terlouw, Dianne J.
Mwapasa, Victor
Kalilani-Phiri, Linda
Faragher, E. Brian
author_sort Mukaka, Mavuto
collection PubMed
description BACKGROUND: Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of estimates while increasing the potential for bias. As multiple imputation (MI) methods preserve sample size, they are generally viewed as the preferred analytical approach. We examined this assumption, comparing the performance of CC and MI methods to determine risk difference (RD) estimates in the presence of missing binary outcomes. We conducted simulation studies of 5000 simulated data sets with 50 imputations of RCTs with one primary follow-up endpoint at different underlying levels of RD (3–25 %) and missing outcomes (5–30 %). RESULTS: For missing at random (MAR) or missing completely at random (MCAR) outcomes, CC method estimates generally remained unbiased and achieved precision similar to or better than MI methods, and high statistical coverage. Missing not at random (MNAR) scenarios yielded invalid inferences with both methods. Effect size estimate bias was reduced in MI methods by always including group membership even if this was unrelated to missingness. Surprisingly, under MAR and MCAR conditions in the assessed scenarios, MI offered no statistical advantage over CC methods. CONCLUSION: While MI must inherently accompany CC methods for intention-to-treat analyses, these findings endorse CC methods for per protocol risk difference analyses in these conditions. These findings provide an argument for the use of the CC approach to always complement MI analyses, with the usual caveat that the validity of the mechanism for missingness be thoroughly discussed. More importantly, researchers should strive to collect as much data as possible.
format Online
Article
Text
id pubmed-4957845
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49578452016-07-23 Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing? Mukaka, Mavuto White, Sarah A. Terlouw, Dianne J. Mwapasa, Victor Kalilani-Phiri, Linda Faragher, E. Brian Trials Methodology BACKGROUND: Missing outcomes can seriously impair the ability to make correct inferences from randomized controlled trials (RCTs). Complete case (CC) analysis is commonly used, but it reduces sample size and is perceived to lead to reduced statistical efficiency of estimates while increasing the potential for bias. As multiple imputation (MI) methods preserve sample size, they are generally viewed as the preferred analytical approach. We examined this assumption, comparing the performance of CC and MI methods to determine risk difference (RD) estimates in the presence of missing binary outcomes. We conducted simulation studies of 5000 simulated data sets with 50 imputations of RCTs with one primary follow-up endpoint at different underlying levels of RD (3–25 %) and missing outcomes (5–30 %). RESULTS: For missing at random (MAR) or missing completely at random (MCAR) outcomes, CC method estimates generally remained unbiased and achieved precision similar to or better than MI methods, and high statistical coverage. Missing not at random (MNAR) scenarios yielded invalid inferences with both methods. Effect size estimate bias was reduced in MI methods by always including group membership even if this was unrelated to missingness. Surprisingly, under MAR and MCAR conditions in the assessed scenarios, MI offered no statistical advantage over CC methods. CONCLUSION: While MI must inherently accompany CC methods for intention-to-treat analyses, these findings endorse CC methods for per protocol risk difference analyses in these conditions. These findings provide an argument for the use of the CC approach to always complement MI analyses, with the usual caveat that the validity of the mechanism for missingness be thoroughly discussed. More importantly, researchers should strive to collect as much data as possible. BioMed Central 2016-07-22 /pmc/articles/PMC4957845/ /pubmed/27450066 http://dx.doi.org/10.1186/s13063-016-1473-3 Text en © Mukaka et al. 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology
Mukaka, Mavuto
White, Sarah A.
Terlouw, Dianne J.
Mwapasa, Victor
Kalilani-Phiri, Linda
Faragher, E. Brian
Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title_full Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title_fullStr Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title_full_unstemmed Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title_short Is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
title_sort is using multiple imputation better than complete case analysis for estimating a prevalence (risk) difference in randomized controlled trials when binary outcome observations are missing?
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4957845/
https://www.ncbi.nlm.nih.gov/pubmed/27450066
http://dx.doi.org/10.1186/s13063-016-1473-3
work_keys_str_mv AT mukakamavuto isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing
AT whitesaraha isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing
AT terlouwdiannej isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing
AT mwapasavictor isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing
AT kalilaniphirilinda isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing
AT faragherebrian isusingmultipleimputationbetterthancompletecaseanalysisforestimatingaprevalenceriskdifferenceinrandomizedcontrolledtrialswhenbinaryoutcomeobservationsaremissing