Cargando…

An exploration of the missing data mechanism in an Internet based smoking cessation trial

BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to in...

Descripción completa

Detalles Bibliográficos
Autores principales: Jackson, Dan, Mason, Dan, White, Ian R, Sutton, Stephen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507670/
https://www.ncbi.nlm.nih.gov/pubmed/23067272
http://dx.doi.org/10.1186/1471-2288-12-157
_version_ 1782251105608007680
author Jackson, Dan
Mason, Dan
White, Ian R
Sutton, Stephen
author_facet Jackson, Dan
Mason, Dan
White, Ian R
Sutton, Stephen
author_sort Jackson, Dan
collection PubMed
description BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to investigate which of a set of a priori chosen baseline variables are predictive of missingness, and the evidence for and against the “missing=smoking” assumption. METHODS: We use a selection model, which models the probability that the outcome is observed given the outcome and other variables. The selection model includes a parameter for which zero indicates that the data are Missing at Random (MAR) and large values indicate “missing=smoking”. We examine the evidence for the predictive power of baseline variables in the context of a sensitivity analysis. We use data on the number and type of attempts made to obtain outcome data in order to estimate the association between smoking status and the missing data indicator. RESULTS: We apply our methods to the iQuit smoking cessation trial data. From the sensitivity analysis, we obtain strong evidence that older participants are more likely to provide outcome data. The model for the number and type of attempts to obtain outcome data confirms that age is a good predictor of missing data. There is weak evidence from this model that participants who have successfully given up smoking are more likely to provide outcome data but this evidence does not support the “missing=smoking” assumption. The probability that participants with missing outcome data are not smoking at the end of the trial is estimated to be between 0.14 and 0.19. CONCLUSIONS: Those conducting smoking cessation trials, and wishing to perform an analysis that assumes the data are MAR, should collect and incorporate baseline variables into their models that are thought to be good predictors of missing data in order to make this assumption more plausible. However they should also consider the possibility of Missing Not at Random (MNAR) models that make or allow for less extreme assumptions than “missing=smoking”.
format Online
Article
Text
id pubmed-3507670
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35076702012-12-03 An exploration of the missing data mechanism in an Internet based smoking cessation trial Jackson, Dan Mason, Dan White, Ian R Sutton, Stephen BMC Med Res Methodol Research Article BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to investigate which of a set of a priori chosen baseline variables are predictive of missingness, and the evidence for and against the “missing=smoking” assumption. METHODS: We use a selection model, which models the probability that the outcome is observed given the outcome and other variables. The selection model includes a parameter for which zero indicates that the data are Missing at Random (MAR) and large values indicate “missing=smoking”. We examine the evidence for the predictive power of baseline variables in the context of a sensitivity analysis. We use data on the number and type of attempts made to obtain outcome data in order to estimate the association between smoking status and the missing data indicator. RESULTS: We apply our methods to the iQuit smoking cessation trial data. From the sensitivity analysis, we obtain strong evidence that older participants are more likely to provide outcome data. The model for the number and type of attempts to obtain outcome data confirms that age is a good predictor of missing data. There is weak evidence from this model that participants who have successfully given up smoking are more likely to provide outcome data but this evidence does not support the “missing=smoking” assumption. The probability that participants with missing outcome data are not smoking at the end of the trial is estimated to be between 0.14 and 0.19. CONCLUSIONS: Those conducting smoking cessation trials, and wishing to perform an analysis that assumes the data are MAR, should collect and incorporate baseline variables into their models that are thought to be good predictors of missing data in order to make this assumption more plausible. However they should also consider the possibility of Missing Not at Random (MNAR) models that make or allow for less extreme assumptions than “missing=smoking”. BioMed Central 2012-10-15 /pmc/articles/PMC3507670/ /pubmed/23067272 http://dx.doi.org/10.1186/1471-2288-12-157 Text en Copyright ©2012 Jackson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Jackson, Dan
Mason, Dan
White, Ian R
Sutton, Stephen
An exploration of the missing data mechanism in an Internet based smoking cessation trial
title An exploration of the missing data mechanism in an Internet based smoking cessation trial
title_full An exploration of the missing data mechanism in an Internet based smoking cessation trial
title_fullStr An exploration of the missing data mechanism in an Internet based smoking cessation trial
title_full_unstemmed An exploration of the missing data mechanism in an Internet based smoking cessation trial
title_short An exploration of the missing data mechanism in an Internet based smoking cessation trial
title_sort exploration of the missing data mechanism in an internet based smoking cessation trial
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507670/
https://www.ncbi.nlm.nih.gov/pubmed/23067272
http://dx.doi.org/10.1186/1471-2288-12-157
work_keys_str_mv AT jacksondan anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT masondan anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT whiteianr anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT suttonstephen anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT jacksondan explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT masondan explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT whiteianr explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial
AT suttonstephen explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial