Cargando…
An exploration of the missing data mechanism in an Internet based smoking cessation trial
BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507670/ https://www.ncbi.nlm.nih.gov/pubmed/23067272 http://dx.doi.org/10.1186/1471-2288-12-157 |
_version_ | 1782251105608007680 |
---|---|
author | Jackson, Dan Mason, Dan White, Ian R Sutton, Stephen |
author_facet | Jackson, Dan Mason, Dan White, Ian R Sutton, Stephen |
author_sort | Jackson, Dan |
collection | PubMed |
description | BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to investigate which of a set of a priori chosen baseline variables are predictive of missingness, and the evidence for and against the “missing=smoking” assumption. METHODS: We use a selection model, which models the probability that the outcome is observed given the outcome and other variables. The selection model includes a parameter for which zero indicates that the data are Missing at Random (MAR) and large values indicate “missing=smoking”. We examine the evidence for the predictive power of baseline variables in the context of a sensitivity analysis. We use data on the number and type of attempts made to obtain outcome data in order to estimate the association between smoking status and the missing data indicator. RESULTS: We apply our methods to the iQuit smoking cessation trial data. From the sensitivity analysis, we obtain strong evidence that older participants are more likely to provide outcome data. The model for the number and type of attempts to obtain outcome data confirms that age is a good predictor of missing data. There is weak evidence from this model that participants who have successfully given up smoking are more likely to provide outcome data but this evidence does not support the “missing=smoking” assumption. The probability that participants with missing outcome data are not smoking at the end of the trial is estimated to be between 0.14 and 0.19. CONCLUSIONS: Those conducting smoking cessation trials, and wishing to perform an analysis that assumes the data are MAR, should collect and incorporate baseline variables into their models that are thought to be good predictors of missing data in order to make this assumption more plausible. However they should also consider the possibility of Missing Not at Random (MNAR) models that make or allow for less extreme assumptions than “missing=smoking”. |
format | Online Article Text |
id | pubmed-3507670 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35076702012-12-03 An exploration of the missing data mechanism in an Internet based smoking cessation trial Jackson, Dan Mason, Dan White, Ian R Sutton, Stephen BMC Med Res Methodol Research Article BACKGROUND: Missing outcome data are very common in smoking cessation trials. It is often assumed that all such missing data are from participants who have been unsuccessful in giving up smoking (“missing=smoking”). Here we use data from a recent Internet based smoking cessation trial in order to investigate which of a set of a priori chosen baseline variables are predictive of missingness, and the evidence for and against the “missing=smoking” assumption. METHODS: We use a selection model, which models the probability that the outcome is observed given the outcome and other variables. The selection model includes a parameter for which zero indicates that the data are Missing at Random (MAR) and large values indicate “missing=smoking”. We examine the evidence for the predictive power of baseline variables in the context of a sensitivity analysis. We use data on the number and type of attempts made to obtain outcome data in order to estimate the association between smoking status and the missing data indicator. RESULTS: We apply our methods to the iQuit smoking cessation trial data. From the sensitivity analysis, we obtain strong evidence that older participants are more likely to provide outcome data. The model for the number and type of attempts to obtain outcome data confirms that age is a good predictor of missing data. There is weak evidence from this model that participants who have successfully given up smoking are more likely to provide outcome data but this evidence does not support the “missing=smoking” assumption. The probability that participants with missing outcome data are not smoking at the end of the trial is estimated to be between 0.14 and 0.19. CONCLUSIONS: Those conducting smoking cessation trials, and wishing to perform an analysis that assumes the data are MAR, should collect and incorporate baseline variables into their models that are thought to be good predictors of missing data in order to make this assumption more plausible. However they should also consider the possibility of Missing Not at Random (MNAR) models that make or allow for less extreme assumptions than “missing=smoking”. BioMed Central 2012-10-15 /pmc/articles/PMC3507670/ /pubmed/23067272 http://dx.doi.org/10.1186/1471-2288-12-157 Text en Copyright ©2012 Jackson et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Jackson, Dan Mason, Dan White, Ian R Sutton, Stephen An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title | An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title_full | An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title_fullStr | An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title_full_unstemmed | An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title_short | An exploration of the missing data mechanism in an Internet based smoking cessation trial |
title_sort | exploration of the missing data mechanism in an internet based smoking cessation trial |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3507670/ https://www.ncbi.nlm.nih.gov/pubmed/23067272 http://dx.doi.org/10.1186/1471-2288-12-157 |
work_keys_str_mv | AT jacksondan anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT masondan anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT whiteianr anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT suttonstephen anexplorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT jacksondan explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT masondan explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT whiteianr explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial AT suttonstephen explorationofthemissingdatamechanisminaninternetbasedsmokingcessationtrial |