Cargando…
What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry
BACKGROUND: Within epidemiological and clinical research, missing data are a common issue and often over looked in publications. When the issue of missing observations is addressed it is usually assumed that the missing data are ‘missing at random’ (MAR). This assumption should be checked for plausi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5294884/ https://www.ncbi.nlm.nih.gov/pubmed/28166735 http://dx.doi.org/10.1186/s12874-017-0301-0 |
_version_ | 1782505325879885824 |
---|---|
author | Smuk, M. Carpenter, J. R. Morris, T. P. |
author_facet | Smuk, M. Carpenter, J. R. Morris, T. P. |
author_sort | Smuk, M. |
collection | PubMed |
description | BACKGROUND: Within epidemiological and clinical research, missing data are a common issue and often over looked in publications. When the issue of missing observations is addressed it is usually assumed that the missing data are ‘missing at random’ (MAR). This assumption should be checked for plausibility, however it is untestable, thus inferences should be assessed for robustness to departures from missing at random. METHODS: We highlight the method of pattern mixture sensitivity analysis after multiple imputation using colorectal cancer data as an example. We focus on the Dukes’ stage variable which has the highest proportion of missing observations. First, we find the probability of being in each Dukes’ stage given the MAR imputed dataset. We use these probabilities in a questionnaire to elicit prior beliefs from experts on what they believe the probability would be in the missing data. The questionnaire responses are then used in a Dirichlet draw to create a Bayesian ‘missing not at random’ (MNAR) prior to impute the missing observations. The model of interest is applied and inferences are compared to those from the MAR imputed data. RESULTS: The inferences were largely insensitive to departure from MAR. Inferences under MNAR suggested a smaller association between Dukes’ stage and death, though the association remained positive and with similarly low p values. CONCLUSIONS: We conclude by discussing the positives and negatives of our method and highlight the importance of making people aware of the need to test the MAR assumption. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-017-0301-0) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5294884 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-52948842017-02-09 What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry Smuk, M. Carpenter, J. R. Morris, T. P. BMC Med Res Methodol Research Article BACKGROUND: Within epidemiological and clinical research, missing data are a common issue and often over looked in publications. When the issue of missing observations is addressed it is usually assumed that the missing data are ‘missing at random’ (MAR). This assumption should be checked for plausibility, however it is untestable, thus inferences should be assessed for robustness to departures from missing at random. METHODS: We highlight the method of pattern mixture sensitivity analysis after multiple imputation using colorectal cancer data as an example. We focus on the Dukes’ stage variable which has the highest proportion of missing observations. First, we find the probability of being in each Dukes’ stage given the MAR imputed dataset. We use these probabilities in a questionnaire to elicit prior beliefs from experts on what they believe the probability would be in the missing data. The questionnaire responses are then used in a Dirichlet draw to create a Bayesian ‘missing not at random’ (MNAR) prior to impute the missing observations. The model of interest is applied and inferences are compared to those from the MAR imputed data. RESULTS: The inferences were largely insensitive to departure from MAR. Inferences under MNAR suggested a smaller association between Dukes’ stage and death, though the association remained positive and with similarly low p values. CONCLUSIONS: We conclude by discussing the positives and negatives of our method and highlight the importance of making people aware of the need to test the MAR assumption. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-017-0301-0) contains supplementary material, which is available to authorized users. BioMed Central 2017-02-06 /pmc/articles/PMC5294884/ /pubmed/28166735 http://dx.doi.org/10.1186/s12874-017-0301-0 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Smuk, M. Carpenter, J. R. Morris, T. P. What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title | What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title_full | What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title_fullStr | What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title_full_unstemmed | What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title_short | What impact do assumptions about missing data have on conclusions? A practical sensitivity analysis for a cancer survival registry |
title_sort | what impact do assumptions about missing data have on conclusions? a practical sensitivity analysis for a cancer survival registry |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5294884/ https://www.ncbi.nlm.nih.gov/pubmed/28166735 http://dx.doi.org/10.1186/s12874-017-0301-0 |
work_keys_str_mv | AT smukm whatimpactdoassumptionsaboutmissingdatahaveonconclusionsapracticalsensitivityanalysisforacancersurvivalregistry AT carpenterjr whatimpactdoassumptionsaboutmissingdatahaveonconclusionsapracticalsensitivityanalysisforacancersurvivalregistry AT morristp whatimpactdoassumptionsaboutmissingdatahaveonconclusionsapracticalsensitivityanalysisforacancersurvivalregistry |