Cargando…
The rise of multiple imputation: a review of the reporting and implementation of the method in medical research
BACKGROUND: Missing data are common in medical research, which can lead to a loss in statistical power and potentially biased results if not handled appropriately. Multiple imputation (MI) is a statistical method, widely adopted in practice, for dealing with missing data. Many academic journals now...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4396150/ https://www.ncbi.nlm.nih.gov/pubmed/25880850 http://dx.doi.org/10.1186/s12874-015-0022-1 |
_version_ | 1782366552933269504 |
---|---|
author | Hayati Rezvan, Panteha Lee, Katherine J Simpson, Julie A |
author_facet | Hayati Rezvan, Panteha Lee, Katherine J Simpson, Julie A |
author_sort | Hayati Rezvan, Panteha |
collection | PubMed |
description | BACKGROUND: Missing data are common in medical research, which can lead to a loss in statistical power and potentially biased results if not handled appropriately. Multiple imputation (MI) is a statistical method, widely adopted in practice, for dealing with missing data. Many academic journals now emphasise the importance of reporting information regarding missing data and proposed guidelines for documenting the application of MI have been published. This review evaluated the reporting of missing data, the application of MI including the details provided regarding the imputation model, and the frequency of sensitivity analyses within the MI framework in medical research articles. METHODS: A systematic review of articles published in the Lancet and New England Journal of Medicine between January 2008 and December 2013 in which MI was implemented was carried out. RESULTS: We identified 103 papers that used MI, with the number of papers increasing from 11 in 2008 to 26 in 2013. Nearly half of the papers specified the proportion of complete cases or the proportion with missing data by each variable. In the majority of the articles (86%) the imputed variables were specified. Of the 38 papers (37%) that stated the method of imputation, 20 used chained equations, 8 used multivariate normal imputation, and 10 used alternative methods. Very few articles (9%) detailed how they handled non-normally distributed variables during imputation. Thirty-nine papers (38%) stated the variables included in the imputation model. Less than half of the papers (46%) reported the number of imputations, and only two papers compared the distribution of imputed and observed data. Sixty-six papers presented the results from MI as a secondary analysis. Only three articles carried out a sensitivity analysis following MI to assess departures from the missing at random assumption, with details of the sensitivity analyses only provided by one article. CONCLUSIONS: This review outlined deficiencies in the documenting of missing data and the details provided about imputation. Furthermore, only a few articles performed sensitivity analyses following MI even though this is strongly recommended in guidelines. Authors are encouraged to follow the available guidelines and provide information on missing data and the imputation process. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-015-0022-1) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4396150 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-43961502015-04-14 The rise of multiple imputation: a review of the reporting and implementation of the method in medical research Hayati Rezvan, Panteha Lee, Katherine J Simpson, Julie A BMC Med Res Methodol Research Article BACKGROUND: Missing data are common in medical research, which can lead to a loss in statistical power and potentially biased results if not handled appropriately. Multiple imputation (MI) is a statistical method, widely adopted in practice, for dealing with missing data. Many academic journals now emphasise the importance of reporting information regarding missing data and proposed guidelines for documenting the application of MI have been published. This review evaluated the reporting of missing data, the application of MI including the details provided regarding the imputation model, and the frequency of sensitivity analyses within the MI framework in medical research articles. METHODS: A systematic review of articles published in the Lancet and New England Journal of Medicine between January 2008 and December 2013 in which MI was implemented was carried out. RESULTS: We identified 103 papers that used MI, with the number of papers increasing from 11 in 2008 to 26 in 2013. Nearly half of the papers specified the proportion of complete cases or the proportion with missing data by each variable. In the majority of the articles (86%) the imputed variables were specified. Of the 38 papers (37%) that stated the method of imputation, 20 used chained equations, 8 used multivariate normal imputation, and 10 used alternative methods. Very few articles (9%) detailed how they handled non-normally distributed variables during imputation. Thirty-nine papers (38%) stated the variables included in the imputation model. Less than half of the papers (46%) reported the number of imputations, and only two papers compared the distribution of imputed and observed data. Sixty-six papers presented the results from MI as a secondary analysis. Only three articles carried out a sensitivity analysis following MI to assess departures from the missing at random assumption, with details of the sensitivity analyses only provided by one article. CONCLUSIONS: This review outlined deficiencies in the documenting of missing data and the details provided about imputation. Furthermore, only a few articles performed sensitivity analyses following MI even though this is strongly recommended in guidelines. Authors are encouraged to follow the available guidelines and provide information on missing data and the imputation process. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12874-015-0022-1) contains supplementary material, which is available to authorized users. BioMed Central 2015-04-07 /pmc/articles/PMC4396150/ /pubmed/25880850 http://dx.doi.org/10.1186/s12874-015-0022-1 Text en © Hayati Rezvan et al.; licensee BioMed Central. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Hayati Rezvan, Panteha Lee, Katherine J Simpson, Julie A The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title | The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title_full | The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title_fullStr | The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title_full_unstemmed | The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title_short | The rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
title_sort | rise of multiple imputation: a review of the reporting and implementation of the method in medical research |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4396150/ https://www.ncbi.nlm.nih.gov/pubmed/25880850 http://dx.doi.org/10.1186/s12874-015-0022-1 |
work_keys_str_mv | AT hayatirezvanpanteha theriseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch AT leekatherinej theriseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch AT simpsonjuliea theriseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch AT hayatirezvanpanteha riseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch AT leekatherinej riseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch AT simpsonjuliea riseofmultipleimputationareviewofthereportingandimplementationofthemethodinmedicalresearch |