Cargando…
Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial
BACKGROUND: Standard mean imputation for missing values in the Western Ontario and Mc Master (WOMAC) Osteoarthritis Index limits the use of collected data and may lead to bias. Probability model-based imputation methods overcome such limitations but were never before applied to the WOMAC. In this st...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3124424/ https://www.ncbi.nlm.nih.gov/pubmed/21605458 http://dx.doi.org/10.1186/1471-2474-12-109 |
_version_ | 1782207085732167680 |
---|---|
author | Ghomrawi, Hassan MK Mandl, Lisa A Rutledge, John Alexiades, Michael M Mazumdar, Madhu |
author_facet | Ghomrawi, Hassan MK Mandl, Lisa A Rutledge, John Alexiades, Michael M Mazumdar, Madhu |
author_sort | Ghomrawi, Hassan MK |
collection | PubMed |
description | BACKGROUND: Standard mean imputation for missing values in the Western Ontario and Mc Master (WOMAC) Osteoarthritis Index limits the use of collected data and may lead to bias. Probability model-based imputation methods overcome such limitations but were never before applied to the WOMAC. In this study, we compare imputation results for the Expectation Maximization method (EM) and the mean imputation method for WOMAC in a cohort of total hip replacement patients. METHODS: WOMAC data on a consecutive cohort of 2062 patients scheduled for surgery were analyzed. Rates of missing values in each of the WOMAC items from this large cohort were used to create missing patterns in the subset of patients with complete data. EM and the WOMAC's method of imputation are then applied to fill the missing values. Summary score statistics for both methods are then described through box-plot and contrasted with the complete case (CC) analysis and the true score (TS). This process is repeated using a smaller sample size of 200 randomly drawn patients with higher missing rate (5 times the rates of missing values observed in the 2062 patients capped at 45%). RESULTS: Rate of missing values per item ranged from 2.9% to 14.5% and 1339 patients had complete data. Probability model-based EM imputed a score for all subjects while WOMAC's imputation method did not. Mean subscale scores were very similar for both imputation methods and were similar to the true score; however, the EM method results were more consistent with the TS after simulation. This difference became more pronounced as the number of items in a subscale increased and the sample size decreased. CONCLUSIONS: The EM method provides a better alternative to the WOMAC imputation method. The EM method is more accurate and imputes data to create a complete data set. These features are very valuable for patient-reported outcomes research in which resources are limited and the WOMAC score is used in a multivariate analysis. |
format | Online Article Text |
id | pubmed-3124424 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-31244242011-06-28 Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial Ghomrawi, Hassan MK Mandl, Lisa A Rutledge, John Alexiades, Michael M Mazumdar, Madhu BMC Musculoskelet Disord Research Article BACKGROUND: Standard mean imputation for missing values in the Western Ontario and Mc Master (WOMAC) Osteoarthritis Index limits the use of collected data and may lead to bias. Probability model-based imputation methods overcome such limitations but were never before applied to the WOMAC. In this study, we compare imputation results for the Expectation Maximization method (EM) and the mean imputation method for WOMAC in a cohort of total hip replacement patients. METHODS: WOMAC data on a consecutive cohort of 2062 patients scheduled for surgery were analyzed. Rates of missing values in each of the WOMAC items from this large cohort were used to create missing patterns in the subset of patients with complete data. EM and the WOMAC's method of imputation are then applied to fill the missing values. Summary score statistics for both methods are then described through box-plot and contrasted with the complete case (CC) analysis and the true score (TS). This process is repeated using a smaller sample size of 200 randomly drawn patients with higher missing rate (5 times the rates of missing values observed in the 2062 patients capped at 45%). RESULTS: Rate of missing values per item ranged from 2.9% to 14.5% and 1339 patients had complete data. Probability model-based EM imputed a score for all subjects while WOMAC's imputation method did not. Mean subscale scores were very similar for both imputation methods and were similar to the true score; however, the EM method results were more consistent with the TS after simulation. This difference became more pronounced as the number of items in a subscale increased and the sample size decreased. CONCLUSIONS: The EM method provides a better alternative to the WOMAC imputation method. The EM method is more accurate and imputes data to create a complete data set. These features are very valuable for patient-reported outcomes research in which resources are limited and the WOMAC score is used in a multivariate analysis. BioMed Central 2011-05-23 /pmc/articles/PMC3124424/ /pubmed/21605458 http://dx.doi.org/10.1186/1471-2474-12-109 Text en Copyright ©2011 Ghomrawi et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ghomrawi, Hassan MK Mandl, Lisa A Rutledge, John Alexiades, Michael M Mazumdar, Madhu Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title | Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title_full | Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title_fullStr | Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title_full_unstemmed | Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title_short | Is there a role for expectation maximization imputation in addressing missing data in research using WOMAC questionnaire? Comparison to the standard mean approach and a tutorial |
title_sort | is there a role for expectation maximization imputation in addressing missing data in research using womac questionnaire? comparison to the standard mean approach and a tutorial |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3124424/ https://www.ncbi.nlm.nih.gov/pubmed/21605458 http://dx.doi.org/10.1186/1471-2474-12-109 |
work_keys_str_mv | AT ghomrawihassanmk istherearoleforexpectationmaximizationimputationinaddressingmissingdatainresearchusingwomacquestionnairecomparisontothestandardmeanapproachandatutorial AT mandllisaa istherearoleforexpectationmaximizationimputationinaddressingmissingdatainresearchusingwomacquestionnairecomparisontothestandardmeanapproachandatutorial AT rutledgejohn istherearoleforexpectationmaximizationimputationinaddressingmissingdatainresearchusingwomacquestionnairecomparisontothestandardmeanapproachandatutorial AT alexiadesmichaelm istherearoleforexpectationmaximizationimputationinaddressingmissingdatainresearchusingwomacquestionnairecomparisontothestandardmeanapproachandatutorial AT mazumdarmadhu istherearoleforexpectationmaximizationimputationinaddressingmissingdatainresearchusingwomacquestionnairecomparisontothestandardmeanapproachandatutorial |