Cargando…

Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data

Background. Missing data are a prevalent problem in almost all types of data analyses, such as survival data analysis. Objective. To evaluate the performance of multivariable imputation via chained equations in determining the factors that affect the survival of multidrug-resistant-tuberculosis (MDR...

Descripción completa

Detalles Bibliográficos
Autores principales: Mbona, Sizwe Vincent, Ndlovu, Principal, Mwambi, Henry, Ramroop, Shaun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PAGEPress Publications, Pavia, Italy 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519120/
https://www.ncbi.nlm.nih.gov/pubmed/37753435
http://dx.doi.org/10.4081/jphia.2023.2388
_version_ 1785109643550261248
author Mbona, Sizwe Vincent
Ndlovu, Principal
Mwambi, Henry
Ramroop, Shaun
author_facet Mbona, Sizwe Vincent
Ndlovu, Principal
Mwambi, Henry
Ramroop, Shaun
author_sort Mbona, Sizwe Vincent
collection PubMed
description Background. Missing data are a prevalent problem in almost all types of data analyses, such as survival data analysis. Objective. To evaluate the performance of multivariable imputation via chained equations in determining the factors that affect the survival of multidrug-resistant-tuberculosis (MDR-TB) and HIV-coinfected patients in KwaZulu-Natal. Materials and Methods. Secondary data from 1542 multidrug-resistant tuberculosis patients were used in this study. First, data from patients with some missing observations were deleted from the original data set to obtain the complete case (CC) data set. Second, missing observations in the original data set were imputed 15 times to obtain complete data sets using a multivariable imputation case (MIC). The Cox regression model was fitted to both the CC and MIC data, and the results were compared using the model goodness of fit criteria [likelihood ratio tests, Akaike information criterion (AIC), and Bayesian Information Criterion (BIC)]. Results. The Cox regression model fitted the MIC data set better (likelihood ratio test statistic =76.88 on 10 df with P<0.01, AIC =1040.90, and BIC =1099.65) than the CC data set (likelihood ratio test statistic =42.68 on 10 df with P<0.01, AIC =1186.05 and BIC =1228.47). Variables that were insignificant when the model was fitted to the CC data set became significant when the model was fitted to the MIC data set. Conclusion. Correcting missing data using multiple imputation techniques for the MDR-TB problem is recommended. This approach led to better estimates and more power in the model.
format Online
Article
Text
id pubmed-10519120
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PAGEPress Publications, Pavia, Italy
record_format MEDLINE/PubMed
spelling pubmed-105191202023-09-26 Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data Mbona, Sizwe Vincent Ndlovu, Principal Mwambi, Henry Ramroop, Shaun J Public Health Afr Article Background. Missing data are a prevalent problem in almost all types of data analyses, such as survival data analysis. Objective. To evaluate the performance of multivariable imputation via chained equations in determining the factors that affect the survival of multidrug-resistant-tuberculosis (MDR-TB) and HIV-coinfected patients in KwaZulu-Natal. Materials and Methods. Secondary data from 1542 multidrug-resistant tuberculosis patients were used in this study. First, data from patients with some missing observations were deleted from the original data set to obtain the complete case (CC) data set. Second, missing observations in the original data set were imputed 15 times to obtain complete data sets using a multivariable imputation case (MIC). The Cox regression model was fitted to both the CC and MIC data, and the results were compared using the model goodness of fit criteria [likelihood ratio tests, Akaike information criterion (AIC), and Bayesian Information Criterion (BIC)]. Results. The Cox regression model fitted the MIC data set better (likelihood ratio test statistic =76.88 on 10 df with P<0.01, AIC =1040.90, and BIC =1099.65) than the CC data set (likelihood ratio test statistic =42.68 on 10 df with P<0.01, AIC =1186.05 and BIC =1228.47). Variables that were insignificant when the model was fitted to the CC data set became significant when the model was fitted to the MIC data set. Conclusion. Correcting missing data using multiple imputation techniques for the MDR-TB problem is recommended. This approach led to better estimates and more power in the model. PAGEPress Publications, Pavia, Italy 2023-06-05 /pmc/articles/PMC10519120/ /pubmed/37753435 http://dx.doi.org/10.4081/jphia.2023.2388 Text en Copyright © 2023, the Author(s) https://creativecommons.org/licenses/by-nc/4.0/This work is licensed under a Creative Commons Attribution NonCommercial 4.0 License (CC BY-NC 4.0).
spellingShingle Article
Mbona, Sizwe Vincent
Ndlovu, Principal
Mwambi, Henry
Ramroop, Shaun
Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title_full Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title_fullStr Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title_full_unstemmed Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title_short Multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and HIV data
title_sort multiple imputation using chained equations for missing data in survival models: applied to multidrug-resistant tuberculosis and hiv data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10519120/
https://www.ncbi.nlm.nih.gov/pubmed/37753435
http://dx.doi.org/10.4081/jphia.2023.2388
work_keys_str_mv AT mbonasizwevincent multipleimputationusingchainedequationsformissingdatainsurvivalmodelsappliedtomultidrugresistanttuberculosisandhivdata
AT ndlovuprincipal multipleimputationusingchainedequationsformissingdatainsurvivalmodelsappliedtomultidrugresistanttuberculosisandhivdata
AT mwambihenry multipleimputationusingchainedequationsformissingdatainsurvivalmodelsappliedtomultidrugresistanttuberculosisandhivdata
AT ramroopshaun multipleimputationusingchainedequationsformissingdatainsurvivalmodelsappliedtomultidrugresistanttuberculosisandhivdata