Cargando…

Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation

BACKGROUND: The occurrence and timing of mycobacterial culture conversion is used as a proxy for tuberculosis treatment response. When researchers serially sample sputum during tuberculosis studies, contamination or missed visits leads to missing data points. Traditionally, this is managed by ignori...

Descripción completa

Detalles Bibliográficos
Autores principales: Malatesta, Samantha, Weir, Isabelle R., Weber, Sarah E., Bouton, Tara C., Carney, Tara, Theron, Danie, Myers, Bronwyn, Horsburgh, C. Robert, Warren, Robin M., Jacobson, Karen R., White, Laura F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675206/
https://www.ncbi.nlm.nih.gov/pubmed/36402979
http://dx.doi.org/10.1186/s12874-022-01782-8
_version_ 1784833320958296064
author Malatesta, Samantha
Weir, Isabelle R.
Weber, Sarah E.
Bouton, Tara C.
Carney, Tara
Theron, Danie
Myers, Bronwyn
Horsburgh, C. Robert
Warren, Robin M.
Jacobson, Karen R.
White, Laura F.
author_facet Malatesta, Samantha
Weir, Isabelle R.
Weber, Sarah E.
Bouton, Tara C.
Carney, Tara
Theron, Danie
Myers, Bronwyn
Horsburgh, C. Robert
Warren, Robin M.
Jacobson, Karen R.
White, Laura F.
author_sort Malatesta, Samantha
collection PubMed
description BACKGROUND: The occurrence and timing of mycobacterial culture conversion is used as a proxy for tuberculosis treatment response. When researchers serially sample sputum during tuberculosis studies, contamination or missed visits leads to missing data points. Traditionally, this is managed by ignoring missing data or simple carry-forward techniques. Statistically advanced multiple imputation methods potentially decrease bias and retain sample size and statistical power. METHODS: We analyzed data from 261 participants who provided weekly sputa for the first 12 weeks of tuberculosis treatment. We compared methods for handling missing data points in a longitudinal study with a time-to-event outcome. Our primary outcome was time to culture conversion, defined as two consecutive weeks with no Mycobacterium tuberculosis growth. Methods used to address missing data included: 1) available case analysis, 2) last observation carried forward, and 3) multiple imputation by fully conditional specification. For each method, we calculated the proportion culture converted and used survival analysis to estimate Kaplan-Meier curves, hazard ratios, and restricted mean survival times. We compared methods based on point estimates, confidence intervals, and conclusions to specific research questions. RESULTS: The three missing data methods lead to differences in the number of participants achieving conversion; 78 (32.8%) participants converted with available case analysis, 154 (64.7%) converted with last observation carried forward, and 184 (77.1%) converted with multiple imputation. Multiple imputation resulted in smaller point estimates than simple approaches with narrower confidence intervals. The adjusted hazard ratio for smear negative participants was 3.4 (95% CI 2.3, 5.1) using multiple imputation compared to 5.2 (95% CI 3.1, 8.7) using last observation carried forward and 5.0 (95% CI 2.4, 10.6) using available case analysis. CONCLUSION: We showed that accounting for missing sputum data through multiple imputation, a statistically valid approach under certain conditions, can lead to different conclusions than naïve methods. Careful consideration for how to handle missing data must be taken and be pre-specified prior to analysis. We used data from a TB study to demonstrate these concepts, however, the methods we described are broadly applicable to longitudinal missing data. We provide valuable statistical guidance and code for researchers to appropriately handle missing data in longitudinal studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01782-8.
format Online
Article
Text
id pubmed-9675206
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-96752062022-11-20 Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation Malatesta, Samantha Weir, Isabelle R. Weber, Sarah E. Bouton, Tara C. Carney, Tara Theron, Danie Myers, Bronwyn Horsburgh, C. Robert Warren, Robin M. Jacobson, Karen R. White, Laura F. BMC Med Res Methodol Research BACKGROUND: The occurrence and timing of mycobacterial culture conversion is used as a proxy for tuberculosis treatment response. When researchers serially sample sputum during tuberculosis studies, contamination or missed visits leads to missing data points. Traditionally, this is managed by ignoring missing data or simple carry-forward techniques. Statistically advanced multiple imputation methods potentially decrease bias and retain sample size and statistical power. METHODS: We analyzed data from 261 participants who provided weekly sputa for the first 12 weeks of tuberculosis treatment. We compared methods for handling missing data points in a longitudinal study with a time-to-event outcome. Our primary outcome was time to culture conversion, defined as two consecutive weeks with no Mycobacterium tuberculosis growth. Methods used to address missing data included: 1) available case analysis, 2) last observation carried forward, and 3) multiple imputation by fully conditional specification. For each method, we calculated the proportion culture converted and used survival analysis to estimate Kaplan-Meier curves, hazard ratios, and restricted mean survival times. We compared methods based on point estimates, confidence intervals, and conclusions to specific research questions. RESULTS: The three missing data methods lead to differences in the number of participants achieving conversion; 78 (32.8%) participants converted with available case analysis, 154 (64.7%) converted with last observation carried forward, and 184 (77.1%) converted with multiple imputation. Multiple imputation resulted in smaller point estimates than simple approaches with narrower confidence intervals. The adjusted hazard ratio for smear negative participants was 3.4 (95% CI 2.3, 5.1) using multiple imputation compared to 5.2 (95% CI 3.1, 8.7) using last observation carried forward and 5.0 (95% CI 2.4, 10.6) using available case analysis. CONCLUSION: We showed that accounting for missing sputum data through multiple imputation, a statistically valid approach under certain conditions, can lead to different conclusions than naïve methods. Careful consideration for how to handle missing data must be taken and be pre-specified prior to analysis. We used data from a TB study to demonstrate these concepts, however, the methods we described are broadly applicable to longitudinal missing data. We provide valuable statistical guidance and code for researchers to appropriately handle missing data in longitudinal studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01782-8. BioMed Central 2022-11-19 /pmc/articles/PMC9675206/ /pubmed/36402979 http://dx.doi.org/10.1186/s12874-022-01782-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Malatesta, Samantha
Weir, Isabelle R.
Weber, Sarah E.
Bouton, Tara C.
Carney, Tara
Theron, Danie
Myers, Bronwyn
Horsburgh, C. Robert
Warren, Robin M.
Jacobson, Karen R.
White, Laura F.
Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title_full Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title_fullStr Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title_full_unstemmed Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title_short Methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
title_sort methods for handling missing data in serially sampled sputum specimens for mycobacterial culture conversion calculation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9675206/
https://www.ncbi.nlm.nih.gov/pubmed/36402979
http://dx.doi.org/10.1186/s12874-022-01782-8
work_keys_str_mv AT malatestasamantha methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT weirisabeller methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT webersarahe methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT boutontarac methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT carneytara methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT therondanie methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT myersbronwyn methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT horsburghcrobert methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT warrenrobinm methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT jacobsonkarenr methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation
AT whitelauraf methodsforhandlingmissingdatainseriallysampledsputumspecimensformycobacterialcultureconversioncalculation