Cargando…

Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships

Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via...

Descripción completa

Detalles Bibliográficos
Autores principales: Wijesuriya, Rushani, Moreno‐Betancur, Margarita, Carlin, John, De Silva, Anurika Priyanjali, Lee, Katherine Jane
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540355/
https://www.ncbi.nlm.nih.gov/pubmed/35893317
http://dx.doi.org/10.1002/sim.9515
_version_ 1784803688586412032
author Wijesuriya, Rushani
Moreno‐Betancur, Margarita
Carlin, John
De Silva, Anurika Priyanjali
Lee, Katherine Jane
author_facet Wijesuriya, Rushani
Moreno‐Betancur, Margarita
Carlin, John
De Silva, Anurika Priyanjali
Lee, Katherine Jane
author_sort Wijesuriya, Rushani
collection PubMed
description Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via multiple imputation (MI). If the three‐level, cross‐classified structure is modeled in the analysis, it also needs to be accommodated in the imputation model to ensure valid results. While incomplete three‐level data can be handled using various approaches within MI, the performance of these in the cross‐classified data setting remains unclear. We conducted simulations under a range of scenarios to compare these approaches in the context of an acute‐effects cross‐classified random effects substantive model, which models the time‐varying cluster membership via simple additive random effects. The simulation study was based on a case study in a longitudinal cohort of students clustered within schools. We evaluated methods that ignore the time‐varying cluster memberships by taking the first or most common cluster for each individual; pragmatic extensions of single‐ and two‐level MI approaches within the joint modeling (JM) and the fully conditional specification (FCS) frameworks, using dummy indicators (DI) and/or imputing repeated measures in wide format to account for the cross‐classified structure; and a three‐level FCS MI approach developed specifically for cross‐classified data. Results indicated that the FCS implementations performed well in terms of bias and precision while JM approaches performed poorly. Under both frameworks approaches using the DI extension should be used with caution in the presence of sparse data.
format Online
Article
Text
id pubmed-9540355
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-95403552022-10-14 Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships Wijesuriya, Rushani Moreno‐Betancur, Margarita Carlin, John De Silva, Anurika Priyanjali Lee, Katherine Jane Stat Med Research Articles Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via multiple imputation (MI). If the three‐level, cross‐classified structure is modeled in the analysis, it also needs to be accommodated in the imputation model to ensure valid results. While incomplete three‐level data can be handled using various approaches within MI, the performance of these in the cross‐classified data setting remains unclear. We conducted simulations under a range of scenarios to compare these approaches in the context of an acute‐effects cross‐classified random effects substantive model, which models the time‐varying cluster membership via simple additive random effects. The simulation study was based on a case study in a longitudinal cohort of students clustered within schools. We evaluated methods that ignore the time‐varying cluster memberships by taking the first or most common cluster for each individual; pragmatic extensions of single‐ and two‐level MI approaches within the joint modeling (JM) and the fully conditional specification (FCS) frameworks, using dummy indicators (DI) and/or imputing repeated measures in wide format to account for the cross‐classified structure; and a three‐level FCS MI approach developed specifically for cross‐classified data. Results indicated that the FCS implementations performed well in terms of bias and precision while JM approaches performed poorly. Under both frameworks approaches using the DI extension should be used with caution in the presence of sparse data. John Wiley & Sons, Inc. 2022-07-27 2022-09-30 /pmc/articles/PMC9540355/ /pubmed/35893317 http://dx.doi.org/10.1002/sim.9515 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Wijesuriya, Rushani
Moreno‐Betancur, Margarita
Carlin, John
De Silva, Anurika Priyanjali
Lee, Katherine Jane
Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title_full Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title_fullStr Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title_full_unstemmed Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title_short Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
title_sort multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540355/
https://www.ncbi.nlm.nih.gov/pubmed/35893317
http://dx.doi.org/10.1002/sim.9515
work_keys_str_mv AT wijesuriyarushani multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships
AT morenobetancurmargarita multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships
AT carlinjohn multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships
AT desilvaanurikapriyanjali multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships
AT leekatherinejane multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships