Cargando…
Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships
Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540355/ https://www.ncbi.nlm.nih.gov/pubmed/35893317 http://dx.doi.org/10.1002/sim.9515 |
_version_ | 1784803688586412032 |
---|---|
author | Wijesuriya, Rushani Moreno‐Betancur, Margarita Carlin, John De Silva, Anurika Priyanjali Lee, Katherine Jane |
author_facet | Wijesuriya, Rushani Moreno‐Betancur, Margarita Carlin, John De Silva, Anurika Priyanjali Lee, Katherine Jane |
author_sort | Wijesuriya, Rushani |
collection | PubMed |
description | Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via multiple imputation (MI). If the three‐level, cross‐classified structure is modeled in the analysis, it also needs to be accommodated in the imputation model to ensure valid results. While incomplete three‐level data can be handled using various approaches within MI, the performance of these in the cross‐classified data setting remains unclear. We conducted simulations under a range of scenarios to compare these approaches in the context of an acute‐effects cross‐classified random effects substantive model, which models the time‐varying cluster membership via simple additive random effects. The simulation study was based on a case study in a longitudinal cohort of students clustered within schools. We evaluated methods that ignore the time‐varying cluster memberships by taking the first or most common cluster for each individual; pragmatic extensions of single‐ and two‐level MI approaches within the joint modeling (JM) and the fully conditional specification (FCS) frameworks, using dummy indicators (DI) and/or imputing repeated measures in wide format to account for the cross‐classified structure; and a three‐level FCS MI approach developed specifically for cross‐classified data. Results indicated that the FCS implementations performed well in terms of bias and precision while JM approaches performed poorly. Under both frameworks approaches using the DI extension should be used with caution in the presence of sparse data. |
format | Online Article Text |
id | pubmed-9540355 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95403552022-10-14 Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships Wijesuriya, Rushani Moreno‐Betancur, Margarita Carlin, John De Silva, Anurika Priyanjali Lee, Katherine Jane Stat Med Research Articles Three‐level data arising from repeated measures on individuals clustered within higher‐level units are common in medical research. A complexity arises when individuals change clusters over time, resulting in a cross‐classified data structure. Missing values in these studies are commonly handled via multiple imputation (MI). If the three‐level, cross‐classified structure is modeled in the analysis, it also needs to be accommodated in the imputation model to ensure valid results. While incomplete three‐level data can be handled using various approaches within MI, the performance of these in the cross‐classified data setting remains unclear. We conducted simulations under a range of scenarios to compare these approaches in the context of an acute‐effects cross‐classified random effects substantive model, which models the time‐varying cluster membership via simple additive random effects. The simulation study was based on a case study in a longitudinal cohort of students clustered within schools. We evaluated methods that ignore the time‐varying cluster memberships by taking the first or most common cluster for each individual; pragmatic extensions of single‐ and two‐level MI approaches within the joint modeling (JM) and the fully conditional specification (FCS) frameworks, using dummy indicators (DI) and/or imputing repeated measures in wide format to account for the cross‐classified structure; and a three‐level FCS MI approach developed specifically for cross‐classified data. Results indicated that the FCS implementations performed well in terms of bias and precision while JM approaches performed poorly. Under both frameworks approaches using the DI extension should be used with caution in the presence of sparse data. John Wiley & Sons, Inc. 2022-07-27 2022-09-30 /pmc/articles/PMC9540355/ /pubmed/35893317 http://dx.doi.org/10.1002/sim.9515 Text en © 2022 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes. |
spellingShingle | Research Articles Wijesuriya, Rushani Moreno‐Betancur, Margarita Carlin, John De Silva, Anurika Priyanjali Lee, Katherine Jane Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title | Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title_full | Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title_fullStr | Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title_full_unstemmed | Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title_short | Multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
title_sort | multiple imputation approaches for handling incomplete three‐level data with time‐varying cluster‐memberships |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9540355/ https://www.ncbi.nlm.nih.gov/pubmed/35893317 http://dx.doi.org/10.1002/sim.9515 |
work_keys_str_mv | AT wijesuriyarushani multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships AT morenobetancurmargarita multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships AT carlinjohn multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships AT desilvaanurikapriyanjali multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships AT leekatherinejane multipleimputationapproachesforhandlingincompletethreeleveldatawithtimevaryingclustermemberships |