Cargando…

The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns

The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median T...

Descripción completa

Detalles Bibliográficos
Autores principales: Weed, Lara, Lok, Renske, Chawra, Dwijen, Zeitzer, Jamie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9590093/
https://www.ncbi.nlm.nih.gov/pubmed/36278532
http://dx.doi.org/10.3390/clockssleep4040039
_version_ 1784814441475342336
author Weed, Lara
Lok, Renske
Chawra, Dwijen
Zeitzer, Jamie
author_facet Weed, Lara
Lok, Renske
Chawra, Dwijen
Zeitzer, Jamie
author_sort Weed, Lara
collection PubMed
description The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median ToD imputation) for estimating IV and IS was also tested. Week-long actigraphy records with no non-wear or missing timeseries data were masked with zeros or ‘Not a Number’ (NaN) across a range of timings and durations for single and multiple missing data bouts. IV and IS were calculated for true, masked, and imputed (i.e., linear interpolation, mean ToD and, median ToD imputation) timeseries data and used to generate Bland–Alman plots for each condition. Heatmaps were used to analyze the impact of timings and durations of and between bouts. Simulated missing data produced deviations in IV and IS for longer durations, midday crossings, and during similar timing on consecutive days. Median ToD imputation produced the least deviation among the imputation methods. Median ToD imputation is recommended to recapitulate IV and IS under missing data conditions for less than 24 h.
format Online
Article
Text
id pubmed-9590093
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-95900932022-10-25 The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns Weed, Lara Lok, Renske Chawra, Dwijen Zeitzer, Jamie Clocks Sleep Article The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median ToD imputation) for estimating IV and IS was also tested. Week-long actigraphy records with no non-wear or missing timeseries data were masked with zeros or ‘Not a Number’ (NaN) across a range of timings and durations for single and multiple missing data bouts. IV and IS were calculated for true, masked, and imputed (i.e., linear interpolation, mean ToD and, median ToD imputation) timeseries data and used to generate Bland–Alman plots for each condition. Heatmaps were used to analyze the impact of timings and durations of and between bouts. Simulated missing data produced deviations in IV and IS for longer durations, midday crossings, and during similar timing on consecutive days. Median ToD imputation produced the least deviation among the imputation methods. Median ToD imputation is recommended to recapitulate IV and IS under missing data conditions for less than 24 h. MDPI 2022-09-27 /pmc/articles/PMC9590093/ /pubmed/36278532 http://dx.doi.org/10.3390/clockssleep4040039 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Weed, Lara
Lok, Renske
Chawra, Dwijen
Zeitzer, Jamie
The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title_full The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title_fullStr The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title_full_unstemmed The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title_short The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
title_sort impact of missing data and imputation methods on the analysis of 24-hour activity patterns
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9590093/
https://www.ncbi.nlm.nih.gov/pubmed/36278532
http://dx.doi.org/10.3390/clockssleep4040039
work_keys_str_mv AT weedlara theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT lokrenske theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT chawradwijen theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT zeitzerjamie theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT weedlara impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT lokrenske impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT chawradwijen impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns
AT zeitzerjamie impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns