Cargando…
The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns
The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median T...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9590093/ https://www.ncbi.nlm.nih.gov/pubmed/36278532 http://dx.doi.org/10.3390/clockssleep4040039 |
_version_ | 1784814441475342336 |
---|---|
author | Weed, Lara Lok, Renske Chawra, Dwijen Zeitzer, Jamie |
author_facet | Weed, Lara Lok, Renske Chawra, Dwijen Zeitzer, Jamie |
author_sort | Weed, Lara |
collection | PubMed |
description | The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median ToD imputation) for estimating IV and IS was also tested. Week-long actigraphy records with no non-wear or missing timeseries data were masked with zeros or ‘Not a Number’ (NaN) across a range of timings and durations for single and multiple missing data bouts. IV and IS were calculated for true, masked, and imputed (i.e., linear interpolation, mean ToD and, median ToD imputation) timeseries data and used to generate Bland–Alman plots for each condition. Heatmaps were used to analyze the impact of timings and durations of and between bouts. Simulated missing data produced deviations in IV and IS for longer durations, midday crossings, and during similar timing on consecutive days. Median ToD imputation produced the least deviation among the imputation methods. Median ToD imputation is recommended to recapitulate IV and IS under missing data conditions for less than 24 h. |
format | Online Article Text |
id | pubmed-9590093 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-95900932022-10-25 The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns Weed, Lara Lok, Renske Chawra, Dwijen Zeitzer, Jamie Clocks Sleep Article The purpose of this study is to characterize the impact of the timing and duration of missing actigraphy data on interdaily stability (IS) and intradaily variability (IV) calculation. The performance of three missing data imputation methods (linear interpolation, mean time of day (ToD), and median ToD imputation) for estimating IV and IS was also tested. Week-long actigraphy records with no non-wear or missing timeseries data were masked with zeros or ‘Not a Number’ (NaN) across a range of timings and durations for single and multiple missing data bouts. IV and IS were calculated for true, masked, and imputed (i.e., linear interpolation, mean ToD and, median ToD imputation) timeseries data and used to generate Bland–Alman plots for each condition. Heatmaps were used to analyze the impact of timings and durations of and between bouts. Simulated missing data produced deviations in IV and IS for longer durations, midday crossings, and during similar timing on consecutive days. Median ToD imputation produced the least deviation among the imputation methods. Median ToD imputation is recommended to recapitulate IV and IS under missing data conditions for less than 24 h. MDPI 2022-09-27 /pmc/articles/PMC9590093/ /pubmed/36278532 http://dx.doi.org/10.3390/clockssleep4040039 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Weed, Lara Lok, Renske Chawra, Dwijen Zeitzer, Jamie The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title | The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title_full | The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title_fullStr | The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title_full_unstemmed | The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title_short | The Impact of Missing Data and Imputation Methods on the Analysis of 24-Hour Activity Patterns |
title_sort | impact of missing data and imputation methods on the analysis of 24-hour activity patterns |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9590093/ https://www.ncbi.nlm.nih.gov/pubmed/36278532 http://dx.doi.org/10.3390/clockssleep4040039 |
work_keys_str_mv | AT weedlara theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT lokrenske theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT chawradwijen theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT zeitzerjamie theimpactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT weedlara impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT lokrenske impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT chawradwijen impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns AT zeitzerjamie impactofmissingdataandimputationmethodsontheanalysisof24houractivitypatterns |