Cargando…

Effects of Data Aggregation on Time Series Analysis of Seasonal Infections

Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We prese...

Descripción completa

Detalles Bibliográficos
Autores principales: Alarcon Falconi, Tania M., Estrella, Bertha, Sempértegui, Fernando, Naumova, Elena N.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7460497/
https://www.ncbi.nlm.nih.gov/pubmed/32823719
http://dx.doi.org/10.3390/ijerph17165887
_version_ 1783576615389757440
author Alarcon Falconi, Tania M.
Estrella, Bertha
Sempértegui, Fernando
Naumova, Elena N.
author_facet Alarcon Falconi, Tania M.
Estrella, Bertha
Sempértegui, Fernando
Naumova, Elena N.
author_sort Alarcon Falconi, Tania M.
collection PubMed
description Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We present a critical thinking process for making decisions about data aggregation in time series analysis of seasonal infections. We systematically build a harmonic regression model to characterize peak timing and amplitude of three respiratory and enteric infections that have different seasonal patterns and incidence. We show that irregularities introduced when aggregating data must be controlled during modeling to prevent erroneous results. Aggregation irregularities had a minimal impact on the estimates of trend, amplitude, and peak timing for daily and weekly data regardless of the disease. However, estimates of peak timing of the more common infections changed by as much as 2.5 months when controlling for monthly data irregularities. Building a systematic model that controls for data irregularities is essential to accurately characterize temporal patterns of infections. With the urgent need to characterize temporal patterns of novel infections, such as COVID-19, this tutorial is timely and highly valuable for experts in many disciplines.
format Online
Article
Text
id pubmed-7460497
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-74604972020-09-03 Effects of Data Aggregation on Time Series Analysis of Seasonal Infections Alarcon Falconi, Tania M. Estrella, Bertha Sempértegui, Fernando Naumova, Elena N. Int J Environ Res Public Health Article Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We present a critical thinking process for making decisions about data aggregation in time series analysis of seasonal infections. We systematically build a harmonic regression model to characterize peak timing and amplitude of three respiratory and enteric infections that have different seasonal patterns and incidence. We show that irregularities introduced when aggregating data must be controlled during modeling to prevent erroneous results. Aggregation irregularities had a minimal impact on the estimates of trend, amplitude, and peak timing for daily and weekly data regardless of the disease. However, estimates of peak timing of the more common infections changed by as much as 2.5 months when controlling for monthly data irregularities. Building a systematic model that controls for data irregularities is essential to accurately characterize temporal patterns of infections. With the urgent need to characterize temporal patterns of novel infections, such as COVID-19, this tutorial is timely and highly valuable for experts in many disciplines. MDPI 2020-08-13 2020-08 /pmc/articles/PMC7460497/ /pubmed/32823719 http://dx.doi.org/10.3390/ijerph17165887 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Alarcon Falconi, Tania M.
Estrella, Bertha
Sempértegui, Fernando
Naumova, Elena N.
Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title_full Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title_fullStr Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title_full_unstemmed Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title_short Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
title_sort effects of data aggregation on time series analysis of seasonal infections
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7460497/
https://www.ncbi.nlm.nih.gov/pubmed/32823719
http://dx.doi.org/10.3390/ijerph17165887
work_keys_str_mv AT alarconfalconitaniam effectsofdataaggregationontimeseriesanalysisofseasonalinfections
AT estrellabertha effectsofdataaggregationontimeseriesanalysisofseasonalinfections
AT semperteguifernando effectsofdataaggregationontimeseriesanalysisofseasonalinfections
AT naumovaelenan effectsofdataaggregationontimeseriesanalysisofseasonalinfections