Cargando…
Effects of Data Aggregation on Time Series Analysis of Seasonal Infections
Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We prese...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7460497/ https://www.ncbi.nlm.nih.gov/pubmed/32823719 http://dx.doi.org/10.3390/ijerph17165887 |
_version_ | 1783576615389757440 |
---|---|
author | Alarcon Falconi, Tania M. Estrella, Bertha Sempértegui, Fernando Naumova, Elena N. |
author_facet | Alarcon Falconi, Tania M. Estrella, Bertha Sempértegui, Fernando Naumova, Elena N. |
author_sort | Alarcon Falconi, Tania M. |
collection | PubMed |
description | Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We present a critical thinking process for making decisions about data aggregation in time series analysis of seasonal infections. We systematically build a harmonic regression model to characterize peak timing and amplitude of three respiratory and enteric infections that have different seasonal patterns and incidence. We show that irregularities introduced when aggregating data must be controlled during modeling to prevent erroneous results. Aggregation irregularities had a minimal impact on the estimates of trend, amplitude, and peak timing for daily and weekly data regardless of the disease. However, estimates of peak timing of the more common infections changed by as much as 2.5 months when controlling for monthly data irregularities. Building a systematic model that controls for data irregularities is essential to accurately characterize temporal patterns of infections. With the urgent need to characterize temporal patterns of novel infections, such as COVID-19, this tutorial is timely and highly valuable for experts in many disciplines. |
format | Online Article Text |
id | pubmed-7460497 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-74604972020-09-03 Effects of Data Aggregation on Time Series Analysis of Seasonal Infections Alarcon Falconi, Tania M. Estrella, Bertha Sempértegui, Fernando Naumova, Elena N. Int J Environ Res Public Health Article Time series analysis in epidemiological studies is typically conducted on aggregated counts, although data tend to be collected at finer temporal resolutions. The decision to aggregate data is rarely discussed in epidemiological literature although it has been shown to impact model results. We present a critical thinking process for making decisions about data aggregation in time series analysis of seasonal infections. We systematically build a harmonic regression model to characterize peak timing and amplitude of three respiratory and enteric infections that have different seasonal patterns and incidence. We show that irregularities introduced when aggregating data must be controlled during modeling to prevent erroneous results. Aggregation irregularities had a minimal impact on the estimates of trend, amplitude, and peak timing for daily and weekly data regardless of the disease. However, estimates of peak timing of the more common infections changed by as much as 2.5 months when controlling for monthly data irregularities. Building a systematic model that controls for data irregularities is essential to accurately characterize temporal patterns of infections. With the urgent need to characterize temporal patterns of novel infections, such as COVID-19, this tutorial is timely and highly valuable for experts in many disciplines. MDPI 2020-08-13 2020-08 /pmc/articles/PMC7460497/ /pubmed/32823719 http://dx.doi.org/10.3390/ijerph17165887 Text en © 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Alarcon Falconi, Tania M. Estrella, Bertha Sempértegui, Fernando Naumova, Elena N. Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title | Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title_full | Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title_fullStr | Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title_full_unstemmed | Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title_short | Effects of Data Aggregation on Time Series Analysis of Seasonal Infections |
title_sort | effects of data aggregation on time series analysis of seasonal infections |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7460497/ https://www.ncbi.nlm.nih.gov/pubmed/32823719 http://dx.doi.org/10.3390/ijerph17165887 |
work_keys_str_mv | AT alarconfalconitaniam effectsofdataaggregationontimeseriesanalysisofseasonalinfections AT estrellabertha effectsofdataaggregationontimeseriesanalysisofseasonalinfections AT semperteguifernando effectsofdataaggregationontimeseriesanalysisofseasonalinfections AT naumovaelenan effectsofdataaggregationontimeseriesanalysisofseasonalinfections |