Cargando…

Spatial aggregation choice in the era of digital and administrative surveillance data

Traditional disease surveillance is increasingly being complemented by data from non-traditional sources like medical claims, electronic health records, and participatory syndromic data platforms. As non-traditional data are often collected at the individual-level and are convenience samples from a...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Elizabeth C., Arab, Ali, Colizza, Vittoria, Bansal, Shweta
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931313/
https://www.ncbi.nlm.nih.gov/pubmed/36812505
http://dx.doi.org/10.1371/journal.pdig.0000039
_version_ 1784889221893324800
author Lee, Elizabeth C.
Arab, Ali
Colizza, Vittoria
Bansal, Shweta
author_facet Lee, Elizabeth C.
Arab, Ali
Colizza, Vittoria
Bansal, Shweta
author_sort Lee, Elizabeth C.
collection PubMed
description Traditional disease surveillance is increasingly being complemented by data from non-traditional sources like medical claims, electronic health records, and participatory syndromic data platforms. As non-traditional data are often collected at the individual-level and are convenience samples from a population, choices must be made on the aggregation of these data for epidemiological inference. Our study seeks to understand the influence of spatial aggregation choice on our understanding of disease spread with a case study of influenza-like illness in the United States. Using U.S. medical claims data from 2002 to 2009, we examined the epidemic source location, onset and peak season timing, and epidemic duration of influenza seasons for data aggregated to the county and state scales. We also compared spatial autocorrelation and tested the relative magnitude of spatial aggregation differences between onset and peak measures of disease burden. We found discrepancies in the inferred epidemic source locations and estimated influenza season onsets and peaks when comparing county and state-level data. Spatial autocorrelation was detected across more expansive geographic ranges during the peak season as compared to the early flu season, and there were greater spatial aggregation differences in early season measures as well. Epidemiological inferences are more sensitive to spatial scale early on during U.S. influenza seasons, when there is greater heterogeneity in timing, intensity, and geographic spread of the epidemics. Users of non-traditional disease surveillance should carefully consider how to extract accurate disease signals from finer-scaled data for early use in disease outbreaks.
format Online
Article
Text
id pubmed-9931313
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-99313132023-02-16 Spatial aggregation choice in the era of digital and administrative surveillance data Lee, Elizabeth C. Arab, Ali Colizza, Vittoria Bansal, Shweta PLOS Digit Health Research Article Traditional disease surveillance is increasingly being complemented by data from non-traditional sources like medical claims, electronic health records, and participatory syndromic data platforms. As non-traditional data are often collected at the individual-level and are convenience samples from a population, choices must be made on the aggregation of these data for epidemiological inference. Our study seeks to understand the influence of spatial aggregation choice on our understanding of disease spread with a case study of influenza-like illness in the United States. Using U.S. medical claims data from 2002 to 2009, we examined the epidemic source location, onset and peak season timing, and epidemic duration of influenza seasons for data aggregated to the county and state scales. We also compared spatial autocorrelation and tested the relative magnitude of spatial aggregation differences between onset and peak measures of disease burden. We found discrepancies in the inferred epidemic source locations and estimated influenza season onsets and peaks when comparing county and state-level data. Spatial autocorrelation was detected across more expansive geographic ranges during the peak season as compared to the early flu season, and there were greater spatial aggregation differences in early season measures as well. Epidemiological inferences are more sensitive to spatial scale early on during U.S. influenza seasons, when there is greater heterogeneity in timing, intensity, and geographic spread of the epidemics. Users of non-traditional disease surveillance should carefully consider how to extract accurate disease signals from finer-scaled data for early use in disease outbreaks. Public Library of Science 2022-06-03 /pmc/articles/PMC9931313/ /pubmed/36812505 http://dx.doi.org/10.1371/journal.pdig.0000039 Text en © 2022 Lee et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Lee, Elizabeth C.
Arab, Ali
Colizza, Vittoria
Bansal, Shweta
Spatial aggregation choice in the era of digital and administrative surveillance data
title Spatial aggregation choice in the era of digital and administrative surveillance data
title_full Spatial aggregation choice in the era of digital and administrative surveillance data
title_fullStr Spatial aggregation choice in the era of digital and administrative surveillance data
title_full_unstemmed Spatial aggregation choice in the era of digital and administrative surveillance data
title_short Spatial aggregation choice in the era of digital and administrative surveillance data
title_sort spatial aggregation choice in the era of digital and administrative surveillance data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9931313/
https://www.ncbi.nlm.nih.gov/pubmed/36812505
http://dx.doi.org/10.1371/journal.pdig.0000039
work_keys_str_mv AT leeelizabethc spatialaggregationchoiceintheeraofdigitalandadministrativesurveillancedata
AT arabali spatialaggregationchoiceintheeraofdigitalandadministrativesurveillancedata
AT colizzavittoria spatialaggregationchoiceintheeraofdigitalandadministrativesurveillancedata
AT bansalshweta spatialaggregationchoiceintheeraofdigitalandadministrativesurveillancedata