Cargando…

Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system

BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of...

Descripción completa

Detalles Bibliográficos
Autores principales: Xing, Jian, Burkom, Howard, Moniz, Linda, Edgerton, James, Leuze, Michael, Tokars, Jerome
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2716331/
https://www.ncbi.nlm.nih.gov/pubmed/19615075
http://dx.doi.org/10.1186/1476-072X-8-45
_version_ 1782169806438400000
author Xing, Jian
Burkom, Howard
Moniz, Linda
Edgerton, James
Leuze, Michael
Tokars, Jerome
author_facet Xing, Jian
Burkom, Howard
Moniz, Linda
Edgerton, James
Leuze, Michael
Tokars, Jerome
author_sort Xing, Jian
collection PubMed
description BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of the daily disease monitoring task. Our study focused on finding useful anomalies at manageable alert rates according to available BioSense data history. METHODS: The study dataset included more than 3 years of daily counts of military outpatient clinic visits for respiratory and rash syndrome groupings. We applied four spatial estimation methods in implementations of space-time scan statistics cross-checked in Matlab and C. We compared the utility of these methods according to the resultant background cluster rate (a false alarm surrogate) and sensitivity to injected cluster signals. The comparison runs used a spatial resolution based on the facility zip code in the patient record and a finer resolution based on the residence zip code. RESULTS: Simple estimation methods that account for day-of-week (DOW) data patterns yielded a clear advantage both in background cluster rate and in signal sensitivity. A 28-day baseline gave the most robust results for this estimation; the preferred baseline is long enough to remove daily fluctuations but short enough to reflect recent disease trends and data representation. Background cluster rates were lower for the rash syndrome counts than for the respiratory counts, likely because of seasonality and the large scale of the respiratory counts. CONCLUSION: The spatial estimation method should be chosen according to characteristics of the selected data streams. In this dataset with strong day-of-week effects, the overall best detection performance was achieved using subregion averages over a 28-day baseline stratified by weekday or weekend/holiday behavior. Changing the estimation method for particular scenarios involving different spatial resolution or other syndromes can yield further improvement.
format Text
id pubmed-2716331
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27163312009-07-28 Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system Xing, Jian Burkom, Howard Moniz, Linda Edgerton, James Leuze, Michael Tokars, Jerome Int J Health Geogr Methodology BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of the daily disease monitoring task. Our study focused on finding useful anomalies at manageable alert rates according to available BioSense data history. METHODS: The study dataset included more than 3 years of daily counts of military outpatient clinic visits for respiratory and rash syndrome groupings. We applied four spatial estimation methods in implementations of space-time scan statistics cross-checked in Matlab and C. We compared the utility of these methods according to the resultant background cluster rate (a false alarm surrogate) and sensitivity to injected cluster signals. The comparison runs used a spatial resolution based on the facility zip code in the patient record and a finer resolution based on the residence zip code. RESULTS: Simple estimation methods that account for day-of-week (DOW) data patterns yielded a clear advantage both in background cluster rate and in signal sensitivity. A 28-day baseline gave the most robust results for this estimation; the preferred baseline is long enough to remove daily fluctuations but short enough to reflect recent disease trends and data representation. Background cluster rates were lower for the rash syndrome counts than for the respiratory counts, likely because of seasonality and the large scale of the respiratory counts. CONCLUSION: The spatial estimation method should be chosen according to characteristics of the selected data streams. In this dataset with strong day-of-week effects, the overall best detection performance was achieved using subregion averages over a 28-day baseline stratified by weekday or weekend/holiday behavior. Changing the estimation method for particular scenarios involving different spatial resolution or other syndromes can yield further improvement. BioMed Central 2009-07-17 /pmc/articles/PMC2716331/ /pubmed/19615075 http://dx.doi.org/10.1186/1476-072X-8-45 Text en Copyright © 2009 Xing et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Xing, Jian
Burkom, Howard
Moniz, Linda
Edgerton, James
Leuze, Michael
Tokars, Jerome
Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title_full Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title_fullStr Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title_full_unstemmed Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title_short Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
title_sort evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2716331/
https://www.ncbi.nlm.nih.gov/pubmed/19615075
http://dx.doi.org/10.1186/1476-072X-8-45
work_keys_str_mv AT xingjian evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem
AT burkomhoward evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem
AT monizlinda evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem
AT edgertonjames evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem
AT leuzemichael evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem
AT tokarsjerome evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem