Cargando…
Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system
BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2716331/ https://www.ncbi.nlm.nih.gov/pubmed/19615075 http://dx.doi.org/10.1186/1476-072X-8-45 |
_version_ | 1782169806438400000 |
---|---|
author | Xing, Jian Burkom, Howard Moniz, Linda Edgerton, James Leuze, Michael Tokars, Jerome |
author_facet | Xing, Jian Burkom, Howard Moniz, Linda Edgerton, James Leuze, Michael Tokars, Jerome |
author_sort | Xing, Jian |
collection | PubMed |
description | BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of the daily disease monitoring task. Our study focused on finding useful anomalies at manageable alert rates according to available BioSense data history. METHODS: The study dataset included more than 3 years of daily counts of military outpatient clinic visits for respiratory and rash syndrome groupings. We applied four spatial estimation methods in implementations of space-time scan statistics cross-checked in Matlab and C. We compared the utility of these methods according to the resultant background cluster rate (a false alarm surrogate) and sensitivity to injected cluster signals. The comparison runs used a spatial resolution based on the facility zip code in the patient record and a finer resolution based on the residence zip code. RESULTS: Simple estimation methods that account for day-of-week (DOW) data patterns yielded a clear advantage both in background cluster rate and in signal sensitivity. A 28-day baseline gave the most robust results for this estimation; the preferred baseline is long enough to remove daily fluctuations but short enough to reflect recent disease trends and data representation. Background cluster rates were lower for the rash syndrome counts than for the respiratory counts, likely because of seasonality and the large scale of the respiratory counts. CONCLUSION: The spatial estimation method should be chosen according to characteristics of the selected data streams. In this dataset with strong day-of-week effects, the overall best detection performance was achieved using subregion averages over a 28-day baseline stratified by weekday or weekend/holiday behavior. Changing the estimation method for particular scenarios involving different spatial resolution or other syndromes can yield further improvement. |
format | Text |
id | pubmed-2716331 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27163312009-07-28 Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system Xing, Jian Burkom, Howard Moniz, Linda Edgerton, James Leuze, Michael Tokars, Jerome Int J Health Geogr Methodology BACKGROUND: The Centers for Disease Control and Prevention's (CDC's) BioSense system provides near-real time situational awareness for public health monitoring through analysis of electronic health data. Determination of anomalous spatial and temporal disease clusters is a crucial part of the daily disease monitoring task. Our study focused on finding useful anomalies at manageable alert rates according to available BioSense data history. METHODS: The study dataset included more than 3 years of daily counts of military outpatient clinic visits for respiratory and rash syndrome groupings. We applied four spatial estimation methods in implementations of space-time scan statistics cross-checked in Matlab and C. We compared the utility of these methods according to the resultant background cluster rate (a false alarm surrogate) and sensitivity to injected cluster signals. The comparison runs used a spatial resolution based on the facility zip code in the patient record and a finer resolution based on the residence zip code. RESULTS: Simple estimation methods that account for day-of-week (DOW) data patterns yielded a clear advantage both in background cluster rate and in signal sensitivity. A 28-day baseline gave the most robust results for this estimation; the preferred baseline is long enough to remove daily fluctuations but short enough to reflect recent disease trends and data representation. Background cluster rates were lower for the rash syndrome counts than for the respiratory counts, likely because of seasonality and the large scale of the respiratory counts. CONCLUSION: The spatial estimation method should be chosen according to characteristics of the selected data streams. In this dataset with strong day-of-week effects, the overall best detection performance was achieved using subregion averages over a 28-day baseline stratified by weekday or weekend/holiday behavior. Changing the estimation method for particular scenarios involving different spatial resolution or other syndromes can yield further improvement. BioMed Central 2009-07-17 /pmc/articles/PMC2716331/ /pubmed/19615075 http://dx.doi.org/10.1186/1476-072X-8-45 Text en Copyright © 2009 Xing et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Xing, Jian Burkom, Howard Moniz, Linda Edgerton, James Leuze, Michael Tokars, Jerome Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title | Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title_full | Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title_fullStr | Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title_full_unstemmed | Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title_short | Evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
title_sort | evaluation of sliding baseline methods for spatial estimation for cluster detection in the biosurveillance system |
topic | Methodology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2716331/ https://www.ncbi.nlm.nih.gov/pubmed/19615075 http://dx.doi.org/10.1186/1476-072X-8-45 |
work_keys_str_mv | AT xingjian evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem AT burkomhoward evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem AT monizlinda evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem AT edgertonjames evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem AT leuzemichael evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem AT tokarsjerome evaluationofslidingbaselinemethodsforspatialestimationforclusterdetectioninthebiosurveillancesystem |