Cargando…

Statistical Models for Biosurveillance of Multiple Organisms

OBJECTIVE: To look at the diversity of the patterns displayed by a range of organisms, and to seek a simple family of models that adequately describes all organisms, rather than a well-fitting model for any particular organism. INTRODUCTION: There has been much research on statistical methods of pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Enki, Doyo G., Noufaily, Angela, Farrington, C. P., Garthwaite, Paul H., Andrews, Nick, Charlett, André, Lane, Chris
Formato: Online Artículo Texto
Lenguaje:English
Publicado: University of Illinois at Chicago Library 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692751/
_version_ 1782274646664544256
author Enki, Doyo G.
Noufaily, Angela
Farrington, C. P.
Garthwaite, Paul H.
Andrews, Nick
Charlett, André
Lane, Chris
author_facet Enki, Doyo G.
Noufaily, Angela
Farrington, C. P.
Garthwaite, Paul H.
Andrews, Nick
Charlett, André
Lane, Chris
author_sort Enki, Doyo G.
collection PubMed
description OBJECTIVE: To look at the diversity of the patterns displayed by a range of organisms, and to seek a simple family of models that adequately describes all organisms, rather than a well-fitting model for any particular organism. INTRODUCTION: There has been much research on statistical methods of prospective outbreak detection that are aimed at identifying unusual clusters of one syndrome or disease, and some work on multivariate surveillance methods (1). In England and Wales, automated laboratory surveillance of infectious diseases has been undertaken since the early 1990’s. The statistical methodology of this automated system is described in (2). However, there has been little research on outbreak detection methods that are suited to large, multiple surveillance systems involving thousands of different organisms. METHODS: We obtained twenty years’ data on weekly counts of all infectious disease organisms reported to the UK’s Health Protection Agency. We summarized the mean frequencies, trends and seasonality of each organism using log-linear models. To identify a simple family of models which adequately represents all organisms, the Poisson model, the quasi-Poisson model and the negative binomial model were investigated (3,4). Formal goodness-of-fit tests were not used as they can be unreliable with sparse data. Adequacy of the models was empirically studied using the relationships between the mean, variance and skewness. For this purpose, each data series was first subdivided into 41 half-years and de-seasonalized. RESULTS: Trends and seasonality were summarized by plotting the distribution of estimated linear trend parameters for 2250 organisms, and modal seasonal period for 2254 organisms, including those organisms for which the seasonal effect is statistically significant. Relationships between mean and variance were summarized as given in Figure 1. Similar plots were used to summarize the relationships between mean and skewness. CONCLUSIONS: Statistical outbreak detection models must be able to cope with seasonality and trends. The data analyses suggest that the great majority of organisms can adequately – though far from perfectly – be represented by a statistical model in which the variance is proportional to the mean, such as the quasi-Poisson or negative binomial models.
format Online
Article
Text
id pubmed-3692751
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher University of Illinois at Chicago Library
record_format MEDLINE/PubMed
spelling pubmed-36927512013-06-26 Statistical Models for Biosurveillance of Multiple Organisms Enki, Doyo G. Noufaily, Angela Farrington, C. P. Garthwaite, Paul H. Andrews, Nick Charlett, André Lane, Chris Online J Public Health Inform ISDS 2012 Conference Abstracts OBJECTIVE: To look at the diversity of the patterns displayed by a range of organisms, and to seek a simple family of models that adequately describes all organisms, rather than a well-fitting model for any particular organism. INTRODUCTION: There has been much research on statistical methods of prospective outbreak detection that are aimed at identifying unusual clusters of one syndrome or disease, and some work on multivariate surveillance methods (1). In England and Wales, automated laboratory surveillance of infectious diseases has been undertaken since the early 1990’s. The statistical methodology of this automated system is described in (2). However, there has been little research on outbreak detection methods that are suited to large, multiple surveillance systems involving thousands of different organisms. METHODS: We obtained twenty years’ data on weekly counts of all infectious disease organisms reported to the UK’s Health Protection Agency. We summarized the mean frequencies, trends and seasonality of each organism using log-linear models. To identify a simple family of models which adequately represents all organisms, the Poisson model, the quasi-Poisson model and the negative binomial model were investigated (3,4). Formal goodness-of-fit tests were not used as they can be unreliable with sparse data. Adequacy of the models was empirically studied using the relationships between the mean, variance and skewness. For this purpose, each data series was first subdivided into 41 half-years and de-seasonalized. RESULTS: Trends and seasonality were summarized by plotting the distribution of estimated linear trend parameters for 2250 organisms, and modal seasonal period for 2254 organisms, including those organisms for which the seasonal effect is statistically significant. Relationships between mean and variance were summarized as given in Figure 1. Similar plots were used to summarize the relationships between mean and skewness. CONCLUSIONS: Statistical outbreak detection models must be able to cope with seasonality and trends. The data analyses suggest that the great majority of organisms can adequately – though far from perfectly – be represented by a statistical model in which the variance is proportional to the mean, such as the quasi-Poisson or negative binomial models. University of Illinois at Chicago Library 2013-04-04 /pmc/articles/PMC3692751/ Text en ©2013 the author(s) http://www.uic.edu/htbin/cgiwrap/bin/ojs/index.php/ojphi/about/submissions#copyrightNotice This is an Open Access article. Authors own copyright of their articles appearing in the Online Journal of Public Health Informatics. Readers may copy articles without permission of the copyright owner(s), as long as the author and OJPHI are acknowledged in the copy and the copy is used for educational, not-for-profit purposes.
spellingShingle ISDS 2012 Conference Abstracts
Enki, Doyo G.
Noufaily, Angela
Farrington, C. P.
Garthwaite, Paul H.
Andrews, Nick
Charlett, André
Lane, Chris
Statistical Models for Biosurveillance of Multiple Organisms
title Statistical Models for Biosurveillance of Multiple Organisms
title_full Statistical Models for Biosurveillance of Multiple Organisms
title_fullStr Statistical Models for Biosurveillance of Multiple Organisms
title_full_unstemmed Statistical Models for Biosurveillance of Multiple Organisms
title_short Statistical Models for Biosurveillance of Multiple Organisms
title_sort statistical models for biosurveillance of multiple organisms
topic ISDS 2012 Conference Abstracts
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3692751/
work_keys_str_mv AT enkidoyog statisticalmodelsforbiosurveillanceofmultipleorganisms
AT noufailyangela statisticalmodelsforbiosurveillanceofmultipleorganisms
AT farringtoncp statisticalmodelsforbiosurveillanceofmultipleorganisms
AT garthwaitepaulh statisticalmodelsforbiosurveillanceofmultipleorganisms
AT andrewsnick statisticalmodelsforbiosurveillanceofmultipleorganisms
AT charlettandre statisticalmodelsforbiosurveillanceofmultipleorganisms
AT lanechris statisticalmodelsforbiosurveillanceofmultipleorganisms