Cargando…

Machine learning to refine decision making within a syndromic surveillance service

BACKGROUND: Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of public hea...

Descripción completa

Detalles Bibliográficos
Autores principales: Lake, I. R., Colón-González, F. J., Barker, G. C., Morbey, R. A., Smith, G. E., Elliot, A. J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6515660/
https://www.ncbi.nlm.nih.gov/pubmed/31088446
http://dx.doi.org/10.1186/s12889-019-6916-9
_version_ 1783418126563540992
author Lake, I. R.
Colón-González, F. J.
Barker, G. C.
Morbey, R. A.
Smith, G. E.
Elliot, A. J.
author_facet Lake, I. R.
Colón-González, F. J.
Barker, G. C.
Morbey, R. A.
Smith, G. E.
Elliot, A. J.
author_sort Lake, I. R.
collection PubMed
description BACKGROUND: Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of public health importance. All alarms must be assessed to confirm whether they are of public health importance. In England, approximately 100 alarms are generated daily and, although their analysis is formalised through a risk assessment process, the process requires notable time, training, and maintenance of an expertise base to determine which alarms are of public health importance. The process is made more complicated by the observation that only 0.1% of statistical alarms are deemed to be of public health importance. Therefore, the aims of this study were to evaluate machine learning as a tool for computer-assisted human decision-making when assessing statistical alarms. METHODS: A record of the risk assessment process was obtained from Public Health England for all 67,505 statistical alarms between August 2013 and October 2015. This record contained information on the characteristics of the alarm (e.g. size, location). We used three Bayesian classifiers- naïve Bayes, tree-augmented naïve Bayes and Multinets - to examine the risk assessment record in England with respect to the final ‘Decision’ outcome made by an epidemiologist of ‘Alert’, ‘Monitor’ or ‘No-action’. Two further classifications based upon tree-augmented naïve Bayes and Multinets were implemented to account for the predominance of ‘No-action’ outcomes. RESULTS: The attributes of each individual risk assessment were linked to the final decision made by an epidemiologist, providing confidence in the current process. The naïve Bayesian classifier performed best, correctly classifying 51.5% of ‘Alert’ outcomes. If the ‘Alert’ and ‘Monitor’ actions are combined then performance increases to 82.6% correctly classified. We demonstrate how a decision support system based upon a naïve Bayes classifier could be operationalised within an operational syndromic surveillance system. CONCLUSIONS: Within syndromic surveillance systems, machine learning techniques have the potential to make risk assessment following statistical alarms more automated, robust, and rigorous. However, our results also highlight the importance of specialist human input to the process.
format Online
Article
Text
id pubmed-6515660
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65156602019-05-21 Machine learning to refine decision making within a syndromic surveillance service Lake, I. R. Colón-González, F. J. Barker, G. C. Morbey, R. A. Smith, G. E. Elliot, A. J. BMC Public Health Research Article BACKGROUND: Worldwide, syndromic surveillance is increasingly used for improved and timely situational awareness and early identification of public health threats. Syndromic data streams are fed into detection algorithms, which produce statistical alarms highlighting potential activity of public health importance. All alarms must be assessed to confirm whether they are of public health importance. In England, approximately 100 alarms are generated daily and, although their analysis is formalised through a risk assessment process, the process requires notable time, training, and maintenance of an expertise base to determine which alarms are of public health importance. The process is made more complicated by the observation that only 0.1% of statistical alarms are deemed to be of public health importance. Therefore, the aims of this study were to evaluate machine learning as a tool for computer-assisted human decision-making when assessing statistical alarms. METHODS: A record of the risk assessment process was obtained from Public Health England for all 67,505 statistical alarms between August 2013 and October 2015. This record contained information on the characteristics of the alarm (e.g. size, location). We used three Bayesian classifiers- naïve Bayes, tree-augmented naïve Bayes and Multinets - to examine the risk assessment record in England with respect to the final ‘Decision’ outcome made by an epidemiologist of ‘Alert’, ‘Monitor’ or ‘No-action’. Two further classifications based upon tree-augmented naïve Bayes and Multinets were implemented to account for the predominance of ‘No-action’ outcomes. RESULTS: The attributes of each individual risk assessment were linked to the final decision made by an epidemiologist, providing confidence in the current process. The naïve Bayesian classifier performed best, correctly classifying 51.5% of ‘Alert’ outcomes. If the ‘Alert’ and ‘Monitor’ actions are combined then performance increases to 82.6% correctly classified. We demonstrate how a decision support system based upon a naïve Bayes classifier could be operationalised within an operational syndromic surveillance system. CONCLUSIONS: Within syndromic surveillance systems, machine learning techniques have the potential to make risk assessment following statistical alarms more automated, robust, and rigorous. However, our results also highlight the importance of specialist human input to the process. BioMed Central 2019-05-14 /pmc/articles/PMC6515660/ /pubmed/31088446 http://dx.doi.org/10.1186/s12889-019-6916-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Lake, I. R.
Colón-González, F. J.
Barker, G. C.
Morbey, R. A.
Smith, G. E.
Elliot, A. J.
Machine learning to refine decision making within a syndromic surveillance service
title Machine learning to refine decision making within a syndromic surveillance service
title_full Machine learning to refine decision making within a syndromic surveillance service
title_fullStr Machine learning to refine decision making within a syndromic surveillance service
title_full_unstemmed Machine learning to refine decision making within a syndromic surveillance service
title_short Machine learning to refine decision making within a syndromic surveillance service
title_sort machine learning to refine decision making within a syndromic surveillance service
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6515660/
https://www.ncbi.nlm.nih.gov/pubmed/31088446
http://dx.doi.org/10.1186/s12889-019-6916-9
work_keys_str_mv AT lakeir machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT colongonzalezfj machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT barkergc machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT morbeyra machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT smithge machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice
AT elliotaj machinelearningtorefinedecisionmakingwithinasyndromicsurveillanceservice