Cargando…

Machine learning for syndromic surveillance using veterinary necropsy reports

The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillanc...

Descripción completa

Detalles Bibliográficos
Autores principales: Bollig, Nathan, Clarke, Lorelei, Elsmo, Elizabeth, Craven, Mark
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7001958/
https://www.ncbi.nlm.nih.gov/pubmed/32023271
http://dx.doi.org/10.1371/journal.pone.0228105
_version_ 1783494319935586304
author Bollig, Nathan
Clarke, Lorelei
Elsmo, Elizabeth
Craven, Mark
author_facet Bollig, Nathan
Clarke, Lorelei
Elsmo, Elizabeth
Craven, Mark
author_sort Bollig, Nathan
collection PubMed
description The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillance using free-text veterinary necropsy reports. We train a system to detect if a necropsy report from the Wisconsin Veterinary Diagnostic Laboratory contains evidence of gastrointestinal, respiratory, or urinary pathology. We evaluate the performance of several machine learning algorithms including deep learning with a long short-term memory network. Although no single algorithm was superior, random forest using feature vectors of TF-IDF statistics ranked among the top-performing models with F1 scores of 0.923 (gastrointestinal), 0.960 (respiratory), and 0.888 (urinary). This model was applied to over 33,000 necropsy reports and was used to describe temporal and spatial features of diseases within a 14-year period, exposing epidemiological trends and detecting a potential focus of gastrointestinal disease from a single submitting producer in the fall of 2016.
format Online
Article
Text
id pubmed-7001958
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70019582020-02-18 Machine learning for syndromic surveillance using veterinary necropsy reports Bollig, Nathan Clarke, Lorelei Elsmo, Elizabeth Craven, Mark PLoS One Research Article The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillance using free-text veterinary necropsy reports. We train a system to detect if a necropsy report from the Wisconsin Veterinary Diagnostic Laboratory contains evidence of gastrointestinal, respiratory, or urinary pathology. We evaluate the performance of several machine learning algorithms including deep learning with a long short-term memory network. Although no single algorithm was superior, random forest using feature vectors of TF-IDF statistics ranked among the top-performing models with F1 scores of 0.923 (gastrointestinal), 0.960 (respiratory), and 0.888 (urinary). This model was applied to over 33,000 necropsy reports and was used to describe temporal and spatial features of diseases within a 14-year period, exposing epidemiological trends and detecting a potential focus of gastrointestinal disease from a single submitting producer in the fall of 2016. Public Library of Science 2020-02-05 /pmc/articles/PMC7001958/ /pubmed/32023271 http://dx.doi.org/10.1371/journal.pone.0228105 Text en © 2020 Bollig et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Bollig, Nathan
Clarke, Lorelei
Elsmo, Elizabeth
Craven, Mark
Machine learning for syndromic surveillance using veterinary necropsy reports
title Machine learning for syndromic surveillance using veterinary necropsy reports
title_full Machine learning for syndromic surveillance using veterinary necropsy reports
title_fullStr Machine learning for syndromic surveillance using veterinary necropsy reports
title_full_unstemmed Machine learning for syndromic surveillance using veterinary necropsy reports
title_short Machine learning for syndromic surveillance using veterinary necropsy reports
title_sort machine learning for syndromic surveillance using veterinary necropsy reports
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7001958/
https://www.ncbi.nlm.nih.gov/pubmed/32023271
http://dx.doi.org/10.1371/journal.pone.0228105
work_keys_str_mv AT bollignathan machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports
AT clarkelorelei machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports
AT elsmoelizabeth machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports
AT cravenmark machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports