Cargando…
Machine learning for syndromic surveillance using veterinary necropsy reports
The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillanc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7001958/ https://www.ncbi.nlm.nih.gov/pubmed/32023271 http://dx.doi.org/10.1371/journal.pone.0228105 |
_version_ | 1783494319935586304 |
---|---|
author | Bollig, Nathan Clarke, Lorelei Elsmo, Elizabeth Craven, Mark |
author_facet | Bollig, Nathan Clarke, Lorelei Elsmo, Elizabeth Craven, Mark |
author_sort | Bollig, Nathan |
collection | PubMed |
description | The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillance using free-text veterinary necropsy reports. We train a system to detect if a necropsy report from the Wisconsin Veterinary Diagnostic Laboratory contains evidence of gastrointestinal, respiratory, or urinary pathology. We evaluate the performance of several machine learning algorithms including deep learning with a long short-term memory network. Although no single algorithm was superior, random forest using feature vectors of TF-IDF statistics ranked among the top-performing models with F1 scores of 0.923 (gastrointestinal), 0.960 (respiratory), and 0.888 (urinary). This model was applied to over 33,000 necropsy reports and was used to describe temporal and spatial features of diseases within a 14-year period, exposing epidemiological trends and detecting a potential focus of gastrointestinal disease from a single submitting producer in the fall of 2016. |
format | Online Article Text |
id | pubmed-7001958 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-70019582020-02-18 Machine learning for syndromic surveillance using veterinary necropsy reports Bollig, Nathan Clarke, Lorelei Elsmo, Elizabeth Craven, Mark PLoS One Research Article The use of natural language data for animal population surveillance represents a valuable opportunity to gather information about potential disease outbreaks, emerging zoonotic diseases, or bioterrorism threats. In this study, we evaluate machine learning methods for conducting syndromic surveillance using free-text veterinary necropsy reports. We train a system to detect if a necropsy report from the Wisconsin Veterinary Diagnostic Laboratory contains evidence of gastrointestinal, respiratory, or urinary pathology. We evaluate the performance of several machine learning algorithms including deep learning with a long short-term memory network. Although no single algorithm was superior, random forest using feature vectors of TF-IDF statistics ranked among the top-performing models with F1 scores of 0.923 (gastrointestinal), 0.960 (respiratory), and 0.888 (urinary). This model was applied to over 33,000 necropsy reports and was used to describe temporal and spatial features of diseases within a 14-year period, exposing epidemiological trends and detecting a potential focus of gastrointestinal disease from a single submitting producer in the fall of 2016. Public Library of Science 2020-02-05 /pmc/articles/PMC7001958/ /pubmed/32023271 http://dx.doi.org/10.1371/journal.pone.0228105 Text en © 2020 Bollig et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Bollig, Nathan Clarke, Lorelei Elsmo, Elizabeth Craven, Mark Machine learning for syndromic surveillance using veterinary necropsy reports |
title | Machine learning for syndromic surveillance using veterinary necropsy reports |
title_full | Machine learning for syndromic surveillance using veterinary necropsy reports |
title_fullStr | Machine learning for syndromic surveillance using veterinary necropsy reports |
title_full_unstemmed | Machine learning for syndromic surveillance using veterinary necropsy reports |
title_short | Machine learning for syndromic surveillance using veterinary necropsy reports |
title_sort | machine learning for syndromic surveillance using veterinary necropsy reports |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7001958/ https://www.ncbi.nlm.nih.gov/pubmed/32023271 http://dx.doi.org/10.1371/journal.pone.0228105 |
work_keys_str_mv | AT bollignathan machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports AT clarkelorelei machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports AT elsmoelizabeth machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports AT cravenmark machinelearningforsyndromicsurveillanceusingveterinarynecropsyreports |