Cargando…

Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media

Foodborne outbreaks are a serious but preventable threat to public health that often lead to illness, loss of life, significant economic loss, and the erosion of consumer confidence. Understanding how consumers respond when interacting with foods, as well as extracting information from posts on soci...

Descripción completa

Detalles Bibliográficos
Autores principales: Tao, Dandan, Zhang, Dongyu, Hu, Ruofan, Rundensteiner, Elke, Feng, Hao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568976/
https://www.ncbi.nlm.nih.gov/pubmed/34737325
http://dx.doi.org/10.1038/s41598-021-00766-w
_version_ 1784594548624719872
author Tao, Dandan
Zhang, Dongyu
Hu, Ruofan
Rundensteiner, Elke
Feng, Hao
author_facet Tao, Dandan
Zhang, Dongyu
Hu, Ruofan
Rundensteiner, Elke
Feng, Hao
author_sort Tao, Dandan
collection PubMed
description Foodborne outbreaks are a serious but preventable threat to public health that often lead to illness, loss of life, significant economic loss, and the erosion of consumer confidence. Understanding how consumers respond when interacting with foods, as well as extracting information from posts on social media may provide new means of reducing the risks and curtailing the outbreaks. In recent years, Twitter has been employed as a new tool for identifying unreported foodborne illnesses. However, there is a huge gap between the identification of sporadic illnesses and the early detection of a potential outbreak. In this work, the dual-task BERTweet model was developed to identify unreported foodborne illnesses and extract foodborne-illness-related entities from Twitter. Unlike previous methods, our model leveraged the mutually beneficial relationships between the two tasks. The results showed that the F1-score of relevance prediction was 0.87, and the F1-score of entity extraction was 0.61. Key elements such as time, location, and food detected from sentences indicating foodborne illnesses were used to analyze potential foodborne outbreaks in massive historical tweets. A case study on tweets indicating foodborne illnesses showed that the discovered trend is consistent with the true outbreaks that occurred during the same period.
format Online
Article
Text
id pubmed-8568976
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85689762021-11-05 Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media Tao, Dandan Zhang, Dongyu Hu, Ruofan Rundensteiner, Elke Feng, Hao Sci Rep Article Foodborne outbreaks are a serious but preventable threat to public health that often lead to illness, loss of life, significant economic loss, and the erosion of consumer confidence. Understanding how consumers respond when interacting with foods, as well as extracting information from posts on social media may provide new means of reducing the risks and curtailing the outbreaks. In recent years, Twitter has been employed as a new tool for identifying unreported foodborne illnesses. However, there is a huge gap between the identification of sporadic illnesses and the early detection of a potential outbreak. In this work, the dual-task BERTweet model was developed to identify unreported foodborne illnesses and extract foodborne-illness-related entities from Twitter. Unlike previous methods, our model leveraged the mutually beneficial relationships between the two tasks. The results showed that the F1-score of relevance prediction was 0.87, and the F1-score of entity extraction was 0.61. Key elements such as time, location, and food detected from sentences indicating foodborne illnesses were used to analyze potential foodborne outbreaks in massive historical tweets. A case study on tweets indicating foodborne illnesses showed that the discovered trend is consistent with the true outbreaks that occurred during the same period. Nature Publishing Group UK 2021-11-04 /pmc/articles/PMC8568976/ /pubmed/34737325 http://dx.doi.org/10.1038/s41598-021-00766-w Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Tao, Dandan
Zhang, Dongyu
Hu, Ruofan
Rundensteiner, Elke
Feng, Hao
Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title_full Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title_fullStr Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title_full_unstemmed Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title_short Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
title_sort crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8568976/
https://www.ncbi.nlm.nih.gov/pubmed/34737325
http://dx.doi.org/10.1038/s41598-021-00766-w
work_keys_str_mv AT taodandan crowdsourcingandmachinelearningapproachesforextractingentitiesindicatingpotentialfoodborneoutbreaksfromsocialmedia
AT zhangdongyu crowdsourcingandmachinelearningapproachesforextractingentitiesindicatingpotentialfoodborneoutbreaksfromsocialmedia
AT huruofan crowdsourcingandmachinelearningapproachesforextractingentitiesindicatingpotentialfoodborneoutbreaksfromsocialmedia
AT rundensteinerelke crowdsourcingandmachinelearningapproachesforextractingentitiesindicatingpotentialfoodborneoutbreaksfromsocialmedia
AT fenghao crowdsourcingandmachinelearningapproachesforextractingentitiesindicatingpotentialfoodborneoutbreaksfromsocialmedia