Cargando…

Neural attention with character embeddings for hay fever detection from twitter

The paper aims to leverage the highly unstructured user-generated content in the context of pollen allergy surveillance using neural networks with character embeddings and the attention mechanism. Currently, there is no accurate representation of hay fever prevalence, particularly in real-time scena...

Descripción completa

Detalles Bibliográficos
Autores principales: Du, Jiahua, Michalska, Sandra, Subramani, Sudha, Wang, Hua, Zhang, Yanchun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6790203/
https://www.ncbi.nlm.nih.gov/pubmed/31656594
http://dx.doi.org/10.1007/s13755-019-0084-2
_version_ 1783458754568650752
author Du, Jiahua
Michalska, Sandra
Subramani, Sudha
Wang, Hua
Zhang, Yanchun
author_facet Du, Jiahua
Michalska, Sandra
Subramani, Sudha
Wang, Hua
Zhang, Yanchun
author_sort Du, Jiahua
collection PubMed
description The paper aims to leverage the highly unstructured user-generated content in the context of pollen allergy surveillance using neural networks with character embeddings and the attention mechanism. Currently, there is no accurate representation of hay fever prevalence, particularly in real-time scenarios. Social media serves as an alternative to extract knowledge about the condition, which is valuable for allergy sufferers, general practitioners, and policy makers. Despite tremendous potential offered, conventional natural language processing methods prove limited when exposed to the challenging nature of user-generated content. As a result, the detection of actual hay fever instances among the number of false positives, as well as the correct identification of non-technical expressions as pollen allergy symptoms poses a major problem. We propose a deep architecture enhanced with character embeddings and neural attention to improve the performance of hay fever-related content classification from Twitter data. Improvement in prediction is achieved due to the character-level semantics introduced, which effectively addresses the out-of-vocabulary problem in our dataset where the rate is approximately 9%. Overall, the study is a step forward towards improved real-time pollen allergy surveillance from social media with state-of-art technology.
format Online
Article
Text
id pubmed-6790203
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-67902032019-10-25 Neural attention with character embeddings for hay fever detection from twitter Du, Jiahua Michalska, Sandra Subramani, Sudha Wang, Hua Zhang, Yanchun Health Inf Sci Syst Research The paper aims to leverage the highly unstructured user-generated content in the context of pollen allergy surveillance using neural networks with character embeddings and the attention mechanism. Currently, there is no accurate representation of hay fever prevalence, particularly in real-time scenarios. Social media serves as an alternative to extract knowledge about the condition, which is valuable for allergy sufferers, general practitioners, and policy makers. Despite tremendous potential offered, conventional natural language processing methods prove limited when exposed to the challenging nature of user-generated content. As a result, the detection of actual hay fever instances among the number of false positives, as well as the correct identification of non-technical expressions as pollen allergy symptoms poses a major problem. We propose a deep architecture enhanced with character embeddings and neural attention to improve the performance of hay fever-related content classification from Twitter data. Improvement in prediction is achieved due to the character-level semantics introduced, which effectively addresses the out-of-vocabulary problem in our dataset where the rate is approximately 9%. Overall, the study is a step forward towards improved real-time pollen allergy surveillance from social media with state-of-art technology. Springer International Publishing 2019-10-12 /pmc/articles/PMC6790203/ /pubmed/31656594 http://dx.doi.org/10.1007/s13755-019-0084-2 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Research
Du, Jiahua
Michalska, Sandra
Subramani, Sudha
Wang, Hua
Zhang, Yanchun
Neural attention with character embeddings for hay fever detection from twitter
title Neural attention with character embeddings for hay fever detection from twitter
title_full Neural attention with character embeddings for hay fever detection from twitter
title_fullStr Neural attention with character embeddings for hay fever detection from twitter
title_full_unstemmed Neural attention with character embeddings for hay fever detection from twitter
title_short Neural attention with character embeddings for hay fever detection from twitter
title_sort neural attention with character embeddings for hay fever detection from twitter
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6790203/
https://www.ncbi.nlm.nih.gov/pubmed/31656594
http://dx.doi.org/10.1007/s13755-019-0084-2
work_keys_str_mv AT dujiahua neuralattentionwithcharacterembeddingsforhayfeverdetectionfromtwitter
AT michalskasandra neuralattentionwithcharacterembeddingsforhayfeverdetectionfromtwitter
AT subramanisudha neuralattentionwithcharacterembeddingsforhayfeverdetectionfromtwitter
AT wanghua neuralattentionwithcharacterembeddingsforhayfeverdetectionfromtwitter
AT zhangyanchun neuralattentionwithcharacterembeddingsforhayfeverdetectionfromtwitter