Cargando…

Sentiment Analysis in Health and Well-Being: Systematic Review

BACKGROUND: Sentiment analysis (SA) is a subfield of natural language processing whose aim is to automatically classify the sentiment expressed in a free text. It has found practical applications across a wide range of societal contexts including marketing, economy, and politics. This review focuses...

Descripción completa

Detalles Bibliográficos
Autores principales: Zunic, Anastazia, Corcoran, Padraig, Spasic, Irena
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013658/
https://www.ncbi.nlm.nih.gov/pubmed/32012057
http://dx.doi.org/10.2196/16023
_version_ 1783496454388580352
author Zunic, Anastazia
Corcoran, Padraig
Spasic, Irena
author_facet Zunic, Anastazia
Corcoran, Padraig
Spasic, Irena
author_sort Zunic, Anastazia
collection PubMed
description BACKGROUND: Sentiment analysis (SA) is a subfield of natural language processing whose aim is to automatically classify the sentiment expressed in a free text. It has found practical applications across a wide range of societal contexts including marketing, economy, and politics. This review focuses specifically on applications related to health, which is defined as “a state of complete physical, mental, and social well-being and not merely the absence of disease or infirmity.” OBJECTIVE: This study aimed to establish the state of the art in SA related to health and well-being by conducting a systematic review of the recent literature. To capture the perspective of those individuals whose health and well-being are affected, we focused specifically on spontaneously generated content and not necessarily that of health care professionals. METHODS: Our methodology is based on the guidelines for performing systematic reviews. In January 2019, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified a total of 86 relevant studies and extracted data about the datasets analyzed, discourse topics, data creators, downstream applications, algorithms used, and their evaluation. RESULTS: The majority of data were collected from social networking and Web-based retailing platforms. The primary purpose of online conversations is to exchange information and provide social support online. These communities tend to form around health conditions with high severity and chronicity rates. Different treatments and services discussed include medications, vaccination, surgery, orthodontic services, individual physicians, and health care services in general. We identified 5 roles with respect to health and well-being among the authors of the types of spontaneously generated narratives considered in this review: a sufferer, an addict, a patient, a carer, and a suicide victim. Out of 86 studies considered, only 4 reported the demographic characteristics. A wide range of methods were used to perform SA. Most common choices included support vector machines, naïve Bayesian learning, decision trees, logistic regression, and adaptive boosting. In contrast with general trends in SA research, only 1 study used deep learning. The performance lags behind the state of the art achieved in other domains when measured by F-score, which was found to be below 60% on average. In the context of SA, the domain of health and well-being was found to be resource poor: few domain-specific corpora and lexica are shared publicly for research purposes. CONCLUSIONS: SA results in the area of health and well-being lag behind those in other domains. It is yet unclear if this is because of the intrinsic differences between the domains and their respective sublanguages, the size of training datasets, the lack of domain-specific sentiment lexica, or the choice of algorithms.
format Online
Article
Text
id pubmed-7013658
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-70136582020-03-05 Sentiment Analysis in Health and Well-Being: Systematic Review Zunic, Anastazia Corcoran, Padraig Spasic, Irena JMIR Med Inform Review BACKGROUND: Sentiment analysis (SA) is a subfield of natural language processing whose aim is to automatically classify the sentiment expressed in a free text. It has found practical applications across a wide range of societal contexts including marketing, economy, and politics. This review focuses specifically on applications related to health, which is defined as “a state of complete physical, mental, and social well-being and not merely the absence of disease or infirmity.” OBJECTIVE: This study aimed to establish the state of the art in SA related to health and well-being by conducting a systematic review of the recent literature. To capture the perspective of those individuals whose health and well-being are affected, we focused specifically on spontaneously generated content and not necessarily that of health care professionals. METHODS: Our methodology is based on the guidelines for performing systematic reviews. In January 2019, we used PubMed, a multifaceted interface, to perform a literature search against MEDLINE. We identified a total of 86 relevant studies and extracted data about the datasets analyzed, discourse topics, data creators, downstream applications, algorithms used, and their evaluation. RESULTS: The majority of data were collected from social networking and Web-based retailing platforms. The primary purpose of online conversations is to exchange information and provide social support online. These communities tend to form around health conditions with high severity and chronicity rates. Different treatments and services discussed include medications, vaccination, surgery, orthodontic services, individual physicians, and health care services in general. We identified 5 roles with respect to health and well-being among the authors of the types of spontaneously generated narratives considered in this review: a sufferer, an addict, a patient, a carer, and a suicide victim. Out of 86 studies considered, only 4 reported the demographic characteristics. A wide range of methods were used to perform SA. Most common choices included support vector machines, naïve Bayesian learning, decision trees, logistic regression, and adaptive boosting. In contrast with general trends in SA research, only 1 study used deep learning. The performance lags behind the state of the art achieved in other domains when measured by F-score, which was found to be below 60% on average. In the context of SA, the domain of health and well-being was found to be resource poor: few domain-specific corpora and lexica are shared publicly for research purposes. CONCLUSIONS: SA results in the area of health and well-being lag behind those in other domains. It is yet unclear if this is because of the intrinsic differences between the domains and their respective sublanguages, the size of training datasets, the lack of domain-specific sentiment lexica, or the choice of algorithms. JMIR Publications 2020-01-28 /pmc/articles/PMC7013658/ /pubmed/32012057 http://dx.doi.org/10.2196/16023 Text en ©Anastazia Zunic, Padraig Corcoran, Irena Spasic. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 28.01.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Review
Zunic, Anastazia
Corcoran, Padraig
Spasic, Irena
Sentiment Analysis in Health and Well-Being: Systematic Review
title Sentiment Analysis in Health and Well-Being: Systematic Review
title_full Sentiment Analysis in Health and Well-Being: Systematic Review
title_fullStr Sentiment Analysis in Health and Well-Being: Systematic Review
title_full_unstemmed Sentiment Analysis in Health and Well-Being: Systematic Review
title_short Sentiment Analysis in Health and Well-Being: Systematic Review
title_sort sentiment analysis in health and well-being: systematic review
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7013658/
https://www.ncbi.nlm.nih.gov/pubmed/32012057
http://dx.doi.org/10.2196/16023
work_keys_str_mv AT zunicanastazia sentimentanalysisinhealthandwellbeingsystematicreview
AT corcoranpadraig sentimentanalysisinhealthandwellbeingsystematicreview
AT spasicirena sentimentanalysisinhealthandwellbeingsystematicreview