Cargando…

Twitter sentiment classification for measuring public health concerns

An important task of public health officials is to keep track of health issues, such as spreading epidemics. In this paper, we are addressing the issue of spreading public concern about epidemics. Public concern about a communicable disease can be seen as a problem of its own. Keeping track of trend...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ji, Xiang, Chun, Soon Ae, Wei, Zhi, Geller, James
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer Vienna 2015
Materias:	Original Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7096866/ https://www.ncbi.nlm.nih.gov/pubmed/32226558 http://dx.doi.org/10.1007/s13278-015-0253-5

_version_	1783510920120500224
author	Ji, Xiang Chun, Soon Ae Wei, Zhi Geller, James
author_facet	Ji, Xiang Chun, Soon Ae Wei, Zhi Geller, James
author_sort	Ji, Xiang
collection	PubMed
description	An important task of public health officials is to keep track of health issues, such as spreading epidemics. In this paper, we are addressing the issue of spreading public concern about epidemics. Public concern about a communicable disease can be seen as a problem of its own. Keeping track of trends in concern about public health and identifying peaks of public concern are therefore crucial tasks. However, monitoring public health concerns is not only expensive with traditional surveillance systems, but also suffers from limited coverage and significant delays. To address these problems, we are using Twitter messages, which are available free of cost, are generated world-wide, and are posted in real time. We are measuring public concern using a two-step sentiment classification approach. In the first step, we distinguish Personal tweets from News (i.e., Non-Personal) tweets. In the second step, we further separate Personal Negative from Personal Non-Negative tweets. Both these steps consist themselves of two sub-steps. In the first sub-step (of both steps), our programs automatically generate training data using an emotion-oriented, clue-based method. In the second sub-step, we are training and testing three different Machine Learning (ML) models with the training data from the first sub-step; this allows us to determine the best ML model for different datasets. Furthermore, we are testing the already trained ML models with a human annotated, disjoint dataset. Based on the number of tweets classified as Personal Negative, we compute a Measure of Concern (MOC) and a timeline of the MOC. We attempt to correlate peaks of the MOC timeline to peaks of the News (Non-Personal) timeline. Our best accuracy results are achieved using the two-step method with a Naïve Bayes classifier for the Epidemic domain (six datasets) and the Mental Health domain (three datasets).
format	Online Article Text
id	pubmed-7096866
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Springer Vienna
record_format	MEDLINE/PubMed
spelling	pubmed-70968662020-03-26 Twitter sentiment classification for measuring public health concerns Ji, Xiang Chun, Soon Ae Wei, Zhi Geller, James Soc Netw Anal Min Original Article An important task of public health officials is to keep track of health issues, such as spreading epidemics. In this paper, we are addressing the issue of spreading public concern about epidemics. Public concern about a communicable disease can be seen as a problem of its own. Keeping track of trends in concern about public health and identifying peaks of public concern are therefore crucial tasks. However, monitoring public health concerns is not only expensive with traditional surveillance systems, but also suffers from limited coverage and significant delays. To address these problems, we are using Twitter messages, which are available free of cost, are generated world-wide, and are posted in real time. We are measuring public concern using a two-step sentiment classification approach. In the first step, we distinguish Personal tweets from News (i.e., Non-Personal) tweets. In the second step, we further separate Personal Negative from Personal Non-Negative tweets. Both these steps consist themselves of two sub-steps. In the first sub-step (of both steps), our programs automatically generate training data using an emotion-oriented, clue-based method. In the second sub-step, we are training and testing three different Machine Learning (ML) models with the training data from the first sub-step; this allows us to determine the best ML model for different datasets. Furthermore, we are testing the already trained ML models with a human annotated, disjoint dataset. Based on the number of tweets classified as Personal Negative, we compute a Measure of Concern (MOC) and a timeline of the MOC. We attempt to correlate peaks of the MOC timeline to peaks of the News (Non-Personal) timeline. Our best accuracy results are achieved using the two-step method with a Naïve Bayes classifier for the Epidemic domain (six datasets) and the Mental Health domain (three datasets). Springer Vienna 2015-05-12 2015 /pmc/articles/PMC7096866/ /pubmed/32226558 http://dx.doi.org/10.1007/s13278-015-0253-5 Text en © Springer-Verlag Wien 2015 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Original Article Ji, Xiang Chun, Soon Ae Wei, Zhi Geller, James Twitter sentiment classification for measuring public health concerns
title	Twitter sentiment classification for measuring public health concerns
title_full	Twitter sentiment classification for measuring public health concerns
title_fullStr	Twitter sentiment classification for measuring public health concerns
title_full_unstemmed	Twitter sentiment classification for measuring public health concerns
title_short	Twitter sentiment classification for measuring public health concerns
title_sort	twitter sentiment classification for measuring public health concerns
topic	Original Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7096866/ https://www.ncbi.nlm.nih.gov/pubmed/32226558 http://dx.doi.org/10.1007/s13278-015-0253-5
work_keys_str_mv	AT jixiang twittersentimentclassificationformeasuringpublichealthconcerns AT chunsoonae twittersentimentclassificationformeasuringpublichealthconcerns AT weizhi twittersentimentclassificationformeasuringpublichealthconcerns AT gellerjames twittersentimentclassificationformeasuringpublichealthconcerns

Twitter sentiment classification for measuring public health concerns

Ejemplares similares