Cargando…
Identifying Sentiment of Hookah-Related Posts on Twitter
BACKGROUND: The increasing popularity of hookah (or waterpipe) use in the United States and elsewhere has consequences for public health because it has similar health risks to that of combustible cigarettes. While hookah use rapidly increases in popularity, social media data (Twitter, Instagram) can...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5667930/ https://www.ncbi.nlm.nih.gov/pubmed/29046267 http://dx.doi.org/10.2196/publichealth.8133 |
_version_ | 1783275579129200640 |
---|---|
author | Allem, Jon-Patrick Ramanujam, Jagannathan Lerman, Kristina Chu, Kar-Hai Boley Cruz, Tess Unger, Jennifer B |
author_facet | Allem, Jon-Patrick Ramanujam, Jagannathan Lerman, Kristina Chu, Kar-Hai Boley Cruz, Tess Unger, Jennifer B |
author_sort | Allem, Jon-Patrick |
collection | PubMed |
description | BACKGROUND: The increasing popularity of hookah (or waterpipe) use in the United States and elsewhere has consequences for public health because it has similar health risks to that of combustible cigarettes. While hookah use rapidly increases in popularity, social media data (Twitter, Instagram) can be used to capture and describe the social and environmental contexts in which individuals use, perceive, discuss, and are marketed this tobacco product. These data may allow people to organically report on their sentiment toward tobacco products like hookah unprimed by a researcher, without instrument bias, and at low costs. OBJECTIVE: This study describes the sentiment of hookah-related posts on Twitter and describes the importance of debiasing Twitter data when attempting to understand attitudes. METHODS: Hookah-related posts on Twitter (N=986,320) were collected from March 24, 2015, to December 2, 2016. Machine learning models were used to describe sentiment on 20 different emotions and to debias the data so that Twitter posts reflected sentiment of legitimate human users and not of social bots or marketing-oriented accounts that would possibly provide overly positive or overly negative sentiment of hookah. RESULTS: From the analytical sample, 352,116 tweets (59.50%) were classified as positive while 177,537 (30.00%) were classified as negative, and 62,139 (10.50%) neutral. Among all positive tweets, 218,312 (62.00%) were classified as highly positive emotions (eg, active, alert, excited, elated, happy, and pleasant), while 133,804 (38.00%) positive tweets were classified as passive positive emotions (eg, contented, serene, calm, relaxed, and subdued). Among all negative tweets, 95,870 (54.00%) were classified as subdued negative emotions (eg, sad, unhappy, depressed, and bored) while the remaining 81,667 (46.00%) negative tweets were classified as highly negative emotions (eg, tense, nervous, stressed, upset, and unpleasant). Sentiment changed drastically when comparing a corpus of tweets with social bots to one without. For example, the probability of any one tweet reflecting joy was 61.30% from the debiased (or bot free) corpus of tweets. In contrast, the probability of any one tweet reflecting joy was 16.40% from the biased corpus. CONCLUSIONS: Social media data provide researchers the ability to understand public sentiment and attitudes by listening to what people are saying in their own words. Tobacco control programmers in charge of risk communication may consider targeting individuals posting positive messages about hookah on Twitter or designing messages that amplify the negative sentiments. Posts on Twitter communicating positive sentiment toward hookah could add to the normalization of hookah use and is an area of future research. Findings from this study demonstrated the importance of debiasing data when attempting to understand attitudes from Twitter data. |
format | Online Article Text |
id | pubmed-5667930 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-56679302017-11-30 Identifying Sentiment of Hookah-Related Posts on Twitter Allem, Jon-Patrick Ramanujam, Jagannathan Lerman, Kristina Chu, Kar-Hai Boley Cruz, Tess Unger, Jennifer B JMIR Public Health Surveill Original Paper BACKGROUND: The increasing popularity of hookah (or waterpipe) use in the United States and elsewhere has consequences for public health because it has similar health risks to that of combustible cigarettes. While hookah use rapidly increases in popularity, social media data (Twitter, Instagram) can be used to capture and describe the social and environmental contexts in which individuals use, perceive, discuss, and are marketed this tobacco product. These data may allow people to organically report on their sentiment toward tobacco products like hookah unprimed by a researcher, without instrument bias, and at low costs. OBJECTIVE: This study describes the sentiment of hookah-related posts on Twitter and describes the importance of debiasing Twitter data when attempting to understand attitudes. METHODS: Hookah-related posts on Twitter (N=986,320) were collected from March 24, 2015, to December 2, 2016. Machine learning models were used to describe sentiment on 20 different emotions and to debias the data so that Twitter posts reflected sentiment of legitimate human users and not of social bots or marketing-oriented accounts that would possibly provide overly positive or overly negative sentiment of hookah. RESULTS: From the analytical sample, 352,116 tweets (59.50%) were classified as positive while 177,537 (30.00%) were classified as negative, and 62,139 (10.50%) neutral. Among all positive tweets, 218,312 (62.00%) were classified as highly positive emotions (eg, active, alert, excited, elated, happy, and pleasant), while 133,804 (38.00%) positive tweets were classified as passive positive emotions (eg, contented, serene, calm, relaxed, and subdued). Among all negative tweets, 95,870 (54.00%) were classified as subdued negative emotions (eg, sad, unhappy, depressed, and bored) while the remaining 81,667 (46.00%) negative tweets were classified as highly negative emotions (eg, tense, nervous, stressed, upset, and unpleasant). Sentiment changed drastically when comparing a corpus of tweets with social bots to one without. For example, the probability of any one tweet reflecting joy was 61.30% from the debiased (or bot free) corpus of tweets. In contrast, the probability of any one tweet reflecting joy was 16.40% from the biased corpus. CONCLUSIONS: Social media data provide researchers the ability to understand public sentiment and attitudes by listening to what people are saying in their own words. Tobacco control programmers in charge of risk communication may consider targeting individuals posting positive messages about hookah on Twitter or designing messages that amplify the negative sentiments. Posts on Twitter communicating positive sentiment toward hookah could add to the normalization of hookah use and is an area of future research. Findings from this study demonstrated the importance of debiasing data when attempting to understand attitudes from Twitter data. JMIR Publications 2017-10-18 /pmc/articles/PMC5667930/ /pubmed/29046267 http://dx.doi.org/10.2196/publichealth.8133 Text en ©Jon-Patrick Allem, Jagannathan Ramanujam, Kristina Lerman, Kar-Hai Chu, Tess Boley Cruz, Jennifer B Unger. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 18.10.2017. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Allem, Jon-Patrick Ramanujam, Jagannathan Lerman, Kristina Chu, Kar-Hai Boley Cruz, Tess Unger, Jennifer B Identifying Sentiment of Hookah-Related Posts on Twitter |
title | Identifying Sentiment of Hookah-Related Posts on Twitter |
title_full | Identifying Sentiment of Hookah-Related Posts on Twitter |
title_fullStr | Identifying Sentiment of Hookah-Related Posts on Twitter |
title_full_unstemmed | Identifying Sentiment of Hookah-Related Posts on Twitter |
title_short | Identifying Sentiment of Hookah-Related Posts on Twitter |
title_sort | identifying sentiment of hookah-related posts on twitter |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5667930/ https://www.ncbi.nlm.nih.gov/pubmed/29046267 http://dx.doi.org/10.2196/publichealth.8133 |
work_keys_str_mv | AT allemjonpatrick identifyingsentimentofhookahrelatedpostsontwitter AT ramanujamjagannathan identifyingsentimentofhookahrelatedpostsontwitter AT lermankristina identifyingsentimentofhookahrelatedpostsontwitter AT chukarhai identifyingsentimentofhookahrelatedpostsontwitter AT boleycruztess identifyingsentimentofhookahrelatedpostsontwitter AT ungerjenniferb identifyingsentimentofhookahrelatedpostsontwitter |