Cargando…

E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends

BACKGROUND: As e-cigarette use rapidly increases in popularity, data from online social systems (Twitter, Instagram, Google Web Search) can be used to capture and describe the social and environmental context in which individuals use, perceive, and are marketed this tobacco product. Social media dat...

Descripción completa

Detalles Bibliográficos
Autores principales: Allem, Jon-Patrick, Ferrara, Emilio, Uppu, Sree Priyanka, Cruz, Tess Boley, Unger, Jennifer B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5752967/
https://www.ncbi.nlm.nih.gov/pubmed/29263018
http://dx.doi.org/10.2196/publichealth.8641
_version_ 1783290178957213696
author Allem, Jon-Patrick
Ferrara, Emilio
Uppu, Sree Priyanka
Cruz, Tess Boley
Unger, Jennifer B
author_facet Allem, Jon-Patrick
Ferrara, Emilio
Uppu, Sree Priyanka
Cruz, Tess Boley
Unger, Jennifer B
author_sort Allem, Jon-Patrick
collection PubMed
description BACKGROUND: As e-cigarette use rapidly increases in popularity, data from online social systems (Twitter, Instagram, Google Web Search) can be used to capture and describe the social and environmental context in which individuals use, perceive, and are marketed this tobacco product. Social media data may serve as a massive focus group where people organically discuss e-cigarettes unprimed by a researcher, without instrument bias, captured in near real time and at low costs. OBJECTIVE: This study documents e-cigarette–related discussions on Twitter, describing themes of conversations and locations where Twitter users often discuss e-cigarettes, to identify priority areas for e-cigarette education campaigns. Additionally, this study demonstrates the importance of distinguishing between social bots and human users when attempting to understand public health–related behaviors and attitudes. METHODS: E-cigarette–related posts on Twitter (N=6,185,153) were collected from December 24, 2016, to April 21, 2017. Techniques drawn from network science were used to determine discussions of e-cigarettes by describing which hashtags co-occur (concept clusters) in a Twitter network. Posts and metadata were used to describe where geographically e-cigarette–related discussions in the United States occurred. Machine learning models were used to distinguish between Twitter posts reflecting attitudes and behaviors of genuine human users from those of social bots. Odds ratios were computed from 2x2 contingency tables to detect if hashtags varied by source (social bot vs human user) using the Fisher exact test to determine statistical significance. RESULTS: Clusters found in the corpus of hashtags from human users included behaviors (eg, #vaping), vaping identity (eg, #vapelife), and vaping community (eg, #vapenation). Additional clusters included products (eg, #eliquids), dual tobacco use (eg, #hookah), and polysubstance use (eg, #marijuana). Clusters found in the corpus of hashtags from social bots included health (eg, #health), smoking cessation (eg, #quitsmoking), and new products (eg, #ismog). Social bots were significantly more likely to post hashtags that referenced smoking cessation and new products compared to human users. The volume of tweets was highest in the Mid-Atlantic (eg, Pennsylvania, New Jersey, Maryland, and New York), followed by the West Coast and Southwest (eg, California, Arizona and Nevada). CONCLUSIONS: Social media data may be used to complement and extend the surveillance of health behaviors including tobacco product use. Public health researchers could harness these data and methods to identify new products or devices. Furthermore, findings from this study demonstrate the importance of distinguishing between Twitter posts from social bots and humans when attempting to understand attitudes and behaviors. Social bots may be used to perpetuate the idea that e-cigarettes are helpful in cessation and to promote new products as they enter the marketplace.
format Online
Article
Text
id pubmed-5752967
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-57529672018-01-08 E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends Allem, Jon-Patrick Ferrara, Emilio Uppu, Sree Priyanka Cruz, Tess Boley Unger, Jennifer B JMIR Public Health Surveill Original Paper BACKGROUND: As e-cigarette use rapidly increases in popularity, data from online social systems (Twitter, Instagram, Google Web Search) can be used to capture and describe the social and environmental context in which individuals use, perceive, and are marketed this tobacco product. Social media data may serve as a massive focus group where people organically discuss e-cigarettes unprimed by a researcher, without instrument bias, captured in near real time and at low costs. OBJECTIVE: This study documents e-cigarette–related discussions on Twitter, describing themes of conversations and locations where Twitter users often discuss e-cigarettes, to identify priority areas for e-cigarette education campaigns. Additionally, this study demonstrates the importance of distinguishing between social bots and human users when attempting to understand public health–related behaviors and attitudes. METHODS: E-cigarette–related posts on Twitter (N=6,185,153) were collected from December 24, 2016, to April 21, 2017. Techniques drawn from network science were used to determine discussions of e-cigarettes by describing which hashtags co-occur (concept clusters) in a Twitter network. Posts and metadata were used to describe where geographically e-cigarette–related discussions in the United States occurred. Machine learning models were used to distinguish between Twitter posts reflecting attitudes and behaviors of genuine human users from those of social bots. Odds ratios were computed from 2x2 contingency tables to detect if hashtags varied by source (social bot vs human user) using the Fisher exact test to determine statistical significance. RESULTS: Clusters found in the corpus of hashtags from human users included behaviors (eg, #vaping), vaping identity (eg, #vapelife), and vaping community (eg, #vapenation). Additional clusters included products (eg, #eliquids), dual tobacco use (eg, #hookah), and polysubstance use (eg, #marijuana). Clusters found in the corpus of hashtags from social bots included health (eg, #health), smoking cessation (eg, #quitsmoking), and new products (eg, #ismog). Social bots were significantly more likely to post hashtags that referenced smoking cessation and new products compared to human users. The volume of tweets was highest in the Mid-Atlantic (eg, Pennsylvania, New Jersey, Maryland, and New York), followed by the West Coast and Southwest (eg, California, Arizona and Nevada). CONCLUSIONS: Social media data may be used to complement and extend the surveillance of health behaviors including tobacco product use. Public health researchers could harness these data and methods to identify new products or devices. Furthermore, findings from this study demonstrate the importance of distinguishing between Twitter posts from social bots and humans when attempting to understand attitudes and behaviors. Social bots may be used to perpetuate the idea that e-cigarettes are helpful in cessation and to promote new products as they enter the marketplace. JMIR Publications 2017-12-20 /pmc/articles/PMC5752967/ /pubmed/29263018 http://dx.doi.org/10.2196/publichealth.8641 Text en ©Jon-Patrick Allem, Emilio Ferrara, Sree Priyanka Uppu, Tess Boley Cruz, Jennifer B Unger. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 20.12.2017. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Public Health and Surveillance, is properly cited. The complete bibliographic information, a link to the original publication on http://publichealth.jmir.org, as well as this copyright and license information must be included.
spellingShingle Original Paper
Allem, Jon-Patrick
Ferrara, Emilio
Uppu, Sree Priyanka
Cruz, Tess Boley
Unger, Jennifer B
E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title_full E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title_fullStr E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title_full_unstemmed E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title_short E-Cigarette Surveillance With Social Media Data: Social Bots, Emerging Topics, and Trends
title_sort e-cigarette surveillance with social media data: social bots, emerging topics, and trends
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5752967/
https://www.ncbi.nlm.nih.gov/pubmed/29263018
http://dx.doi.org/10.2196/publichealth.8641
work_keys_str_mv AT allemjonpatrick ecigarettesurveillancewithsocialmediadatasocialbotsemergingtopicsandtrends
AT ferraraemilio ecigarettesurveillancewithsocialmediadatasocialbotsemergingtopicsandtrends
AT uppusreepriyanka ecigarettesurveillancewithsocialmediadatasocialbotsemergingtopicsandtrends
AT cruztessboley ecigarettesurveillancewithsocialmediadatasocialbotsemergingtopicsandtrends
AT ungerjenniferb ecigarettesurveillancewithsocialmediadatasocialbotsemergingtopicsandtrends