Cargando…

Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms

BACKGROUND: Electronic cigarette (e-cigarette) is an emerging product with a rapid-growth market in recent years. Social media has become an important platform for information seeking and sharing. We aim to mine hidden topics from e-cigarette datasets collected from different social media platforms....

Descripción completa

Detalles Bibliográficos
Autores principales: Zhan, Yongcheng, Liu, Ruoran, Li, Qiudan, Leischow, Scott James, Zeng, Daniel Dajun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291865/
https://www.ncbi.nlm.nih.gov/pubmed/28108428
http://dx.doi.org/10.2196/jmir.5780
_version_ 1782504840485666816
author Zhan, Yongcheng
Liu, Ruoran
Li, Qiudan
Leischow, Scott James
Zeng, Daniel Dajun
author_facet Zhan, Yongcheng
Liu, Ruoran
Li, Qiudan
Leischow, Scott James
Zeng, Daniel Dajun
author_sort Zhan, Yongcheng
collection PubMed
description BACKGROUND: Electronic cigarette (e-cigarette) is an emerging product with a rapid-growth market in recent years. Social media has become an important platform for information seeking and sharing. We aim to mine hidden topics from e-cigarette datasets collected from different social media platforms. OBJECTIVE: This paper aims to gain a systematic understanding of the characteristics of various types of social media, which will provide deep insights into how consumers and policy makers effectively use social media to track e-cigarette-related content and adjust their decisions and policies. METHODS: We collected data from Reddit (27,638 e-cigarette flavor-related posts from January 1, 2011, to June 30, 2015), JuiceDB (14,433 e-juice reviews from June 26, 2013 to November 12, 2015), and Twitter (13,356 “e-cig ban”-related tweets from January, 1, 2010 to June 30, 2015). Latent Dirichlet Allocation, a generative model for topic modeling, was used to analyze the topics from these data. RESULTS: We found four types of topics across the platforms: (1) promotions, (2) flavor discussions, (3) experience sharing, and (4) regulation debates. Promotions included sales from vendors to users, as well as trades among users. A total of 10.72% (2,962/27,638) of the posts from Reddit were related to trading. Promotion links were found between social media platforms. Most of the links (87.30%) in JuiceDB were related to Reddit posts. JuiceDB and Reddit identified consistent flavor categories. E-cigarette vaping methods and features such as steeping, throat hit, and vapor production were broadly discussed both on Reddit and on JuiceDB. Reddit provided space for policy discussions and majority of the posts (60.7%) holding a negative attitude toward regulations, whereas Twitter was used to launch campaigns using certain hashtags. Our findings are based on data across different platforms. The topic distribution between Reddit and JuiceDB was significantly different (P<.001), which indicated that the user discussions focused on different perspectives across the platforms. CONCLUSIONS: This study examined Reddit, JuiceDB, and Twitter as social media data sources for e-cigarette research. These mined findings could be further used by other researchers and policy makers. By utilizing the automatic topic-modeling method, the proposed unified feedback model could be a useful tool for policy makers to comprehensively consider how to collect valuable feedback from social media.
format Online
Article
Text
id pubmed-5291865
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-52918652017-02-15 Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms Zhan, Yongcheng Liu, Ruoran Li, Qiudan Leischow, Scott James Zeng, Daniel Dajun J Med Internet Res Original Paper BACKGROUND: Electronic cigarette (e-cigarette) is an emerging product with a rapid-growth market in recent years. Social media has become an important platform for information seeking and sharing. We aim to mine hidden topics from e-cigarette datasets collected from different social media platforms. OBJECTIVE: This paper aims to gain a systematic understanding of the characteristics of various types of social media, which will provide deep insights into how consumers and policy makers effectively use social media to track e-cigarette-related content and adjust their decisions and policies. METHODS: We collected data from Reddit (27,638 e-cigarette flavor-related posts from January 1, 2011, to June 30, 2015), JuiceDB (14,433 e-juice reviews from June 26, 2013 to November 12, 2015), and Twitter (13,356 “e-cig ban”-related tweets from January, 1, 2010 to June 30, 2015). Latent Dirichlet Allocation, a generative model for topic modeling, was used to analyze the topics from these data. RESULTS: We found four types of topics across the platforms: (1) promotions, (2) flavor discussions, (3) experience sharing, and (4) regulation debates. Promotions included sales from vendors to users, as well as trades among users. A total of 10.72% (2,962/27,638) of the posts from Reddit were related to trading. Promotion links were found between social media platforms. Most of the links (87.30%) in JuiceDB were related to Reddit posts. JuiceDB and Reddit identified consistent flavor categories. E-cigarette vaping methods and features such as steeping, throat hit, and vapor production were broadly discussed both on Reddit and on JuiceDB. Reddit provided space for policy discussions and majority of the posts (60.7%) holding a negative attitude toward regulations, whereas Twitter was used to launch campaigns using certain hashtags. Our findings are based on data across different platforms. The topic distribution between Reddit and JuiceDB was significantly different (P<.001), which indicated that the user discussions focused on different perspectives across the platforms. CONCLUSIONS: This study examined Reddit, JuiceDB, and Twitter as social media data sources for e-cigarette research. These mined findings could be further used by other researchers and policy makers. By utilizing the automatic topic-modeling method, the proposed unified feedback model could be a useful tool for policy makers to comprehensively consider how to collect valuable feedback from social media. JMIR Publications 2017-01-20 /pmc/articles/PMC5291865/ /pubmed/28108428 http://dx.doi.org/10.2196/jmir.5780 Text en ©Yongcheng Zhan, Ruoran Liu, Qiudan Li, Scott James Leischow, Daniel Dajun Zeng. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 20.01.2017. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Zhan, Yongcheng
Liu, Ruoran
Li, Qiudan
Leischow, Scott James
Zeng, Daniel Dajun
Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title_full Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title_fullStr Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title_full_unstemmed Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title_short Identifying Topics for E-Cigarette User-Generated Contents: A Case Study From Multiple Social Media Platforms
title_sort identifying topics for e-cigarette user-generated contents: a case study from multiple social media platforms
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5291865/
https://www.ncbi.nlm.nih.gov/pubmed/28108428
http://dx.doi.org/10.2196/jmir.5780
work_keys_str_mv AT zhanyongcheng identifyingtopicsforecigaretteusergeneratedcontentsacasestudyfrommultiplesocialmediaplatforms
AT liuruoran identifyingtopicsforecigaretteusergeneratedcontentsacasestudyfrommultiplesocialmediaplatforms
AT liqiudan identifyingtopicsforecigaretteusergeneratedcontentsacasestudyfrommultiplesocialmediaplatforms
AT leischowscottjames identifyingtopicsforecigaretteusergeneratedcontentsacasestudyfrommultiplesocialmediaplatforms
AT zengdanieldajun identifyingtopicsforecigaretteusergeneratedcontentsacasestudyfrommultiplesocialmediaplatforms