Cargando…

Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking

BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current socia...

Descripción completa

Detalles Bibliográficos
Autores principales: Chu, Kar-Hai, Colditz, Jason, Malik, Momin, Yates, Tabitha, Primack, Brian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643764/
https://www.ncbi.nlm.nih.gov/pubmed/31287063
http://dx.doi.org/10.2196/12443
_version_ 1783437160458747904
author Chu, Kar-Hai
Colditz, Jason
Malik, Momin
Yates, Tabitha
Primack, Brian
author_facet Chu, Kar-Hai
Colditz, Jason
Malik, Momin
Yates, Tabitha
Primack, Brian
author_sort Chu, Kar-Hai
collection PubMed
description BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current social media platforms provide researchers with opportunities to better identify and target specific audiences and even individuals. However, we are not aware of systematic research attempting to identify audiences with mixed or ambivalent views toward HTS. OBJECTIVE: The objective of this study was to (1) confirm previous research showing positively skewed HTS sentiment on Twitter using a larger dataset by leveraging machine learning techniques and (2) systematically identify individuals who exhibit mixed opinions about HTS via the Twitter platform and therefore represent key audiences for intervention. METHODS: We prospectively collected tweets related to HTS from January to June 2016. We double-coded sentiment for a subset of approximately 5000 randomly sampled tweets for sentiment toward HTS and used these data to train a machine learning classifier to assess the remaining approximately 556,000 HTS-related Twitter posts. Natural language processing software was used to extract linguistic features (ie, language-based covariates). The data were processed by machine learning tools and algorithms using R. Finally, we used the results to identify individuals who, because they had consistently posted both positive and negative content, might be ambivalent toward HTS and represent an ideal audience for intervention. RESULTS: There were 561,960 HTS-related tweets: 373,911 were classified as positive and 183,139 were classified as negative. A set of 12,861 users met a priori criteria indicating that they posted both positive and negative tweets about HTS. CONCLUSIONS: Sentiment analysis can allow researchers to identify audience segments on social media that demonstrate ambiguity toward key public health issues, such as HTS, and therefore represent ideal populations for intervention. Using large social media datasets can help public health officials to preemptively identify specific audience segments that would be most receptive to targeted campaigns.
format Online
Article
Text
id pubmed-6643764
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-66437642019-07-30 Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking Chu, Kar-Hai Colditz, Jason Malik, Momin Yates, Tabitha Primack, Brian J Med Internet Res Original Paper BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current social media platforms provide researchers with opportunities to better identify and target specific audiences and even individuals. However, we are not aware of systematic research attempting to identify audiences with mixed or ambivalent views toward HTS. OBJECTIVE: The objective of this study was to (1) confirm previous research showing positively skewed HTS sentiment on Twitter using a larger dataset by leveraging machine learning techniques and (2) systematically identify individuals who exhibit mixed opinions about HTS via the Twitter platform and therefore represent key audiences for intervention. METHODS: We prospectively collected tweets related to HTS from January to June 2016. We double-coded sentiment for a subset of approximately 5000 randomly sampled tweets for sentiment toward HTS and used these data to train a machine learning classifier to assess the remaining approximately 556,000 HTS-related Twitter posts. Natural language processing software was used to extract linguistic features (ie, language-based covariates). The data were processed by machine learning tools and algorithms using R. Finally, we used the results to identify individuals who, because they had consistently posted both positive and negative content, might be ambivalent toward HTS and represent an ideal audience for intervention. RESULTS: There were 561,960 HTS-related tweets: 373,911 were classified as positive and 183,139 were classified as negative. A set of 12,861 users met a priori criteria indicating that they posted both positive and negative tweets about HTS. CONCLUSIONS: Sentiment analysis can allow researchers to identify audience segments on social media that demonstrate ambiguity toward key public health issues, such as HTS, and therefore represent ideal populations for intervention. Using large social media datasets can help public health officials to preemptively identify specific audience segments that would be most receptive to targeted campaigns. JMIR Publications 2019-07-08 /pmc/articles/PMC6643764/ /pubmed/31287063 http://dx.doi.org/10.2196/12443 Text en ©Kar-Hai Chu, Jason Colditz, Momin Malik, Tabitha Yates, Brian Primack. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 08.07.2019. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Chu, Kar-Hai
Colditz, Jason
Malik, Momin
Yates, Tabitha
Primack, Brian
Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title_full Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title_fullStr Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title_full_unstemmed Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title_short Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
title_sort identifying key target audiences for public health campaigns: leveraging machine learning in the case of hookah tobacco smoking
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643764/
https://www.ncbi.nlm.nih.gov/pubmed/31287063
http://dx.doi.org/10.2196/12443
work_keys_str_mv AT chukarhai identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking
AT colditzjason identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking
AT malikmomin identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking
AT yatestabitha identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking
AT primackbrian identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking