Cargando…
Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking
BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current socia...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643764/ https://www.ncbi.nlm.nih.gov/pubmed/31287063 http://dx.doi.org/10.2196/12443 |
_version_ | 1783437160458747904 |
---|---|
author | Chu, Kar-Hai Colditz, Jason Malik, Momin Yates, Tabitha Primack, Brian |
author_facet | Chu, Kar-Hai Colditz, Jason Malik, Momin Yates, Tabitha Primack, Brian |
author_sort | Chu, Kar-Hai |
collection | PubMed |
description | BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current social media platforms provide researchers with opportunities to better identify and target specific audiences and even individuals. However, we are not aware of systematic research attempting to identify audiences with mixed or ambivalent views toward HTS. OBJECTIVE: The objective of this study was to (1) confirm previous research showing positively skewed HTS sentiment on Twitter using a larger dataset by leveraging machine learning techniques and (2) systematically identify individuals who exhibit mixed opinions about HTS via the Twitter platform and therefore represent key audiences for intervention. METHODS: We prospectively collected tweets related to HTS from January to June 2016. We double-coded sentiment for a subset of approximately 5000 randomly sampled tweets for sentiment toward HTS and used these data to train a machine learning classifier to assess the remaining approximately 556,000 HTS-related Twitter posts. Natural language processing software was used to extract linguistic features (ie, language-based covariates). The data were processed by machine learning tools and algorithms using R. Finally, we used the results to identify individuals who, because they had consistently posted both positive and negative content, might be ambivalent toward HTS and represent an ideal audience for intervention. RESULTS: There were 561,960 HTS-related tweets: 373,911 were classified as positive and 183,139 were classified as negative. A set of 12,861 users met a priori criteria indicating that they posted both positive and negative tweets about HTS. CONCLUSIONS: Sentiment analysis can allow researchers to identify audience segments on social media that demonstrate ambiguity toward key public health issues, such as HTS, and therefore represent ideal populations for intervention. Using large social media datasets can help public health officials to preemptively identify specific audience segments that would be most receptive to targeted campaigns. |
format | Online Article Text |
id | pubmed-6643764 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-66437642019-07-30 Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking Chu, Kar-Hai Colditz, Jason Malik, Momin Yates, Tabitha Primack, Brian J Med Internet Res Original Paper BACKGROUND: Hookah tobacco smoking (HTS) is a particularly important issue for public health professionals to address owing to its prevalence and deleterious health effects. Social media sites can be a valuable tool for public health officials to conduct informational health campaigns. Current social media platforms provide researchers with opportunities to better identify and target specific audiences and even individuals. However, we are not aware of systematic research attempting to identify audiences with mixed or ambivalent views toward HTS. OBJECTIVE: The objective of this study was to (1) confirm previous research showing positively skewed HTS sentiment on Twitter using a larger dataset by leveraging machine learning techniques and (2) systematically identify individuals who exhibit mixed opinions about HTS via the Twitter platform and therefore represent key audiences for intervention. METHODS: We prospectively collected tweets related to HTS from January to June 2016. We double-coded sentiment for a subset of approximately 5000 randomly sampled tweets for sentiment toward HTS and used these data to train a machine learning classifier to assess the remaining approximately 556,000 HTS-related Twitter posts. Natural language processing software was used to extract linguistic features (ie, language-based covariates). The data were processed by machine learning tools and algorithms using R. Finally, we used the results to identify individuals who, because they had consistently posted both positive and negative content, might be ambivalent toward HTS and represent an ideal audience for intervention. RESULTS: There were 561,960 HTS-related tweets: 373,911 were classified as positive and 183,139 were classified as negative. A set of 12,861 users met a priori criteria indicating that they posted both positive and negative tweets about HTS. CONCLUSIONS: Sentiment analysis can allow researchers to identify audience segments on social media that demonstrate ambiguity toward key public health issues, such as HTS, and therefore represent ideal populations for intervention. Using large social media datasets can help public health officials to preemptively identify specific audience segments that would be most receptive to targeted campaigns. JMIR Publications 2019-07-08 /pmc/articles/PMC6643764/ /pubmed/31287063 http://dx.doi.org/10.2196/12443 Text en ©Kar-Hai Chu, Jason Colditz, Momin Malik, Tabitha Yates, Brian Primack. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 08.07.2019. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Chu, Kar-Hai Colditz, Jason Malik, Momin Yates, Tabitha Primack, Brian Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title | Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title_full | Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title_fullStr | Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title_full_unstemmed | Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title_short | Identifying Key Target Audiences for Public Health Campaigns: Leveraging Machine Learning in the Case of Hookah Tobacco Smoking |
title_sort | identifying key target audiences for public health campaigns: leveraging machine learning in the case of hookah tobacco smoking |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6643764/ https://www.ncbi.nlm.nih.gov/pubmed/31287063 http://dx.doi.org/10.2196/12443 |
work_keys_str_mv | AT chukarhai identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking AT colditzjason identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking AT malikmomin identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking AT yatestabitha identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking AT primackbrian identifyingkeytargetaudiencesforpublichealthcampaignsleveragingmachinelearninginthecaseofhookahtobaccosmoking |