Cargando…
Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent prevent...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168526/ https://www.ncbi.nlm.nih.gov/pubmed/27919863 http://dx.doi.org/10.2196/jmir.6670 |
_version_ | 1782483361619509248 |
---|---|
author | Massey, Philip M Leader, Amy Yom-Tov, Elad Budenz, Alexandra Fisher, Kara Klassen, Ann C |
author_facet | Massey, Philip M Leader, Amy Yom-Tov, Elad Budenz, Alexandra Fisher, Kara Klassen, Ann C |
author_sort | Massey, Philip M |
collection | PubMed |
description | BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent preventive health care. As media evolves, more information about HPV vaccination is shifting to social media platforms such as Twitter. Health information consumed on social media may be especially influential for segments of society such as younger populations, as well as ethnic and racial minorities. OBJECTIVE: The objectives of our study were to quantify HPV vaccine communication on Twitter, and to develop a novel methodology to improve the collection and analysis of Twitter data. METHODS: We collected Twitter data using 10 keywords related to HPV vaccination from August 1, 2014 to July 31, 2015. Prospective data collection used the Twitter Search API and retrospective data collection used Twitter Firehose. Using a codebook to characterize tweet sentiment and content, we coded a subsample of tweets by hand to develop classification models to code the entire sample using machine learning procedures. We also documented the words in the 140-character tweet text most associated with each keyword. We used chi-square tests, analysis of variance, and nonparametric equality of medians to test for significant differences in tweet characteristic by sentiment. RESULTS: A total of 193,379 English-language tweets were collected, classified, and analyzed. Associated words varied with each keyword, with more positive and preventive words associated with “HPV vaccine” and more negative words associated with name-brand vaccines. Positive sentiment was the largest type of sentiment in the sample, with 75,393 positive tweets (38.99% of the sample), followed by negative sentiment with 48,940 tweets (25.31% of the sample). Positive and neutral tweets constituted the largest percentage of tweets mentioning prevention or protection (20,425/75,393, 27.09% and 6477/25,110, 25.79%, respectively), compared with only 11.5% of negative tweets (5647/48,940; P<.001). Nearly one-half (22,726/48,940, 46.44%) of negative tweets mentioned side effects, compared with only 17.14% (12,921/75,393) of positive tweets and 15.08% of neutral tweets (3787/25,110; P<.001). CONCLUSIONS: Examining social media to detect health trends, as well as to communicate important health information, is a growing area of research in public health. Understanding the content and implications of conversations that form around HPV vaccination on social media can aid health organizations and health-focused Twitter users in creating a meaningful exchange of ideas and in having a significant impact on vaccine uptake. This area of research is inherently interdisciplinary, and this study supports this movement by applying public health, health communication, and data science approaches to extend methodologies across fields. |
format | Online Article Text |
id | pubmed-5168526 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-51685262017-01-03 Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter Massey, Philip M Leader, Amy Yom-Tov, Elad Budenz, Alexandra Fisher, Kara Klassen, Ann C J Med Internet Res Original Paper BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent preventive health care. As media evolves, more information about HPV vaccination is shifting to social media platforms such as Twitter. Health information consumed on social media may be especially influential for segments of society such as younger populations, as well as ethnic and racial minorities. OBJECTIVE: The objectives of our study were to quantify HPV vaccine communication on Twitter, and to develop a novel methodology to improve the collection and analysis of Twitter data. METHODS: We collected Twitter data using 10 keywords related to HPV vaccination from August 1, 2014 to July 31, 2015. Prospective data collection used the Twitter Search API and retrospective data collection used Twitter Firehose. Using a codebook to characterize tweet sentiment and content, we coded a subsample of tweets by hand to develop classification models to code the entire sample using machine learning procedures. We also documented the words in the 140-character tweet text most associated with each keyword. We used chi-square tests, analysis of variance, and nonparametric equality of medians to test for significant differences in tweet characteristic by sentiment. RESULTS: A total of 193,379 English-language tweets were collected, classified, and analyzed. Associated words varied with each keyword, with more positive and preventive words associated with “HPV vaccine” and more negative words associated with name-brand vaccines. Positive sentiment was the largest type of sentiment in the sample, with 75,393 positive tweets (38.99% of the sample), followed by negative sentiment with 48,940 tweets (25.31% of the sample). Positive and neutral tweets constituted the largest percentage of tweets mentioning prevention or protection (20,425/75,393, 27.09% and 6477/25,110, 25.79%, respectively), compared with only 11.5% of negative tweets (5647/48,940; P<.001). Nearly one-half (22,726/48,940, 46.44%) of negative tweets mentioned side effects, compared with only 17.14% (12,921/75,393) of positive tweets and 15.08% of neutral tweets (3787/25,110; P<.001). CONCLUSIONS: Examining social media to detect health trends, as well as to communicate important health information, is a growing area of research in public health. Understanding the content and implications of conversations that form around HPV vaccination on social media can aid health organizations and health-focused Twitter users in creating a meaningful exchange of ideas and in having a significant impact on vaccine uptake. This area of research is inherently interdisciplinary, and this study supports this movement by applying public health, health communication, and data science approaches to extend methodologies across fields. JMIR Publications 2016-12-05 /pmc/articles/PMC5168526/ /pubmed/27919863 http://dx.doi.org/10.2196/jmir.6670 Text en ©Philip M Massey, Amy Leader, Elad Yom-Tov, Alexandra Budenz, Kara Fisher, Ann C Klassen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 05.12.2016. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Massey, Philip M Leader, Amy Yom-Tov, Elad Budenz, Alexandra Fisher, Kara Klassen, Ann C Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title | Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title_full | Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title_fullStr | Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title_full_unstemmed | Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title_short | Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter |
title_sort | applying multiple data collection tools to quantify human papillomavirus vaccine communication on twitter |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168526/ https://www.ncbi.nlm.nih.gov/pubmed/27919863 http://dx.doi.org/10.2196/jmir.6670 |
work_keys_str_mv | AT masseyphilipm applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter AT leaderamy applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter AT yomtovelad applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter AT budenzalexandra applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter AT fisherkara applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter AT klassenannc applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter |