Cargando…

Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter

BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent prevent...

Descripción completa

Detalles Bibliográficos
Autores principales: Massey, Philip M, Leader, Amy, Yom-Tov, Elad, Budenz, Alexandra, Fisher, Kara, Klassen, Ann C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168526/
https://www.ncbi.nlm.nih.gov/pubmed/27919863
http://dx.doi.org/10.2196/jmir.6670
_version_ 1782483361619509248
author Massey, Philip M
Leader, Amy
Yom-Tov, Elad
Budenz, Alexandra
Fisher, Kara
Klassen, Ann C
author_facet Massey, Philip M
Leader, Amy
Yom-Tov, Elad
Budenz, Alexandra
Fisher, Kara
Klassen, Ann C
author_sort Massey, Philip M
collection PubMed
description BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent preventive health care. As media evolves, more information about HPV vaccination is shifting to social media platforms such as Twitter. Health information consumed on social media may be especially influential for segments of society such as younger populations, as well as ethnic and racial minorities. OBJECTIVE: The objectives of our study were to quantify HPV vaccine communication on Twitter, and to develop a novel methodology to improve the collection and analysis of Twitter data. METHODS: We collected Twitter data using 10 keywords related to HPV vaccination from August 1, 2014 to July 31, 2015. Prospective data collection used the Twitter Search API and retrospective data collection used Twitter Firehose. Using a codebook to characterize tweet sentiment and content, we coded a subsample of tweets by hand to develop classification models to code the entire sample using machine learning procedures. We also documented the words in the 140-character tweet text most associated with each keyword. We used chi-square tests, analysis of variance, and nonparametric equality of medians to test for significant differences in tweet characteristic by sentiment. RESULTS: A total of 193,379 English-language tweets were collected, classified, and analyzed. Associated words varied with each keyword, with more positive and preventive words associated with “HPV vaccine” and more negative words associated with name-brand vaccines. Positive sentiment was the largest type of sentiment in the sample, with 75,393 positive tweets (38.99% of the sample), followed by negative sentiment with 48,940 tweets (25.31% of the sample). Positive and neutral tweets constituted the largest percentage of tweets mentioning prevention or protection (20,425/75,393, 27.09% and 6477/25,110, 25.79%, respectively), compared with only 11.5% of negative tweets (5647/48,940; P<.001). Nearly one-half (22,726/48,940, 46.44%) of negative tweets mentioned side effects, compared with only 17.14% (12,921/75,393) of positive tweets and 15.08% of neutral tweets (3787/25,110; P<.001). CONCLUSIONS: Examining social media to detect health trends, as well as to communicate important health information, is a growing area of research in public health. Understanding the content and implications of conversations that form around HPV vaccination on social media can aid health organizations and health-focused Twitter users in creating a meaningful exchange of ideas and in having a significant impact on vaccine uptake. This area of research is inherently interdisciplinary, and this study supports this movement by applying public health, health communication, and data science approaches to extend methodologies across fields.
format Online
Article
Text
id pubmed-5168526
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-51685262017-01-03 Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter Massey, Philip M Leader, Amy Yom-Tov, Elad Budenz, Alexandra Fisher, Kara Klassen, Ann C J Med Internet Res Original Paper BACKGROUND: Human papillomavirus (HPV) is the most common sexually transmitted infection in the United States. There are several vaccines that protect against strains of HPV most associated with cervical and other cancers. Thus, HPV vaccination has become an important component of adolescent preventive health care. As media evolves, more information about HPV vaccination is shifting to social media platforms such as Twitter. Health information consumed on social media may be especially influential for segments of society such as younger populations, as well as ethnic and racial minorities. OBJECTIVE: The objectives of our study were to quantify HPV vaccine communication on Twitter, and to develop a novel methodology to improve the collection and analysis of Twitter data. METHODS: We collected Twitter data using 10 keywords related to HPV vaccination from August 1, 2014 to July 31, 2015. Prospective data collection used the Twitter Search API and retrospective data collection used Twitter Firehose. Using a codebook to characterize tweet sentiment and content, we coded a subsample of tweets by hand to develop classification models to code the entire sample using machine learning procedures. We also documented the words in the 140-character tweet text most associated with each keyword. We used chi-square tests, analysis of variance, and nonparametric equality of medians to test for significant differences in tweet characteristic by sentiment. RESULTS: A total of 193,379 English-language tweets were collected, classified, and analyzed. Associated words varied with each keyword, with more positive and preventive words associated with “HPV vaccine” and more negative words associated with name-brand vaccines. Positive sentiment was the largest type of sentiment in the sample, with 75,393 positive tweets (38.99% of the sample), followed by negative sentiment with 48,940 tweets (25.31% of the sample). Positive and neutral tweets constituted the largest percentage of tweets mentioning prevention or protection (20,425/75,393, 27.09% and 6477/25,110, 25.79%, respectively), compared with only 11.5% of negative tweets (5647/48,940; P<.001). Nearly one-half (22,726/48,940, 46.44%) of negative tweets mentioned side effects, compared with only 17.14% (12,921/75,393) of positive tweets and 15.08% of neutral tweets (3787/25,110; P<.001). CONCLUSIONS: Examining social media to detect health trends, as well as to communicate important health information, is a growing area of research in public health. Understanding the content and implications of conversations that form around HPV vaccination on social media can aid health organizations and health-focused Twitter users in creating a meaningful exchange of ideas and in having a significant impact on vaccine uptake. This area of research is inherently interdisciplinary, and this study supports this movement by applying public health, health communication, and data science approaches to extend methodologies across fields. JMIR Publications 2016-12-05 /pmc/articles/PMC5168526/ /pubmed/27919863 http://dx.doi.org/10.2196/jmir.6670 Text en ©Philip M Massey, Amy Leader, Elad Yom-Tov, Alexandra Budenz, Kara Fisher, Ann C Klassen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 05.12.2016. https://creativecommons.org/licenses/by/2.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0/ (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Massey, Philip M
Leader, Amy
Yom-Tov, Elad
Budenz, Alexandra
Fisher, Kara
Klassen, Ann C
Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title_full Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title_fullStr Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title_full_unstemmed Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title_short Applying Multiple Data Collection Tools to Quantify Human Papillomavirus Vaccine Communication on Twitter
title_sort applying multiple data collection tools to quantify human papillomavirus vaccine communication on twitter
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5168526/
https://www.ncbi.nlm.nih.gov/pubmed/27919863
http://dx.doi.org/10.2196/jmir.6670
work_keys_str_mv AT masseyphilipm applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter
AT leaderamy applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter
AT yomtovelad applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter
AT budenzalexandra applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter
AT fisherkara applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter
AT klassenannc applyingmultipledatacollectiontoolstoquantifyhumanpapillomavirusvaccinecommunicationontwitter