Cargando…

Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study

BACKGROUND: The emergence of the novel coronavirus (COVID-19) and the necessary separation of populations have led to an unprecedented number of new social media users seeking information related to the pandemic. Currently, with an estimated 4.5 billion users worldwide, social media data offer an op...

Descripción completa

Detalles Bibliográficos
Autores principales: Melton, Chad A, White, Brianna M, Davis, Robert L, Bednarczyk, Robert A, Shaban-Nejad, Arash
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578521/
https://www.ncbi.nlm.nih.gov/pubmed/36174192
http://dx.doi.org/10.2196/40408
_version_ 1784811981949108224
author Melton, Chad A
White, Brianna M
Davis, Robert L
Bednarczyk, Robert A
Shaban-Nejad, Arash
author_facet Melton, Chad A
White, Brianna M
Davis, Robert L
Bednarczyk, Robert A
Shaban-Nejad, Arash
author_sort Melton, Chad A
collection PubMed
description BACKGROUND: The emergence of the novel coronavirus (COVID-19) and the necessary separation of populations have led to an unprecedented number of new social media users seeking information related to the pandemic. Currently, with an estimated 4.5 billion users worldwide, social media data offer an opportunity for near real-time analysis of large bodies of text related to disease outbreaks and vaccination. These analyses can be used by officials to develop appropriate public health messaging, digital interventions, educational materials, and policies. OBJECTIVE: Our study investigated and compared public sentiment related to COVID-19 vaccines expressed on 2 popular social media platforms—Reddit and Twitter—harvested from January 1, 2020, to March 1, 2022. METHODS: To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict the sentiments of approximately 9.5 million tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 tweets and then augmented our data set through back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python programming language and the Hugging Face sentiment analysis pipeline. RESULTS: Our results determined that the average sentiment expressed on Twitter was more negative (5,215,830/9,518,270, 54.8%) than positive, and the sentiment expressed on Reddit was more positive (42,316/67,962, 62.3%) than negative. Although the average sentiment was found to vary between these social media platforms, both platforms displayed similar behavior related to the sentiment shared at key vaccine-related developments during the pandemic. CONCLUSIONS: Considering this similar trend in shared sentiment demonstrated across social media platforms, Twitter and Reddit continue to be valuable data sources that public health officials can use to strengthen vaccine confidence and combat misinformation. As the spread of misinformation poses a range of psychological and psychosocial risks (anxiety and fear, etc), there is an urgency in understanding the public perspective and attitude toward shared falsities. Comprehensive educational delivery systems tailored to a population’s expressed sentiments that facilitate digital literacy, health information–seeking behavior, and precision health promotion could aid in clarifying such misinformation.
format Online
Article
Text
id pubmed-9578521
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-95785212022-10-19 Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study Melton, Chad A White, Brianna M Davis, Robert L Bednarczyk, Robert A Shaban-Nejad, Arash J Med Internet Res Original Paper BACKGROUND: The emergence of the novel coronavirus (COVID-19) and the necessary separation of populations have led to an unprecedented number of new social media users seeking information related to the pandemic. Currently, with an estimated 4.5 billion users worldwide, social media data offer an opportunity for near real-time analysis of large bodies of text related to disease outbreaks and vaccination. These analyses can be used by officials to develop appropriate public health messaging, digital interventions, educational materials, and policies. OBJECTIVE: Our study investigated and compared public sentiment related to COVID-19 vaccines expressed on 2 popular social media platforms—Reddit and Twitter—harvested from January 1, 2020, to March 1, 2022. METHODS: To accomplish this task, we created a fine-tuned DistilRoBERTa model to predict the sentiments of approximately 9.5 million tweets and 70 thousand Reddit comments. To fine-tune our model, our team manually labeled the sentiment of 3600 tweets and then augmented our data set through back-translation. Text sentiment for each social media platform was then classified with our fine-tuned model using Python programming language and the Hugging Face sentiment analysis pipeline. RESULTS: Our results determined that the average sentiment expressed on Twitter was more negative (5,215,830/9,518,270, 54.8%) than positive, and the sentiment expressed on Reddit was more positive (42,316/67,962, 62.3%) than negative. Although the average sentiment was found to vary between these social media platforms, both platforms displayed similar behavior related to the sentiment shared at key vaccine-related developments during the pandemic. CONCLUSIONS: Considering this similar trend in shared sentiment demonstrated across social media platforms, Twitter and Reddit continue to be valuable data sources that public health officials can use to strengthen vaccine confidence and combat misinformation. As the spread of misinformation poses a range of psychological and psychosocial risks (anxiety and fear, etc), there is an urgency in understanding the public perspective and attitude toward shared falsities. Comprehensive educational delivery systems tailored to a population’s expressed sentiments that facilitate digital literacy, health information–seeking behavior, and precision health promotion could aid in clarifying such misinformation. JMIR Publications 2022-10-17 /pmc/articles/PMC9578521/ /pubmed/36174192 http://dx.doi.org/10.2196/40408 Text en ©Chad A Melton, Brianna M White, Robert L Davis, Robert A Bednarczyk, Arash Shaban-Nejad. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 17.10.2022. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Melton, Chad A
White, Brianna M
Davis, Robert L
Bednarczyk, Robert A
Shaban-Nejad, Arash
Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title_full Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title_fullStr Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title_full_unstemmed Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title_short Fine-tuned Sentiment Analysis of COVID-19 Vaccine–Related Social Media Data: Comparative Study
title_sort fine-tuned sentiment analysis of covid-19 vaccine–related social media data: comparative study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9578521/
https://www.ncbi.nlm.nih.gov/pubmed/36174192
http://dx.doi.org/10.2196/40408
work_keys_str_mv AT meltonchada finetunedsentimentanalysisofcovid19vaccinerelatedsocialmediadatacomparativestudy
AT whitebriannam finetunedsentimentanalysisofcovid19vaccinerelatedsocialmediadatacomparativestudy
AT davisrobertl finetunedsentimentanalysisofcovid19vaccinerelatedsocialmediadatacomparativestudy
AT bednarczykroberta finetunedsentimentanalysisofcovid19vaccinerelatedsocialmediadatacomparativestudy
AT shabannejadarash finetunedsentimentanalysisofcovid19vaccinerelatedsocialmediadatacomparativestudy