Cargando…

Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design

Natural language processing techniques have increased the volume and variety of text data that can be analyzed. The aim of this study was to identify the positive and negative topical sentiments among diet, diabetes, exercise, and obesity tweets. Using a sequential explanatory mixed-method design fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaw, George, Zimmerman, Margaret, Vasquez-Huot, Ligia, Karami, Amir
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9691243/
https://www.ncbi.nlm.nih.gov/pubmed/36421644
http://dx.doi.org/10.3390/healthcare10112320
_version_ 1784836996674355200
author Shaw, George
Zimmerman, Margaret
Vasquez-Huot, Ligia
Karami, Amir
author_facet Shaw, George
Zimmerman, Margaret
Vasquez-Huot, Ligia
Karami, Amir
author_sort Shaw, George
collection PubMed
description Natural language processing techniques have increased the volume and variety of text data that can be analyzed. The aim of this study was to identify the positive and negative topical sentiments among diet, diabetes, exercise, and obesity tweets. Using a sequential explanatory mixed-method design for our analytical framework, we analyzed a data corpus of 1.7 million diet, diabetes, exercise, and obesity (DDEO)-related tweets collected over 12 months. Sentiment analysis and topic modeling were used to analyze the data. The results show that overall, 29% of the tweets were positive, and 17% were negative. Using sentiment analysis and latent Dirichlet allocation (LDA) topic modeling, we analyzed 800 positive and negative DDEO topics. From the 800 LDA topics—after the qualitative and computational removal of incoherent topics—473 topics were characterized as coherent. Obesity was the only query health topic with a higher percentage of negative tweets. The use of social media by public health practitioners should focus not only on the dissemination of health information based on the topics discovered but also consider what they can do for the health consumer as a result of the interaction in digital spaces such as social media. Future studies will benefit from using multiclass sentiment analysis methods associated with other novel topic modeling approaches.
format Online
Article
Text
id pubmed-9691243
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96912432022-11-25 Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design Shaw, George Zimmerman, Margaret Vasquez-Huot, Ligia Karami, Amir Healthcare (Basel) Article Natural language processing techniques have increased the volume and variety of text data that can be analyzed. The aim of this study was to identify the positive and negative topical sentiments among diet, diabetes, exercise, and obesity tweets. Using a sequential explanatory mixed-method design for our analytical framework, we analyzed a data corpus of 1.7 million diet, diabetes, exercise, and obesity (DDEO)-related tweets collected over 12 months. Sentiment analysis and topic modeling were used to analyze the data. The results show that overall, 29% of the tweets were positive, and 17% were negative. Using sentiment analysis and latent Dirichlet allocation (LDA) topic modeling, we analyzed 800 positive and negative DDEO topics. From the 800 LDA topics—after the qualitative and computational removal of incoherent topics—473 topics were characterized as coherent. Obesity was the only query health topic with a higher percentage of negative tweets. The use of social media by public health practitioners should focus not only on the dissemination of health information based on the topics discovered but also consider what they can do for the health consumer as a result of the interaction in digital spaces such as social media. Future studies will benefit from using multiclass sentiment analysis methods associated with other novel topic modeling approaches. MDPI 2022-11-19 /pmc/articles/PMC9691243/ /pubmed/36421644 http://dx.doi.org/10.3390/healthcare10112320 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Shaw, George
Zimmerman, Margaret
Vasquez-Huot, Ligia
Karami, Amir
Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title_full Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title_fullStr Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title_full_unstemmed Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title_short Deciphering Latent Health Information in Social Media Using a Mixed-Methods Design
title_sort deciphering latent health information in social media using a mixed-methods design
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9691243/
https://www.ncbi.nlm.nih.gov/pubmed/36421644
http://dx.doi.org/10.3390/healthcare10112320
work_keys_str_mv AT shawgeorge decipheringlatenthealthinformationinsocialmediausingamixedmethodsdesign
AT zimmermanmargaret decipheringlatenthealthinformationinsocialmediausingamixedmethodsdesign
AT vasquezhuotligia decipheringlatenthealthinformationinsocialmediausingamixedmethodsdesign
AT karamiamir decipheringlatenthealthinformationinsocialmediausingamixedmethodsdesign