Cargando…

Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study

BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organiz...

Descripción completa

Detalles Bibliográficos
Autores principales: Ovalle, Anaelia, Goldstein, Orpaz, Kachuee, Mohammad, Wu, Elizabeth S C, Hong, Chenglin, Holloway, Ian W, Sarrafzadeh, Majid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8111510/
https://www.ncbi.nlm.nih.gov/pubmed/33900200
http://dx.doi.org/10.2196/22042
_version_ 1783690515650183168
author Ovalle, Anaelia
Goldstein, Orpaz
Kachuee, Mohammad
Wu, Elizabeth S C
Hong, Chenglin
Holloway, Ian W
Sarrafzadeh, Majid
author_facet Ovalle, Anaelia
Goldstein, Orpaz
Kachuee, Mohammad
Wu, Elizabeth S C
Hong, Chenglin
Holloway, Ian W
Sarrafzadeh, Majid
author_sort Ovalle, Anaelia
collection PubMed
description BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organizational funding and limited user adoption. Furthermore, the efficacy of health interventions is often based on self-reported data, which are not always reliable. Health-promotion strategies for communities facing multiple vulnerabilities, such as men who have sex with men, can benefit from an automated system that not only determines health behavior risk but also suggests appropriate intervention targets. OBJECTIVE: This study aims to determine the value of leveraging social media messages to identify health risk behavior for men who have sex with men. METHODS: The Gay Social Networking Analysis Program was created as a preliminary framework for intelligent web-based health-promotion intervention. The program consisted of a data collection system that automatically gathered social media data, health questionnaires, and clinical results for sexually transmitted diseases and drug tests across 51 participants over 3 months. Machine learning techniques were utilized to assess the relationship between social media messages and participants' offline sexual health and substance use biological outcomes. The F1 score, a weighted average of precision and recall, was used to evaluate each algorithm. Natural language processing techniques were employed to create health behavior risk scores from participant messages. RESULTS: Offline HIV, amphetamine, and methamphetamine use were correctly identified using only social media data, with machine learning models obtaining F1 scores of 82.6%, 85.9%, and 85.3%, respectively. Additionally, constructed risk scores were found to be reasonably comparable to risk scores adapted from the Center for Disease Control. CONCLUSIONS: To our knowledge, our study is the first empirical evaluation of a social media–based public health intervention framework for men who have sex with men. We found that social media data were correlated with offline sexual health and substance use, verified through biological testing. The proof of concept and initial results validate that public health interventions can indeed use social media–based systems to successfully determine offline health risk behaviors. The findings demonstrate the promise of deploying a social media–based just-in-time adaptive intervention to target substance use and HIV risk behavior.
format Online
Article
Text
id pubmed-8111510
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-81115102021-05-13 Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study Ovalle, Anaelia Goldstein, Orpaz Kachuee, Mohammad Wu, Elizabeth S C Hong, Chenglin Holloway, Ian W Sarrafzadeh, Majid J Med Internet Res Original Paper BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organizational funding and limited user adoption. Furthermore, the efficacy of health interventions is often based on self-reported data, which are not always reliable. Health-promotion strategies for communities facing multiple vulnerabilities, such as men who have sex with men, can benefit from an automated system that not only determines health behavior risk but also suggests appropriate intervention targets. OBJECTIVE: This study aims to determine the value of leveraging social media messages to identify health risk behavior for men who have sex with men. METHODS: The Gay Social Networking Analysis Program was created as a preliminary framework for intelligent web-based health-promotion intervention. The program consisted of a data collection system that automatically gathered social media data, health questionnaires, and clinical results for sexually transmitted diseases and drug tests across 51 participants over 3 months. Machine learning techniques were utilized to assess the relationship between social media messages and participants' offline sexual health and substance use biological outcomes. The F1 score, a weighted average of precision and recall, was used to evaluate each algorithm. Natural language processing techniques were employed to create health behavior risk scores from participant messages. RESULTS: Offline HIV, amphetamine, and methamphetamine use were correctly identified using only social media data, with machine learning models obtaining F1 scores of 82.6%, 85.9%, and 85.3%, respectively. Additionally, constructed risk scores were found to be reasonably comparable to risk scores adapted from the Center for Disease Control. CONCLUSIONS: To our knowledge, our study is the first empirical evaluation of a social media–based public health intervention framework for men who have sex with men. We found that social media data were correlated with offline sexual health and substance use, verified through biological testing. The proof of concept and initial results validate that public health interventions can indeed use social media–based systems to successfully determine offline health risk behaviors. The findings demonstrate the promise of deploying a social media–based just-in-time adaptive intervention to target substance use and HIV risk behavior. JMIR Publications 2021-04-26 /pmc/articles/PMC8111510/ /pubmed/33900200 http://dx.doi.org/10.2196/22042 Text en ©Anaelia Ovalle, Orpaz Goldstein, Mohammad Kachuee, Elizabeth S C Wu, Chenglin Hong, Ian W Holloway, Majid Sarrafzadeh. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 26.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Ovalle, Anaelia
Goldstein, Orpaz
Kachuee, Mohammad
Wu, Elizabeth S C
Hong, Chenglin
Holloway, Ian W
Sarrafzadeh, Majid
Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title_full Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title_fullStr Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title_full_unstemmed Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title_short Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
title_sort leveraging social media activity and machine learning for hiv and substance abuse risk assessment: development and validation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8111510/
https://www.ncbi.nlm.nih.gov/pubmed/33900200
http://dx.doi.org/10.2196/22042
work_keys_str_mv AT ovalleanaelia leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT goldsteinorpaz leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT kachueemohammad leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT wuelizabethsc leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT hongchenglin leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT hollowayianw leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy
AT sarrafzadehmajid leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy