Cargando…
Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study
BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organiz...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8111510/ https://www.ncbi.nlm.nih.gov/pubmed/33900200 http://dx.doi.org/10.2196/22042 |
_version_ | 1783690515650183168 |
---|---|
author | Ovalle, Anaelia Goldstein, Orpaz Kachuee, Mohammad Wu, Elizabeth S C Hong, Chenglin Holloway, Ian W Sarrafzadeh, Majid |
author_facet | Ovalle, Anaelia Goldstein, Orpaz Kachuee, Mohammad Wu, Elizabeth S C Hong, Chenglin Holloway, Ian W Sarrafzadeh, Majid |
author_sort | Ovalle, Anaelia |
collection | PubMed |
description | BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organizational funding and limited user adoption. Furthermore, the efficacy of health interventions is often based on self-reported data, which are not always reliable. Health-promotion strategies for communities facing multiple vulnerabilities, such as men who have sex with men, can benefit from an automated system that not only determines health behavior risk but also suggests appropriate intervention targets. OBJECTIVE: This study aims to determine the value of leveraging social media messages to identify health risk behavior for men who have sex with men. METHODS: The Gay Social Networking Analysis Program was created as a preliminary framework for intelligent web-based health-promotion intervention. The program consisted of a data collection system that automatically gathered social media data, health questionnaires, and clinical results for sexually transmitted diseases and drug tests across 51 participants over 3 months. Machine learning techniques were utilized to assess the relationship between social media messages and participants' offline sexual health and substance use biological outcomes. The F1 score, a weighted average of precision and recall, was used to evaluate each algorithm. Natural language processing techniques were employed to create health behavior risk scores from participant messages. RESULTS: Offline HIV, amphetamine, and methamphetamine use were correctly identified using only social media data, with machine learning models obtaining F1 scores of 82.6%, 85.9%, and 85.3%, respectively. Additionally, constructed risk scores were found to be reasonably comparable to risk scores adapted from the Center for Disease Control. CONCLUSIONS: To our knowledge, our study is the first empirical evaluation of a social media–based public health intervention framework for men who have sex with men. We found that social media data were correlated with offline sexual health and substance use, verified through biological testing. The proof of concept and initial results validate that public health interventions can indeed use social media–based systems to successfully determine offline health risk behaviors. The findings demonstrate the promise of deploying a social media–based just-in-time adaptive intervention to target substance use and HIV risk behavior. |
format | Online Article Text |
id | pubmed-8111510 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-81115102021-05-13 Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study Ovalle, Anaelia Goldstein, Orpaz Kachuee, Mohammad Wu, Elizabeth S C Hong, Chenglin Holloway, Ian W Sarrafzadeh, Majid J Med Internet Res Original Paper BACKGROUND: Social media networks provide an abundance of diverse information that can be leveraged for data-driven applications across various social and physical sciences. One opportunity to utilize such data exists in the public health domain, where data collection is often constrained by organizational funding and limited user adoption. Furthermore, the efficacy of health interventions is often based on self-reported data, which are not always reliable. Health-promotion strategies for communities facing multiple vulnerabilities, such as men who have sex with men, can benefit from an automated system that not only determines health behavior risk but also suggests appropriate intervention targets. OBJECTIVE: This study aims to determine the value of leveraging social media messages to identify health risk behavior for men who have sex with men. METHODS: The Gay Social Networking Analysis Program was created as a preliminary framework for intelligent web-based health-promotion intervention. The program consisted of a data collection system that automatically gathered social media data, health questionnaires, and clinical results for sexually transmitted diseases and drug tests across 51 participants over 3 months. Machine learning techniques were utilized to assess the relationship between social media messages and participants' offline sexual health and substance use biological outcomes. The F1 score, a weighted average of precision and recall, was used to evaluate each algorithm. Natural language processing techniques were employed to create health behavior risk scores from participant messages. RESULTS: Offline HIV, amphetamine, and methamphetamine use were correctly identified using only social media data, with machine learning models obtaining F1 scores of 82.6%, 85.9%, and 85.3%, respectively. Additionally, constructed risk scores were found to be reasonably comparable to risk scores adapted from the Center for Disease Control. CONCLUSIONS: To our knowledge, our study is the first empirical evaluation of a social media–based public health intervention framework for men who have sex with men. We found that social media data were correlated with offline sexual health and substance use, verified through biological testing. The proof of concept and initial results validate that public health interventions can indeed use social media–based systems to successfully determine offline health risk behaviors. The findings demonstrate the promise of deploying a social media–based just-in-time adaptive intervention to target substance use and HIV risk behavior. JMIR Publications 2021-04-26 /pmc/articles/PMC8111510/ /pubmed/33900200 http://dx.doi.org/10.2196/22042 Text en ©Anaelia Ovalle, Orpaz Goldstein, Mohammad Kachuee, Elizabeth S C Wu, Chenglin Hong, Ian W Holloway, Majid Sarrafzadeh. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 26.04.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Ovalle, Anaelia Goldstein, Orpaz Kachuee, Mohammad Wu, Elizabeth S C Hong, Chenglin Holloway, Ian W Sarrafzadeh, Majid Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title | Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title_full | Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title_fullStr | Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title_full_unstemmed | Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title_short | Leveraging Social Media Activity and Machine Learning for HIV and Substance Abuse Risk Assessment: Development and Validation Study |
title_sort | leveraging social media activity and machine learning for hiv and substance abuse risk assessment: development and validation study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8111510/ https://www.ncbi.nlm.nih.gov/pubmed/33900200 http://dx.doi.org/10.2196/22042 |
work_keys_str_mv | AT ovalleanaelia leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT goldsteinorpaz leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT kachueemohammad leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT wuelizabethsc leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT hongchenglin leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT hollowayianw leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy AT sarrafzadehmajid leveragingsocialmediaactivityandmachinelearningforhivandsubstanceabuseriskassessmentdevelopmentandvalidationstudy |