Cargando…

Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study

INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policym...

Descripción completa

Detalles Bibliográficos
Autores principales: Cai, Mingxiang, Shah, Neal, Li, Jiawei, Chen, Wen-Hao, Cuomo, Raphael E., Obradovich, Nick, Mackey, Tim K.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7449407/
https://www.ncbi.nlm.nih.gov/pubmed/32845882
http://dx.doi.org/10.1371/journal.pone.0235150
_version_ 1783574634183000064
author Cai, Mingxiang
Shah, Neal
Li, Jiawei
Chen, Wen-Hao
Cuomo, Raphael E.
Obradovich, Nick
Mackey, Tim K.
author_facet Cai, Mingxiang
Shah, Neal
Li, Jiawei
Chen, Wen-Hao
Cuomo, Raphael E.
Obradovich, Nick
Mackey, Tim K.
author_sort Cai, Mingxiang
collection PubMed
description INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policymaking. In response, this retrospective infoveillance study identifies and characterizes user-generated messages related to opioid abuse, heroin injection drug use, and HIV status using natural language processing (NLP) among Twitter users in Indiana during the period of this HIV outbreak. MATERIALS AND METHODS: Our study consisted of two phases: data collection and processing, and data analysis. We collected Indiana geolocated tweets from the public Twitter API using Amazon Web Services EC2 instances filtered for geocoded messages in the immediate pre and post period of the outbreak. In the data analysis phase we applied an unsupervised machine learning approach using NLP called the Biterm Topic Model (BTM) to identify tweets related to opioid, heroin/injection, and HIV behavior and then examined these messages for HIV risk-related topics that could be associated with the outbreak. RESULTS: More than 10 million geocoded tweets occurring in Indiana during the immediate pre and post period of the outbreak were collected for analysis. Using BTM, we identified 1350 tweets thought to be relevant to the outbreak and then confirmed 358 tweets using human annotation. The most prevalent themes identified were tweets related to self-reported abuse of illicit and prescription drugs, opioid use disorder, self-reported HIV status, and public sentiment regarding the outbreak. Geospatial analysis found that these messages clustered in population dense areas outside of the outbreak, including Indianapolis and neighboring Clark County. DISCUSSION: This infoveillance study characterized the social media conversations of communities in Indiana in the pre and post period of the 2015 HIV outbreak. Behavioral themes detected reflect discussion about risk factors related to HIV transmission stemming from opioid and heroin abuse for priority populations, and also help identify community attitudes that could have motivated or detracted the use of HIV prevention methods, along with helping identify factors that can impede access to prevention services. CONCLUSIONS: Infoveillance approaches, such as the analysis conducted in this study, represent a possibly strategy to detect “signal” of the emergence of risk factors associated with an outbreak though may be limited in their scope and generalizability. Our results, in conjunction with other forms of public health surveillance, can leverage the growing ubiquity of social media platforms to better detect opioid-related HIV risk knowledge, attitudes and behavior, as well as inform future prevention efforts.
format Online
Article
Text
id pubmed-7449407
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-74494072020-09-02 Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study Cai, Mingxiang Shah, Neal Li, Jiawei Chen, Wen-Hao Cuomo, Raphael E. Obradovich, Nick Mackey, Tim K. PLoS One Research Article INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policymaking. In response, this retrospective infoveillance study identifies and characterizes user-generated messages related to opioid abuse, heroin injection drug use, and HIV status using natural language processing (NLP) among Twitter users in Indiana during the period of this HIV outbreak. MATERIALS AND METHODS: Our study consisted of two phases: data collection and processing, and data analysis. We collected Indiana geolocated tweets from the public Twitter API using Amazon Web Services EC2 instances filtered for geocoded messages in the immediate pre and post period of the outbreak. In the data analysis phase we applied an unsupervised machine learning approach using NLP called the Biterm Topic Model (BTM) to identify tweets related to opioid, heroin/injection, and HIV behavior and then examined these messages for HIV risk-related topics that could be associated with the outbreak. RESULTS: More than 10 million geocoded tweets occurring in Indiana during the immediate pre and post period of the outbreak were collected for analysis. Using BTM, we identified 1350 tweets thought to be relevant to the outbreak and then confirmed 358 tweets using human annotation. The most prevalent themes identified were tweets related to self-reported abuse of illicit and prescription drugs, opioid use disorder, self-reported HIV status, and public sentiment regarding the outbreak. Geospatial analysis found that these messages clustered in population dense areas outside of the outbreak, including Indianapolis and neighboring Clark County. DISCUSSION: This infoveillance study characterized the social media conversations of communities in Indiana in the pre and post period of the 2015 HIV outbreak. Behavioral themes detected reflect discussion about risk factors related to HIV transmission stemming from opioid and heroin abuse for priority populations, and also help identify community attitudes that could have motivated or detracted the use of HIV prevention methods, along with helping identify factors that can impede access to prevention services. CONCLUSIONS: Infoveillance approaches, such as the analysis conducted in this study, represent a possibly strategy to detect “signal” of the emergence of risk factors associated with an outbreak though may be limited in their scope and generalizability. Our results, in conjunction with other forms of public health surveillance, can leverage the growing ubiquity of social media platforms to better detect opioid-related HIV risk knowledge, attitudes and behavior, as well as inform future prevention efforts. Public Library of Science 2020-08-26 /pmc/articles/PMC7449407/ /pubmed/32845882 http://dx.doi.org/10.1371/journal.pone.0235150 Text en © 2020 Cai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Cai, Mingxiang
Shah, Neal
Li, Jiawei
Chen, Wen-Hao
Cuomo, Raphael E.
Obradovich, Nick
Mackey, Tim K.
Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title_full Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title_fullStr Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title_full_unstemmed Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title_short Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
title_sort identification and characterization of tweets related to the 2015 indiana hiv outbreak: a retrospective infoveillance study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7449407/
https://www.ncbi.nlm.nih.gov/pubmed/32845882
http://dx.doi.org/10.1371/journal.pone.0235150
work_keys_str_mv AT caimingxiang identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT shahneal identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT lijiawei identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT chenwenhao identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT cuomoraphaele identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT obradovichnick identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy
AT mackeytimk identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy