Cargando…
Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study
INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policym...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7449407/ https://www.ncbi.nlm.nih.gov/pubmed/32845882 http://dx.doi.org/10.1371/journal.pone.0235150 |
_version_ | 1783574634183000064 |
---|---|
author | Cai, Mingxiang Shah, Neal Li, Jiawei Chen, Wen-Hao Cuomo, Raphael E. Obradovich, Nick Mackey, Tim K. |
author_facet | Cai, Mingxiang Shah, Neal Li, Jiawei Chen, Wen-Hao Cuomo, Raphael E. Obradovich, Nick Mackey, Tim K. |
author_sort | Cai, Mingxiang |
collection | PubMed |
description | INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policymaking. In response, this retrospective infoveillance study identifies and characterizes user-generated messages related to opioid abuse, heroin injection drug use, and HIV status using natural language processing (NLP) among Twitter users in Indiana during the period of this HIV outbreak. MATERIALS AND METHODS: Our study consisted of two phases: data collection and processing, and data analysis. We collected Indiana geolocated tweets from the public Twitter API using Amazon Web Services EC2 instances filtered for geocoded messages in the immediate pre and post period of the outbreak. In the data analysis phase we applied an unsupervised machine learning approach using NLP called the Biterm Topic Model (BTM) to identify tweets related to opioid, heroin/injection, and HIV behavior and then examined these messages for HIV risk-related topics that could be associated with the outbreak. RESULTS: More than 10 million geocoded tweets occurring in Indiana during the immediate pre and post period of the outbreak were collected for analysis. Using BTM, we identified 1350 tweets thought to be relevant to the outbreak and then confirmed 358 tweets using human annotation. The most prevalent themes identified were tweets related to self-reported abuse of illicit and prescription drugs, opioid use disorder, self-reported HIV status, and public sentiment regarding the outbreak. Geospatial analysis found that these messages clustered in population dense areas outside of the outbreak, including Indianapolis and neighboring Clark County. DISCUSSION: This infoveillance study characterized the social media conversations of communities in Indiana in the pre and post period of the 2015 HIV outbreak. Behavioral themes detected reflect discussion about risk factors related to HIV transmission stemming from opioid and heroin abuse for priority populations, and also help identify community attitudes that could have motivated or detracted the use of HIV prevention methods, along with helping identify factors that can impede access to prevention services. CONCLUSIONS: Infoveillance approaches, such as the analysis conducted in this study, represent a possibly strategy to detect “signal” of the emergence of risk factors associated with an outbreak though may be limited in their scope and generalizability. Our results, in conjunction with other forms of public health surveillance, can leverage the growing ubiquity of social media platforms to better detect opioid-related HIV risk knowledge, attitudes and behavior, as well as inform future prevention efforts. |
format | Online Article Text |
id | pubmed-7449407 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-74494072020-09-02 Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study Cai, Mingxiang Shah, Neal Li, Jiawei Chen, Wen-Hao Cuomo, Raphael E. Obradovich, Nick Mackey, Tim K. PLoS One Research Article INTRODUCTION: From late 2014 through 2015, Scott County, Indiana faced an HIV outbreak triggered by opioid abuse and transition to injection drug use. Investigating the origins, risk factors, and responses related to this outbreak is critical to inform future surveillance, interventions, and policymaking. In response, this retrospective infoveillance study identifies and characterizes user-generated messages related to opioid abuse, heroin injection drug use, and HIV status using natural language processing (NLP) among Twitter users in Indiana during the period of this HIV outbreak. MATERIALS AND METHODS: Our study consisted of two phases: data collection and processing, and data analysis. We collected Indiana geolocated tweets from the public Twitter API using Amazon Web Services EC2 instances filtered for geocoded messages in the immediate pre and post period of the outbreak. In the data analysis phase we applied an unsupervised machine learning approach using NLP called the Biterm Topic Model (BTM) to identify tweets related to opioid, heroin/injection, and HIV behavior and then examined these messages for HIV risk-related topics that could be associated with the outbreak. RESULTS: More than 10 million geocoded tweets occurring in Indiana during the immediate pre and post period of the outbreak were collected for analysis. Using BTM, we identified 1350 tweets thought to be relevant to the outbreak and then confirmed 358 tweets using human annotation. The most prevalent themes identified were tweets related to self-reported abuse of illicit and prescription drugs, opioid use disorder, self-reported HIV status, and public sentiment regarding the outbreak. Geospatial analysis found that these messages clustered in population dense areas outside of the outbreak, including Indianapolis and neighboring Clark County. DISCUSSION: This infoveillance study characterized the social media conversations of communities in Indiana in the pre and post period of the 2015 HIV outbreak. Behavioral themes detected reflect discussion about risk factors related to HIV transmission stemming from opioid and heroin abuse for priority populations, and also help identify community attitudes that could have motivated or detracted the use of HIV prevention methods, along with helping identify factors that can impede access to prevention services. CONCLUSIONS: Infoveillance approaches, such as the analysis conducted in this study, represent a possibly strategy to detect “signal” of the emergence of risk factors associated with an outbreak though may be limited in their scope and generalizability. Our results, in conjunction with other forms of public health surveillance, can leverage the growing ubiquity of social media platforms to better detect opioid-related HIV risk knowledge, attitudes and behavior, as well as inform future prevention efforts. Public Library of Science 2020-08-26 /pmc/articles/PMC7449407/ /pubmed/32845882 http://dx.doi.org/10.1371/journal.pone.0235150 Text en © 2020 Cai et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Cai, Mingxiang Shah, Neal Li, Jiawei Chen, Wen-Hao Cuomo, Raphael E. Obradovich, Nick Mackey, Tim K. Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title | Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title_full | Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title_fullStr | Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title_full_unstemmed | Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title_short | Identification and characterization of tweets related to the 2015 Indiana HIV outbreak: A retrospective infoveillance study |
title_sort | identification and characterization of tweets related to the 2015 indiana hiv outbreak: a retrospective infoveillance study |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7449407/ https://www.ncbi.nlm.nih.gov/pubmed/32845882 http://dx.doi.org/10.1371/journal.pone.0235150 |
work_keys_str_mv | AT caimingxiang identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT shahneal identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT lijiawei identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT chenwenhao identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT cuomoraphaele identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT obradovichnick identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy AT mackeytimk identificationandcharacterizationoftweetsrelatedtothe2015indianahivoutbreakaretrospectiveinfoveillancestudy |