Cargando…

Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework

BACKGROUND: A hallmark of unregulated drug markets is their unpredictability and constant evolution with newly introduced substances. People who use drugs and the public health workforce are often unaware of the appearance of new drugs on the unregulated market and their type, safe dosage, and poten...

Descripción completa

Detalles Bibliográficos
Autores principales: Fisher, Andrew, Young, Matthew Maclaren, Payer, Doris, Pacheco, Karen, Dubeau, Chad, Mago, Vijay
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548323/
https://www.ncbi.nlm.nih.gov/pubmed/37725410
http://dx.doi.org/10.2196/43630
_version_ 1785115250292424704
author Fisher, Andrew
Young, Matthew Maclaren
Payer, Doris
Pacheco, Karen
Dubeau, Chad
Mago, Vijay
author_facet Fisher, Andrew
Young, Matthew Maclaren
Payer, Doris
Pacheco, Karen
Dubeau, Chad
Mago, Vijay
author_sort Fisher, Andrew
collection PubMed
description BACKGROUND: A hallmark of unregulated drug markets is their unpredictability and constant evolution with newly introduced substances. People who use drugs and the public health workforce are often unaware of the appearance of new drugs on the unregulated market and their type, safe dosage, and potential adverse effects. This increases risks to people who use drugs, including the risk of unknown consumption and unintentional drug poisoning. Early warning systems (EWSs) can help monitor the landscape of emerging drugs in a given community by collecting and tracking up-to-date information and determining trends. However, there are currently few ways to systematically monitor the appearance and harms of new drugs on the unregulated market in Canada. OBJECTIVE: The goal of this work is to examine how artificial intelligence can assist in identifying patterns of drug-related risks and harms, by monitoring the social media activity of public health and law enforcement groups. This information is beneficial in the form of an EWS as it can be used to identify new and emerging drug trends in various communities. METHODS: To collect data for this study, 145 relevant Twitter accounts throughout Quebec (n=33), Ontario (n=78), and British Columbia (n=34) were manually identified. Tweets posted between August 23 and December 21, 2021, were collected via the application programming interface developed by Twitter for a total of 40,393 tweets. Next, subject matter experts (1) developed keyword filters that reduced the data set to 3746 tweets and (2) manually identified relevant tweets for monitoring and early warning efforts for a total of 464 tweets. Using this information, a zero-shot classifier was applied to tweets from step 1 with a set of keep (drug arrest, drug discovery, and drug report) and not-keep (drug addiction support, public safety report, and others) labels to see how accurately it could extract the tweets identified in step 2. RESULTS: When looking at the accuracy in identifying relevant posts, the system extracted a total of 584 tweets and had an overlap of 392 out of 477 (specificity of ~84.5%) with the subject matter experts. Conversely, the system identified a total of 3162 irrelevant tweets and had an overlap of 3090 (sensitivity of ~94.1%) with the subject matter experts. CONCLUSIONS: This study demonstrates the benefits of using artificial intelligence to assist in finding relevant tweets for an EWS. The results showed that it can be quite accurate in filtering out irrelevant information, which greatly reduces the amount of manual work required. Although the accuracy in retaining relevant information was observed to be lower, an analysis showed that the label definitions can impact the results significantly and would therefore be suitable for future work to refine. Nonetheless, the performance is promising and demonstrates the usefulness of artificial intelligence in this domain.
format Online
Article
Text
id pubmed-10548323
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-105483232023-10-05 Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework Fisher, Andrew Young, Matthew Maclaren Payer, Doris Pacheco, Karen Dubeau, Chad Mago, Vijay J Med Internet Res Original Paper BACKGROUND: A hallmark of unregulated drug markets is their unpredictability and constant evolution with newly introduced substances. People who use drugs and the public health workforce are often unaware of the appearance of new drugs on the unregulated market and their type, safe dosage, and potential adverse effects. This increases risks to people who use drugs, including the risk of unknown consumption and unintentional drug poisoning. Early warning systems (EWSs) can help monitor the landscape of emerging drugs in a given community by collecting and tracking up-to-date information and determining trends. However, there are currently few ways to systematically monitor the appearance and harms of new drugs on the unregulated market in Canada. OBJECTIVE: The goal of this work is to examine how artificial intelligence can assist in identifying patterns of drug-related risks and harms, by monitoring the social media activity of public health and law enforcement groups. This information is beneficial in the form of an EWS as it can be used to identify new and emerging drug trends in various communities. METHODS: To collect data for this study, 145 relevant Twitter accounts throughout Quebec (n=33), Ontario (n=78), and British Columbia (n=34) were manually identified. Tweets posted between August 23 and December 21, 2021, were collected via the application programming interface developed by Twitter for a total of 40,393 tweets. Next, subject matter experts (1) developed keyword filters that reduced the data set to 3746 tweets and (2) manually identified relevant tweets for monitoring and early warning efforts for a total of 464 tweets. Using this information, a zero-shot classifier was applied to tweets from step 1 with a set of keep (drug arrest, drug discovery, and drug report) and not-keep (drug addiction support, public safety report, and others) labels to see how accurately it could extract the tweets identified in step 2. RESULTS: When looking at the accuracy in identifying relevant posts, the system extracted a total of 584 tweets and had an overlap of 392 out of 477 (specificity of ~84.5%) with the subject matter experts. Conversely, the system identified a total of 3162 irrelevant tweets and had an overlap of 3090 (sensitivity of ~94.1%) with the subject matter experts. CONCLUSIONS: This study demonstrates the benefits of using artificial intelligence to assist in finding relevant tweets for an EWS. The results showed that it can be quite accurate in filtering out irrelevant information, which greatly reduces the amount of manual work required. Although the accuracy in retaining relevant information was observed to be lower, an analysis showed that the label definitions can impact the results significantly and would therefore be suitable for future work to refine. Nonetheless, the performance is promising and demonstrates the usefulness of artificial intelligence in this domain. JMIR Publications 2023-09-19 /pmc/articles/PMC10548323/ /pubmed/37725410 http://dx.doi.org/10.2196/43630 Text en ©Andrew Fisher, Matthew Maclaren Young, Doris Payer, Karen Pacheco, Chad Dubeau, Vijay Mago. Originally published in the Journal of Medical Internet Research (https://www.jmir.org), 19.09.2023. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on https://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Fisher, Andrew
Young, Matthew Maclaren
Payer, Doris
Pacheco, Karen
Dubeau, Chad
Mago, Vijay
Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title_full Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title_fullStr Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title_full_unstemmed Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title_short Automating Detection of Drug-Related Harms on Social Media: Machine Learning Framework
title_sort automating detection of drug-related harms on social media: machine learning framework
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10548323/
https://www.ncbi.nlm.nih.gov/pubmed/37725410
http://dx.doi.org/10.2196/43630
work_keys_str_mv AT fisherandrew automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework
AT youngmatthewmaclaren automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework
AT payerdoris automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework
AT pachecokaren automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework
AT dubeauchad automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework
AT magovijay automatingdetectionofdrugrelatedharmsonsocialmediamachinelearningframework