Cargando…

A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study

BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular soc...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Jiawei, Xu, Qing, Shah, Neal, Mackey, Tim K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6598421/
https://www.ncbi.nlm.nih.gov/pubmed/31199298
http://dx.doi.org/10.2196/13803
_version_ 1783430770349572096
author Li, Jiawei
Xu, Qing
Shah, Neal
Mackey, Tim K
author_facet Li, Jiawei
Xu, Qing
Shah, Neal
Mackey, Tim K
author_sort Li, Jiawei
collection PubMed
description BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular social media platforms as conduits for opioid sales. This study uses deep learning to detect illicit drug dealing on the image and video sharing platform Instagram. OBJECTIVE: The aim of this study was to develop and evaluate a machine learning approach to detect Instagram posts related to illegal internet drug dealing. METHODS: In this paper, we describe an approach to detect drug dealers by using a deep learning model on Instagram. We collected Instagram posts using a Web scraper between July 2018 and October 2018 and then compared our deep learning model against 3 different machine learning models (eg, random forest, decision tree, and support vector machine) to assess the performance and accuracy of the model. For our deep learning model, we used the long short-term memory unit in the recurrent neural network to learn the pattern of the text of drug dealing posts. We also manually annotated all posts collected to evaluate our model performance and to characterize drug selling conversations. RESULTS: From the 12,857 posts we collected, we detected 1228 drug dealer posts comprising 267 unique users. We used cross-validation to evaluate the 4 models, with our deep learning model reaching 95% on F1 score and performing better than the other 3 models. We also found that by removing the hashtags in the text, the model had better performance. Detected posts contained hashtags related to several drugs, including the controlled substance Xanax (1078/1228, 87.78%), oxycodone/OxyContin (321/1228, 26.14%), and illicit drugs lysergic acid diethylamide (213/1228, 17.34%) and 3,4-methylenedioxy-methamphetamine (94/1228, 7.65%). We also observed the use of communication applications for suspected drug trading through user comments. CONCLUSIONS: Our approach using a combination of Web scraping and deep learning was able to detect illegal online drug sellers on Instagram, with high accuracy. Despite increased scrutiny by regulators and policymakers, the Instagram platform continues to host posts from drug dealers, in violation of federal law. Further action needs to be taken to ensure the safety of social media communities and help put an end to this illicit digital channel of sourcing.
format Online
Article
Text
id pubmed-6598421
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-65984212019-07-17 A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study Li, Jiawei Xu, Qing Shah, Neal Mackey, Tim K J Med Internet Res Original Paper BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular social media platforms as conduits for opioid sales. This study uses deep learning to detect illicit drug dealing on the image and video sharing platform Instagram. OBJECTIVE: The aim of this study was to develop and evaluate a machine learning approach to detect Instagram posts related to illegal internet drug dealing. METHODS: In this paper, we describe an approach to detect drug dealers by using a deep learning model on Instagram. We collected Instagram posts using a Web scraper between July 2018 and October 2018 and then compared our deep learning model against 3 different machine learning models (eg, random forest, decision tree, and support vector machine) to assess the performance and accuracy of the model. For our deep learning model, we used the long short-term memory unit in the recurrent neural network to learn the pattern of the text of drug dealing posts. We also manually annotated all posts collected to evaluate our model performance and to characterize drug selling conversations. RESULTS: From the 12,857 posts we collected, we detected 1228 drug dealer posts comprising 267 unique users. We used cross-validation to evaluate the 4 models, with our deep learning model reaching 95% on F1 score and performing better than the other 3 models. We also found that by removing the hashtags in the text, the model had better performance. Detected posts contained hashtags related to several drugs, including the controlled substance Xanax (1078/1228, 87.78%), oxycodone/OxyContin (321/1228, 26.14%), and illicit drugs lysergic acid diethylamide (213/1228, 17.34%) and 3,4-methylenedioxy-methamphetamine (94/1228, 7.65%). We also observed the use of communication applications for suspected drug trading through user comments. CONCLUSIONS: Our approach using a combination of Web scraping and deep learning was able to detect illegal online drug sellers on Instagram, with high accuracy. Despite increased scrutiny by regulators and policymakers, the Instagram platform continues to host posts from drug dealers, in violation of federal law. Further action needs to be taken to ensure the safety of social media communities and help put an end to this illicit digital channel of sourcing. JMIR Publications 2019-06-15 /pmc/articles/PMC6598421/ /pubmed/31199298 http://dx.doi.org/10.2196/13803 Text en ©Jiawei Li, Qing Xu, Neal Shah, Tim K Mackey. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.06.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Li, Jiawei
Xu, Qing
Shah, Neal
Mackey, Tim K
A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title_full A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title_fullStr A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title_full_unstemmed A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title_short A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
title_sort machine learning approach for the detection and characterization of illicit drug dealers on instagram: model evaluation study
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6598421/
https://www.ncbi.nlm.nih.gov/pubmed/31199298
http://dx.doi.org/10.2196/13803
work_keys_str_mv AT lijiawei amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT xuqing amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT shahneal amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT mackeytimk amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT lijiawei machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT xuqing machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT shahneal machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy
AT mackeytimk machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy