Cargando…
A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study
BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular soc...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6598421/ https://www.ncbi.nlm.nih.gov/pubmed/31199298 http://dx.doi.org/10.2196/13803 |
_version_ | 1783430770349572096 |
---|---|
author | Li, Jiawei Xu, Qing Shah, Neal Mackey, Tim K |
author_facet | Li, Jiawei Xu, Qing Shah, Neal Mackey, Tim K |
author_sort | Li, Jiawei |
collection | PubMed |
description | BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular social media platforms as conduits for opioid sales. This study uses deep learning to detect illicit drug dealing on the image and video sharing platform Instagram. OBJECTIVE: The aim of this study was to develop and evaluate a machine learning approach to detect Instagram posts related to illegal internet drug dealing. METHODS: In this paper, we describe an approach to detect drug dealers by using a deep learning model on Instagram. We collected Instagram posts using a Web scraper between July 2018 and October 2018 and then compared our deep learning model against 3 different machine learning models (eg, random forest, decision tree, and support vector machine) to assess the performance and accuracy of the model. For our deep learning model, we used the long short-term memory unit in the recurrent neural network to learn the pattern of the text of drug dealing posts. We also manually annotated all posts collected to evaluate our model performance and to characterize drug selling conversations. RESULTS: From the 12,857 posts we collected, we detected 1228 drug dealer posts comprising 267 unique users. We used cross-validation to evaluate the 4 models, with our deep learning model reaching 95% on F1 score and performing better than the other 3 models. We also found that by removing the hashtags in the text, the model had better performance. Detected posts contained hashtags related to several drugs, including the controlled substance Xanax (1078/1228, 87.78%), oxycodone/OxyContin (321/1228, 26.14%), and illicit drugs lysergic acid diethylamide (213/1228, 17.34%) and 3,4-methylenedioxy-methamphetamine (94/1228, 7.65%). We also observed the use of communication applications for suspected drug trading through user comments. CONCLUSIONS: Our approach using a combination of Web scraping and deep learning was able to detect illegal online drug sellers on Instagram, with high accuracy. Despite increased scrutiny by regulators and policymakers, the Instagram platform continues to host posts from drug dealers, in violation of federal law. Further action needs to be taken to ensure the safety of social media communities and help put an end to this illicit digital channel of sourcing. |
format | Online Article Text |
id | pubmed-6598421 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-65984212019-07-17 A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study Li, Jiawei Xu, Qing Shah, Neal Mackey, Tim K J Med Internet Res Original Paper BACKGROUND: Social media use is now ubiquitous, but the growth in social media communications has also made it a convenient digital platform for drug dealers selling controlled substances, opioids, and other illicit drugs. Previous studies and news investigations have reported the use of popular social media platforms as conduits for opioid sales. This study uses deep learning to detect illicit drug dealing on the image and video sharing platform Instagram. OBJECTIVE: The aim of this study was to develop and evaluate a machine learning approach to detect Instagram posts related to illegal internet drug dealing. METHODS: In this paper, we describe an approach to detect drug dealers by using a deep learning model on Instagram. We collected Instagram posts using a Web scraper between July 2018 and October 2018 and then compared our deep learning model against 3 different machine learning models (eg, random forest, decision tree, and support vector machine) to assess the performance and accuracy of the model. For our deep learning model, we used the long short-term memory unit in the recurrent neural network to learn the pattern of the text of drug dealing posts. We also manually annotated all posts collected to evaluate our model performance and to characterize drug selling conversations. RESULTS: From the 12,857 posts we collected, we detected 1228 drug dealer posts comprising 267 unique users. We used cross-validation to evaluate the 4 models, with our deep learning model reaching 95% on F1 score and performing better than the other 3 models. We also found that by removing the hashtags in the text, the model had better performance. Detected posts contained hashtags related to several drugs, including the controlled substance Xanax (1078/1228, 87.78%), oxycodone/OxyContin (321/1228, 26.14%), and illicit drugs lysergic acid diethylamide (213/1228, 17.34%) and 3,4-methylenedioxy-methamphetamine (94/1228, 7.65%). We also observed the use of communication applications for suspected drug trading through user comments. CONCLUSIONS: Our approach using a combination of Web scraping and deep learning was able to detect illegal online drug sellers on Instagram, with high accuracy. Despite increased scrutiny by regulators and policymakers, the Instagram platform continues to host posts from drug dealers, in violation of federal law. Further action needs to be taken to ensure the safety of social media communities and help put an end to this illicit digital channel of sourcing. JMIR Publications 2019-06-15 /pmc/articles/PMC6598421/ /pubmed/31199298 http://dx.doi.org/10.2196/13803 Text en ©Jiawei Li, Qing Xu, Neal Shah, Tim K Mackey. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 15.06.2019. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Li, Jiawei Xu, Qing Shah, Neal Mackey, Tim K A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title | A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title_full | A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title_fullStr | A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title_full_unstemmed | A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title_short | A Machine Learning Approach for the Detection and Characterization of Illicit Drug Dealers on Instagram: Model Evaluation Study |
title_sort | machine learning approach for the detection and characterization of illicit drug dealers on instagram: model evaluation study |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6598421/ https://www.ncbi.nlm.nih.gov/pubmed/31199298 http://dx.doi.org/10.2196/13803 |
work_keys_str_mv | AT lijiawei amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT xuqing amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT shahneal amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT mackeytimk amachinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT lijiawei machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT xuqing machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT shahneal machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy AT mackeytimk machinelearningapproachforthedetectionandcharacterizationofillicitdrugdealersoninstagrammodelevaluationstudy |