Cargando…
Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach
BACKGROUND: In recent years, both suicide and overdose rates have been increasing. Many individuals who struggle with opioid use disorder are prone to suicidal ideation; this may often result in overdose. However, these fatal overdoses are difficult to classify as intentional or unintentional. Inten...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7732714/ https://www.ncbi.nlm.nih.gov/pubmed/33245287 http://dx.doi.org/10.2196/15293 |
_version_ | 1783622155385176064 |
---|---|
author | Yao, Hannah Rashidian, Sina Dong, Xinyu Duanmu, Hongyi Rosenthal, Richard N Wang, Fusheng |
author_facet | Yao, Hannah Rashidian, Sina Dong, Xinyu Duanmu, Hongyi Rosenthal, Richard N Wang, Fusheng |
author_sort | Yao, Hannah |
collection | PubMed |
description | BACKGROUND: In recent years, both suicide and overdose rates have been increasing. Many individuals who struggle with opioid use disorder are prone to suicidal ideation; this may often result in overdose. However, these fatal overdoses are difficult to classify as intentional or unintentional. Intentional overdose is difficult to detect, partially due to the lack of predictors and social stigmas that push individuals away from seeking help. These individuals may instead use web-based means to articulate their concerns. OBJECTIVE: This study aimed to extract posts of suicidality among opioid users on Reddit using machine learning methods. The performance of the models is derivative of the data purity, and the results will help us to better understand the rationale of these users, providing new insights into individuals who are part of the opioid epidemic. METHODS: Reddit posts between June 2017 and June 2018 were collected from r/suicidewatch, r/depression, a set of opioid-related subreddits, and a control subreddit set. We first classified suicidal versus nonsuicidal languages and then classified users with opioid usage versus those without opioid usage. Several traditional baselines and neural network (NN) text classifiers were trained using subreddit names as the labels and combinations of semantic inputs. We then attempted to extract out-of-sample data belonging to the intersection of suicide ideation and opioid abuse. Amazon Mechanical Turk was used to provide labels for the out-of-sample data. RESULTS: Classification results were at least 90% across all models for at least one combination of input; the best classifier was convolutional neural network, which obtained an F(1) score of 96.6%. When predicting out-of-sample data for posts containing both suicidal ideation and signs of opioid addiction, NN classifiers produced more false positives and traditional methods produced more false negatives, which is less desirable for predicting suicidal sentiments. CONCLUSIONS: Opioid abuse is linked to the risk of unintentional overdose and suicide risk. Social media platforms such as Reddit contain metadata that can aid machine learning and provide information at a personal level that cannot be obtained elsewhere. We demonstrate that it is possible to use NNs as a tool to predict an out-of-sample target with a model built from data sets labeled by characteristics we wish to distinguish in the out-of-sample target. |
format | Online Article Text |
id | pubmed-7732714 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-77327142020-12-22 Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach Yao, Hannah Rashidian, Sina Dong, Xinyu Duanmu, Hongyi Rosenthal, Richard N Wang, Fusheng J Med Internet Res Original Paper BACKGROUND: In recent years, both suicide and overdose rates have been increasing. Many individuals who struggle with opioid use disorder are prone to suicidal ideation; this may often result in overdose. However, these fatal overdoses are difficult to classify as intentional or unintentional. Intentional overdose is difficult to detect, partially due to the lack of predictors and social stigmas that push individuals away from seeking help. These individuals may instead use web-based means to articulate their concerns. OBJECTIVE: This study aimed to extract posts of suicidality among opioid users on Reddit using machine learning methods. The performance of the models is derivative of the data purity, and the results will help us to better understand the rationale of these users, providing new insights into individuals who are part of the opioid epidemic. METHODS: Reddit posts between June 2017 and June 2018 were collected from r/suicidewatch, r/depression, a set of opioid-related subreddits, and a control subreddit set. We first classified suicidal versus nonsuicidal languages and then classified users with opioid usage versus those without opioid usage. Several traditional baselines and neural network (NN) text classifiers were trained using subreddit names as the labels and combinations of semantic inputs. We then attempted to extract out-of-sample data belonging to the intersection of suicide ideation and opioid abuse. Amazon Mechanical Turk was used to provide labels for the out-of-sample data. RESULTS: Classification results were at least 90% across all models for at least one combination of input; the best classifier was convolutional neural network, which obtained an F(1) score of 96.6%. When predicting out-of-sample data for posts containing both suicidal ideation and signs of opioid addiction, NN classifiers produced more false positives and traditional methods produced more false negatives, which is less desirable for predicting suicidal sentiments. CONCLUSIONS: Opioid abuse is linked to the risk of unintentional overdose and suicide risk. Social media platforms such as Reddit contain metadata that can aid machine learning and provide information at a personal level that cannot be obtained elsewhere. We demonstrate that it is possible to use NNs as a tool to predict an out-of-sample target with a model built from data sets labeled by characteristics we wish to distinguish in the out-of-sample target. JMIR Publications 2020-11-27 /pmc/articles/PMC7732714/ /pubmed/33245287 http://dx.doi.org/10.2196/15293 Text en ©Hannah Yao, Sina Rashidian, Xinyu Dong, Hongyi Duanmu, Richard N Rosenthal, Fusheng Wang. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 27.11.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Yao, Hannah Rashidian, Sina Dong, Xinyu Duanmu, Hongyi Rosenthal, Richard N Wang, Fusheng Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title | Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title_full | Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title_fullStr | Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title_full_unstemmed | Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title_short | Detection of Suicidality Among Opioid Users on Reddit: Machine Learning–Based Approach |
title_sort | detection of suicidality among opioid users on reddit: machine learning–based approach |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7732714/ https://www.ncbi.nlm.nih.gov/pubmed/33245287 http://dx.doi.org/10.2196/15293 |
work_keys_str_mv | AT yaohannah detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach AT rashidiansina detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach AT dongxinyu detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach AT duanmuhongyi detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach AT rosenthalrichardn detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach AT wangfusheng detectionofsuicidalityamongopioidusersonredditmachinelearningbasedapproach |