Cargando…

Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data

As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide de...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zepeng, Zhou, Jiawei, An, Zhengyi, Cheng, Wenchuan, Hu, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9029105/
https://www.ncbi.nlm.nih.gov/pubmed/35455105
http://dx.doi.org/10.3390/e24040442
_version_ 1784691792802742272
author Li, Zepeng
Zhou, Jiawei
An, Zhengyi
Cheng, Wenchuan
Hu, Bin
author_facet Li, Zepeng
Zhou, Jiawei
An, Zhengyi
Cheng, Wenchuan
Hu, Bin
author_sort Li, Zepeng
collection PubMed
description As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide detection, but related research still faces some difficulties, such as data imbalance and expression implicitness. In this paper, we propose a Deep Hierarchical Ensemble model for Suicide Detection (DHE-SD) based on a hierarchical ensemble strategy, and construct a dataset based on Sina Weibo, which contains more than 550 thousand posts from 4521 users. To verify the effectiveness of the model, we also conduct experiments on a public Weibo dataset containing 7329 users’ posts. The proposed model achieves the best performance on both the constructed dataset and the public dataset. In addition, in order to make the model applicable to a wider population, we use the proposed sentence-level mask mechanism to delete user posts with strong suicidal ideation. Experiments show that the proposed model can still effectively identify social media users with suicidal ideation even when the performance of the baseline models decrease significantly.
format Online
Article
Text
id pubmed-9029105
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-90291052022-04-23 Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data Li, Zepeng Zhou, Jiawei An, Zhengyi Cheng, Wenchuan Hu, Bin Entropy (Basel) Article As a serious worldwide problem, suicide often causes huge and irreversible losses to families and society. Therefore, it is necessary to detect and help individuals with suicidal ideation in time. In recent years, the prosperous development of social media has provided new perspectives on suicide detection, but related research still faces some difficulties, such as data imbalance and expression implicitness. In this paper, we propose a Deep Hierarchical Ensemble model for Suicide Detection (DHE-SD) based on a hierarchical ensemble strategy, and construct a dataset based on Sina Weibo, which contains more than 550 thousand posts from 4521 users. To verify the effectiveness of the model, we also conduct experiments on a public Weibo dataset containing 7329 users’ posts. The proposed model achieves the best performance on both the constructed dataset and the public dataset. In addition, in order to make the model applicable to a wider population, we use the proposed sentence-level mask mechanism to delete user posts with strong suicidal ideation. Experiments show that the proposed model can still effectively identify social media users with suicidal ideation even when the performance of the baseline models decrease significantly. MDPI 2022-03-23 /pmc/articles/PMC9029105/ /pubmed/35455105 http://dx.doi.org/10.3390/e24040442 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Zepeng
Zhou, Jiawei
An, Zhengyi
Cheng, Wenchuan
Hu, Bin
Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_full Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_fullStr Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_full_unstemmed Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_short Deep Hierarchical Ensemble Model for Suicide Detection on Imbalanced Social Media Data
title_sort deep hierarchical ensemble model for suicide detection on imbalanced social media data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9029105/
https://www.ncbi.nlm.nih.gov/pubmed/35455105
http://dx.doi.org/10.3390/e24040442
work_keys_str_mv AT lizepeng deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata
AT zhoujiawei deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata
AT anzhengyi deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata
AT chengwenchuan deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata
AT hubin deephierarchicalensemblemodelforsuicidedetectiononimbalancedsocialmediadata