Cargando…

Creating a Chinese suicide dictionary for identifying suicide risk on social media

Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to det...

Descripción completa

Detalles Bibliográficos
Autores principales: Lv, Meizhen, Li, Ang, Liu, Tianli, Zhu, Tingshao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4690390/
https://www.ncbi.nlm.nih.gov/pubmed/26713232
http://dx.doi.org/10.7717/peerj.1455
_version_ 1782407010608742400
author Lv, Meizhen
Li, Ang
Liu, Tianli
Zhu, Tingshao
author_facet Lv, Meizhen
Li, Ang
Liu, Tianli
Zhu, Tingshao
author_sort Lv, Meizhen
collection PubMed
description Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary. Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models. Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F(1) = 0.48; t2: F(1) = 0.56) produced a more accurate identification than SCLIWC (t1: F(1) = 0.41; t2: F(1) = 0.48) on different observation windows. Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries.
format Online
Article
Text
id pubmed-4690390
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-46903902015-12-28 Creating a Chinese suicide dictionary for identifying suicide risk on social media Lv, Meizhen Li, Ang Liu, Tianli Zhu, Tingshao PeerJ Psychiatry and Psychology Introduction. Suicide has become a serious worldwide epidemic. Early detection of individual suicide risk in population is important for reducing suicide rates. Traditional methods are ineffective in identifying suicide risk in time, suggesting a need for novel techniques. This paper proposes to detect suicide risk on social media using a Chinese suicide dictionary. Methods. To build the Chinese suicide dictionary, eight researchers were recruited to select initial words from 4,653 posts published on Sina Weibo (the largest social media service provider in China) and two Chinese sentiment dictionaries (HowNet and NTUSD). Then, another three researchers were recruited to filter out irrelevant words. Finally, remaining words were further expanded using a corpus-based method. After building the Chinese suicide dictionary, we tested its performance in identifying suicide risk on Weibo. First, we made a comparison of the performance in both detecting suicidal expression in Weibo posts and evaluating individual levels of suicide risk between the dictionary-based identifications and the expert ratings. Second, to differentiate between individuals with high and non-high scores on self-rating measure of suicide risk (Suicidal Possibility Scale, SPS), we built Support Vector Machines (SVM) models on the Chinese suicide dictionary and the Simplified Chinese Linguistic Inquiry and Word Count (SCLIWC) program, respectively. After that, we made a comparison of the classification performance between two types of SVM models. Results and Discussion. Dictionary-based identifications were significantly correlated with expert ratings in terms of both detecting suicidal expression (r = 0.507) and evaluating individual suicide risk (r = 0.455). For the differentiation between individuals with high and non-high scores on SPS, the Chinese suicide dictionary (t1: F(1) = 0.48; t2: F(1) = 0.56) produced a more accurate identification than SCLIWC (t1: F(1) = 0.41; t2: F(1) = 0.48) on different observation windows. Conclusions. This paper confirms that, using social media, it is possible to implement real-time monitoring individual suicide risk in population. Results of this study may be useful to improve Chinese suicide prevention programs and may be insightful for other countries. PeerJ Inc. 2015-12-15 /pmc/articles/PMC4690390/ /pubmed/26713232 http://dx.doi.org/10.7717/peerj.1455 Text en © 2015 Lv et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Psychiatry and Psychology
Lv, Meizhen
Li, Ang
Liu, Tianli
Zhu, Tingshao
Creating a Chinese suicide dictionary for identifying suicide risk on social media
title Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_full Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_fullStr Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_full_unstemmed Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_short Creating a Chinese suicide dictionary for identifying suicide risk on social media
title_sort creating a chinese suicide dictionary for identifying suicide risk on social media
topic Psychiatry and Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4690390/
https://www.ncbi.nlm.nih.gov/pubmed/26713232
http://dx.doi.org/10.7717/peerj.1455
work_keys_str_mv AT lvmeizhen creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT liang creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT liutianli creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia
AT zhutingshao creatingachinesesuicidedictionaryforidentifyingsuicideriskonsocialmedia