Cargando…

Suicide Note Classification Using Natural Language Processing: A Content Analysis

Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. Thi...

Descripción completa

Detalles Bibliográficos
Autores principales: Pestian, John, Nasrallah, Henry, Matykiewicz, Pawel, Bennett, Aurora, Leenaars, Antoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3107011/
https://www.ncbi.nlm.nih.gov/pubmed/21643548
http://dx.doi.org/10.4137/BII.S4706
_version_ 1782205192687583232
author Pestian, John
Nasrallah, Henry
Matykiewicz, Pawel
Bennett, Aurora
Leenaars, Antoon
author_facet Pestian, John
Nasrallah, Henry
Matykiewicz, Pawel
Bennett, Aurora
Leenaars, Antoon
author_sort Pestian, John
collection PubMed
description Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes.
format Online
Article
Text
id pubmed-3107011
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-31070112011-06-02 Suicide Note Classification Using Natural Language Processing: A Content Analysis Pestian, John Nasrallah, Henry Matykiewicz, Pawel Bennett, Aurora Leenaars, Antoon Biomed Inform Insights Original Research Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes. SAGE Publications 2010-08-04 /pmc/articles/PMC3107011/ /pubmed/21643548 http://dx.doi.org/10.4137/BII.S4706 Text en © 2010 SAGE Publications. http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 3.0 License (http://www.creativecommons.org/licenses/by-nc/3.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Pestian, John
Nasrallah, Henry
Matykiewicz, Pawel
Bennett, Aurora
Leenaars, Antoon
Suicide Note Classification Using Natural Language Processing: A Content Analysis
title Suicide Note Classification Using Natural Language Processing: A Content Analysis
title_full Suicide Note Classification Using Natural Language Processing: A Content Analysis
title_fullStr Suicide Note Classification Using Natural Language Processing: A Content Analysis
title_full_unstemmed Suicide Note Classification Using Natural Language Processing: A Content Analysis
title_short Suicide Note Classification Using Natural Language Processing: A Content Analysis
title_sort suicide note classification using natural language processing: a content analysis
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3107011/
https://www.ncbi.nlm.nih.gov/pubmed/21643548
http://dx.doi.org/10.4137/BII.S4706
work_keys_str_mv AT pestianjohn suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis
AT nasrallahhenry suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis
AT matykiewiczpawel suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis
AT bennettaurora suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis
AT leenaarsantoon suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis