Cargando…
Suicide Note Classification Using Natural Language Processing: A Content Analysis
Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. Thi...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
SAGE Publications
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3107011/ https://www.ncbi.nlm.nih.gov/pubmed/21643548 http://dx.doi.org/10.4137/BII.S4706 |
_version_ | 1782205192687583232 |
---|---|
author | Pestian, John Nasrallah, Henry Matykiewicz, Pawel Bennett, Aurora Leenaars, Antoon |
author_facet | Pestian, John Nasrallah, Henry Matykiewicz, Pawel Bennett, Aurora Leenaars, Antoon |
author_sort | Pestian, John |
collection | PubMed |
description | Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes. |
format | Online Article Text |
id | pubmed-3107011 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | SAGE Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-31070112011-06-02 Suicide Note Classification Using Natural Language Processing: A Content Analysis Pestian, John Nasrallah, Henry Matykiewicz, Pawel Bennett, Aurora Leenaars, Antoon Biomed Inform Insights Original Research Suicide is the second leading cause of death among 25–34 year olds and the third leading cause of death among 15–25 year olds in the United States. In the Emergency Department, where suicidal patients often present, estimating the risk of repeated attempts is generally left to clinical judgment. This paper presents our second attempt to determine the role of computational algorithms in understanding a suicidal patient's thoughts, as represented by suicide notes. We focus on developing methods of natural language processing that distinguish between genuine and elicited suicide notes. We hypothesize that machine learning algorithms can categorize suicide notes as well as mental health professionals and psychiatric physician trainees do. The data used are comprised of suicide notes from 33 suicide completers and matched to 33 elicited notes from healthy control group members. Eleven mental health professionals and 31 psychiatric trainees were asked to decide if a note was genuine or elicited. Their decisions were compared to nine different machine-learning algorithms. The results indicate that trainees accurately classified notes 49% of the time, mental health professionals accurately classified notes 63% of the time, and the best machine learning algorithm accurately classified the notes 78% of the time. This is an important step in developing an evidence-based predictor of repeated suicide attempts because it shows that natural language processing can aid in distinguishing between classes of suicidal notes. SAGE Publications 2010-08-04 /pmc/articles/PMC3107011/ /pubmed/21643548 http://dx.doi.org/10.4137/BII.S4706 Text en © 2010 SAGE Publications. http://creativecommons.org/licenses/by-nc/3.0/ This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 3.0 License (http://www.creativecommons.org/licenses/by-nc/3.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage). |
spellingShingle | Original Research Pestian, John Nasrallah, Henry Matykiewicz, Pawel Bennett, Aurora Leenaars, Antoon Suicide Note Classification Using Natural Language Processing: A Content Analysis |
title | Suicide Note Classification Using Natural Language Processing: A
Content Analysis |
title_full | Suicide Note Classification Using Natural Language Processing: A
Content Analysis |
title_fullStr | Suicide Note Classification Using Natural Language Processing: A
Content Analysis |
title_full_unstemmed | Suicide Note Classification Using Natural Language Processing: A
Content Analysis |
title_short | Suicide Note Classification Using Natural Language Processing: A
Content Analysis |
title_sort | suicide note classification using natural language processing: a
content analysis |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3107011/ https://www.ncbi.nlm.nih.gov/pubmed/21643548 http://dx.doi.org/10.4137/BII.S4706 |
work_keys_str_mv | AT pestianjohn suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis AT nasrallahhenry suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis AT matykiewiczpawel suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis AT bennettaurora suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis AT leenaarsantoon suicidenoteclassificationusingnaturallanguageprocessingacontentanalysis |