Cargando…

Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach

BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learn...

Descripción completa

Detalles Bibliográficos
Autores principales: Obeid, Jihad S, Dahne, Jennifer, Christensen, Sean, Howard, Samuel, Crawford, Tami, Frey, Lewis J, Stecker, Tracy, Bunnell, Brian E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7426805/
https://www.ncbi.nlm.nih.gov/pubmed/32729840
http://dx.doi.org/10.2196/17784
_version_ 1783570759707262976
author Obeid, Jihad S
Dahne, Jennifer
Christensen, Sean
Howard, Samuel
Crawford, Tami
Frey, Lewis J
Stecker, Tracy
Bunnell, Brian E
author_facet Obeid, Jihad S
Dahne, Jennifer
Christensen, Sean
Howard, Samuel
Crawford, Tami
Frey, Lewis J
Stecker, Tracy
Bunnell, Brian E
author_sort Obeid, Jihad S
collection PubMed
description BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. OBJECTIVE: This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. METHODS: We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words–based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. RESULTS: The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. CONCLUSIONS: The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.
format Online
Article
Text
id pubmed-7426805
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74268052020-08-24 Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach Obeid, Jihad S Dahne, Jennifer Christensen, Sean Howard, Samuel Crawford, Tami Frey, Lewis J Stecker, Tracy Bunnell, Brian E JMIR Med Inform Original Paper BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. OBJECTIVE: This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. METHODS: We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words–based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. RESULTS: The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. CONCLUSIONS: The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data. JMIR Publications 2020-07-30 /pmc/articles/PMC7426805/ /pubmed/32729840 http://dx.doi.org/10.2196/17784 Text en ©Jihad S Obeid, Jennifer Dahne, Sean Christensen, Samuel Howard, Tami Crawford, Lewis J Frey, Tracy Stecker, Brian E Bunnell. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 30.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Original Paper
Obeid, Jihad S
Dahne, Jennifer
Christensen, Sean
Howard, Samuel
Crawford, Tami
Frey, Lewis J
Stecker, Tracy
Bunnell, Brian E
Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title_full Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title_fullStr Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title_full_unstemmed Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title_short Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
title_sort identifying and predicting intentional self-harm in electronic health record clinical notes: deep learning approach
topic Original Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7426805/
https://www.ncbi.nlm.nih.gov/pubmed/32729840
http://dx.doi.org/10.2196/17784
work_keys_str_mv AT obeidjihads identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT dahnejennifer identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT christensensean identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT howardsamuel identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT crawfordtami identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT freylewisj identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT steckertracy identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach
AT bunnellbriane identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach