Cargando…
Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learn...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7426805/ https://www.ncbi.nlm.nih.gov/pubmed/32729840 http://dx.doi.org/10.2196/17784 |
_version_ | 1783570759707262976 |
---|---|
author | Obeid, Jihad S Dahne, Jennifer Christensen, Sean Howard, Samuel Crawford, Tami Frey, Lewis J Stecker, Tracy Bunnell, Brian E |
author_facet | Obeid, Jihad S Dahne, Jennifer Christensen, Sean Howard, Samuel Crawford, Tami Frey, Lewis J Stecker, Tracy Bunnell, Brian E |
author_sort | Obeid, Jihad S |
collection | PubMed |
description | BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. OBJECTIVE: This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. METHODS: We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words–based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. RESULTS: The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. CONCLUSIONS: The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data. |
format | Online Article Text |
id | pubmed-7426805 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-74268052020-08-24 Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach Obeid, Jihad S Dahne, Jennifer Christensen, Sean Howard, Samuel Crawford, Tami Frey, Lewis J Stecker, Tracy Bunnell, Brian E JMIR Med Inform Original Paper BACKGROUND: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. OBJECTIVE: This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. METHODS: We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words–based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. RESULTS: The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an F1 score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an F1 score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. CONCLUSIONS: The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data. JMIR Publications 2020-07-30 /pmc/articles/PMC7426805/ /pubmed/32729840 http://dx.doi.org/10.2196/17784 Text en ©Jihad S Obeid, Jennifer Dahne, Sean Christensen, Samuel Howard, Tami Crawford, Lewis J Frey, Tracy Stecker, Brian E Bunnell. Originally published in JMIR Medical Informatics (http://medinform.jmir.org), 30.07.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on http://medinform.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Original Paper Obeid, Jihad S Dahne, Jennifer Christensen, Sean Howard, Samuel Crawford, Tami Frey, Lewis J Stecker, Tracy Bunnell, Brian E Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title | Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title_full | Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title_fullStr | Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title_full_unstemmed | Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title_short | Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach |
title_sort | identifying and predicting intentional self-harm in electronic health record clinical notes: deep learning approach |
topic | Original Paper |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7426805/ https://www.ncbi.nlm.nih.gov/pubmed/32729840 http://dx.doi.org/10.2196/17784 |
work_keys_str_mv | AT obeidjihads identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT dahnejennifer identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT christensensean identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT howardsamuel identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT crawfordtami identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT freylewisj identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT steckertracy identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach AT bunnellbriane identifyingandpredictingintentionalselfharminelectronichealthrecordclinicalnotesdeeplearningapproach |