Cargando…

Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression

BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable...

Descripción completa

Detalles Bibliográficos
Autores principales: Geraci, Joseph, Wilansky, Pamela, de Luca, Vincenzo, Roy, Anvesh, Kennedy, James L, Strauss, John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5566092/
https://www.ncbi.nlm.nih.gov/pubmed/28739578
http://dx.doi.org/10.1136/eb-2017-102688
_version_ 1783258484786069504
author Geraci, Joseph
Wilansky, Pamela
de Luca, Vincenzo
Roy, Anvesh
Kennedy, James L
Strauss, John
author_facet Geraci, Joseph
Wilansky, Pamela
de Luca, Vincenzo
Roy, Anvesh
Kennedy, James L
Strauss, John
author_sort Geraci, Joseph
collection PubMed
description BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression. OBJECTIVE: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion. METHODS: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task. FINDINGS: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment. CONCLUSION: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system. CLINICAL IMPLICATIONS: Future efforts will employ alternate neural network algorithms available and other machine learning methods.
format Online
Article
Text
id pubmed-5566092
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BMJ Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-55660922017-08-28 Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression Geraci, Joseph Wilansky, Pamela de Luca, Vincenzo Roy, Anvesh Kennedy, James L Strauss, John Evid Based Ment Health Original Article BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression. OBJECTIVE: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion. METHODS: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task. FINDINGS: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment. CONCLUSION: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system. CLINICAL IMPLICATIONS: Future efforts will employ alternate neural network algorithms available and other machine learning methods. BMJ Publishing Group 2017-08 2017-07-24 /pmc/articles/PMC5566092/ /pubmed/28739578 http://dx.doi.org/10.1136/eb-2017-102688 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/
spellingShingle Original Article
Geraci, Joseph
Wilansky, Pamela
de Luca, Vincenzo
Roy, Anvesh
Kennedy, James L
Strauss, John
Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title_full Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title_fullStr Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title_full_unstemmed Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title_short Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
title_sort applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5566092/
https://www.ncbi.nlm.nih.gov/pubmed/28739578
http://dx.doi.org/10.1136/eb-2017-102688
work_keys_str_mv AT geracijoseph applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression
AT wilanskypamela applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression
AT delucavincenzo applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression
AT royanvesh applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression
AT kennedyjamesl applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression
AT straussjohn applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression