Cargando…
Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression
BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BMJ Publishing Group
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5566092/ https://www.ncbi.nlm.nih.gov/pubmed/28739578 http://dx.doi.org/10.1136/eb-2017-102688 |
_version_ | 1783258484786069504 |
---|---|
author | Geraci, Joseph Wilansky, Pamela de Luca, Vincenzo Roy, Anvesh Kennedy, James L Strauss, John |
author_facet | Geraci, Joseph Wilansky, Pamela de Luca, Vincenzo Roy, Anvesh Kennedy, James L Strauss, John |
author_sort | Geraci, Joseph |
collection | PubMed |
description | BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression. OBJECTIVE: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion. METHODS: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task. FINDINGS: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment. CONCLUSION: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system. CLINICAL IMPLICATIONS: Future efforts will employ alternate neural network algorithms available and other machine learning methods. |
format | Online Article Text |
id | pubmed-5566092 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BMJ Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-55660922017-08-28 Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression Geraci, Joseph Wilansky, Pamela de Luca, Vincenzo Roy, Anvesh Kennedy, James L Strauss, John Evid Based Ment Health Original Article BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression. OBJECTIVE: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion. METHODS: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task. FINDINGS: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment. CONCLUSION: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system. CLINICAL IMPLICATIONS: Future efforts will employ alternate neural network algorithms available and other machine learning methods. BMJ Publishing Group 2017-08 2017-07-24 /pmc/articles/PMC5566092/ /pubmed/28739578 http://dx.doi.org/10.1136/eb-2017-102688 Text en © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted. This is an Open Access article distributed in accordance with the Creative Commons Attribution Non Commercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial. See: http://creativecommons.org/licenses/by-nc/4.0/ |
spellingShingle | Original Article Geraci, Joseph Wilansky, Pamela de Luca, Vincenzo Roy, Anvesh Kennedy, James L Strauss, John Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title | Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title_full | Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title_fullStr | Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title_full_unstemmed | Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title_short | Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
title_sort | applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5566092/ https://www.ncbi.nlm.nih.gov/pubmed/28739578 http://dx.doi.org/10.1136/eb-2017-102688 |
work_keys_str_mv | AT geracijoseph applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression AT wilanskypamela applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression AT delucavincenzo applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression AT royanvesh applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression AT kennedyjamesl applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression AT straussjohn applyingdeepneuralnetworkstounstructuredtextnotesinelectronicmedicalrecordsforphenotypingyouthdepression |