Cargando…

Applying deep neural networks to unstructured text notes in electronic medical records for phenotyping youth depression

BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable...

Descripción completa

Detalles Bibliográficos
Autores principales: Geraci, Joseph, Wilansky, Pamela, de Luca, Vincenzo, Roy, Anvesh, Kennedy, James L, Strauss, John
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BMJ Publishing Group 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5566092/
https://www.ncbi.nlm.nih.gov/pubmed/28739578
http://dx.doi.org/10.1136/eb-2017-102688
Descripción
Sumario:BACKGROUND: We report a study of machine learning applied to the phenotyping of psychiatric diagnosis for research recruitment in youth depression, conducted with 861 labelled electronic medical records (EMRs) documents. A model was built that could accurately identify individuals who were suitable candidates for a study on youth depression. OBJECTIVE: Our objective was a model to identify individuals who meet inclusion criteria as well as unsuitable patients who would require exclusion. METHODS: Our methods included applying a system that coded the EMR documents by removing personally identifying information, using two psychiatrists who labelled a set of EMR documents (from which the 861 came), using a brute force search and training a deep neural network for this task. FINDINGS: According to a cross-validation evaluation, we describe a model that had a specificity of 97% and a sensitivity of 45% and a second model with a specificity of 53% and a sensitivity of 89%. We combined these two models into a third one (sensitivity 93.5%; specificity 68%; positive predictive value (precision) 77%) to generate a list of most suitable candidates in support of research recruitment. CONCLUSION: Our efforts are meant to demonstrate the potential for this type of approach for patient recruitment purposes but it should be noted that a larger sample size is required to build a truly reliable recommendation system. CLINICAL IMPLICATIONS: Future efforts will employ alternate neural network algorithms available and other machine learning methods.