Cargando…

Predicting early psychiatric readmission with natural language processing of narrative discharge summaries

The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, b...

Descripción completa

Detalles Bibliográficos
Autores principales: Rumshisky, A, Ghassemi, M, Naumann, T, Szolovits, P, Castro, V M, McCoy, T H, Perlis, R H
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315537/
https://www.ncbi.nlm.nih.gov/pubmed/27754482
http://dx.doi.org/10.1038/tp.2015.182
_version_ 1782508697740640256
author Rumshisky, A
Ghassemi, M
Naumann, T
Szolovits, P
Castro, V M
McCoy, T H
Perlis, R H
author_facet Rumshisky, A
Ghassemi, M
Naumann, T
Szolovits, P
Castro, V M
McCoy, T H
Perlis, R H
author_sort Rumshisky, A
collection PubMed
description The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts.
format Online
Article
Text
id pubmed-5315537
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-53155372017-02-27 Predicting early psychiatric readmission with natural language processing of narrative discharge summaries Rumshisky, A Ghassemi, M Naumann, T Szolovits, P Castro, V M McCoy, T H Perlis, R H Transl Psychiatry Original Article The ability to predict psychiatric readmission would facilitate the development of interventions to reduce this risk, a major driver of psychiatric health-care costs. The symptoms or characteristics of illness course necessary to develop reliable predictors are not available in coded billing data, but may be present in narrative electronic health record (EHR) discharge summaries. We identified a cohort of individuals admitted to a psychiatric inpatient unit between 1994 and 2012 with a principal diagnosis of major depressive disorder, and extracted inpatient psychiatric discharge narrative notes. Using these data, we trained a 75-topic Latent Dirichlet Allocation (LDA) model, a form of natural language processing, which identifies groups of words associated with topics discussed in a document collection. The cohort was randomly split to derive a training (70%) and testing (30%) data set, and we trained separate support vector machine models for baseline clinical features alone, baseline features plus common individual words and the above plus topics identified from the 75-topic LDA model. Of 4687 patients with inpatient discharge summaries, 470 were readmitted within 30 days. The 75-topic LDA model included topics linked to psychiatric symptoms (suicide, severe depression, anxiety, trauma, eating/weight and panic) and major depressive disorder comorbidities (infection, postpartum, brain tumor, diarrhea and pulmonary disease). By including LDA topics, prediction of readmission, as measured by area under receiver-operating characteristic curves in the testing data set, was improved from baseline (area under the curve 0.618) to baseline+1000 words (0.682) to baseline+75 topics (0.784). Inclusion of topics derived from narrative notes allows more accurate discrimination of individuals at high risk for psychiatric readmission in this cohort. Topic modeling and related approaches offer the potential to improve prediction using EHRs, if generalizability can be established in other clinical cohorts. Nature Publishing Group 2016-10 2016-10-18 /pmc/articles/PMC5315537/ /pubmed/27754482 http://dx.doi.org/10.1038/tp.2015.182 Text en Copyright © 2016 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/4.0/
spellingShingle Original Article
Rumshisky, A
Ghassemi, M
Naumann, T
Szolovits, P
Castro, V M
McCoy, T H
Perlis, R H
Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title_full Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title_fullStr Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title_full_unstemmed Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title_short Predicting early psychiatric readmission with natural language processing of narrative discharge summaries
title_sort predicting early psychiatric readmission with natural language processing of narrative discharge summaries
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5315537/
https://www.ncbi.nlm.nih.gov/pubmed/27754482
http://dx.doi.org/10.1038/tp.2015.182
work_keys_str_mv AT rumshiskya predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT ghassemim predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT naumannt predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT szolovitsp predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT castrovm predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT mccoyth predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries
AT perlisrh predictingearlypsychiatricreadmissionwithnaturallanguageprocessingofnarrativedischargesummaries