Cargando…

Analysis of risk factor domains in psychosis patient health records

BACKGROUND: Readmission after discharge from a hospital is disruptive and costly, regardless of the reason. However, it can be particularly problematic for psychiatric patients, so predicting which patients may be readmitted is critically important but also very difficult. Clinical narratives in psy...

Descripción completa

Detalles Bibliográficos
Autores principales: Holderness, Eben, Miller, Nicholas, Cawkwell, Philip, Bolton, Kirsten, Meteer, Marie, Pustejovsky, James, Hall, Mei-Hua
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6823956/
https://www.ncbi.nlm.nih.gov/pubmed/31672168
http://dx.doi.org/10.1186/s13326-019-0210-8
_version_ 1783464627727761408
author Holderness, Eben
Miller, Nicholas
Cawkwell, Philip
Bolton, Kirsten
Meteer, Marie
Pustejovsky, James
Hall, Mei-Hua
author_facet Holderness, Eben
Miller, Nicholas
Cawkwell, Philip
Bolton, Kirsten
Meteer, Marie
Pustejovsky, James
Hall, Mei-Hua
author_sort Holderness, Eben
collection PubMed
description BACKGROUND: Readmission after discharge from a hospital is disruptive and costly, regardless of the reason. However, it can be particularly problematic for psychiatric patients, so predicting which patients may be readmitted is critically important but also very difficult. Clinical narratives in psychiatric electronic health records (EHRs) span a wide range of topics and vocabulary; therefore, a psychiatric readmission prediction model must begin with a robust and interpretable topic extraction component. RESULTS: We designed and evaluated multiple multilayer perceptron and radial basis function neural networks to predict the sentences in a patient’s EHR that are associated with one or more of seven readmission risk factor domains that we identified. In contrast to our baseline cosine similarity model that is based on the methodologies of prior works, our deep learning approaches achieved considerably better F1 scores (0.83 vs 0.66) while also being more scalable and computationally efficient with large volumes of data. Additionally, we found that integrating clinically relevant multiword expressions during preprocessing improves the accuracy of our models and allows for identifying a wider scope of training data in a semi-supervised setting. CONCLUSION: We created a data pipeline for using document vector similarity metrics to perform topic extraction on psychiatric EHR data in service of our long-term goal of creating a readmission risk classifier. We show results for our topic extraction model and identify additional features we will be incorporating in the future.
format Online
Article
Text
id pubmed-6823956
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68239562019-11-06 Analysis of risk factor domains in psychosis patient health records Holderness, Eben Miller, Nicholas Cawkwell, Philip Bolton, Kirsten Meteer, Marie Pustejovsky, James Hall, Mei-Hua J Biomed Semantics Research BACKGROUND: Readmission after discharge from a hospital is disruptive and costly, regardless of the reason. However, it can be particularly problematic for psychiatric patients, so predicting which patients may be readmitted is critically important but also very difficult. Clinical narratives in psychiatric electronic health records (EHRs) span a wide range of topics and vocabulary; therefore, a psychiatric readmission prediction model must begin with a robust and interpretable topic extraction component. RESULTS: We designed and evaluated multiple multilayer perceptron and radial basis function neural networks to predict the sentences in a patient’s EHR that are associated with one or more of seven readmission risk factor domains that we identified. In contrast to our baseline cosine similarity model that is based on the methodologies of prior works, our deep learning approaches achieved considerably better F1 scores (0.83 vs 0.66) while also being more scalable and computationally efficient with large volumes of data. Additionally, we found that integrating clinically relevant multiword expressions during preprocessing improves the accuracy of our models and allows for identifying a wider scope of training data in a semi-supervised setting. CONCLUSION: We created a data pipeline for using document vector similarity metrics to perform topic extraction on psychiatric EHR data in service of our long-term goal of creating a readmission risk classifier. We show results for our topic extraction model and identify additional features we will be incorporating in the future. BioMed Central 2019-10-31 /pmc/articles/PMC6823956/ /pubmed/31672168 http://dx.doi.org/10.1186/s13326-019-0210-8 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Holderness, Eben
Miller, Nicholas
Cawkwell, Philip
Bolton, Kirsten
Meteer, Marie
Pustejovsky, James
Hall, Mei-Hua
Analysis of risk factor domains in psychosis patient health records
title Analysis of risk factor domains in psychosis patient health records
title_full Analysis of risk factor domains in psychosis patient health records
title_fullStr Analysis of risk factor domains in psychosis patient health records
title_full_unstemmed Analysis of risk factor domains in psychosis patient health records
title_short Analysis of risk factor domains in psychosis patient health records
title_sort analysis of risk factor domains in psychosis patient health records
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6823956/
https://www.ncbi.nlm.nih.gov/pubmed/31672168
http://dx.doi.org/10.1186/s13326-019-0210-8
work_keys_str_mv AT holdernesseben analysisofriskfactordomainsinpsychosispatienthealthrecords
AT millernicholas analysisofriskfactordomainsinpsychosispatienthealthrecords
AT cawkwellphilip analysisofriskfactordomainsinpsychosispatienthealthrecords
AT boltonkirsten analysisofriskfactordomainsinpsychosispatienthealthrecords
AT meteermarie analysisofriskfactordomainsinpsychosispatienthealthrecords
AT pustejovskyjames analysisofriskfactordomainsinpsychosispatienthealthrecords
AT hallmeihua analysisofriskfactordomainsinpsychosispatienthealthrecords