Cargando…

Incorporating medical code descriptions for diagnosis prediction in healthcare

BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with...

Descripción completa

Detalles Bibliográficos
Autores principales: Ma, Fenglong, Wang, Yaqing, Xiao, Houping, Yuan, Ye, Chitta, Radha, Zhou, Jing, Gao, Jing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6921390/
https://www.ncbi.nlm.nih.gov/pubmed/31856806
http://dx.doi.org/10.1186/s12911-019-0961-2
_version_ 1783481150875893760
author Ma, Fenglong
Wang, Yaqing
Xiao, Houping
Yuan, Ye
Chitta, Radha
Zhou, Jing
Gao, Jing
author_facet Ma, Fenglong
Wang, Yaqing
Xiao, Houping
Yuan, Ye
Chitta, Radha
Zhou, Jing
Gao, Jing
author_sort Ma, Fenglong
collection PubMed
description BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient. METHODS: We propose a simple, but general diagnosis prediction framework, which includes two basic components: diagnosis code embedding and predictive model. To learn the interpretable code embeddings, we apply convolutional neural networks (CNN) to model medical descriptions of diagnosis codes extracted from online medical websites. The learned medical embedding matrix is used to embed the input visits into vector representations, which are fed into the predictive models. Any existing diagnosis prediction approach (referred to as the base model) can be cast into the proposed framework as the predictive model (called the enhanced model). RESULTS: We conduct experiments on two real medical datasets: the MIMIC-III dataset and the Heart Failure claim dataset. Experimental results show that the enhanced diagnosis prediction approaches significantly improve the prediction performance. Moreover, we validate the effectiveness of the proposed framework with insufficient EHR data. Finally, we visualize the learned medical code embeddings to show the interpretability of the proposed framework. CONCLUSIONS: Given the historical visit records of a patient, the proposed framework is able to predict the next visit information by incorporating medical code descriptions.
format Online
Article
Text
id pubmed-6921390
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-69213902019-12-30 Incorporating medical code descriptions for diagnosis prediction in healthcare Ma, Fenglong Wang, Yaqing Xiao, Houping Yuan, Ye Chitta, Radha Zhou, Jing Gao, Jing BMC Med Inform Decis Mak Research BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient. METHODS: We propose a simple, but general diagnosis prediction framework, which includes two basic components: diagnosis code embedding and predictive model. To learn the interpretable code embeddings, we apply convolutional neural networks (CNN) to model medical descriptions of diagnosis codes extracted from online medical websites. The learned medical embedding matrix is used to embed the input visits into vector representations, which are fed into the predictive models. Any existing diagnosis prediction approach (referred to as the base model) can be cast into the proposed framework as the predictive model (called the enhanced model). RESULTS: We conduct experiments on two real medical datasets: the MIMIC-III dataset and the Heart Failure claim dataset. Experimental results show that the enhanced diagnosis prediction approaches significantly improve the prediction performance. Moreover, we validate the effectiveness of the proposed framework with insufficient EHR data. Finally, we visualize the learned medical code embeddings to show the interpretability of the proposed framework. CONCLUSIONS: Given the historical visit records of a patient, the proposed framework is able to predict the next visit information by incorporating medical code descriptions. BioMed Central 2019-12-19 /pmc/articles/PMC6921390/ /pubmed/31856806 http://dx.doi.org/10.1186/s12911-019-0961-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Ma, Fenglong
Wang, Yaqing
Xiao, Houping
Yuan, Ye
Chitta, Radha
Zhou, Jing
Gao, Jing
Incorporating medical code descriptions for diagnosis prediction in healthcare
title Incorporating medical code descriptions for diagnosis prediction in healthcare
title_full Incorporating medical code descriptions for diagnosis prediction in healthcare
title_fullStr Incorporating medical code descriptions for diagnosis prediction in healthcare
title_full_unstemmed Incorporating medical code descriptions for diagnosis prediction in healthcare
title_short Incorporating medical code descriptions for diagnosis prediction in healthcare
title_sort incorporating medical code descriptions for diagnosis prediction in healthcare
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6921390/
https://www.ncbi.nlm.nih.gov/pubmed/31856806
http://dx.doi.org/10.1186/s12911-019-0961-2
work_keys_str_mv AT mafenglong incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT wangyaqing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT xiaohouping incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT yuanye incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT chittaradha incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT zhoujing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare
AT gaojing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare