Cargando…
Incorporating medical code descriptions for diagnosis prediction in healthcare
BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6921390/ https://www.ncbi.nlm.nih.gov/pubmed/31856806 http://dx.doi.org/10.1186/s12911-019-0961-2 |
_version_ | 1783481150875893760 |
---|---|
author | Ma, Fenglong Wang, Yaqing Xiao, Houping Yuan, Ye Chitta, Radha Zhou, Jing Gao, Jing |
author_facet | Ma, Fenglong Wang, Yaqing Xiao, Houping Yuan, Ye Chitta, Radha Zhou, Jing Gao, Jing |
author_sort | Ma, Fenglong |
collection | PubMed |
description | BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient. METHODS: We propose a simple, but general diagnosis prediction framework, which includes two basic components: diagnosis code embedding and predictive model. To learn the interpretable code embeddings, we apply convolutional neural networks (CNN) to model medical descriptions of diagnosis codes extracted from online medical websites. The learned medical embedding matrix is used to embed the input visits into vector representations, which are fed into the predictive models. Any existing diagnosis prediction approach (referred to as the base model) can be cast into the proposed framework as the predictive model (called the enhanced model). RESULTS: We conduct experiments on two real medical datasets: the MIMIC-III dataset and the Heart Failure claim dataset. Experimental results show that the enhanced diagnosis prediction approaches significantly improve the prediction performance. Moreover, we validate the effectiveness of the proposed framework with insufficient EHR data. Finally, we visualize the learned medical code embeddings to show the interpretability of the proposed framework. CONCLUSIONS: Given the historical visit records of a patient, the proposed framework is able to predict the next visit information by incorporating medical code descriptions. |
format | Online Article Text |
id | pubmed-6921390 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-69213902019-12-30 Incorporating medical code descriptions for diagnosis prediction in healthcare Ma, Fenglong Wang, Yaqing Xiao, Houping Yuan, Ye Chitta, Radha Zhou, Jing Gao, Jing BMC Med Inform Decis Mak Research BACKGROUND: Diagnosis aims to predict the future health status of patients according to their historical electronic health records (EHR), which is an important yet challenging task in healthcare informatics. Existing diagnosis prediction approaches mainly employ recurrent neural networks (RNN) with attention mechanisms to make predictions. However, these approaches ignore the importance of code descriptions, i.e., the medical definitions of diagnosis codes. We believe that taking diagnosis code descriptions into account can help the state-of-the-art models not only to learn meaning code representations, but also to improve the predictive performance, especially when the EHR data are insufficient. METHODS: We propose a simple, but general diagnosis prediction framework, which includes two basic components: diagnosis code embedding and predictive model. To learn the interpretable code embeddings, we apply convolutional neural networks (CNN) to model medical descriptions of diagnosis codes extracted from online medical websites. The learned medical embedding matrix is used to embed the input visits into vector representations, which are fed into the predictive models. Any existing diagnosis prediction approach (referred to as the base model) can be cast into the proposed framework as the predictive model (called the enhanced model). RESULTS: We conduct experiments on two real medical datasets: the MIMIC-III dataset and the Heart Failure claim dataset. Experimental results show that the enhanced diagnosis prediction approaches significantly improve the prediction performance. Moreover, we validate the effectiveness of the proposed framework with insufficient EHR data. Finally, we visualize the learned medical code embeddings to show the interpretability of the proposed framework. CONCLUSIONS: Given the historical visit records of a patient, the proposed framework is able to predict the next visit information by incorporating medical code descriptions. BioMed Central 2019-12-19 /pmc/articles/PMC6921390/ /pubmed/31856806 http://dx.doi.org/10.1186/s12911-019-0961-2 Text en © The Author(s) 2019 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Ma, Fenglong Wang, Yaqing Xiao, Houping Yuan, Ye Chitta, Radha Zhou, Jing Gao, Jing Incorporating medical code descriptions for diagnosis prediction in healthcare |
title | Incorporating medical code descriptions for diagnosis prediction in healthcare |
title_full | Incorporating medical code descriptions for diagnosis prediction in healthcare |
title_fullStr | Incorporating medical code descriptions for diagnosis prediction in healthcare |
title_full_unstemmed | Incorporating medical code descriptions for diagnosis prediction in healthcare |
title_short | Incorporating medical code descriptions for diagnosis prediction in healthcare |
title_sort | incorporating medical code descriptions for diagnosis prediction in healthcare |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6921390/ https://www.ncbi.nlm.nih.gov/pubmed/31856806 http://dx.doi.org/10.1186/s12911-019-0961-2 |
work_keys_str_mv | AT mafenglong incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT wangyaqing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT xiaohouping incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT yuanye incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT chittaradha incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT zhoujing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare AT gaojing incorporatingmedicalcodedescriptionsfordiagnosispredictioninhealthcare |