Cargando…
Medical code prediction via capsule networks and ICD knowledge
BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the d...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323200/ https://www.ncbi.nlm.nih.gov/pubmed/34330264 http://dx.doi.org/10.1186/s12911-021-01426-9 |
_version_ | 1783731195068022784 |
---|---|
author | Bao, Weidong Lin, Hongfei Zhang, Yijia Wang, Jian Zhang, Shaowu |
author_facet | Bao, Weidong Lin, Hongfei Zhang, Yijia Wang, Jian Zhang, Shaowu |
author_sort | Bao, Weidong |
collection | PubMed |
description | BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text. METHODS: In this paper, we propose a hybrid capsule network model. Specifically, we use bi-directional LSTM (Bi-LSTM) with forwarding and backward directions to merge the information from both sides of the sequence. The label embedding framework embeds the text and labels together to leverage the label information. We then use a dynamic routing algorithm in the capsule network to extract valuable features for medical code prediction task. RESULTS: We applied our model to the task of automatic medical codes assignment to clinical notes and conducted a series of experiments based on MIMIC-III data. The experimental results show that our method achieves a micro F1-score of 67.5% on MIMIC-III dataset, which outperforms the other state-of-the-art methods. CONCLUSIONS: The proposed model employed the dynamic routing algorithm and label embedding framework can effectively capture the important features across sentences. Both Capsule networks and domain knowledge are helpful for medical code prediction task. |
format | Online Article Text |
id | pubmed-8323200 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-83232002021-07-30 Medical code prediction via capsule networks and ICD knowledge Bao, Weidong Lin, Hongfei Zhang, Yijia Wang, Jian Zhang, Shaowu BMC Med Inform Decis Mak Research BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text. METHODS: In this paper, we propose a hybrid capsule network model. Specifically, we use bi-directional LSTM (Bi-LSTM) with forwarding and backward directions to merge the information from both sides of the sequence. The label embedding framework embeds the text and labels together to leverage the label information. We then use a dynamic routing algorithm in the capsule network to extract valuable features for medical code prediction task. RESULTS: We applied our model to the task of automatic medical codes assignment to clinical notes and conducted a series of experiments based on MIMIC-III data. The experimental results show that our method achieves a micro F1-score of 67.5% on MIMIC-III dataset, which outperforms the other state-of-the-art methods. CONCLUSIONS: The proposed model employed the dynamic routing algorithm and label embedding framework can effectively capture the important features across sentences. Both Capsule networks and domain knowledge are helpful for medical code prediction task. BioMed Central 2021-07-30 /pmc/articles/PMC8323200/ /pubmed/34330264 http://dx.doi.org/10.1186/s12911-021-01426-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Bao, Weidong Lin, Hongfei Zhang, Yijia Wang, Jian Zhang, Shaowu Medical code prediction via capsule networks and ICD knowledge |
title | Medical code prediction via capsule networks and ICD knowledge |
title_full | Medical code prediction via capsule networks and ICD knowledge |
title_fullStr | Medical code prediction via capsule networks and ICD knowledge |
title_full_unstemmed | Medical code prediction via capsule networks and ICD knowledge |
title_short | Medical code prediction via capsule networks and ICD knowledge |
title_sort | medical code prediction via capsule networks and icd knowledge |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323200/ https://www.ncbi.nlm.nih.gov/pubmed/34330264 http://dx.doi.org/10.1186/s12911-021-01426-9 |
work_keys_str_mv | AT baoweidong medicalcodepredictionviacapsulenetworksandicdknowledge AT linhongfei medicalcodepredictionviacapsulenetworksandicdknowledge AT zhangyijia medicalcodepredictionviacapsulenetworksandicdknowledge AT wangjian medicalcodepredictionviacapsulenetworksandicdknowledge AT zhangshaowu medicalcodepredictionviacapsulenetworksandicdknowledge |