Cargando…

Medical code prediction via capsule networks and ICD knowledge

BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the d...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Weidong, Lin, Hongfei, Zhang, Yijia, Wang, Jian, Zhang, Shaowu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323200/
https://www.ncbi.nlm.nih.gov/pubmed/34330264
http://dx.doi.org/10.1186/s12911-021-01426-9
_version_ 1783731195068022784
author Bao, Weidong
Lin, Hongfei
Zhang, Yijia
Wang, Jian
Zhang, Shaowu
author_facet Bao, Weidong
Lin, Hongfei
Zhang, Yijia
Wang, Jian
Zhang, Shaowu
author_sort Bao, Weidong
collection PubMed
description BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text. METHODS: In this paper, we propose a hybrid capsule network model. Specifically, we use bi-directional LSTM (Bi-LSTM) with forwarding and backward directions to merge the information from both sides of the sequence. The label embedding framework embeds the text and labels together to leverage the label information. We then use a dynamic routing algorithm in the capsule network to extract valuable features for medical code prediction task. RESULTS: We applied our model to the task of automatic medical codes assignment to clinical notes and conducted a series of experiments based on MIMIC-III data. The experimental results show that our method achieves a micro F1-score of 67.5% on MIMIC-III dataset, which outperforms the other state-of-the-art methods. CONCLUSIONS: The proposed model employed the dynamic routing algorithm and label embedding framework can effectively capture the important features across sentences. Both Capsule networks and domain knowledge are helpful for medical code prediction task.
format Online
Article
Text
id pubmed-8323200
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-83232002021-07-30 Medical code prediction via capsule networks and ICD knowledge Bao, Weidong Lin, Hongfei Zhang, Yijia Wang, Jian Zhang, Shaowu BMC Med Inform Decis Mak Research BACKGROUND: Clinical notes record the health status, clinical manifestations and other detailed information of each patient. The International Classification of Diseases (ICD) codes are important labels for electronic health records. Automatic medical codes assignment to clinical notes through the deep learning model can not only improve work efficiency and accelerate the development of medical informatization but also facilitate the resolution of many issues related to medical insurance. Recently, neural network-based methods have been proposed for the automatic medical code assignment. However, in the medical field, clinical notes are usually long documents and contain many complex sentences, most of the current methods cannot effective in learning the representation of potential features from document text. METHODS: In this paper, we propose a hybrid capsule network model. Specifically, we use bi-directional LSTM (Bi-LSTM) with forwarding and backward directions to merge the information from both sides of the sequence. The label embedding framework embeds the text and labels together to leverage the label information. We then use a dynamic routing algorithm in the capsule network to extract valuable features for medical code prediction task. RESULTS: We applied our model to the task of automatic medical codes assignment to clinical notes and conducted a series of experiments based on MIMIC-III data. The experimental results show that our method achieves a micro F1-score of 67.5% on MIMIC-III dataset, which outperforms the other state-of-the-art methods. CONCLUSIONS: The proposed model employed the dynamic routing algorithm and label embedding framework can effectively capture the important features across sentences. Both Capsule networks and domain knowledge are helpful for medical code prediction task. BioMed Central 2021-07-30 /pmc/articles/PMC8323200/ /pubmed/34330264 http://dx.doi.org/10.1186/s12911-021-01426-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Bao, Weidong
Lin, Hongfei
Zhang, Yijia
Wang, Jian
Zhang, Shaowu
Medical code prediction via capsule networks and ICD knowledge
title Medical code prediction via capsule networks and ICD knowledge
title_full Medical code prediction via capsule networks and ICD knowledge
title_fullStr Medical code prediction via capsule networks and ICD knowledge
title_full_unstemmed Medical code prediction via capsule networks and ICD knowledge
title_short Medical code prediction via capsule networks and ICD knowledge
title_sort medical code prediction via capsule networks and icd knowledge
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8323200/
https://www.ncbi.nlm.nih.gov/pubmed/34330264
http://dx.doi.org/10.1186/s12911-021-01426-9
work_keys_str_mv AT baoweidong medicalcodepredictionviacapsulenetworksandicdknowledge
AT linhongfei medicalcodepredictionviacapsulenetworksandicdknowledge
AT zhangyijia medicalcodepredictionviacapsulenetworksandicdknowledge
AT wangjian medicalcodepredictionviacapsulenetworksandicdknowledge
AT zhangshaowu medicalcodepredictionviacapsulenetworksandicdknowledge