Cargando…

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

BACKGROUND: The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both la...

Descripción completa

Detalles Bibliográficos
Autores principales:	Chen, Pei-Fu, Wang, Ssu-Ming, Liao, Wei-Chih, Kuo, Lu-Cheng, Chen, Kuan-Chih, Lin, Yu-Cheng, Yang, Chi-Yu, Chiu, Chi-Hao, Chang, Shu-Chih, Lai, Feipei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	JMIR Publications 2021
Materias:	Original Paper
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8441604/ https://www.ncbi.nlm.nih.gov/pubmed/34463639 http://dx.doi.org/10.2196/23230

_version_	1783752902252167168
author	Chen, Pei-Fu Wang, Ssu-Ming Liao, Wei-Chih Kuo, Lu-Cheng Chen, Kuan-Chih Lin, Yu-Cheng Yang, Chi-Yu Chiu, Chi-Hao Chang, Shu-Chih Lai, Feipei
author_facet	Chen, Pei-Fu Wang, Ssu-Ming Liao, Wei-Chih Kuo, Lu-Cheng Chen, Kuan-Chih Lin, Yu-Cheng Yang, Chi-Yu Chiu, Chi-Hao Chang, Shu-Chih Lai, Feipei
author_sort	Chen, Pei-Fu
collection	PubMed
description	BACKGROUND: The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. OBJECTIVE: This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. METHODS: We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F(1)-score and the coding time by coders before and after using our model. RESULTS: In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F(1)-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F(1)-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval. CONCLUSIONS: The proposed model significantly improved the F(1)-score but did not decrease the time consumed in coding by disease coders.
format	Online Article Text
id	pubmed-8441604
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	JMIR Publications
record_format	MEDLINE/PubMed
spelling	pubmed-84416042021-09-28 Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning Chen, Pei-Fu Wang, Ssu-Ming Liao, Wei-Chih Kuo, Lu-Cheng Chen, Kuan-Chih Lin, Yu-Cheng Yang, Chi-Yu Chiu, Chi-Hao Chang, Shu-Chih Lai, Feipei JMIR Med Inform Original Paper BACKGROUND: The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning– and natural language processing–related approaches have been studied to assist disease coders. OBJECTIVE: This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort. METHODS: We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F(1)-score and the coding time by coders before and after using our model. RESULTS: In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F(1)-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F(1)-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval. CONCLUSIONS: The proposed model significantly improved the F(1)-score but did not decrease the time consumed in coding by disease coders. JMIR Publications 2021-08-31 /pmc/articles/PMC8441604/ /pubmed/34463639 http://dx.doi.org/10.2196/23230 Text en ©Pei-Fu Chen, Ssu-Ming Wang, Wei-Chih Liao, Lu-Cheng Kuo, Kuan-Chih Chen, Yu-Cheng Lin, Chi-Yu Yang, Chi-Hao Chiu, Shu-Chih Chang, Feipei Lai. Originally published in JMIR Medical Informatics (https://medinform.jmir.org), 31.08.2021. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Informatics, is properly cited. The complete bibliographic information, a link to the original publication on https://medinform.jmir.org/, as well as this copyright and license information must be included.
spellingShingle	Original Paper Chen, Pei-Fu Wang, Ssu-Ming Liao, Wei-Chih Kuo, Lu-Cheng Chen, Kuan-Chih Lin, Yu-Cheng Yang, Chi-Yu Chiu, Chi-Hao Chang, Shu-Chih Lai, Feipei Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title	Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title_full	Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title_fullStr	Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title_full_unstemmed	Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title_short	Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning
title_sort	automatic icd-10 coding and training system: deep neural network based on supervised learning
topic	Original Paper
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8441604/ https://www.ncbi.nlm.nih.gov/pubmed/34463639 http://dx.doi.org/10.2196/23230
work_keys_str_mv	AT chenpeifu automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT wangssuming automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT liaoweichih automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT kuolucheng automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT chenkuanchih automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT linyucheng automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT yangchiyu automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT chiuchihao automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT changshuchih automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning AT laifeipei automaticicd10codingandtrainingsystemdeepneuralnetworkbasedonsupervisedlearning

Automatic ICD-10 Coding and Training System: Deep Neural Network Based on Supervised Learning

Ejemplares similares