Cargando…

Dual Model Medical Invoices Recognition

Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work us...

Descripción completa

Detalles Bibliográficos
Autores principales: Yi, Fei, Zhao, Yi-Fei, Sheng, Guan-Qun, Xie, Kai, Wen, Chang, Tang, Xin-Gong, Qi, Xuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832594/
https://www.ncbi.nlm.nih.gov/pubmed/31658617
http://dx.doi.org/10.3390/s19204370
_version_ 1783466209810841600
author Yi, Fei
Zhao, Yi-Fei
Sheng, Guan-Qun
Xie, Kai
Wen, Chang
Tang, Xin-Gong
Qi, Xuan
author_facet Yi, Fei
Zhao, Yi-Fei
Sheng, Guan-Qun
Xie, Kai
Wen, Chang
Tang, Xin-Gong
Qi, Xuan
author_sort Yi, Fei
collection PubMed
description Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate.
format Online
Article
Text
id pubmed-6832594
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-68325942019-11-25 Dual Model Medical Invoices Recognition Yi, Fei Zhao, Yi-Fei Sheng, Guan-Qun Xie, Kai Wen, Chang Tang, Xin-Gong Qi, Xuan Sensors (Basel) Article Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate. MDPI 2019-10-10 /pmc/articles/PMC6832594/ /pubmed/31658617 http://dx.doi.org/10.3390/s19204370 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Yi, Fei
Zhao, Yi-Fei
Sheng, Guan-Qun
Xie, Kai
Wen, Chang
Tang, Xin-Gong
Qi, Xuan
Dual Model Medical Invoices Recognition
title Dual Model Medical Invoices Recognition
title_full Dual Model Medical Invoices Recognition
title_fullStr Dual Model Medical Invoices Recognition
title_full_unstemmed Dual Model Medical Invoices Recognition
title_short Dual Model Medical Invoices Recognition
title_sort dual model medical invoices recognition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832594/
https://www.ncbi.nlm.nih.gov/pubmed/31658617
http://dx.doi.org/10.3390/s19204370
work_keys_str_mv AT yifei dualmodelmedicalinvoicesrecognition
AT zhaoyifei dualmodelmedicalinvoicesrecognition
AT shengguanqun dualmodelmedicalinvoicesrecognition
AT xiekai dualmodelmedicalinvoicesrecognition
AT wenchang dualmodelmedicalinvoicesrecognition
AT tangxingong dualmodelmedicalinvoicesrecognition
AT qixuan dualmodelmedicalinvoicesrecognition