Cargando…
Dual Model Medical Invoices Recognition
Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work us...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832594/ https://www.ncbi.nlm.nih.gov/pubmed/31658617 http://dx.doi.org/10.3390/s19204370 |
_version_ | 1783466209810841600 |
---|---|
author | Yi, Fei Zhao, Yi-Fei Sheng, Guan-Qun Xie, Kai Wen, Chang Tang, Xin-Gong Qi, Xuan |
author_facet | Yi, Fei Zhao, Yi-Fei Sheng, Guan-Qun Xie, Kai Wen, Chang Tang, Xin-Gong Qi, Xuan |
author_sort | Yi, Fei |
collection | PubMed |
description | Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate. |
format | Online Article Text |
id | pubmed-6832594 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-68325942019-11-25 Dual Model Medical Invoices Recognition Yi, Fei Zhao, Yi-Fei Sheng, Guan-Qun Xie, Kai Wen, Chang Tang, Xin-Gong Qi, Xuan Sensors (Basel) Article Hospitals need to invest a lot of manpower to manually input the contents of medical invoices (nearly 300,000,000 medical invoices a year) into the medical system. In order to help the hospital save money and stabilize work efficiency, this paper designed a system to complete the complicated work using a Gaussian blur and smoothing–convolutional neural network combined with a recurrent neural network (GBS-CR) method. Gaussian blur and smoothing (GBS) is a novel preprocessing method that can fix the breakpoint font in medical invoices. The combination of convolutional neural network (CNN) and recurrent neural network (RNN) was used to raise the recognition rate of the breakpoint font in medical invoices. RNN was designed to be the semantic revision module. In the aspect of image preprocessing, Gaussian blur and smoothing were used to fix the breakpoint font. In the period of making the self-built dataset, a certain proportion of the breakpoint font (the font of breakpoint is 3, the original font is 7) was added, in this paper, so as to optimize the Alexnet–Adam–CNN (AA-CNN) model, which is more suitable for the recognition of the breakpoint font than the traditional CNN model. In terms of the identification methods, we not only adopted the optimized AA-CNN for identification, but also combined RNN to carry out the semantic revisions of the identified results of CNN, meanwhile further improving the recognition rate of the medical invoices. The experimental results show that compared with the state-of-art invoice recognition method, the method presented in this paper has an average increase of 10 to 15 percentage points in recognition rate. MDPI 2019-10-10 /pmc/articles/PMC6832594/ /pubmed/31658617 http://dx.doi.org/10.3390/s19204370 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Yi, Fei Zhao, Yi-Fei Sheng, Guan-Qun Xie, Kai Wen, Chang Tang, Xin-Gong Qi, Xuan Dual Model Medical Invoices Recognition |
title | Dual Model Medical Invoices Recognition |
title_full | Dual Model Medical Invoices Recognition |
title_fullStr | Dual Model Medical Invoices Recognition |
title_full_unstemmed | Dual Model Medical Invoices Recognition |
title_short | Dual Model Medical Invoices Recognition |
title_sort | dual model medical invoices recognition |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6832594/ https://www.ncbi.nlm.nih.gov/pubmed/31658617 http://dx.doi.org/10.3390/s19204370 |
work_keys_str_mv | AT yifei dualmodelmedicalinvoicesrecognition AT zhaoyifei dualmodelmedicalinvoicesrecognition AT shengguanqun dualmodelmedicalinvoicesrecognition AT xiekai dualmodelmedicalinvoicesrecognition AT wenchang dualmodelmedicalinvoicesrecognition AT tangxingong dualmodelmedicalinvoicesrecognition AT qixuan dualmodelmedicalinvoicesrecognition |