Cargando…
iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special sta...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467298/ https://www.ncbi.nlm.nih.gov/pubmed/34564101 http://dx.doi.org/10.3390/jimaging7090175 |
_version_ | 1784573361648566272 |
---|---|
author | Tekleyohannes, Menbere Kina Rybalkin, Vladimir Ghaffar, Muhammad Mohsin Varela, Javier Alejandro Wehn, Norbert Dengel, Andreas |
author_facet | Tekleyohannes, Menbere Kina Rybalkin, Vladimir Ghaffar, Muhammad Mohsin Varela, Javier Alejandro Wehn, Norbert Dengel, Andreas |
author_sort | Tekleyohannes, Menbere Kina |
collection | PubMed |
description | In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special stationary equipment for scanning historical documents. However, to digitize these records without removing them from where they are archived, portable devices that combine scanning and OCR capabilities are required. An existing end-to-end OCR software called anyOCR achieves high recognition accuracy for historical documents. However, it is unsuitable for portable devices, as it exhibits high computational complexity resulting in long runtime and high power consumption. Therefore, we have designed and implemented a configurable hardware-software programmable SoC called iDocChip that makes use of anyOCR techniques to achieve high accuracy. As a low-power and energy-efficient system with real-time capabilities, the iDocChip delivers the required portability. In this paper, we present the hybrid CPU-FPGA architecture of iDocChip along with the optimized software implementations of the anyOCR. We demonstrate our results on multiple platforms with respect to runtime and power consumption. The iDocChip system outperforms the existing anyOCR by [Formula: see text] while achieving [Formula: see text] higher energy efficiency and a [Formula: see text] increase in recognition accuracy. |
format | Online Article Text |
id | pubmed-8467298 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-84672982021-10-28 iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing Tekleyohannes, Menbere Kina Rybalkin, Vladimir Ghaffar, Muhammad Mohsin Varela, Javier Alejandro Wehn, Norbert Dengel, Andreas J Imaging Article In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special stationary equipment for scanning historical documents. However, to digitize these records without removing them from where they are archived, portable devices that combine scanning and OCR capabilities are required. An existing end-to-end OCR software called anyOCR achieves high recognition accuracy for historical documents. However, it is unsuitable for portable devices, as it exhibits high computational complexity resulting in long runtime and high power consumption. Therefore, we have designed and implemented a configurable hardware-software programmable SoC called iDocChip that makes use of anyOCR techniques to achieve high accuracy. As a low-power and energy-efficient system with real-time capabilities, the iDocChip delivers the required portability. In this paper, we present the hybrid CPU-FPGA architecture of iDocChip along with the optimized software implementations of the anyOCR. We demonstrate our results on multiple platforms with respect to runtime and power consumption. The iDocChip system outperforms the existing anyOCR by [Formula: see text] while achieving [Formula: see text] higher energy efficiency and a [Formula: see text] increase in recognition accuracy. MDPI 2021-09-03 /pmc/articles/PMC8467298/ /pubmed/34564101 http://dx.doi.org/10.3390/jimaging7090175 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Tekleyohannes, Menbere Kina Rybalkin, Vladimir Ghaffar, Muhammad Mohsin Varela, Javier Alejandro Wehn, Norbert Dengel, Andreas iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title | iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title_full | iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title_fullStr | iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title_full_unstemmed | iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title_short | iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing |
title_sort | idocchip: a configurable hardware accelerator for an end-to-end historical document image processing |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467298/ https://www.ncbi.nlm.nih.gov/pubmed/34564101 http://dx.doi.org/10.3390/jimaging7090175 |
work_keys_str_mv | AT tekleyohannesmenberekina idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing AT rybalkinvladimir idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing AT ghaffarmuhammadmohsin idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing AT varelajavieralejandro idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing AT wehnnorbert idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing AT dengelandreas idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing |