Cargando…

iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing

In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special sta...

Descripción completa

Detalles Bibliográficos
Autores principales: Tekleyohannes, Menbere Kina, Rybalkin, Vladimir, Ghaffar, Muhammad Mohsin, Varela, Javier Alejandro, Wehn, Norbert, Dengel, Andreas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467298/
https://www.ncbi.nlm.nih.gov/pubmed/34564101
http://dx.doi.org/10.3390/jimaging7090175
_version_ 1784573361648566272
author Tekleyohannes, Menbere Kina
Rybalkin, Vladimir
Ghaffar, Muhammad Mohsin
Varela, Javier Alejandro
Wehn, Norbert
Dengel, Andreas
author_facet Tekleyohannes, Menbere Kina
Rybalkin, Vladimir
Ghaffar, Muhammad Mohsin
Varela, Javier Alejandro
Wehn, Norbert
Dengel, Andreas
author_sort Tekleyohannes, Menbere Kina
collection PubMed
description In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special stationary equipment for scanning historical documents. However, to digitize these records without removing them from where they are archived, portable devices that combine scanning and OCR capabilities are required. An existing end-to-end OCR software called anyOCR achieves high recognition accuracy for historical documents. However, it is unsuitable for portable devices, as it exhibits high computational complexity resulting in long runtime and high power consumption. Therefore, we have designed and implemented a configurable hardware-software programmable SoC called iDocChip that makes use of anyOCR techniques to achieve high accuracy. As a low-power and energy-efficient system with real-time capabilities, the iDocChip delivers the required portability. In this paper, we present the hybrid CPU-FPGA architecture of iDocChip along with the optimized software implementations of the anyOCR. We demonstrate our results on multiple platforms with respect to runtime and power consumption. The iDocChip system outperforms the existing anyOCR by [Formula: see text] while achieving [Formula: see text] higher energy efficiency and a [Formula: see text] increase in recognition accuracy.
format Online
Article
Text
id pubmed-8467298
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-84672982021-10-28 iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing Tekleyohannes, Menbere Kina Rybalkin, Vladimir Ghaffar, Muhammad Mohsin Varela, Javier Alejandro Wehn, Norbert Dengel, Andreas J Imaging Article In recent years, there has been an increasing demand to digitize and electronically access historical records. Optical character recognition (OCR) is typically applied to scanned historical archives to transcribe them from document images into machine-readable texts. Many libraries offer special stationary equipment for scanning historical documents. However, to digitize these records without removing them from where they are archived, portable devices that combine scanning and OCR capabilities are required. An existing end-to-end OCR software called anyOCR achieves high recognition accuracy for historical documents. However, it is unsuitable for portable devices, as it exhibits high computational complexity resulting in long runtime and high power consumption. Therefore, we have designed and implemented a configurable hardware-software programmable SoC called iDocChip that makes use of anyOCR techniques to achieve high accuracy. As a low-power and energy-efficient system with real-time capabilities, the iDocChip delivers the required portability. In this paper, we present the hybrid CPU-FPGA architecture of iDocChip along with the optimized software implementations of the anyOCR. We demonstrate our results on multiple platforms with respect to runtime and power consumption. The iDocChip system outperforms the existing anyOCR by [Formula: see text] while achieving [Formula: see text] higher energy efficiency and a [Formula: see text] increase in recognition accuracy. MDPI 2021-09-03 /pmc/articles/PMC8467298/ /pubmed/34564101 http://dx.doi.org/10.3390/jimaging7090175 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tekleyohannes, Menbere Kina
Rybalkin, Vladimir
Ghaffar, Muhammad Mohsin
Varela, Javier Alejandro
Wehn, Norbert
Dengel, Andreas
iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title_full iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title_fullStr iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title_full_unstemmed iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title_short iDocChip: A Configurable Hardware Accelerator for an End-to-End Historical Document Image Processing
title_sort idocchip: a configurable hardware accelerator for an end-to-end historical document image processing
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8467298/
https://www.ncbi.nlm.nih.gov/pubmed/34564101
http://dx.doi.org/10.3390/jimaging7090175
work_keys_str_mv AT tekleyohannesmenberekina idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing
AT rybalkinvladimir idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing
AT ghaffarmuhammadmohsin idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing
AT varelajavieralejandro idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing
AT wehnnorbert idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing
AT dengelandreas idocchipaconfigurablehardwareacceleratorforanendtoendhistoricaldocumentimageprocessing