Cargando…

A Novel Adaptive Deskewing Algorithm for Document Images

Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm fo...

Descripción completa

Detalles Bibliográficos
Autores principales: Bao, Wuzhida, Yang, Cihui, Wen, Shiping, Zeng, Mengjie, Guo, Jianyong, Zhong, Jingting, Xu, Xingmiao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610931/
https://www.ncbi.nlm.nih.gov/pubmed/36298294
http://dx.doi.org/10.3390/s22207944
_version_ 1784819400542519296
author Bao, Wuzhida
Yang, Cihui
Wen, Shiping
Zeng, Mengjie
Guo, Jianyong
Zhong, Jingting
Xu, Xingmiao
author_facet Bao, Wuzhida
Yang, Cihui
Wen, Shiping
Zeng, Mengjie
Guo, Jianyong
Zhong, Jingting
Xu, Xingmiao
author_sort Bao, Wuzhida
collection PubMed
description Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm for document images, which mainly includes Skeleton Line Detection (SKLD), Piecewise Projection Profile (PPP), Morphological Clustering (MC), and the image classification method. The image type is determined firstly based on the image’s layout feature. Thus, adaptive correcting is applied to deskew the image according to its type. Our method maintains high accuracy on the Document Image Skew Estimation Contest (DISEC’2013) and PubLayNet datasets, which achieved 97.6% and 80.1% accuracy, respectively. Meanwhile, extensive experiments show the superiority of the proposed algorithm.
format Online
Article
Text
id pubmed-9610931
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-96109312022-10-28 A Novel Adaptive Deskewing Algorithm for Document Images Bao, Wuzhida Yang, Cihui Wen, Shiping Zeng, Mengjie Guo, Jianyong Zhong, Jingting Xu, Xingmiao Sensors (Basel) Article Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm for document images, which mainly includes Skeleton Line Detection (SKLD), Piecewise Projection Profile (PPP), Morphological Clustering (MC), and the image classification method. The image type is determined firstly based on the image’s layout feature. Thus, adaptive correcting is applied to deskew the image according to its type. Our method maintains high accuracy on the Document Image Skew Estimation Contest (DISEC’2013) and PubLayNet datasets, which achieved 97.6% and 80.1% accuracy, respectively. Meanwhile, extensive experiments show the superiority of the proposed algorithm. MDPI 2022-10-18 /pmc/articles/PMC9610931/ /pubmed/36298294 http://dx.doi.org/10.3390/s22207944 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Bao, Wuzhida
Yang, Cihui
Wen, Shiping
Zeng, Mengjie
Guo, Jianyong
Zhong, Jingting
Xu, Xingmiao
A Novel Adaptive Deskewing Algorithm for Document Images
title A Novel Adaptive Deskewing Algorithm for Document Images
title_full A Novel Adaptive Deskewing Algorithm for Document Images
title_fullStr A Novel Adaptive Deskewing Algorithm for Document Images
title_full_unstemmed A Novel Adaptive Deskewing Algorithm for Document Images
title_short A Novel Adaptive Deskewing Algorithm for Document Images
title_sort novel adaptive deskewing algorithm for document images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610931/
https://www.ncbi.nlm.nih.gov/pubmed/36298294
http://dx.doi.org/10.3390/s22207944
work_keys_str_mv AT baowuzhida anoveladaptivedeskewingalgorithmfordocumentimages
AT yangcihui anoveladaptivedeskewingalgorithmfordocumentimages
AT wenshiping anoveladaptivedeskewingalgorithmfordocumentimages
AT zengmengjie anoveladaptivedeskewingalgorithmfordocumentimages
AT guojianyong anoveladaptivedeskewingalgorithmfordocumentimages
AT zhongjingting anoveladaptivedeskewingalgorithmfordocumentimages
AT xuxingmiao anoveladaptivedeskewingalgorithmfordocumentimages
AT baowuzhida noveladaptivedeskewingalgorithmfordocumentimages
AT yangcihui noveladaptivedeskewingalgorithmfordocumentimages
AT wenshiping noveladaptivedeskewingalgorithmfordocumentimages
AT zengmengjie noveladaptivedeskewingalgorithmfordocumentimages
AT guojianyong noveladaptivedeskewingalgorithmfordocumentimages
AT zhongjingting noveladaptivedeskewingalgorithmfordocumentimages
AT xuxingmiao noveladaptivedeskewingalgorithmfordocumentimages