Cargando…
A Novel Adaptive Deskewing Algorithm for Document Images
Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm fo...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610931/ https://www.ncbi.nlm.nih.gov/pubmed/36298294 http://dx.doi.org/10.3390/s22207944 |
_version_ | 1784819400542519296 |
---|---|
author | Bao, Wuzhida Yang, Cihui Wen, Shiping Zeng, Mengjie Guo, Jianyong Zhong, Jingting Xu, Xingmiao |
author_facet | Bao, Wuzhida Yang, Cihui Wen, Shiping Zeng, Mengjie Guo, Jianyong Zhong, Jingting Xu, Xingmiao |
author_sort | Bao, Wuzhida |
collection | PubMed |
description | Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm for document images, which mainly includes Skeleton Line Detection (SKLD), Piecewise Projection Profile (PPP), Morphological Clustering (MC), and the image classification method. The image type is determined firstly based on the image’s layout feature. Thus, adaptive correcting is applied to deskew the image according to its type. Our method maintains high accuracy on the Document Image Skew Estimation Contest (DISEC’2013) and PubLayNet datasets, which achieved 97.6% and 80.1% accuracy, respectively. Meanwhile, extensive experiments show the superiority of the proposed algorithm. |
format | Online Article Text |
id | pubmed-9610931 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-96109312022-10-28 A Novel Adaptive Deskewing Algorithm for Document Images Bao, Wuzhida Yang, Cihui Wen, Shiping Zeng, Mengjie Guo, Jianyong Zhong, Jingting Xu, Xingmiao Sensors (Basel) Article Document scanning often suffers from skewing, which may seriously influence the efficiency of Optical Character Recognition (OCR). Therefore, it is necessary to correct the skewed document before document image information analysis. In this article, we propose a novel adaptive deskewing algorithm for document images, which mainly includes Skeleton Line Detection (SKLD), Piecewise Projection Profile (PPP), Morphological Clustering (MC), and the image classification method. The image type is determined firstly based on the image’s layout feature. Thus, adaptive correcting is applied to deskew the image according to its type. Our method maintains high accuracy on the Document Image Skew Estimation Contest (DISEC’2013) and PubLayNet datasets, which achieved 97.6% and 80.1% accuracy, respectively. Meanwhile, extensive experiments show the superiority of the proposed algorithm. MDPI 2022-10-18 /pmc/articles/PMC9610931/ /pubmed/36298294 http://dx.doi.org/10.3390/s22207944 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Bao, Wuzhida Yang, Cihui Wen, Shiping Zeng, Mengjie Guo, Jianyong Zhong, Jingting Xu, Xingmiao A Novel Adaptive Deskewing Algorithm for Document Images |
title | A Novel Adaptive Deskewing Algorithm for Document Images |
title_full | A Novel Adaptive Deskewing Algorithm for Document Images |
title_fullStr | A Novel Adaptive Deskewing Algorithm for Document Images |
title_full_unstemmed | A Novel Adaptive Deskewing Algorithm for Document Images |
title_short | A Novel Adaptive Deskewing Algorithm for Document Images |
title_sort | novel adaptive deskewing algorithm for document images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9610931/ https://www.ncbi.nlm.nih.gov/pubmed/36298294 http://dx.doi.org/10.3390/s22207944 |
work_keys_str_mv | AT baowuzhida anoveladaptivedeskewingalgorithmfordocumentimages AT yangcihui anoveladaptivedeskewingalgorithmfordocumentimages AT wenshiping anoveladaptivedeskewingalgorithmfordocumentimages AT zengmengjie anoveladaptivedeskewingalgorithmfordocumentimages AT guojianyong anoveladaptivedeskewingalgorithmfordocumentimages AT zhongjingting anoveladaptivedeskewingalgorithmfordocumentimages AT xuxingmiao anoveladaptivedeskewingalgorithmfordocumentimages AT baowuzhida noveladaptivedeskewingalgorithmfordocumentimages AT yangcihui noveladaptivedeskewingalgorithmfordocumentimages AT wenshiping noveladaptivedeskewingalgorithmfordocumentimages AT zengmengjie noveladaptivedeskewingalgorithmfordocumentimages AT guojianyong noveladaptivedeskewingalgorithmfordocumentimages AT zhongjingting noveladaptivedeskewingalgorithmfordocumentimages AT xuxingmiao noveladaptivedeskewingalgorithmfordocumentimages |