Cargando…

Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks

Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essentia...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Sajid Ullah, Ullah, Imdad, Khan, Faheem, Lee, Youngmoon, Ullah, Shahid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142040/
https://www.ncbi.nlm.nih.gov/pubmed/37112344
http://dx.doi.org/10.3390/s23084003
_version_ 1785033518431076352
author Khan, Sajid Ullah
Ullah, Imdad
Khan, Faheem
Lee, Youngmoon
Ullah, Shahid
author_facet Khan, Sajid Ullah
Ullah, Imdad
Khan, Faheem
Lee, Youngmoon
Ullah, Shahid
author_sort Khan, Sajid Ullah
collection PubMed
description Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essential for several document recognition and analysis tasks. In this era of technology, it is important to enhance these degraded text documents for proper use. To address these issues, a new bi-cubic interpolation of Lifting Wavelet Transform (LWT) and Stationary Wavelet Transform (SWT) is proposed to enhance image resolution. Then a generative adversarial network (GAN) is used to extract the spectral and spatial features in historical text images. The proposed method consists of two parts. In the first part, the transformation method is used to de-noise and de-blur the images, and to increase the resolution effects, whereas in the second part, the GAN architecture is used to fuse the original and the resulting image obtained from part one in order to improve the spectral and spatial features of a historical text image. Experiment results show that the proposed model outperforms the current deep learning methods.
format Online
Article
Text
id pubmed-10142040
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-101420402023-04-29 Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks Khan, Sajid Ullah Ullah, Imdad Khan, Faheem Lee, Youngmoon Ullah, Shahid Sensors (Basel) Communication Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essential for several document recognition and analysis tasks. In this era of technology, it is important to enhance these degraded text documents for proper use. To address these issues, a new bi-cubic interpolation of Lifting Wavelet Transform (LWT) and Stationary Wavelet Transform (SWT) is proposed to enhance image resolution. Then a generative adversarial network (GAN) is used to extract the spectral and spatial features in historical text images. The proposed method consists of two parts. In the first part, the transformation method is used to de-noise and de-blur the images, and to increase the resolution effects, whereas in the second part, the GAN architecture is used to fuse the original and the resulting image obtained from part one in order to improve the spectral and spatial features of a historical text image. Experiment results show that the proposed model outperforms the current deep learning methods. MDPI 2023-04-14 /pmc/articles/PMC10142040/ /pubmed/37112344 http://dx.doi.org/10.3390/s23084003 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Khan, Sajid Ullah
Ullah, Imdad
Khan, Faheem
Lee, Youngmoon
Ullah, Shahid
Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title_full Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title_fullStr Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title_full_unstemmed Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title_short Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
title_sort historical text image enhancement using image scaling and generative adversarial networks
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142040/
https://www.ncbi.nlm.nih.gov/pubmed/37112344
http://dx.doi.org/10.3390/s23084003
work_keys_str_mv AT khansajidullah historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks
AT ullahimdad historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks
AT khanfaheem historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks
AT leeyoungmoon historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks
AT ullahshahid historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks