Cargando…
Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks
Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essentia...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142040/ https://www.ncbi.nlm.nih.gov/pubmed/37112344 http://dx.doi.org/10.3390/s23084003 |
_version_ | 1785033518431076352 |
---|---|
author | Khan, Sajid Ullah Ullah, Imdad Khan, Faheem Lee, Youngmoon Ullah, Shahid |
author_facet | Khan, Sajid Ullah Ullah, Imdad Khan, Faheem Lee, Youngmoon Ullah, Shahid |
author_sort | Khan, Sajid Ullah |
collection | PubMed |
description | Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essential for several document recognition and analysis tasks. In this era of technology, it is important to enhance these degraded text documents for proper use. To address these issues, a new bi-cubic interpolation of Lifting Wavelet Transform (LWT) and Stationary Wavelet Transform (SWT) is proposed to enhance image resolution. Then a generative adversarial network (GAN) is used to extract the spectral and spatial features in historical text images. The proposed method consists of two parts. In the first part, the transformation method is used to de-noise and de-blur the images, and to increase the resolution effects, whereas in the second part, the GAN architecture is used to fuse the original and the resulting image obtained from part one in order to improve the spectral and spatial features of a historical text image. Experiment results show that the proposed model outperforms the current deep learning methods. |
format | Online Article Text |
id | pubmed-10142040 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-101420402023-04-29 Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks Khan, Sajid Ullah Ullah, Imdad Khan, Faheem Lee, Youngmoon Ullah, Shahid Sensors (Basel) Communication Historical documents such as newspapers, invoices, contract papers are often difficult to read due to degraded text quality. These documents may be damaged or degraded due to a variety of factors such as aging, distortion, stamps, watermarks, ink stains, and so on. Text image enhancement is essential for several document recognition and analysis tasks. In this era of technology, it is important to enhance these degraded text documents for proper use. To address these issues, a new bi-cubic interpolation of Lifting Wavelet Transform (LWT) and Stationary Wavelet Transform (SWT) is proposed to enhance image resolution. Then a generative adversarial network (GAN) is used to extract the spectral and spatial features in historical text images. The proposed method consists of two parts. In the first part, the transformation method is used to de-noise and de-blur the images, and to increase the resolution effects, whereas in the second part, the GAN architecture is used to fuse the original and the resulting image obtained from part one in order to improve the spectral and spatial features of a historical text image. Experiment results show that the proposed model outperforms the current deep learning methods. MDPI 2023-04-14 /pmc/articles/PMC10142040/ /pubmed/37112344 http://dx.doi.org/10.3390/s23084003 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Khan, Sajid Ullah Ullah, Imdad Khan, Faheem Lee, Youngmoon Ullah, Shahid Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title | Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title_full | Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title_fullStr | Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title_full_unstemmed | Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title_short | Historical Text Image Enhancement Using Image Scaling and Generative Adversarial Networks |
title_sort | historical text image enhancement using image scaling and generative adversarial networks |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10142040/ https://www.ncbi.nlm.nih.gov/pubmed/37112344 http://dx.doi.org/10.3390/s23084003 |
work_keys_str_mv | AT khansajidullah historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks AT ullahimdad historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks AT khanfaheem historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks AT leeyoungmoon historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks AT ullahshahid historicaltextimageenhancementusingimagescalingandgenerativeadversarialnetworks |