Cargando…
DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or u...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321348/ http://dx.doi.org/10.3390/jimaging7070114 |
_version_ | 1783730831309668352 |
---|---|
author | Arnia, Fitri Saddami, Khairun Munadi, Khairul |
author_facet | Arnia, Fitri Saddami, Khairun Munadi, Khairul |
author_sort | Arnia, Fitri |
collection | PubMed |
description | Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or under poor illumination conditions. The noises over the degraded ancient documents certainly cause a troublesome document analysis. In this paper, we propose a new noise-robust convolutional neural network (CNN) architecture for degradation classification of noisy ancient documents, which is called a degradation classification network (DCNet). DCNet was constructed based on the ResNet101, MobileNetV2, and ShuffleNet architectures. Furthermore, we propose a new self-transition layer following DCNet. We trained the DCNet using (1) noise-free document images and (2) heavy-noise (zero mean Gaussian noise (ZMGN) and speckle) document images. Then, we tested the resulted models with document images containing different levels of ZMGN and speckle noise. We compared our results to three CNN benchmarking architectures, namely MobileNet, ShuffleNet, and ResNet101. In general, the proposed architecture performed better than MobileNet, ShuffleNet, ResNet101, and conventional machine learning (support vector machine and random forest), particularly for documents with heavy noise. |
format | Online Article Text |
id | pubmed-8321348 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83213482021-08-26 DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents Arnia, Fitri Saddami, Khairun Munadi, Khairul J Imaging Article Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or under poor illumination conditions. The noises over the degraded ancient documents certainly cause a troublesome document analysis. In this paper, we propose a new noise-robust convolutional neural network (CNN) architecture for degradation classification of noisy ancient documents, which is called a degradation classification network (DCNet). DCNet was constructed based on the ResNet101, MobileNetV2, and ShuffleNet architectures. Furthermore, we propose a new self-transition layer following DCNet. We trained the DCNet using (1) noise-free document images and (2) heavy-noise (zero mean Gaussian noise (ZMGN) and speckle) document images. Then, we tested the resulted models with document images containing different levels of ZMGN and speckle noise. We compared our results to three CNN benchmarking architectures, namely MobileNet, ShuffleNet, and ResNet101. In general, the proposed architecture performed better than MobileNet, ShuffleNet, ResNet101, and conventional machine learning (support vector machine and random forest), particularly for documents with heavy noise. MDPI 2021-07-12 /pmc/articles/PMC8321348/ http://dx.doi.org/10.3390/jimaging7070114 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Arnia, Fitri Saddami, Khairun Munadi, Khairul DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title | DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title_full | DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title_fullStr | DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title_full_unstemmed | DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title_short | DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents |
title_sort | dcnet: noise-robust convolutional neural networks for degradation classification on ancient documents |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321348/ http://dx.doi.org/10.3390/jimaging7070114 |
work_keys_str_mv | AT arniafitri dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments AT saddamikhairun dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments AT munadikhairul dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments |