Cargando…

DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents

Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or u...

Descripción completa

Detalles Bibliográficos
Autores principales: Arnia, Fitri, Saddami, Khairun, Munadi, Khairul
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321348/
http://dx.doi.org/10.3390/jimaging7070114
_version_ 1783730831309668352
author Arnia, Fitri
Saddami, Khairun
Munadi, Khairul
author_facet Arnia, Fitri
Saddami, Khairun
Munadi, Khairul
author_sort Arnia, Fitri
collection PubMed
description Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or under poor illumination conditions. The noises over the degraded ancient documents certainly cause a troublesome document analysis. In this paper, we propose a new noise-robust convolutional neural network (CNN) architecture for degradation classification of noisy ancient documents, which is called a degradation classification network (DCNet). DCNet was constructed based on the ResNet101, MobileNetV2, and ShuffleNet architectures. Furthermore, we propose a new self-transition layer following DCNet. We trained the DCNet using (1) noise-free document images and (2) heavy-noise (zero mean Gaussian noise (ZMGN) and speckle) document images. Then, we tested the resulted models with document images containing different levels of ZMGN and speckle noise. We compared our results to three CNN benchmarking architectures, namely MobileNet, ShuffleNet, and ResNet101. In general, the proposed architecture performed better than MobileNet, ShuffleNet, ResNet101, and conventional machine learning (support vector machine and random forest), particularly for documents with heavy noise.
format Online
Article
Text
id pubmed-8321348
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83213482021-08-26 DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents Arnia, Fitri Saddami, Khairun Munadi, Khairul J Imaging Article Analysis of degraded ancient documents is challenging due to the severity and combination of degradation present in a single image. Ancient documents also suffer from additional noise during the digitalization process, particularly when digitalization is done using low-specification devices and/or under poor illumination conditions. The noises over the degraded ancient documents certainly cause a troublesome document analysis. In this paper, we propose a new noise-robust convolutional neural network (CNN) architecture for degradation classification of noisy ancient documents, which is called a degradation classification network (DCNet). DCNet was constructed based on the ResNet101, MobileNetV2, and ShuffleNet architectures. Furthermore, we propose a new self-transition layer following DCNet. We trained the DCNet using (1) noise-free document images and (2) heavy-noise (zero mean Gaussian noise (ZMGN) and speckle) document images. Then, we tested the resulted models with document images containing different levels of ZMGN and speckle noise. We compared our results to three CNN benchmarking architectures, namely MobileNet, ShuffleNet, and ResNet101. In general, the proposed architecture performed better than MobileNet, ShuffleNet, ResNet101, and conventional machine learning (support vector machine and random forest), particularly for documents with heavy noise. MDPI 2021-07-12 /pmc/articles/PMC8321348/ http://dx.doi.org/10.3390/jimaging7070114 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Arnia, Fitri
Saddami, Khairun
Munadi, Khairul
DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title_full DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title_fullStr DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title_full_unstemmed DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title_short DCNet: Noise-Robust Convolutional Neural Networks for Degradation Classification on Ancient Documents
title_sort dcnet: noise-robust convolutional neural networks for degradation classification on ancient documents
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8321348/
http://dx.doi.org/10.3390/jimaging7070114
work_keys_str_mv AT arniafitri dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments
AT saddamikhairun dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments
AT munadikhairul dcnetnoiserobustconvolutionalneuralnetworksfordegradationclassificationonancientdocuments