Cargando…

Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems

This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous re...

Descripción completa

Detalles Bibliográficos
Autores principales: Ferrando, Javier, Domínguez, Juan Luis, Torres, Jordi, García, Raúl, García, David, Garrido, Daniel, Cortada, Jordi, Valero, Mateo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302855/
http://dx.doi.org/10.1007/978-3-030-50417-5_29
_version_ 1783547936626442240
author Ferrando, Javier
Domínguez, Juan Luis
Torres, Jordi
García, Raúl
García, David
Garrido, Daniel
Cortada, Jordi
Valero, Mateo
author_facet Ferrando, Javier
Domínguez, Juan Luis
Torres, Jordi
García, Raúl
García, David
Garrido, Daniel
Cortada, Jordi
Valero, Mateo
author_sort Ferrando, Javier
collection PubMed
description This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost solely image input by combining image model predictions with the ones generated by BERT model on extracted text by OCR. We also show that the batch size can be effectively increased without hindering its accuracy so that the training process can be sped up by parallelizing throughout multiple GPUs, decreasing the computational time needed. Lastly, we expose the training performance differences between PyTorch and Tensorflow Deep Learning frameworks.
format Online
Article
Text
id pubmed-7302855
institution National Center for Biotechnology Information
language English
publishDate 2020
record_format MEDLINE/PubMed
spelling pubmed-73028552020-06-19 Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems Ferrando, Javier Domínguez, Juan Luis Torres, Jordi García, Raúl García, David Garrido, Daniel Cortada, Jordi Valero, Mateo Computational Science – ICCS 2020 Article This paper presents a study showing the benefits of the EfficientNet models compared with heavier Convolutional Neural Networks (CNNs) in the Document Classification task, essential problem in the digitalization process of institutions. We show in the RVL-CDIP dataset that we can improve previous results with a much lighter model and present its transfer learning capabilities on a smaller in-domain dataset such as Tobacco3482. Moreover, we present an ensemble pipeline which is able to boost solely image input by combining image model predictions with the ones generated by BERT model on extracted text by OCR. We also show that the batch size can be effectively increased without hindering its accuracy so that the training process can be sped up by parallelizing throughout multiple GPUs, decreasing the computational time needed. Lastly, we expose the training performance differences between PyTorch and Tensorflow Deep Learning frameworks. 2020-06-15 /pmc/articles/PMC7302855/ http://dx.doi.org/10.1007/978-3-030-50417-5_29 Text en © Springer Nature Switzerland AG 2020 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ferrando, Javier
Domínguez, Juan Luis
Torres, Jordi
García, Raúl
García, David
Garrido, Daniel
Cortada, Jordi
Valero, Mateo
Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title_full Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title_fullStr Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title_full_unstemmed Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title_short Improving Accuracy and Speeding Up Document Image Classification Through Parallel Systems
title_sort improving accuracy and speeding up document image classification through parallel systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7302855/
http://dx.doi.org/10.1007/978-3-030-50417-5_29
work_keys_str_mv AT ferrandojavier improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT dominguezjuanluis improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT torresjordi improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT garciaraul improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT garciadavid improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT garridodaniel improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT cortadajordi improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems
AT valeromateo improvingaccuracyandspeedingupdocumentimageclassificationthroughparallelsystems