Cargando…

DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection

With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to im...

Descripción completa

Detalles Bibliográficos
Autores principales: Kazdar, Takwa, Mseddi, Wided Souidene, Akhloufi, Moulay A., Agrebi, Ala, Jmal, Marwa, Attia, Rabah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055874/
https://www.ncbi.nlm.nih.gov/pubmed/36976113
http://dx.doi.org/10.3390/jimaging9030062
_version_ 1785015981106528256
author Kazdar, Takwa
Mseddi, Wided Souidene
Akhloufi, Moulay A.
Agrebi, Ala
Jmal, Marwa
Attia, Rabah
author_facet Kazdar, Takwa
Mseddi, Wided Souidene
Akhloufi, Moulay A.
Agrebi, Ala
Jmal, Marwa
Attia, Rabah
author_sort Kazdar, Takwa
collection PubMed
description With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to improve Faster R-CNN for table detection. DCTable came up to extract more discriminative features using a backbone with dilated convolutions in order to improve the quality of region proposals. Another main contribution of this paper is the anchors optimization using the Intersection over Union (IoU)-balanced loss to train the RPN and reduce the false positive rate. This is followed by a RoI Align layer, instead of the ROI pooling, to improve the accuracy during mapping table proposal candidates by eliminating the coarse misalignment and introducing the bilinear interpolation in mapping region proposal candidates. Training and testing on a public dataset showed the effectiveness of the algorithm and a considerable improvement of the F1-score on ICDAR 2017-Pod, ICDAR-2019, Marmot and RVL CDIP datasets.
format Online
Article
Text
id pubmed-10055874
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100558742023-03-30 DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection Kazdar, Takwa Mseddi, Wided Souidene Akhloufi, Moulay A. Agrebi, Ala Jmal, Marwa Attia, Rabah J Imaging Article With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to improve Faster R-CNN for table detection. DCTable came up to extract more discriminative features using a backbone with dilated convolutions in order to improve the quality of region proposals. Another main contribution of this paper is the anchors optimization using the Intersection over Union (IoU)-balanced loss to train the RPN and reduce the false positive rate. This is followed by a RoI Align layer, instead of the ROI pooling, to improve the accuracy during mapping table proposal candidates by eliminating the coarse misalignment and introducing the bilinear interpolation in mapping region proposal candidates. Training and testing on a public dataset showed the effectiveness of the algorithm and a considerable improvement of the F1-score on ICDAR 2017-Pod, ICDAR-2019, Marmot and RVL CDIP datasets. MDPI 2023-03-07 /pmc/articles/PMC10055874/ /pubmed/36976113 http://dx.doi.org/10.3390/jimaging9030062 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Kazdar, Takwa
Mseddi, Wided Souidene
Akhloufi, Moulay A.
Agrebi, Ala
Jmal, Marwa
Attia, Rabah
DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title_full DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title_fullStr DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title_full_unstemmed DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title_short DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
title_sort dctable: a dilated cnn with optimizing anchors for accurate table detection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055874/
https://www.ncbi.nlm.nih.gov/pubmed/36976113
http://dx.doi.org/10.3390/jimaging9030062
work_keys_str_mv AT kazdartakwa dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection
AT mseddiwidedsouidene dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection
AT akhloufimoulaya dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection
AT agrebiala dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection
AT jmalmarwa dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection
AT attiarabah dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection