Cargando…
DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection
With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to im...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055874/ https://www.ncbi.nlm.nih.gov/pubmed/36976113 http://dx.doi.org/10.3390/jimaging9030062 |
_version_ | 1785015981106528256 |
---|---|
author | Kazdar, Takwa Mseddi, Wided Souidene Akhloufi, Moulay A. Agrebi, Ala Jmal, Marwa Attia, Rabah |
author_facet | Kazdar, Takwa Mseddi, Wided Souidene Akhloufi, Moulay A. Agrebi, Ala Jmal, Marwa Attia, Rabah |
author_sort | Kazdar, Takwa |
collection | PubMed |
description | With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to improve Faster R-CNN for table detection. DCTable came up to extract more discriminative features using a backbone with dilated convolutions in order to improve the quality of region proposals. Another main contribution of this paper is the anchors optimization using the Intersection over Union (IoU)-balanced loss to train the RPN and reduce the false positive rate. This is followed by a RoI Align layer, instead of the ROI pooling, to improve the accuracy during mapping table proposal candidates by eliminating the coarse misalignment and introducing the bilinear interpolation in mapping region proposal candidates. Training and testing on a public dataset showed the effectiveness of the algorithm and a considerable improvement of the F1-score on ICDAR 2017-Pod, ICDAR-2019, Marmot and RVL CDIP datasets. |
format | Online Article Text |
id | pubmed-10055874 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100558742023-03-30 DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection Kazdar, Takwa Mseddi, Wided Souidene Akhloufi, Moulay A. Agrebi, Ala Jmal, Marwa Attia, Rabah J Imaging Article With the widespread use of deep learning in leading systems, it has become the mainstream in the table detection field. Some tables are difficult to detect because of the likely figure layout or the small size. As a solution to the underlined problem, we propose a novel method, called DCTable, to improve Faster R-CNN for table detection. DCTable came up to extract more discriminative features using a backbone with dilated convolutions in order to improve the quality of region proposals. Another main contribution of this paper is the anchors optimization using the Intersection over Union (IoU)-balanced loss to train the RPN and reduce the false positive rate. This is followed by a RoI Align layer, instead of the ROI pooling, to improve the accuracy during mapping table proposal candidates by eliminating the coarse misalignment and introducing the bilinear interpolation in mapping region proposal candidates. Training and testing on a public dataset showed the effectiveness of the algorithm and a considerable improvement of the F1-score on ICDAR 2017-Pod, ICDAR-2019, Marmot and RVL CDIP datasets. MDPI 2023-03-07 /pmc/articles/PMC10055874/ /pubmed/36976113 http://dx.doi.org/10.3390/jimaging9030062 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Kazdar, Takwa Mseddi, Wided Souidene Akhloufi, Moulay A. Agrebi, Ala Jmal, Marwa Attia, Rabah DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title | DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title_full | DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title_fullStr | DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title_full_unstemmed | DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title_short | DCTable: A Dilated CNN with Optimizing Anchors for Accurate Table Detection |
title_sort | dctable: a dilated cnn with optimizing anchors for accurate table detection |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10055874/ https://www.ncbi.nlm.nih.gov/pubmed/36976113 http://dx.doi.org/10.3390/jimaging9030062 |
work_keys_str_mv | AT kazdartakwa dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection AT mseddiwidedsouidene dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection AT akhloufimoulaya dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection AT agrebiala dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection AT jmalmarwa dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection AT attiarabah dctableadilatedcnnwithoptimizinganchorsforaccuratetabledetection |