Cargando…

SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection

Recently, various object detection frameworks have been applied to text detection tasks and have achieved good performance in the final detection. With the further expansion of text detection application scenarios, the research value of text detection topics has gradually increased. Text detection i...

Descripción completa

Detalles Bibliográficos
Autores principales:	Li, Shuangshuang, Cao, Wenming
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2021
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069456/ https://www.ncbi.nlm.nih.gov/pubmed/33918964 http://dx.doi.org/10.3390/s21082657

_version_	1783683241211854848
author	Li, Shuangshuang Cao, Wenming
author_facet	Li, Shuangshuang Cao, Wenming
author_sort	Li, Shuangshuang
collection	PubMed
description	Recently, various object detection frameworks have been applied to text detection tasks and have achieved good performance in the final detection. With the further expansion of text detection application scenarios, the research value of text detection topics has gradually increased. Text detection in natural scenes is more challenging for horizontal text based on a quadrilateral detection box and for curved text of any shape. Most networks have a good effect on the balancing of target samples in text detection, but it is challenging to deal with small targets and solve extremely unbalanced data. We continued to use PSENet to deal with such problems in this work. On the other hand, we studied the problem that most of the existing scene text detection methods use ResNet and FPN as the backbone of feature extraction, and improved the ResNet and FPN network parts of PSENet to make it more conducive to the combination of feature extraction in the early stage. A SEMPANet framework without an anchor and in one stage is proposed to implement a lightweight model, which is embodied in the training time of about 24 h. Finally, we selected the two most representative datasets for oriented text and curved text to conduct experiments. On ICDAR2015, the improved network’s latest results further verify its effectiveness; it reached 1.01% in F-measure compared with PSENet-1s. On CTW1500, the improved network performed better than the original network on average.
format	Online Article Text
id	pubmed-8069456
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-80694562021-04-26 SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection Li, Shuangshuang Cao, Wenming Sensors (Basel) Article Recently, various object detection frameworks have been applied to text detection tasks and have achieved good performance in the final detection. With the further expansion of text detection application scenarios, the research value of text detection topics has gradually increased. Text detection in natural scenes is more challenging for horizontal text based on a quadrilateral detection box and for curved text of any shape. Most networks have a good effect on the balancing of target samples in text detection, but it is challenging to deal with small targets and solve extremely unbalanced data. We continued to use PSENet to deal with such problems in this work. On the other hand, we studied the problem that most of the existing scene text detection methods use ResNet and FPN as the backbone of feature extraction, and improved the ResNet and FPN network parts of PSENet to make it more conducive to the combination of feature extraction in the early stage. A SEMPANet framework without an anchor and in one stage is proposed to implement a lightweight model, which is embodied in the training time of about 24 h. Finally, we selected the two most representative datasets for oriented text and curved text to conduct experiments. On ICDAR2015, the improved network’s latest results further verify its effectiveness; it reached 1.01% in F-measure compared with PSENet-1s. On CTW1500, the improved network performed better than the original network on average. MDPI 2021-04-09 /pmc/articles/PMC8069456/ /pubmed/33918964 http://dx.doi.org/10.3390/s21082657 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Li, Shuangshuang Cao, Wenming SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title	SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title_full	SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title_fullStr	SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title_full_unstemmed	SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title_short	SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection
title_sort	sempanet: a modified path aggregation network with squeeze-excitation for scene text detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8069456/ https://www.ncbi.nlm.nih.gov/pubmed/33918964 http://dx.doi.org/10.3390/s21082657
work_keys_str_mv	AT lishuangshuang sempanetamodifiedpathaggregationnetworkwithsqueezeexcitationforscenetextdetection AT caowenming sempanetamodifiedpathaggregationnetworkwithsqueezeexcitationforscenetextdetection

SEMPANet: A Modified Path Aggregation Network with Squeeze-Excitation for Scene Text Detection

Ejemplares similares