Cargando…

Efficient algorithm for directed text detection based on rotation decoupled bounding box

A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architec...

Descripción completa

Detalles Bibliográficos
Autores principales: Wei, Songma, Lu, Minrui, Chen, Bingsan, Zhang, Tengjian, Zhang, Fujiang, Peng, Xiaodong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280560/
https://www.ncbi.nlm.nih.gov/pubmed/37346620
http://dx.doi.org/10.7717/peerj-cs.1352
_version_ 1785060822501818368
author Wei, Songma
Lu, Minrui
Chen, Bingsan
Zhang, Tengjian
Zhang, Fujiang
Peng, Xiaodong
author_facet Wei, Songma
Lu, Minrui
Chen, Bingsan
Zhang, Tengjian
Zhang, Fujiang
Peng, Xiaodong
author_sort Wei, Songma
collection PubMed
description A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architecture, inspired by the idea of DenseNet dense connection, a parallel cross-scale feature fusion method is proposed to overcome the problem of blurring the underlying feature semantic information and deep location information caused by the sequential stacking approach and to improve the multiscale feature information extraction capability. Furthermore, a rotational decoupling border detection module, which decouples the rotational bounding box into horizontal bounding box during positive sample matching, is provided, overcoming the angular instability in the process of matching the rotational bounding box with the horizontal anchor to obtain higher-quality regression samples and improve the precision of directed text detection. The MSRA-TD500 and ICDAR2015 datasets are used to evaluate the method, and results show that the algorithm measured precision and F(1)-score of 89.2% and 88.1% on the MSRA-TD500 dataset, respectively, and accuracy and F(1)-score of 90.6% and 89.3% on the ICDAR2015 dataset, respectively. The proposed algorithm has better competitive ability than the SOTA text detection algorithm.
format Online
Article
Text
id pubmed-10280560
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-102805602023-06-21 Efficient algorithm for directed text detection based on rotation decoupled bounding box Wei, Songma Lu, Minrui Chen, Bingsan Zhang, Tengjian Zhang, Fujiang Peng, Xiaodong PeerJ Comput Sci Artificial Intelligence A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architecture, inspired by the idea of DenseNet dense connection, a parallel cross-scale feature fusion method is proposed to overcome the problem of blurring the underlying feature semantic information and deep location information caused by the sequential stacking approach and to improve the multiscale feature information extraction capability. Furthermore, a rotational decoupling border detection module, which decouples the rotational bounding box into horizontal bounding box during positive sample matching, is provided, overcoming the angular instability in the process of matching the rotational bounding box with the horizontal anchor to obtain higher-quality regression samples and improve the precision of directed text detection. The MSRA-TD500 and ICDAR2015 datasets are used to evaluate the method, and results show that the algorithm measured precision and F(1)-score of 89.2% and 88.1% on the MSRA-TD500 dataset, respectively, and accuracy and F(1)-score of 90.6% and 89.3% on the ICDAR2015 dataset, respectively. The proposed algorithm has better competitive ability than the SOTA text detection algorithm. PeerJ Inc. 2023-05-09 /pmc/articles/PMC10280560/ /pubmed/37346620 http://dx.doi.org/10.7717/peerj-cs.1352 Text en ©2023 Wei et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited.
spellingShingle Artificial Intelligence
Wei, Songma
Lu, Minrui
Chen, Bingsan
Zhang, Tengjian
Zhang, Fujiang
Peng, Xiaodong
Efficient algorithm for directed text detection based on rotation decoupled bounding box
title Efficient algorithm for directed text detection based on rotation decoupled bounding box
title_full Efficient algorithm for directed text detection based on rotation decoupled bounding box
title_fullStr Efficient algorithm for directed text detection based on rotation decoupled bounding box
title_full_unstemmed Efficient algorithm for directed text detection based on rotation decoupled bounding box
title_short Efficient algorithm for directed text detection based on rotation decoupled bounding box
title_sort efficient algorithm for directed text detection based on rotation decoupled bounding box
topic Artificial Intelligence
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280560/
https://www.ncbi.nlm.nih.gov/pubmed/37346620
http://dx.doi.org/10.7717/peerj-cs.1352
work_keys_str_mv AT weisongma efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox
AT luminrui efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox
AT chenbingsan efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox
AT zhangtengjian efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox
AT zhangfujiang efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox
AT pengxiaodong efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox