Cargando…
Efficient algorithm for directed text detection based on rotation decoupled bounding box
A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architec...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280560/ https://www.ncbi.nlm.nih.gov/pubmed/37346620 http://dx.doi.org/10.7717/peerj-cs.1352 |
_version_ | 1785060822501818368 |
---|---|
author | Wei, Songma Lu, Minrui Chen, Bingsan Zhang, Tengjian Zhang, Fujiang Peng, Xiaodong |
author_facet | Wei, Songma Lu, Minrui Chen, Bingsan Zhang, Tengjian Zhang, Fujiang Peng, Xiaodong |
author_sort | Wei, Songma |
collection | PubMed |
description | A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architecture, inspired by the idea of DenseNet dense connection, a parallel cross-scale feature fusion method is proposed to overcome the problem of blurring the underlying feature semantic information and deep location information caused by the sequential stacking approach and to improve the multiscale feature information extraction capability. Furthermore, a rotational decoupling border detection module, which decouples the rotational bounding box into horizontal bounding box during positive sample matching, is provided, overcoming the angular instability in the process of matching the rotational bounding box with the horizontal anchor to obtain higher-quality regression samples and improve the precision of directed text detection. The MSRA-TD500 and ICDAR2015 datasets are used to evaluate the method, and results show that the algorithm measured precision and F(1)-score of 89.2% and 88.1% on the MSRA-TD500 dataset, respectively, and accuracy and F(1)-score of 90.6% and 89.3% on the ICDAR2015 dataset, respectively. The proposed algorithm has better competitive ability than the SOTA text detection algorithm. |
format | Online Article Text |
id | pubmed-10280560 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-102805602023-06-21 Efficient algorithm for directed text detection based on rotation decoupled bounding box Wei, Songma Lu, Minrui Chen, Bingsan Zhang, Tengjian Zhang, Fujiang Peng, Xiaodong PeerJ Comput Sci Artificial Intelligence A more effective directed text detection algorithm is proposed for the problem of low accuracy in detecting text with multiple sources, dense distribution, large aspect ratio and arbitrary alignment direction in the industrial intelligence process. The algorithm is based on the YOLOv5 model architecture, inspired by the idea of DenseNet dense connection, a parallel cross-scale feature fusion method is proposed to overcome the problem of blurring the underlying feature semantic information and deep location information caused by the sequential stacking approach and to improve the multiscale feature information extraction capability. Furthermore, a rotational decoupling border detection module, which decouples the rotational bounding box into horizontal bounding box during positive sample matching, is provided, overcoming the angular instability in the process of matching the rotational bounding box with the horizontal anchor to obtain higher-quality regression samples and improve the precision of directed text detection. The MSRA-TD500 and ICDAR2015 datasets are used to evaluate the method, and results show that the algorithm measured precision and F(1)-score of 89.2% and 88.1% on the MSRA-TD500 dataset, respectively, and accuracy and F(1)-score of 90.6% and 89.3% on the ICDAR2015 dataset, respectively. The proposed algorithm has better competitive ability than the SOTA text detection algorithm. PeerJ Inc. 2023-05-09 /pmc/articles/PMC10280560/ /pubmed/37346620 http://dx.doi.org/10.7717/peerj-cs.1352 Text en ©2023 Wei et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Artificial Intelligence Wei, Songma Lu, Minrui Chen, Bingsan Zhang, Tengjian Zhang, Fujiang Peng, Xiaodong Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title | Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title_full | Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title_fullStr | Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title_full_unstemmed | Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title_short | Efficient algorithm for directed text detection based on rotation decoupled bounding box |
title_sort | efficient algorithm for directed text detection based on rotation decoupled bounding box |
topic | Artificial Intelligence |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280560/ https://www.ncbi.nlm.nih.gov/pubmed/37346620 http://dx.doi.org/10.7717/peerj-cs.1352 |
work_keys_str_mv | AT weisongma efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox AT luminrui efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox AT chenbingsan efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox AT zhangtengjian efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox AT zhangfujiang efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox AT pengxiaodong efficientalgorithmfordirectedtextdetectionbasedonrotationdecoupledboundingbox |