Cargando…
Multi-Directional Scene Text Detection Based on Improved YOLOv3
To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, an...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309843/ https://www.ncbi.nlm.nih.gov/pubmed/34300607 http://dx.doi.org/10.3390/s21144870 |
_version_ | 1783728618541678592 |
---|---|
author | Xiao, Liyun Zhou, Peng Xu, Ke Zhao, Xiaofang |
author_facet | Xiao, Liyun Zhou, Peng Xu, Ke Zhao, Xiaofang |
author_sort | Xiao, Liyun |
collection | PubMed |
description | To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, and applies it to natural text detection. To detect text in multiple directions, this paper introduces a method of box definition based on sliding vertices. Then, a new rotating box loss function MD-Closs based on CIOU is proposed to improve the detection accuracy. In addition, a step-by-step NMS method is used to further reduce the amount of calculation. Experimental results show that on the ICDAR 2015 data set, the accuracy rate is 86.2%, the recall rate is 81.9%, and the timeliness is 21.3 fps, which shows that the proposed algorithm has a good detection effect on text detection in natural scenes. |
format | Online Article Text |
id | pubmed-8309843 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-83098432021-07-25 Multi-Directional Scene Text Detection Based on Improved YOLOv3 Xiao, Liyun Zhou, Peng Xu, Ke Zhao, Xiaofang Sensors (Basel) Communication To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, and applies it to natural text detection. To detect text in multiple directions, this paper introduces a method of box definition based on sliding vertices. Then, a new rotating box loss function MD-Closs based on CIOU is proposed to improve the detection accuracy. In addition, a step-by-step NMS method is used to further reduce the amount of calculation. Experimental results show that on the ICDAR 2015 data set, the accuracy rate is 86.2%, the recall rate is 81.9%, and the timeliness is 21.3 fps, which shows that the proposed algorithm has a good detection effect on text detection in natural scenes. MDPI 2021-07-16 /pmc/articles/PMC8309843/ /pubmed/34300607 http://dx.doi.org/10.3390/s21144870 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Communication Xiao, Liyun Zhou, Peng Xu, Ke Zhao, Xiaofang Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title | Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title_full | Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title_fullStr | Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title_full_unstemmed | Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title_short | Multi-Directional Scene Text Detection Based on Improved YOLOv3 |
title_sort | multi-directional scene text detection based on improved yolov3 |
topic | Communication |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309843/ https://www.ncbi.nlm.nih.gov/pubmed/34300607 http://dx.doi.org/10.3390/s21144870 |
work_keys_str_mv | AT xiaoliyun multidirectionalscenetextdetectionbasedonimprovedyolov3 AT zhoupeng multidirectionalscenetextdetectionbasedonimprovedyolov3 AT xuke multidirectionalscenetextdetectionbasedonimprovedyolov3 AT zhaoxiaofang multidirectionalscenetextdetectionbasedonimprovedyolov3 |