Cargando…

Multi-Directional Scene Text Detection Based on Improved YOLOv3

To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, an...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiao, Liyun, Zhou, Peng, Xu, Ke, Zhao, Xiaofang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309843/
https://www.ncbi.nlm.nih.gov/pubmed/34300607
http://dx.doi.org/10.3390/s21144870
_version_ 1783728618541678592
author Xiao, Liyun
Zhou, Peng
Xu, Ke
Zhao, Xiaofang
author_facet Xiao, Liyun
Zhou, Peng
Xu, Ke
Zhao, Xiaofang
author_sort Xiao, Liyun
collection PubMed
description To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, and applies it to natural text detection. To detect text in multiple directions, this paper introduces a method of box definition based on sliding vertices. Then, a new rotating box loss function MD-Closs based on CIOU is proposed to improve the detection accuracy. In addition, a step-by-step NMS method is used to further reduce the amount of calculation. Experimental results show that on the ICDAR 2015 data set, the accuracy rate is 86.2%, the recall rate is 81.9%, and the timeliness is 21.3 fps, which shows that the proposed algorithm has a good detection effect on text detection in natural scenes.
format Online
Article
Text
id pubmed-8309843
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-83098432021-07-25 Multi-Directional Scene Text Detection Based on Improved YOLOv3 Xiao, Liyun Zhou, Peng Xu, Ke Zhao, Xiaofang Sensors (Basel) Communication To address the problem of low detection rate caused by the close alignment and multi-directional position of text words in practical application and the need to improve the detection speed of the algorithm, this paper proposes a multi-directional text detection algorithm based on improved YOLOv3, and applies it to natural text detection. To detect text in multiple directions, this paper introduces a method of box definition based on sliding vertices. Then, a new rotating box loss function MD-Closs based on CIOU is proposed to improve the detection accuracy. In addition, a step-by-step NMS method is used to further reduce the amount of calculation. Experimental results show that on the ICDAR 2015 data set, the accuracy rate is 86.2%, the recall rate is 81.9%, and the timeliness is 21.3 fps, which shows that the proposed algorithm has a good detection effect on text detection in natural scenes. MDPI 2021-07-16 /pmc/articles/PMC8309843/ /pubmed/34300607 http://dx.doi.org/10.3390/s21144870 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Communication
Xiao, Liyun
Zhou, Peng
Xu, Ke
Zhao, Xiaofang
Multi-Directional Scene Text Detection Based on Improved YOLOv3
title Multi-Directional Scene Text Detection Based on Improved YOLOv3
title_full Multi-Directional Scene Text Detection Based on Improved YOLOv3
title_fullStr Multi-Directional Scene Text Detection Based on Improved YOLOv3
title_full_unstemmed Multi-Directional Scene Text Detection Based on Improved YOLOv3
title_short Multi-Directional Scene Text Detection Based on Improved YOLOv3
title_sort multi-directional scene text detection based on improved yolov3
topic Communication
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8309843/
https://www.ncbi.nlm.nih.gov/pubmed/34300607
http://dx.doi.org/10.3390/s21144870
work_keys_str_mv AT xiaoliyun multidirectionalscenetextdetectionbasedonimprovedyolov3
AT zhoupeng multidirectionalscenetextdetectionbasedonimprovedyolov3
AT xuke multidirectionalscenetextdetectionbasedonimprovedyolov3
AT zhaoxiaofang multidirectionalscenetextdetectionbasedonimprovedyolov3