Cargando…
A fine-tuned YOLOv5 deep learning approach for real-time house number detection
Detection of small objects in natural scene images is a complicated problem due to the blur and depth found in the images. Detecting house numbers from the natural scene images in real-time is a computer vision problem. On the other hand, convolutional neural network (CNN) based deep learning method...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403189/ https://www.ncbi.nlm.nih.gov/pubmed/37547390 http://dx.doi.org/10.7717/peerj-cs.1453 |
_version_ | 1785085014260580352 |
---|---|
author | Taşyürek, Murat Öztürk, Celal |
author_facet | Taşyürek, Murat Öztürk, Celal |
author_sort | Taşyürek, Murat |
collection | PubMed |
description | Detection of small objects in natural scene images is a complicated problem due to the blur and depth found in the images. Detecting house numbers from the natural scene images in real-time is a computer vision problem. On the other hand, convolutional neural network (CNN) based deep learning methods have been widely used in object detection in recent years. In this study, firstly, a classical CNN-based approach is used to detect house numbers with locations from natural images in real-time. Faster R-CNN, MobileNet, YOLOv4, YOLOv5 and YOLOv7, among the commonly used CNN models, models were applied. However, satisfactory results could not be obtained due to the small size and variable depth of the door plate objects. A new approach using the fine-tuning technique is proposed to improve the performance of CNN-based deep learning models. Experimental evaluations were made on real data from Kayseri province. Classic Faster R-CNN, MobileNet, YOLOv4, YOLOv5 and YOLOv7 methods yield f1 scores of 0.763, 0.677, 0.880, 0.943 and 0.842, respectively. The proposed fine-tuned Faster R-CNN, MobileNet, YOLOv4, YOLOv5, and YOLOv7 approaches achieved f1 scores of 0.845, 0.775, 0.932, 0.972 and 0.889, respectively. Thanks to the proposed fine-tuned approach, the f1 score of all models has increased. Regarding the run time of the methods, classic Faster R-CNN detects 0.603 seconds, while fine-tuned Faster R-CNN detects 0.633 seconds. Classic MobileNet detects 0.046 seconds, while fine-tuned MobileNet detects 0.048 seconds. Classic YOLOv4 and fine-tuned YOLOv4 detect 0.235 and 0.240 seconds, respectively. Classic YOLOv5 and fine-tuned YOLOv5 detect 0.015 seconds, and classic YOLOv7 and fine-tuned YOLOv7 detect objects in 0.009 seconds. While the YOLOv7 model was the fastest running model with an average running time of 0.009 seconds, the proposed fine-tuned YOLOv5 approach achieved the highest performance with an f1 score of 0.972. |
format | Online Article Text |
id | pubmed-10403189 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104031892023-08-05 A fine-tuned YOLOv5 deep learning approach for real-time house number detection Taşyürek, Murat Öztürk, Celal PeerJ Comput Sci Computer Vision Detection of small objects in natural scene images is a complicated problem due to the blur and depth found in the images. Detecting house numbers from the natural scene images in real-time is a computer vision problem. On the other hand, convolutional neural network (CNN) based deep learning methods have been widely used in object detection in recent years. In this study, firstly, a classical CNN-based approach is used to detect house numbers with locations from natural images in real-time. Faster R-CNN, MobileNet, YOLOv4, YOLOv5 and YOLOv7, among the commonly used CNN models, models were applied. However, satisfactory results could not be obtained due to the small size and variable depth of the door plate objects. A new approach using the fine-tuning technique is proposed to improve the performance of CNN-based deep learning models. Experimental evaluations were made on real data from Kayseri province. Classic Faster R-CNN, MobileNet, YOLOv4, YOLOv5 and YOLOv7 methods yield f1 scores of 0.763, 0.677, 0.880, 0.943 and 0.842, respectively. The proposed fine-tuned Faster R-CNN, MobileNet, YOLOv4, YOLOv5, and YOLOv7 approaches achieved f1 scores of 0.845, 0.775, 0.932, 0.972 and 0.889, respectively. Thanks to the proposed fine-tuned approach, the f1 score of all models has increased. Regarding the run time of the methods, classic Faster R-CNN detects 0.603 seconds, while fine-tuned Faster R-CNN detects 0.633 seconds. Classic MobileNet detects 0.046 seconds, while fine-tuned MobileNet detects 0.048 seconds. Classic YOLOv4 and fine-tuned YOLOv4 detect 0.235 and 0.240 seconds, respectively. Classic YOLOv5 and fine-tuned YOLOv5 detect 0.015 seconds, and classic YOLOv7 and fine-tuned YOLOv7 detect objects in 0.009 seconds. While the YOLOv7 model was the fastest running model with an average running time of 0.009 seconds, the proposed fine-tuned YOLOv5 approach achieved the highest performance with an f1 score of 0.972. PeerJ Inc. 2023-07-03 /pmc/articles/PMC10403189/ /pubmed/37547390 http://dx.doi.org/10.7717/peerj-cs.1453 Text en ©2023 Taşyürek and Öztürk https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ Computer Science) and either DOI or URL of the article must be cited. |
spellingShingle | Computer Vision Taşyürek, Murat Öztürk, Celal A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title | A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title_full | A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title_fullStr | A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title_full_unstemmed | A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title_short | A fine-tuned YOLOv5 deep learning approach for real-time house number detection |
title_sort | fine-tuned yolov5 deep learning approach for real-time house number detection |
topic | Computer Vision |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10403189/ https://www.ncbi.nlm.nih.gov/pubmed/37547390 http://dx.doi.org/10.7717/peerj-cs.1453 |
work_keys_str_mv | AT tasyurekmurat afinetunedyolov5deeplearningapproachforrealtimehousenumberdetection AT ozturkcelal afinetunedyolov5deeplearningapproachforrealtimehousenumberdetection AT tasyurekmurat finetunedyolov5deeplearningapproachforrealtimehousenumberdetection AT ozturkcelal finetunedyolov5deeplearningapproachforrealtimehousenumberdetection |