Cargando…

Object detection using YOLO: challenges, architectural successors, datasets and applications

Object detection is one of the predominant and challenging problems in computer vision. Over the decade, with the expeditious evolution of deep learning, researchers have extensively experimented and contributed in the performance enhancement of object detection and related tasks such as object clas...

Descripción completa

Detalles Bibliográficos
Autores principales:	Diwan, Tausif, Anirudh, G., Tembhurne, Jitendra V.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Springer US 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358372/ https://www.ncbi.nlm.nih.gov/pubmed/35968414 http://dx.doi.org/10.1007/s11042-022-13644-y

_version_	1784763916454199296
author	Diwan, Tausif Anirudh, G. Tembhurne, Jitendra V.
author_facet	Diwan, Tausif Anirudh, G. Tembhurne, Jitendra V.
author_sort	Diwan, Tausif
collection	PubMed
description	Object detection is one of the predominant and challenging problems in computer vision. Over the decade, with the expeditious evolution of deep learning, researchers have extensively experimented and contributed in the performance enhancement of object detection and related tasks such as object classification, localization, and segmentation using underlying deep models. Broadly, object detectors are classified into two categories viz. two stage and single stage object detectors. Two stage detectors mainly focus on selective region proposals strategy via complex architecture; however, single stage detectors focus on all the spatial region proposals for the possible detection of objects via relatively simpler architecture in one shot. Performance of any object detector is evaluated through detection accuracy and inference time. Generally, the detection accuracy of two stage detectors outperforms single stage object detectors. However, the inference time of single stage detectors is better compared to its counterparts. Moreover, with the advent of YOLO (You Only Look Once) and its architectural successors, the detection accuracy is improving significantly and sometime it is better than two stage detectors. YOLOs are adopted in various applications majorly due to their faster inferences rather than considering detection accuracy. As an example, detection accuracies are 63.4 and 70 for YOLO and Fast-RCNN respectively, however, inference time is around 300 times faster in case of YOLO. In this paper, we present a comprehensive review of single stage object detectors specially YOLOs, regression formulation, their architecture advancements, and performance statistics. Moreover, we summarize the comparative illustration between two stage and single stage object detectors, among different versions of YOLOs, applications based on two stage detectors, and different versions of YOLOs along with the future research directions.
format	Online Article Text
id	pubmed-9358372
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Springer US
record_format	MEDLINE/PubMed
spelling	pubmed-93583722022-08-09 Object detection using YOLO: challenges, architectural successors, datasets and applications Diwan, Tausif Anirudh, G. Tembhurne, Jitendra V. Multimed Tools Appl Article Object detection is one of the predominant and challenging problems in computer vision. Over the decade, with the expeditious evolution of deep learning, researchers have extensively experimented and contributed in the performance enhancement of object detection and related tasks such as object classification, localization, and segmentation using underlying deep models. Broadly, object detectors are classified into two categories viz. two stage and single stage object detectors. Two stage detectors mainly focus on selective region proposals strategy via complex architecture; however, single stage detectors focus on all the spatial region proposals for the possible detection of objects via relatively simpler architecture in one shot. Performance of any object detector is evaluated through detection accuracy and inference time. Generally, the detection accuracy of two stage detectors outperforms single stage object detectors. However, the inference time of single stage detectors is better compared to its counterparts. Moreover, with the advent of YOLO (You Only Look Once) and its architectural successors, the detection accuracy is improving significantly and sometime it is better than two stage detectors. YOLOs are adopted in various applications majorly due to their faster inferences rather than considering detection accuracy. As an example, detection accuracies are 63.4 and 70 for YOLO and Fast-RCNN respectively, however, inference time is around 300 times faster in case of YOLO. In this paper, we present a comprehensive review of single stage object detectors specially YOLOs, regression formulation, their architecture advancements, and performance statistics. Moreover, we summarize the comparative illustration between two stage and single stage object detectors, among different versions of YOLOs, applications based on two stage detectors, and different versions of YOLOs along with the future research directions. Springer US 2022-08-08 2023 /pmc/articles/PMC9358372/ /pubmed/35968414 http://dx.doi.org/10.1007/s11042-022-13644-y Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022, Springer Nature or its licensor holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle	Article Diwan, Tausif Anirudh, G. Tembhurne, Jitendra V. Object detection using YOLO: challenges, architectural successors, datasets and applications
title	Object detection using YOLO: challenges, architectural successors, datasets and applications
title_full	Object detection using YOLO: challenges, architectural successors, datasets and applications
title_fullStr	Object detection using YOLO: challenges, architectural successors, datasets and applications
title_full_unstemmed	Object detection using YOLO: challenges, architectural successors, datasets and applications
title_short	Object detection using YOLO: challenges, architectural successors, datasets and applications
title_sort	object detection using yolo: challenges, architectural successors, datasets and applications
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9358372/ https://www.ncbi.nlm.nih.gov/pubmed/35968414 http://dx.doi.org/10.1007/s11042-022-13644-y
work_keys_str_mv	AT diwantausif objectdetectionusingyolochallengesarchitecturalsuccessorsdatasetsandapplications AT anirudhg objectdetectionusingyolochallengesarchitecturalsuccessorsdatasetsandapplications AT tembhurnejitendrav objectdetectionusingyolochallengesarchitecturalsuccessorsdatasetsandapplications

Object detection using YOLO: challenges, architectural successors, datasets and applications

Ejemplares similares