Cargando…

LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection

This paper proposes a learnable line encoding technique for bounding boxes commonly used in the object detection task. A bounding box is simply encoded using two main points: the top-left corner and the bottom-right corner of the bounding box; then, a lightweight convolutional neural network (CNN) i...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ibrahem, Hatem, Salem, Ahmed, Kang, Hyun-Soo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2022
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144147/ https://www.ncbi.nlm.nih.gov/pubmed/35632108 http://dx.doi.org/10.3390/s22103699

_version_	1784715977879977984
author	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_facet	Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo
author_sort	Ibrahem, Hatem
collection	PubMed
description	This paper proposes a learnable line encoding technique for bounding boxes commonly used in the object detection task. A bounding box is simply encoded using two main points: the top-left corner and the bottom-right corner of the bounding box; then, a lightweight convolutional neural network (CNN) is employed to learn the lines and propose high-resolution line masks for each category of classes using a pixel-shuffle operation. Post-processing is applied to the predicted line masks to filtrate them and estimate clear lines based on a progressive probabilistic Hough transform. The proposed method was trained and evaluated on two common object detection benchmarks: Pascal VOC2007 and MS-COCO2017. The proposed model attains high mean average precision (mAP) values (78.8% for VOC2007 and 48.1% for COCO2017) while processing each frame in a few milliseconds (37 ms for PASCAL VOC and 47 ms for COCO). The strength of the proposed method lies in its simplicity and ease of implementation unlike the recent state-of-the-art methods in object detection, which include complex processing pipelines.
format	Online Article Text
id	pubmed-9144147
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-91441472022-05-29 LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo Sensors (Basel) Article This paper proposes a learnable line encoding technique for bounding boxes commonly used in the object detection task. A bounding box is simply encoded using two main points: the top-left corner and the bottom-right corner of the bounding box; then, a lightweight convolutional neural network (CNN) is employed to learn the lines and propose high-resolution line masks for each category of classes using a pixel-shuffle operation. Post-processing is applied to the predicted line masks to filtrate them and estimate clear lines based on a progressive probabilistic Hough transform. The proposed method was trained and evaluated on two common object detection benchmarks: Pascal VOC2007 and MS-COCO2017. The proposed model attains high mean average precision (mAP) values (78.8% for VOC2007 and 48.1% for COCO2017) while processing each frame in a few milliseconds (37 ms for PASCAL VOC and 47 ms for COCO). The strength of the proposed method lies in its simplicity and ease of implementation unlike the recent state-of-the-art methods in object detection, which include complex processing pipelines. MDPI 2022-05-12 /pmc/articles/PMC9144147/ /pubmed/35632108 http://dx.doi.org/10.3390/s22103699 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Ibrahem, Hatem Salem, Ahmed Kang, Hyun-Soo LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title	LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title_full	LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title_fullStr	LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title_full_unstemmed	LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title_short	LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection
title_sort	leod-net: learning line-encoded bounding boxes for real-time object detection
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9144147/ https://www.ncbi.nlm.nih.gov/pubmed/35632108 http://dx.doi.org/10.3390/s22103699
work_keys_str_mv	AT ibrahemhatem leodnetlearninglineencodedboundingboxesforrealtimeobjectdetection AT salemahmed leodnetlearninglineencodedboundingboxesforrealtimeobjectdetection AT kanghyunsoo leodnetlearninglineencodedboundingboxesforrealtimeobjectdetection

LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection

Ejemplares similares