Cargando…

IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation

Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a...

Descripción completa

Detalles Bibliográficos
Autores principales:	Dang, Thai-Viet, Tran, Dinh-Manh-Cuong, Tan, Phan Xuan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2023
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422405/ https://www.ncbi.nlm.nih.gov/pubmed/37571691 http://dx.doi.org/10.3390/s23156907

_version_	1785089201657610240
author	Dang, Thai-Viet Tran, Dinh-Manh-Cuong Tan, Phan Xuan
author_facet	Dang, Thai-Viet Tran, Dinh-Manh-Cuong Tan, Phan Xuan
author_sort	Dang, Thai-Viet
collection	PubMed
description	Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a complex sensor system with a high computational efficiency requirement. This study offers a real-time solution to the problem of extracting corridor scenes from a single image using a lightweight semantic segmentation model integrating with the quantization technique to reduce the numerous training parameters and computational costs. The proposed model consists of an FCN as the decoder and MobilenetV2 as the decoder (with multi-scale fusion). This combination allows us to significantly minimize computation time while achieving high precision. Moreover, in this study, we also propose to use the Balance Cross-Entropy loss function to handle diverse datasets, especially those with class imbalances and to integrate a number of techniques, for example, the Adam optimizer and Gaussian filters, to enhance segmentation performance. The results demonstrate that our model can outperform baselines across different datasets. Moreover, when being applied to practical experiments with a real mobile robot, the proposed model’s performance is still consistent, supporting the optimal path planning, allowing the mobile robot to efficiently and effectively avoid the obstacles.
format	Online Article Text
id	pubmed-10422405
institution	National Center for Biotechnology Information
language	English
publishDate	2023
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-104224052023-08-13 IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation Dang, Thai-Viet Tran, Dinh-Manh-Cuong Tan, Phan Xuan Sensors (Basel) Article Computer vision plays a significant role in mobile robot navigation due to the wealth of information extracted from digital images. Mobile robots localize and move to the intended destination based on the captured images. Due to the complexity of the environment, obstacle avoidance still requires a complex sensor system with a high computational efficiency requirement. This study offers a real-time solution to the problem of extracting corridor scenes from a single image using a lightweight semantic segmentation model integrating with the quantization technique to reduce the numerous training parameters and computational costs. The proposed model consists of an FCN as the decoder and MobilenetV2 as the decoder (with multi-scale fusion). This combination allows us to significantly minimize computation time while achieving high precision. Moreover, in this study, we also propose to use the Balance Cross-Entropy loss function to handle diverse datasets, especially those with class imbalances and to integrate a number of techniques, for example, the Adam optimizer and Gaussian filters, to enhance segmentation performance. The results demonstrate that our model can outperform baselines across different datasets. Moreover, when being applied to practical experiments with a real mobile robot, the proposed model’s performance is still consistent, supporting the optimal path planning, allowing the mobile robot to efficiently and effectively avoid the obstacles. MDPI 2023-08-03 /pmc/articles/PMC10422405/ /pubmed/37571691 http://dx.doi.org/10.3390/s23156907 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Dang, Thai-Viet Tran, Dinh-Manh-Cuong Tan, Phan Xuan IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title	IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title_full	IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title_fullStr	IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title_full_unstemmed	IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title_short	IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation
title_sort	irdc-net: lightweight semantic segmentation network based on monocular camera for mobile robot navigation
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10422405/ https://www.ncbi.nlm.nih.gov/pubmed/37571691 http://dx.doi.org/10.3390/s23156907
work_keys_str_mv	AT dangthaiviet irdcnetlightweightsemanticsegmentationnetworkbasedonmonocularcameraformobilerobotnavigation AT trandinhmanhcuong irdcnetlightweightsemanticsegmentationnetworkbasedonmonocularcameraformobilerobotnavigation AT tanphanxuan irdcnetlightweightsemanticsegmentationnetworkbasedonmonocularcameraformobilerobotnavigation

IRDC-Net: Lightweight Semantic Segmentation Network Based on Monocular Camera for Mobile Robot Navigation

Ejemplares similares