Cargando…

Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms

This paper presents a focused investigation into real-time segmentation in unstructured environments, a crucial aspect for enabling autonomous navigation in off-road robots. To address this challenge, an improved variant of the DDRNet23-slim model is proposed, which includes a lightweight network ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Nuanchen, Zhao, Wenfeng, Liang, Shenghao, Zhong, Minyue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346488/
https://www.ncbi.nlm.nih.gov/pubmed/37447855
http://dx.doi.org/10.3390/s23136008
_version_ 1785073325442072576
author Lin, Nuanchen
Zhao, Wenfeng
Liang, Shenghao
Zhong, Minyue
author_facet Lin, Nuanchen
Zhao, Wenfeng
Liang, Shenghao
Zhong, Minyue
author_sort Lin, Nuanchen
collection PubMed
description This paper presents a focused investigation into real-time segmentation in unstructured environments, a crucial aspect for enabling autonomous navigation in off-road robots. To address this challenge, an improved variant of the DDRNet23-slim model is proposed, which includes a lightweight network architecture and reclassifies ten different categories, including drivable roads, trees, high vegetation, obstacles, and buildings, based on the RUGD dataset. The model’s design includes the integration of the semantic-aware normalization and semantic-aware whitening (SAN–SAW) module into the main network to improve generalization ability beyond the visible domain. The model’s segmentation accuracy is improved through the fusion of channel attention and spatial attention mechanisms in the low-resolution branch to enhance its ability to capture fine details in complex scenes. Additionally, to tackle the issue of category imbalance in unstructured scene datasets, a rare class sampling strategy (RCS) is employed to mitigate the negative impact of low segmentation accuracy for rare classes on the overall performance of the model. Experimental results demonstrate that the improved model achieves a significant 14% increase mIoU in the invisible domain, indicating its strong generalization ability. With a parameter count of only 5.79M, the model achieves mAcc of 85.21% and mIoU of 77.75%. The model has been successfully deployed on a a Jetson Xavier NX ROS robot and tested in both real and simulated orchard environments. Speed optimization using TensorRT increased the segmentation speed to 30.17 FPS. The proposed model strikes a desirable balance between inference speed and accuracy and has good domain migration ability, making it applicable in various domains such as forestry rescue and intelligent agricultural orchard harvesting.
format Online
Article
Text
id pubmed-10346488
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103464882023-07-15 Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms Lin, Nuanchen Zhao, Wenfeng Liang, Shenghao Zhong, Minyue Sensors (Basel) Article This paper presents a focused investigation into real-time segmentation in unstructured environments, a crucial aspect for enabling autonomous navigation in off-road robots. To address this challenge, an improved variant of the DDRNet23-slim model is proposed, which includes a lightweight network architecture and reclassifies ten different categories, including drivable roads, trees, high vegetation, obstacles, and buildings, based on the RUGD dataset. The model’s design includes the integration of the semantic-aware normalization and semantic-aware whitening (SAN–SAW) module into the main network to improve generalization ability beyond the visible domain. The model’s segmentation accuracy is improved through the fusion of channel attention and spatial attention mechanisms in the low-resolution branch to enhance its ability to capture fine details in complex scenes. Additionally, to tackle the issue of category imbalance in unstructured scene datasets, a rare class sampling strategy (RCS) is employed to mitigate the negative impact of low segmentation accuracy for rare classes on the overall performance of the model. Experimental results demonstrate that the improved model achieves a significant 14% increase mIoU in the invisible domain, indicating its strong generalization ability. With a parameter count of only 5.79M, the model achieves mAcc of 85.21% and mIoU of 77.75%. The model has been successfully deployed on a a Jetson Xavier NX ROS robot and tested in both real and simulated orchard environments. Speed optimization using TensorRT increased the segmentation speed to 30.17 FPS. The proposed model strikes a desirable balance between inference speed and accuracy and has good domain migration ability, making it applicable in various domains such as forestry rescue and intelligent agricultural orchard harvesting. MDPI 2023-06-28 /pmc/articles/PMC10346488/ /pubmed/37447855 http://dx.doi.org/10.3390/s23136008 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lin, Nuanchen
Zhao, Wenfeng
Liang, Shenghao
Zhong, Minyue
Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title_full Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title_fullStr Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title_full_unstemmed Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title_short Real-Time Segmentation of Unstructured Environments by Combining Domain Generalization and Attention Mechanisms
title_sort real-time segmentation of unstructured environments by combining domain generalization and attention mechanisms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10346488/
https://www.ncbi.nlm.nih.gov/pubmed/37447855
http://dx.doi.org/10.3390/s23136008
work_keys_str_mv AT linnuanchen realtimesegmentationofunstructuredenvironmentsbycombiningdomaingeneralizationandattentionmechanisms
AT zhaowenfeng realtimesegmentationofunstructuredenvironmentsbycombiningdomaingeneralizationandattentionmechanisms
AT liangshenghao realtimesegmentationofunstructuredenvironmentsbycombiningdomaingeneralizationandattentionmechanisms
AT zhongminyue realtimesegmentationofunstructuredenvironmentsbycombiningdomaingeneralizationandattentionmechanisms