Cargando…

Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image

Vision-based identification of lane area and lane marking on the road is an indispensable function for intelligent driving vehicles, especially for localization, mapping and planning tasks. However, due to the increasing complexity of traffic scenes, such as occlusion and discontinuity, detecting la...

Descripción completa

Detalles Bibliográficos
Autores principales: Tian, Wei, Yu, Xianwang, Hu, Haohao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386617/
https://www.ncbi.nlm.nih.gov/pubmed/37514839
http://dx.doi.org/10.3390/s23146545
_version_ 1785081712051486720
author Tian, Wei
Yu, Xianwang
Hu, Haohao
author_facet Tian, Wei
Yu, Xianwang
Hu, Haohao
author_sort Tian, Wei
collection PubMed
description Vision-based identification of lane area and lane marking on the road is an indispensable function for intelligent driving vehicles, especially for localization, mapping and planning tasks. However, due to the increasing complexity of traffic scenes, such as occlusion and discontinuity, detecting lanes and lane markings from an image captured by a monocular camera becomes persistently challenging. The lanes and lane markings have a strong position correlation and are constrained by a spatial geometry prior to the driving scene. Most existing studies only explore a single task, i.e., either lane marking or lane detection, and do not consider the inherent connection or exploit the modeling of this kind of relationship between both elements to improve the detection performance of both tasks. In this paper, we establish a novel multi-task encoder–decoder framework for the simultaneous detection of lanes and lane markings. This approach deploys a dual-branch architecture to extract image information from different scales. By revealing the spatial constraints between lanes and lane markings, we propose an interactive attention learning for their feature information, which involves a Deformable Feature Fusion module for feature encoding, a Cross-Context module as information decoder, a Cross-IoU loss and a Focal-style loss weighting for robust training. Without bells and whistles, our method achieves state-of-the-art results on tasks of lane marking detection (with 32.53% on IoU, 81.61% on accuracy) and lane segmentation (with 91.72% on mIoU) of the BDD100K dataset, which showcases an improvement of 6.33% on IoU, 11.11% on accuracy in lane marking detection and 0.22% on mIoU in lane detection compared to the previous methods.
format Online
Article
Text
id pubmed-10386617
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-103866172023-07-30 Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image Tian, Wei Yu, Xianwang Hu, Haohao Sensors (Basel) Article Vision-based identification of lane area and lane marking on the road is an indispensable function for intelligent driving vehicles, especially for localization, mapping and planning tasks. However, due to the increasing complexity of traffic scenes, such as occlusion and discontinuity, detecting lanes and lane markings from an image captured by a monocular camera becomes persistently challenging. The lanes and lane markings have a strong position correlation and are constrained by a spatial geometry prior to the driving scene. Most existing studies only explore a single task, i.e., either lane marking or lane detection, and do not consider the inherent connection or exploit the modeling of this kind of relationship between both elements to improve the detection performance of both tasks. In this paper, we establish a novel multi-task encoder–decoder framework for the simultaneous detection of lanes and lane markings. This approach deploys a dual-branch architecture to extract image information from different scales. By revealing the spatial constraints between lanes and lane markings, we propose an interactive attention learning for their feature information, which involves a Deformable Feature Fusion module for feature encoding, a Cross-Context module as information decoder, a Cross-IoU loss and a Focal-style loss weighting for robust training. Without bells and whistles, our method achieves state-of-the-art results on tasks of lane marking detection (with 32.53% on IoU, 81.61% on accuracy) and lane segmentation (with 91.72% on mIoU) of the BDD100K dataset, which showcases an improvement of 6.33% on IoU, 11.11% on accuracy in lane marking detection and 0.22% on mIoU in lane detection compared to the previous methods. MDPI 2023-07-20 /pmc/articles/PMC10386617/ /pubmed/37514839 http://dx.doi.org/10.3390/s23146545 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Tian, Wei
Yu, Xianwang
Hu, Haohao
Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title_full Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title_fullStr Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title_full_unstemmed Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title_short Interactive Attention Learning on Detection of Lane and Lane Marking on the Road by Monocular Camera Image
title_sort interactive attention learning on detection of lane and lane marking on the road by monocular camera image
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10386617/
https://www.ncbi.nlm.nih.gov/pubmed/37514839
http://dx.doi.org/10.3390/s23146545
work_keys_str_mv AT tianwei interactiveattentionlearningondetectionoflaneandlanemarkingontheroadbymonocularcameraimage
AT yuxianwang interactiveattentionlearningondetectionoflaneandlanemarkingontheroadbymonocularcameraimage
AT huhaohao interactiveattentionlearningondetectionoflaneandlanemarkingontheroadbymonocularcameraimage