Cargando…

Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems

High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes,...

Descripción completa

Detalles Bibliográficos
Autores principales: Stojkovic, Ana, Aelterman, Jan, Van Hamme, David, Shopovska, Ivana, Philips, Wilfried
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611084/
https://www.ncbi.nlm.nih.gov/pubmed/37896600
http://dx.doi.org/10.3390/s23208507
_version_ 1785128408460558336
author Stojkovic, Ana
Aelterman, Jan
Van Hamme, David
Shopovska, Ivana
Philips, Wilfried
author_facet Stojkovic, Ana
Aelterman, Jan
Van Hamme, David
Shopovska, Ivana
Philips, Wilfried
author_sort Stojkovic, Ana
collection PubMed
description High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes, and (HDR) object perception techniques that can deal with this variety in illumination is highly desirable. Although progress has been made in both HDR imaging solutions and object detection algorithms in the recent years, they have progressed independently of each other. This has led to a situation in which object detection algorithms are typically designed and constantly improved to operate on 8 bit per channel content. This makes these algorithms not ideally suited for use in HDR data processing, which natively encodes to a higher bit-depth (12 bits/16 bits per channel). In this paper, we present and evaluate two novel convolutional neural network (CNN) architectures that intelligently convert high bit depth HDR images into 8-bit images. We attempt to optimize reconstruction quality by focusing on ADS object detection quality. The first research novelty is to jointly perform tone-mapping with demosaicing by additionally successfully suppressing noise and demosaicing artifacts. The first CNN performs tone-mapping with noise suppression on a full-color HDR input, while the second performs joint demosaicing and tone-mapping with noise suppression on a raw HDR input. The focus is to increase the detectability of traffic-related objects in the reconstructed 8-bit content, while ensuring that the realism of the standard dynamic range (SDR) content in diverse conditions is preserved. The second research novelty is that for the first time, to the best of our knowledge, a thorough comparative analysis against the state-of-the-art tone-mapping and demosaicing methods is performed with respect to ADS object detection accuracy on traffic-related content that abounds with diverse challenging (i.e., boundary cases) scenes. The evaluation results show that the two proposed networks have better performance in object detection accuracy and image quality, than both SDR content and content obtained with the state-of-the-art tone-mapping and demosaicing algorithms.
format Online
Article
Text
id pubmed-10611084
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106110842023-10-28 Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems Stojkovic, Ana Aelterman, Jan Van Hamme, David Shopovska, Ivana Philips, Wilfried Sensors (Basel) Article High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes, and (HDR) object perception techniques that can deal with this variety in illumination is highly desirable. Although progress has been made in both HDR imaging solutions and object detection algorithms in the recent years, they have progressed independently of each other. This has led to a situation in which object detection algorithms are typically designed and constantly improved to operate on 8 bit per channel content. This makes these algorithms not ideally suited for use in HDR data processing, which natively encodes to a higher bit-depth (12 bits/16 bits per channel). In this paper, we present and evaluate two novel convolutional neural network (CNN) architectures that intelligently convert high bit depth HDR images into 8-bit images. We attempt to optimize reconstruction quality by focusing on ADS object detection quality. The first research novelty is to jointly perform tone-mapping with demosaicing by additionally successfully suppressing noise and demosaicing artifacts. The first CNN performs tone-mapping with noise suppression on a full-color HDR input, while the second performs joint demosaicing and tone-mapping with noise suppression on a raw HDR input. The focus is to increase the detectability of traffic-related objects in the reconstructed 8-bit content, while ensuring that the realism of the standard dynamic range (SDR) content in diverse conditions is preserved. The second research novelty is that for the first time, to the best of our knowledge, a thorough comparative analysis against the state-of-the-art tone-mapping and demosaicing methods is performed with respect to ADS object detection accuracy on traffic-related content that abounds with diverse challenging (i.e., boundary cases) scenes. The evaluation results show that the two proposed networks have better performance in object detection accuracy and image quality, than both SDR content and content obtained with the state-of-the-art tone-mapping and demosaicing algorithms. MDPI 2023-10-17 /pmc/articles/PMC10611084/ /pubmed/37896600 http://dx.doi.org/10.3390/s23208507 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Stojkovic, Ana
Aelterman, Jan
Van Hamme, David
Shopovska, Ivana
Philips, Wilfried
Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title_full Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title_fullStr Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title_full_unstemmed Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title_short Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
title_sort deep learning tone-mapping and demosaicing for automotive vision systems
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611084/
https://www.ncbi.nlm.nih.gov/pubmed/37896600
http://dx.doi.org/10.3390/s23208507
work_keys_str_mv AT stojkovicana deeplearningtonemappinganddemosaicingforautomotivevisionsystems
AT aeltermanjan deeplearningtonemappinganddemosaicingforautomotivevisionsystems
AT vanhammedavid deeplearningtonemappinganddemosaicingforautomotivevisionsystems
AT shopovskaivana deeplearningtonemappinganddemosaicingforautomotivevisionsystems
AT philipswilfried deeplearningtonemappinganddemosaicingforautomotivevisionsystems