Cargando…
Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems
High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes,...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611084/ https://www.ncbi.nlm.nih.gov/pubmed/37896600 http://dx.doi.org/10.3390/s23208507 |
_version_ | 1785128408460558336 |
---|---|
author | Stojkovic, Ana Aelterman, Jan Van Hamme, David Shopovska, Ivana Philips, Wilfried |
author_facet | Stojkovic, Ana Aelterman, Jan Van Hamme, David Shopovska, Ivana Philips, Wilfried |
author_sort | Stojkovic, Ana |
collection | PubMed |
description | High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes, and (HDR) object perception techniques that can deal with this variety in illumination is highly desirable. Although progress has been made in both HDR imaging solutions and object detection algorithms in the recent years, they have progressed independently of each other. This has led to a situation in which object detection algorithms are typically designed and constantly improved to operate on 8 bit per channel content. This makes these algorithms not ideally suited for use in HDR data processing, which natively encodes to a higher bit-depth (12 bits/16 bits per channel). In this paper, we present and evaluate two novel convolutional neural network (CNN) architectures that intelligently convert high bit depth HDR images into 8-bit images. We attempt to optimize reconstruction quality by focusing on ADS object detection quality. The first research novelty is to jointly perform tone-mapping with demosaicing by additionally successfully suppressing noise and demosaicing artifacts. The first CNN performs tone-mapping with noise suppression on a full-color HDR input, while the second performs joint demosaicing and tone-mapping with noise suppression on a raw HDR input. The focus is to increase the detectability of traffic-related objects in the reconstructed 8-bit content, while ensuring that the realism of the standard dynamic range (SDR) content in diverse conditions is preserved. The second research novelty is that for the first time, to the best of our knowledge, a thorough comparative analysis against the state-of-the-art tone-mapping and demosaicing methods is performed with respect to ADS object detection accuracy on traffic-related content that abounds with diverse challenging (i.e., boundary cases) scenes. The evaluation results show that the two proposed networks have better performance in object detection accuracy and image quality, than both SDR content and content obtained with the state-of-the-art tone-mapping and demosaicing algorithms. |
format | Online Article Text |
id | pubmed-10611084 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106110842023-10-28 Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems Stojkovic, Ana Aelterman, Jan Van Hamme, David Shopovska, Ivana Philips, Wilfried Sensors (Basel) Article High dynamic range (HDR) imaging technology is increasingly being used in automated driving systems (ADS) for improving the safety of traffic participants in scenes with strong differences in illumination. Therefore, a combination of HDR video, that is video with details in all illumination regimes, and (HDR) object perception techniques that can deal with this variety in illumination is highly desirable. Although progress has been made in both HDR imaging solutions and object detection algorithms in the recent years, they have progressed independently of each other. This has led to a situation in which object detection algorithms are typically designed and constantly improved to operate on 8 bit per channel content. This makes these algorithms not ideally suited for use in HDR data processing, which natively encodes to a higher bit-depth (12 bits/16 bits per channel). In this paper, we present and evaluate two novel convolutional neural network (CNN) architectures that intelligently convert high bit depth HDR images into 8-bit images. We attempt to optimize reconstruction quality by focusing on ADS object detection quality. The first research novelty is to jointly perform tone-mapping with demosaicing by additionally successfully suppressing noise and demosaicing artifacts. The first CNN performs tone-mapping with noise suppression on a full-color HDR input, while the second performs joint demosaicing and tone-mapping with noise suppression on a raw HDR input. The focus is to increase the detectability of traffic-related objects in the reconstructed 8-bit content, while ensuring that the realism of the standard dynamic range (SDR) content in diverse conditions is preserved. The second research novelty is that for the first time, to the best of our knowledge, a thorough comparative analysis against the state-of-the-art tone-mapping and demosaicing methods is performed with respect to ADS object detection accuracy on traffic-related content that abounds with diverse challenging (i.e., boundary cases) scenes. The evaluation results show that the two proposed networks have better performance in object detection accuracy and image quality, than both SDR content and content obtained with the state-of-the-art tone-mapping and demosaicing algorithms. MDPI 2023-10-17 /pmc/articles/PMC10611084/ /pubmed/37896600 http://dx.doi.org/10.3390/s23208507 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Stojkovic, Ana Aelterman, Jan Van Hamme, David Shopovska, Ivana Philips, Wilfried Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title | Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title_full | Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title_fullStr | Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title_full_unstemmed | Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title_short | Deep Learning Tone-Mapping and Demosaicing for Automotive Vision Systems |
title_sort | deep learning tone-mapping and demosaicing for automotive vision systems |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611084/ https://www.ncbi.nlm.nih.gov/pubmed/37896600 http://dx.doi.org/10.3390/s23208507 |
work_keys_str_mv | AT stojkovicana deeplearningtonemappinganddemosaicingforautomotivevisionsystems AT aeltermanjan deeplearningtonemappinganddemosaicingforautomotivevisionsystems AT vanhammedavid deeplearningtonemappinganddemosaicingforautomotivevisionsystems AT shopovskaivana deeplearningtonemappinganddemosaicingforautomotivevisionsystems AT philipswilfried deeplearningtonemappinganddemosaicingforautomotivevisionsystems |