Cargando…

CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture

To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline....

Descripción completa

Detalles Bibliográficos
Autor principal: Choi, Ho-Hyoung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256021/
https://www.ncbi.nlm.nih.gov/pubmed/37300068
http://dx.doi.org/10.3390/s23115341
_version_ 1785057013437300736
author Choi, Ho-Hyoung
author_facet Choi, Ho-Hyoung
author_sort Choi, Ho-Hyoung
collection PubMed
description To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline. CVCC has a long history of research and has significantly advanced, but it has yet to overcome some limitations such as algorithm failure or accuracy decreasing under unusual circumstances. To cope with some of the bottlenecks, this article presents a novel CVCC approach that introduces a residual-in-residual dense selective kernel network (RiR-DSN). As its name implies, it has a residual network in a residual network (RiR) and the RiR houses a dense selective kernel network (DSN). A DSN is composed of selective kernel convolutional blocks (SKCBs). The SKCBs, or neurons herein, are interconnected in a feed-forward fashion. Every neuron receives input from all its preceding neurons and feeds the feature maps into all its subsequent neurons, which is how information flows in the proposed architecture. In addition, the architecture has incorporated a dynamic selection mechanism into each neuron to ensure that the neuron can modulate filter kernel sizes depending on varying intensities of stimuli. In a nutshell, the proposed RiR-DSN architecture features neurons called SKCBs and a residual block in a residual block, which brings several benefits such as alleviation of the vanishing gradients, enhancement of feature propagation, promotion of the reuse of features, modulation of receptive filter sizes depending on varying intensities of stimuli, and a dramatic drop in the number of parameters. Experimental results highlight that the RiR-DSN architecture performs well above its state-of-the-art counterparts, as well as proving to be camera- and illuminant-invariant.
format Online
Article
Text
id pubmed-10256021
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-102560212023-06-10 CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture Choi, Ho-Hyoung Sensors (Basel) Article To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline. CVCC has a long history of research and has significantly advanced, but it has yet to overcome some limitations such as algorithm failure or accuracy decreasing under unusual circumstances. To cope with some of the bottlenecks, this article presents a novel CVCC approach that introduces a residual-in-residual dense selective kernel network (RiR-DSN). As its name implies, it has a residual network in a residual network (RiR) and the RiR houses a dense selective kernel network (DSN). A DSN is composed of selective kernel convolutional blocks (SKCBs). The SKCBs, or neurons herein, are interconnected in a feed-forward fashion. Every neuron receives input from all its preceding neurons and feeds the feature maps into all its subsequent neurons, which is how information flows in the proposed architecture. In addition, the architecture has incorporated a dynamic selection mechanism into each neuron to ensure that the neuron can modulate filter kernel sizes depending on varying intensities of stimuli. In a nutshell, the proposed RiR-DSN architecture features neurons called SKCBs and a residual block in a residual block, which brings several benefits such as alleviation of the vanishing gradients, enhancement of feature propagation, promotion of the reuse of features, modulation of receptive filter sizes depending on varying intensities of stimuli, and a dramatic drop in the number of parameters. Experimental results highlight that the RiR-DSN architecture performs well above its state-of-the-art counterparts, as well as proving to be camera- and illuminant-invariant. MDPI 2023-06-05 /pmc/articles/PMC10256021/ /pubmed/37300068 http://dx.doi.org/10.3390/s23115341 Text en © 2023 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Choi, Ho-Hyoung
CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title_full CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title_fullStr CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title_full_unstemmed CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title_short CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
title_sort cvcc model: learning-based computer vision color constancy with rir-dsn architecture
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256021/
https://www.ncbi.nlm.nih.gov/pubmed/37300068
http://dx.doi.org/10.3390/s23115341
work_keys_str_mv AT choihohyoung cvccmodellearningbasedcomputervisioncolorconstancywithrirdsnarchitecture