Cargando…
CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture
To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline....
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256021/ https://www.ncbi.nlm.nih.gov/pubmed/37300068 http://dx.doi.org/10.3390/s23115341 |
_version_ | 1785057013437300736 |
---|---|
author | Choi, Ho-Hyoung |
author_facet | Choi, Ho-Hyoung |
author_sort | Choi, Ho-Hyoung |
collection | PubMed |
description | To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline. CVCC has a long history of research and has significantly advanced, but it has yet to overcome some limitations such as algorithm failure or accuracy decreasing under unusual circumstances. To cope with some of the bottlenecks, this article presents a novel CVCC approach that introduces a residual-in-residual dense selective kernel network (RiR-DSN). As its name implies, it has a residual network in a residual network (RiR) and the RiR houses a dense selective kernel network (DSN). A DSN is composed of selective kernel convolutional blocks (SKCBs). The SKCBs, or neurons herein, are interconnected in a feed-forward fashion. Every neuron receives input from all its preceding neurons and feeds the feature maps into all its subsequent neurons, which is how information flows in the proposed architecture. In addition, the architecture has incorporated a dynamic selection mechanism into each neuron to ensure that the neuron can modulate filter kernel sizes depending on varying intensities of stimuli. In a nutshell, the proposed RiR-DSN architecture features neurons called SKCBs and a residual block in a residual block, which brings several benefits such as alleviation of the vanishing gradients, enhancement of feature propagation, promotion of the reuse of features, modulation of receptive filter sizes depending on varying intensities of stimuli, and a dramatic drop in the number of parameters. Experimental results highlight that the RiR-DSN architecture performs well above its state-of-the-art counterparts, as well as proving to be camera- and illuminant-invariant. |
format | Online Article Text |
id | pubmed-10256021 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-102560212023-06-10 CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture Choi, Ho-Hyoung Sensors (Basel) Article To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline. CVCC has a long history of research and has significantly advanced, but it has yet to overcome some limitations such as algorithm failure or accuracy decreasing under unusual circumstances. To cope with some of the bottlenecks, this article presents a novel CVCC approach that introduces a residual-in-residual dense selective kernel network (RiR-DSN). As its name implies, it has a residual network in a residual network (RiR) and the RiR houses a dense selective kernel network (DSN). A DSN is composed of selective kernel convolutional blocks (SKCBs). The SKCBs, or neurons herein, are interconnected in a feed-forward fashion. Every neuron receives input from all its preceding neurons and feeds the feature maps into all its subsequent neurons, which is how information flows in the proposed architecture. In addition, the architecture has incorporated a dynamic selection mechanism into each neuron to ensure that the neuron can modulate filter kernel sizes depending on varying intensities of stimuli. In a nutshell, the proposed RiR-DSN architecture features neurons called SKCBs and a residual block in a residual block, which brings several benefits such as alleviation of the vanishing gradients, enhancement of feature propagation, promotion of the reuse of features, modulation of receptive filter sizes depending on varying intensities of stimuli, and a dramatic drop in the number of parameters. Experimental results highlight that the RiR-DSN architecture performs well above its state-of-the-art counterparts, as well as proving to be camera- and illuminant-invariant. MDPI 2023-06-05 /pmc/articles/PMC10256021/ /pubmed/37300068 http://dx.doi.org/10.3390/s23115341 Text en © 2023 by the author. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Choi, Ho-Hyoung CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title | CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title_full | CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title_fullStr | CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title_full_unstemmed | CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title_short | CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture |
title_sort | cvcc model: learning-based computer vision color constancy with rir-dsn architecture |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10256021/ https://www.ncbi.nlm.nih.gov/pubmed/37300068 http://dx.doi.org/10.3390/s23115341 |
work_keys_str_mv | AT choihohyoung cvccmodellearningbasedcomputervisioncolorconstancywithrirdsnarchitecture |