Cargando…

Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization

Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Yiwei, Tang, Luping, Liao, Chen, Zhang, Chun, Guo, Yingqing, Xia, Yixuan, Zhang, Yangyang, Yao, Sisi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611172/
https://www.ncbi.nlm.nih.gov/pubmed/37896446
http://dx.doi.org/10.3390/s23208351
_version_ 1785128429126942720
author Liu, Yiwei
Tang, Luping
Liao, Chen
Zhang, Chun
Guo, Yingqing
Xia, Yixuan
Zhang, Yangyang
Yao, Sisi
author_facet Liu, Yiwei
Tang, Luping
Liao, Chen
Zhang, Chun
Guo, Yingqing
Xia, Yixuan
Zhang, Yangyang
Yao, Sisi
author_sort Liu, Yiwei
collection PubMed
description Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset revealed that the deep convolutional neural network (CNN) model based on Gradient-weighted Class Activation Mapping (Grad-CAM) technology cannot effectively resist the interference of large-scale noise. In this article, an optimization of the deep CNN model was proposed by incorporating the Dropkey and Dropout (as a comparison) algorithm. Compared with Grad-CAM, the improved Grad-CAM based on Dropkey applies an attention mechanism to the feature map before calculating the gradient, which can introduce randomness and eliminate some areas by applying a mask to the attention score. Experimental results show that the optimized Grad-CAM deep CNN model based on the Dropkey algorithm can effectively resist large-scale noise interference and achieve accurate localization of image features. For instance, under the interference of a noise variance of 0.6, the Dropkey-enhanced ResNet50 model achieves a confidence level of 0.878 in predicting results, while the other two models exhibit confidence levels of 0.766 and 0.481, respectively. Moreover, it exhibits excellent performance in visualizing tasks related to image features such as distortion, low contrast, and small object characteristics. Furthermore, it has promising prospects in practical computer vision applications. For instance, in the field of autonomous driving, it can assist in verifying whether deep learning models accurately understand and process crucial objects, road signs, pedestrians, or other elements in the environment.
format Online
Article
Text
id pubmed-10611172
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-106111722023-10-28 Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization Liu, Yiwei Tang, Luping Liao, Chen Zhang, Chun Guo, Yingqing Xia, Yixuan Zhang, Yangyang Yao, Sisi Sensors (Basel) Article Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset revealed that the deep convolutional neural network (CNN) model based on Gradient-weighted Class Activation Mapping (Grad-CAM) technology cannot effectively resist the interference of large-scale noise. In this article, an optimization of the deep CNN model was proposed by incorporating the Dropkey and Dropout (as a comparison) algorithm. Compared with Grad-CAM, the improved Grad-CAM based on Dropkey applies an attention mechanism to the feature map before calculating the gradient, which can introduce randomness and eliminate some areas by applying a mask to the attention score. Experimental results show that the optimized Grad-CAM deep CNN model based on the Dropkey algorithm can effectively resist large-scale noise interference and achieve accurate localization of image features. For instance, under the interference of a noise variance of 0.6, the Dropkey-enhanced ResNet50 model achieves a confidence level of 0.878 in predicting results, while the other two models exhibit confidence levels of 0.766 and 0.481, respectively. Moreover, it exhibits excellent performance in visualizing tasks related to image features such as distortion, low contrast, and small object characteristics. Furthermore, it has promising prospects in practical computer vision applications. For instance, in the field of autonomous driving, it can assist in verifying whether deep learning models accurately understand and process crucial objects, road signs, pedestrians, or other elements in the environment. MDPI 2023-10-10 /pmc/articles/PMC10611172/ /pubmed/37896446 http://dx.doi.org/10.3390/s23208351 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Liu, Yiwei
Tang, Luping
Liao, Chen
Zhang, Chun
Guo, Yingqing
Xia, Yixuan
Zhang, Yangyang
Yao, Sisi
Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title_full Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title_fullStr Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title_full_unstemmed Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title_short Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
title_sort optimized dropkey-based grad-cam: toward accurate image feature localization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611172/
https://www.ncbi.nlm.nih.gov/pubmed/37896446
http://dx.doi.org/10.3390/s23208351
work_keys_str_mv AT liuyiwei optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT tangluping optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT liaochen optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT zhangchun optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT guoyingqing optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT xiayixuan optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT zhangyangyang optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization
AT yaosisi optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization