Cargando…
Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization
Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611172/ https://www.ncbi.nlm.nih.gov/pubmed/37896446 http://dx.doi.org/10.3390/s23208351 |
_version_ | 1785128429126942720 |
---|---|
author | Liu, Yiwei Tang, Luping Liao, Chen Zhang, Chun Guo, Yingqing Xia, Yixuan Zhang, Yangyang Yao, Sisi |
author_facet | Liu, Yiwei Tang, Luping Liao, Chen Zhang, Chun Guo, Yingqing Xia, Yixuan Zhang, Yangyang Yao, Sisi |
author_sort | Liu, Yiwei |
collection | PubMed |
description | Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset revealed that the deep convolutional neural network (CNN) model based on Gradient-weighted Class Activation Mapping (Grad-CAM) technology cannot effectively resist the interference of large-scale noise. In this article, an optimization of the deep CNN model was proposed by incorporating the Dropkey and Dropout (as a comparison) algorithm. Compared with Grad-CAM, the improved Grad-CAM based on Dropkey applies an attention mechanism to the feature map before calculating the gradient, which can introduce randomness and eliminate some areas by applying a mask to the attention score. Experimental results show that the optimized Grad-CAM deep CNN model based on the Dropkey algorithm can effectively resist large-scale noise interference and achieve accurate localization of image features. For instance, under the interference of a noise variance of 0.6, the Dropkey-enhanced ResNet50 model achieves a confidence level of 0.878 in predicting results, while the other two models exhibit confidence levels of 0.766 and 0.481, respectively. Moreover, it exhibits excellent performance in visualizing tasks related to image features such as distortion, low contrast, and small object characteristics. Furthermore, it has promising prospects in practical computer vision applications. For instance, in the field of autonomous driving, it can assist in verifying whether deep learning models accurately understand and process crucial objects, road signs, pedestrians, or other elements in the environment. |
format | Online Article Text |
id | pubmed-10611172 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-106111722023-10-28 Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization Liu, Yiwei Tang, Luping Liao, Chen Zhang, Chun Guo, Yingqing Xia, Yixuan Zhang, Yangyang Yao, Sisi Sensors (Basel) Article Regarding the interpretable techniques in the field of image recognition, Grad-CAM is widely used for feature localization in images to reflect the logical decision-making information behind the neural network due to its high applicability. However, extensive experimentation on a customized dataset revealed that the deep convolutional neural network (CNN) model based on Gradient-weighted Class Activation Mapping (Grad-CAM) technology cannot effectively resist the interference of large-scale noise. In this article, an optimization of the deep CNN model was proposed by incorporating the Dropkey and Dropout (as a comparison) algorithm. Compared with Grad-CAM, the improved Grad-CAM based on Dropkey applies an attention mechanism to the feature map before calculating the gradient, which can introduce randomness and eliminate some areas by applying a mask to the attention score. Experimental results show that the optimized Grad-CAM deep CNN model based on the Dropkey algorithm can effectively resist large-scale noise interference and achieve accurate localization of image features. For instance, under the interference of a noise variance of 0.6, the Dropkey-enhanced ResNet50 model achieves a confidence level of 0.878 in predicting results, while the other two models exhibit confidence levels of 0.766 and 0.481, respectively. Moreover, it exhibits excellent performance in visualizing tasks related to image features such as distortion, low contrast, and small object characteristics. Furthermore, it has promising prospects in practical computer vision applications. For instance, in the field of autonomous driving, it can assist in verifying whether deep learning models accurately understand and process crucial objects, road signs, pedestrians, or other elements in the environment. MDPI 2023-10-10 /pmc/articles/PMC10611172/ /pubmed/37896446 http://dx.doi.org/10.3390/s23208351 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Liu, Yiwei Tang, Luping Liao, Chen Zhang, Chun Guo, Yingqing Xia, Yixuan Zhang, Yangyang Yao, Sisi Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title | Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title_full | Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title_fullStr | Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title_full_unstemmed | Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title_short | Optimized Dropkey-Based Grad-CAM: Toward Accurate Image Feature Localization |
title_sort | optimized dropkey-based grad-cam: toward accurate image feature localization |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10611172/ https://www.ncbi.nlm.nih.gov/pubmed/37896446 http://dx.doi.org/10.3390/s23208351 |
work_keys_str_mv | AT liuyiwei optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT tangluping optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT liaochen optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT zhangchun optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT guoyingqing optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT xiayixuan optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT zhangyangyang optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization AT yaosisi optimizeddropkeybasedgradcamtowardaccurateimagefeaturelocalization |