Cargando…

A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images

At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model bas...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Zhenwei, Xu, Mengying, Yang, Xiaoli, Han, Yanqi, Wang, Jiawen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054796/
https://www.ncbi.nlm.nih.gov/pubmed/36985112
http://dx.doi.org/10.3390/mi14030705
_version_ 1785015757821706240
author Li, Zhenwei
Xu, Mengying
Yang, Xiaoli
Han, Yanqi
Wang, Jiawen
author_facet Li, Zhenwei
Xu, Mengying
Yang, Xiaoli
Han, Yanqi
Wang, Jiawen
author_sort Li, Zhenwei
collection PubMed
description At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model based on gradient-weighted class activation mapping (Grad-CAM) is proposed. The model uses VGG19 and ResNet50 as the classification networks. Grad-CAM is a data augmentation module used to obtain a network convolutional layer output activation map. Both the augmented and the original data are used as the input of the model to achieve the classification goal. The data augmentation module can guide the model to learn the feature differences of lesions in the fundus and enhance the robustness of the classification model. Model fine tuning and transfer learning are used to improve the accuracy of multiple classifiers. The proposed method is based on the RFMiD (Retinal Fundus Multi-Disease Image Dataset) dataset, and an ablation experiment was performed. Compared with other methods, the accuracy, precision, and recall of this model are 97%, 92%, and 81%, respectively. The resulting activation graph shows the areas of interest for model classification, making it easier to understand the classification network.
format Online
Article
Text
id pubmed-10054796
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100547962023-03-30 A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images Li, Zhenwei Xu, Mengying Yang, Xiaoli Han, Yanqi Wang, Jiawen Micromachines (Basel) Article At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model based on gradient-weighted class activation mapping (Grad-CAM) is proposed. The model uses VGG19 and ResNet50 as the classification networks. Grad-CAM is a data augmentation module used to obtain a network convolutional layer output activation map. Both the augmented and the original data are used as the input of the model to achieve the classification goal. The data augmentation module can guide the model to learn the feature differences of lesions in the fundus and enhance the robustness of the classification model. Model fine tuning and transfer learning are used to improve the accuracy of multiple classifiers. The proposed method is based on the RFMiD (Retinal Fundus Multi-Disease Image Dataset) dataset, and an ablation experiment was performed. Compared with other methods, the accuracy, precision, and recall of this model are 97%, 92%, and 81%, respectively. The resulting activation graph shows the areas of interest for model classification, making it easier to understand the classification network. MDPI 2023-03-22 /pmc/articles/PMC10054796/ /pubmed/36985112 http://dx.doi.org/10.3390/mi14030705 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Li, Zhenwei
Xu, Mengying
Yang, Xiaoli
Han, Yanqi
Wang, Jiawen
A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title_full A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title_fullStr A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title_full_unstemmed A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title_short A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
title_sort multi-label detection deep learning model with attention-guided image enhancement for retinal images
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054796/
https://www.ncbi.nlm.nih.gov/pubmed/36985112
http://dx.doi.org/10.3390/mi14030705
work_keys_str_mv AT lizhenwei amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT xumengying amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT yangxiaoli amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT hanyanqi amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT wangjiawen amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT lizhenwei multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT xumengying multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT yangxiaoli multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT hanyanqi multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages
AT wangjiawen multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages