Cargando…
A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images
At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model bas...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054796/ https://www.ncbi.nlm.nih.gov/pubmed/36985112 http://dx.doi.org/10.3390/mi14030705 |
_version_ | 1785015757821706240 |
---|---|
author | Li, Zhenwei Xu, Mengying Yang, Xiaoli Han, Yanqi Wang, Jiawen |
author_facet | Li, Zhenwei Xu, Mengying Yang, Xiaoli Han, Yanqi Wang, Jiawen |
author_sort | Li, Zhenwei |
collection | PubMed |
description | At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model based on gradient-weighted class activation mapping (Grad-CAM) is proposed. The model uses VGG19 and ResNet50 as the classification networks. Grad-CAM is a data augmentation module used to obtain a network convolutional layer output activation map. Both the augmented and the original data are used as the input of the model to achieve the classification goal. The data augmentation module can guide the model to learn the feature differences of lesions in the fundus and enhance the robustness of the classification model. Model fine tuning and transfer learning are used to improve the accuracy of multiple classifiers. The proposed method is based on the RFMiD (Retinal Fundus Multi-Disease Image Dataset) dataset, and an ablation experiment was performed. Compared with other methods, the accuracy, precision, and recall of this model are 97%, 92%, and 81%, respectively. The resulting activation graph shows the areas of interest for model classification, making it easier to understand the classification network. |
format | Online Article Text |
id | pubmed-10054796 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100547962023-03-30 A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images Li, Zhenwei Xu, Mengying Yang, Xiaoli Han, Yanqi Wang, Jiawen Micromachines (Basel) Article At present, multi-disease fundus image classification tasks still have the problems of small data volumes, uneven distributions, and low classification accuracy. In order to solve the problem of large data demand of deep learning models, a multi-disease fundus image classification ensemble model based on gradient-weighted class activation mapping (Grad-CAM) is proposed. The model uses VGG19 and ResNet50 as the classification networks. Grad-CAM is a data augmentation module used to obtain a network convolutional layer output activation map. Both the augmented and the original data are used as the input of the model to achieve the classification goal. The data augmentation module can guide the model to learn the feature differences of lesions in the fundus and enhance the robustness of the classification model. Model fine tuning and transfer learning are used to improve the accuracy of multiple classifiers. The proposed method is based on the RFMiD (Retinal Fundus Multi-Disease Image Dataset) dataset, and an ablation experiment was performed. Compared with other methods, the accuracy, precision, and recall of this model are 97%, 92%, and 81%, respectively. The resulting activation graph shows the areas of interest for model classification, making it easier to understand the classification network. MDPI 2023-03-22 /pmc/articles/PMC10054796/ /pubmed/36985112 http://dx.doi.org/10.3390/mi14030705 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Li, Zhenwei Xu, Mengying Yang, Xiaoli Han, Yanqi Wang, Jiawen A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title | A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title_full | A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title_fullStr | A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title_full_unstemmed | A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title_short | A Multi-Label Detection Deep Learning Model with Attention-Guided Image Enhancement for Retinal Images |
title_sort | multi-label detection deep learning model with attention-guided image enhancement for retinal images |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10054796/ https://www.ncbi.nlm.nih.gov/pubmed/36985112 http://dx.doi.org/10.3390/mi14030705 |
work_keys_str_mv | AT lizhenwei amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT xumengying amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT yangxiaoli amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT hanyanqi amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT wangjiawen amultilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT lizhenwei multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT xumengying multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT yangxiaoli multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT hanyanqi multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages AT wangjiawen multilabeldetectiondeeplearningmodelwithattentionguidedimageenhancementforretinalimages |