Cargando…

Harnessing the power of diffusion models for plant disease image augmentation

INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constr...

Descripción completa

Detalles Bibliográficos
Autores principales: Muhammad, Abdullah, Salman, Zafar, Lee, Kiseong, Han, Dongil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669158/
https://www.ncbi.nlm.nih.gov/pubmed/38023884
http://dx.doi.org/10.3389/fpls.2023.1280496
_version_ 1785139629784039424
author Muhammad, Abdullah
Salman, Zafar
Lee, Kiseong
Han, Dongil
author_facet Muhammad, Abdullah
Salman, Zafar
Lee, Kiseong
Han, Dongil
author_sort Muhammad, Abdullah
collection PubMed
description INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constraints, leading to significant cost and time investments. Traditional data augmentation techniques, such as cropping, resizing, and rotation, have been largely supplanted by more advanced methods. In particular, the utilization of Generative Adversarial Networks (GANs) for the creation of realistic synthetic images has become a focal point of contemporary research, addressing issues related to data scarcity and class imbalance in the training of deep learning models. Recently, the emergence of diffusion models has captivated the scientific community, offering superior and realistic output compared to GANs. Despite these advancements, the application of diffusion models in the domain of plant science remains an unexplored frontier, presenting an opportunity for groundbreaking contributions. METHODS: In this study, we delve into the principles of diffusion technology, contrasting its methodology and performance with state-of-the-art GAN solutions, specifically examining the guided inference model of GANs, named InstaGAN, and a diffusion-based model, RePaint. Both models utilize segmentation masks to guide the generation process, albeit with distinct principles. For a fair comparison, a subset of the PlantVillage dataset is used, containing two disease classes of tomato leaves and three disease classes of grape leaf diseases, as results on these classes have been published in other publications. RESULTS: Quantitatively, RePaint demonstrated superior performance over InstaGAN, with average Fréchet Inception Distance (FID) score of 138.28 and Kernel Inception Distance (KID) score of 0.089 ± (0.002), compared to InstaGAN’s average FID and KID scores of 206.02 and 0.159 ± (0.004) respectively. Additionally, RePaint’s FID scores for grape leaf diseases were 69.05, outperforming other published methods such as DCGAN (309.376), LeafGAN (178.256), and InstaGAN (114.28). For tomato leaf diseases, RePaint achieved an FID score of 161.35, surpassing other methods like WGAN (226.08), SAGAN (229.7233), and InstaGAN (236.61). DISCUSSION: This study offers valuable insights into the potential of diffusion models for data augmentation in plant disease detection, paving the way for future research in this promising field.
format Online
Article
Text
id pubmed-10669158
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-106691582023-01-01 Harnessing the power of diffusion models for plant disease image augmentation Muhammad, Abdullah Salman, Zafar Lee, Kiseong Han, Dongil Front Plant Sci Plant Science INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constraints, leading to significant cost and time investments. Traditional data augmentation techniques, such as cropping, resizing, and rotation, have been largely supplanted by more advanced methods. In particular, the utilization of Generative Adversarial Networks (GANs) for the creation of realistic synthetic images has become a focal point of contemporary research, addressing issues related to data scarcity and class imbalance in the training of deep learning models. Recently, the emergence of diffusion models has captivated the scientific community, offering superior and realistic output compared to GANs. Despite these advancements, the application of diffusion models in the domain of plant science remains an unexplored frontier, presenting an opportunity for groundbreaking contributions. METHODS: In this study, we delve into the principles of diffusion technology, contrasting its methodology and performance with state-of-the-art GAN solutions, specifically examining the guided inference model of GANs, named InstaGAN, and a diffusion-based model, RePaint. Both models utilize segmentation masks to guide the generation process, albeit with distinct principles. For a fair comparison, a subset of the PlantVillage dataset is used, containing two disease classes of tomato leaves and three disease classes of grape leaf diseases, as results on these classes have been published in other publications. RESULTS: Quantitatively, RePaint demonstrated superior performance over InstaGAN, with average Fréchet Inception Distance (FID) score of 138.28 and Kernel Inception Distance (KID) score of 0.089 ± (0.002), compared to InstaGAN’s average FID and KID scores of 206.02 and 0.159 ± (0.004) respectively. Additionally, RePaint’s FID scores for grape leaf diseases were 69.05, outperforming other published methods such as DCGAN (309.376), LeafGAN (178.256), and InstaGAN (114.28). For tomato leaf diseases, RePaint achieved an FID score of 161.35, surpassing other methods like WGAN (226.08), SAGAN (229.7233), and InstaGAN (236.61). DISCUSSION: This study offers valuable insights into the potential of diffusion models for data augmentation in plant disease detection, paving the way for future research in this promising field. Frontiers Media S.A. 2023-11-07 /pmc/articles/PMC10669158/ /pubmed/38023884 http://dx.doi.org/10.3389/fpls.2023.1280496 Text en Copyright © 2023 Abdullah, Salman, Lee and Han https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Muhammad, Abdullah
Salman, Zafar
Lee, Kiseong
Han, Dongil
Harnessing the power of diffusion models for plant disease image augmentation
title Harnessing the power of diffusion models for plant disease image augmentation
title_full Harnessing the power of diffusion models for plant disease image augmentation
title_fullStr Harnessing the power of diffusion models for plant disease image augmentation
title_full_unstemmed Harnessing the power of diffusion models for plant disease image augmentation
title_short Harnessing the power of diffusion models for plant disease image augmentation
title_sort harnessing the power of diffusion models for plant disease image augmentation
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669158/
https://www.ncbi.nlm.nih.gov/pubmed/38023884
http://dx.doi.org/10.3389/fpls.2023.1280496
work_keys_str_mv AT muhammadabdullah harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation
AT salmanzafar harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation
AT leekiseong harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation
AT handongil harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation