Cargando…
Harnessing the power of diffusion models for plant disease image augmentation
INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constr...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669158/ https://www.ncbi.nlm.nih.gov/pubmed/38023884 http://dx.doi.org/10.3389/fpls.2023.1280496 |
_version_ | 1785139629784039424 |
---|---|
author | Muhammad, Abdullah Salman, Zafar Lee, Kiseong Han, Dongil |
author_facet | Muhammad, Abdullah Salman, Zafar Lee, Kiseong Han, Dongil |
author_sort | Muhammad, Abdullah |
collection | PubMed |
description | INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constraints, leading to significant cost and time investments. Traditional data augmentation techniques, such as cropping, resizing, and rotation, have been largely supplanted by more advanced methods. In particular, the utilization of Generative Adversarial Networks (GANs) for the creation of realistic synthetic images has become a focal point of contemporary research, addressing issues related to data scarcity and class imbalance in the training of deep learning models. Recently, the emergence of diffusion models has captivated the scientific community, offering superior and realistic output compared to GANs. Despite these advancements, the application of diffusion models in the domain of plant science remains an unexplored frontier, presenting an opportunity for groundbreaking contributions. METHODS: In this study, we delve into the principles of diffusion technology, contrasting its methodology and performance with state-of-the-art GAN solutions, specifically examining the guided inference model of GANs, named InstaGAN, and a diffusion-based model, RePaint. Both models utilize segmentation masks to guide the generation process, albeit with distinct principles. For a fair comparison, a subset of the PlantVillage dataset is used, containing two disease classes of tomato leaves and three disease classes of grape leaf diseases, as results on these classes have been published in other publications. RESULTS: Quantitatively, RePaint demonstrated superior performance over InstaGAN, with average Fréchet Inception Distance (FID) score of 138.28 and Kernel Inception Distance (KID) score of 0.089 ± (0.002), compared to InstaGAN’s average FID and KID scores of 206.02 and 0.159 ± (0.004) respectively. Additionally, RePaint’s FID scores for grape leaf diseases were 69.05, outperforming other published methods such as DCGAN (309.376), LeafGAN (178.256), and InstaGAN (114.28). For tomato leaf diseases, RePaint achieved an FID score of 161.35, surpassing other methods like WGAN (226.08), SAGAN (229.7233), and InstaGAN (236.61). DISCUSSION: This study offers valuable insights into the potential of diffusion models for data augmentation in plant disease detection, paving the way for future research in this promising field. |
format | Online Article Text |
id | pubmed-10669158 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-106691582023-01-01 Harnessing the power of diffusion models for plant disease image augmentation Muhammad, Abdullah Salman, Zafar Lee, Kiseong Han, Dongil Front Plant Sci Plant Science INTRODUCTION: The challenges associated with data availability, class imbalance, and the need for data augmentation are well-recognized in the field of plant disease detection. The collection of large-scale datasets for plant diseases is particularly demanding due to seasonal and geographical constraints, leading to significant cost and time investments. Traditional data augmentation techniques, such as cropping, resizing, and rotation, have been largely supplanted by more advanced methods. In particular, the utilization of Generative Adversarial Networks (GANs) for the creation of realistic synthetic images has become a focal point of contemporary research, addressing issues related to data scarcity and class imbalance in the training of deep learning models. Recently, the emergence of diffusion models has captivated the scientific community, offering superior and realistic output compared to GANs. Despite these advancements, the application of diffusion models in the domain of plant science remains an unexplored frontier, presenting an opportunity for groundbreaking contributions. METHODS: In this study, we delve into the principles of diffusion technology, contrasting its methodology and performance with state-of-the-art GAN solutions, specifically examining the guided inference model of GANs, named InstaGAN, and a diffusion-based model, RePaint. Both models utilize segmentation masks to guide the generation process, albeit with distinct principles. For a fair comparison, a subset of the PlantVillage dataset is used, containing two disease classes of tomato leaves and three disease classes of grape leaf diseases, as results on these classes have been published in other publications. RESULTS: Quantitatively, RePaint demonstrated superior performance over InstaGAN, with average Fréchet Inception Distance (FID) score of 138.28 and Kernel Inception Distance (KID) score of 0.089 ± (0.002), compared to InstaGAN’s average FID and KID scores of 206.02 and 0.159 ± (0.004) respectively. Additionally, RePaint’s FID scores for grape leaf diseases were 69.05, outperforming other published methods such as DCGAN (309.376), LeafGAN (178.256), and InstaGAN (114.28). For tomato leaf diseases, RePaint achieved an FID score of 161.35, surpassing other methods like WGAN (226.08), SAGAN (229.7233), and InstaGAN (236.61). DISCUSSION: This study offers valuable insights into the potential of diffusion models for data augmentation in plant disease detection, paving the way for future research in this promising field. Frontiers Media S.A. 2023-11-07 /pmc/articles/PMC10669158/ /pubmed/38023884 http://dx.doi.org/10.3389/fpls.2023.1280496 Text en Copyright © 2023 Abdullah, Salman, Lee and Han https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Muhammad, Abdullah Salman, Zafar Lee, Kiseong Han, Dongil Harnessing the power of diffusion models for plant disease image augmentation |
title | Harnessing the power of diffusion models for plant disease image augmentation |
title_full | Harnessing the power of diffusion models for plant disease image augmentation |
title_fullStr | Harnessing the power of diffusion models for plant disease image augmentation |
title_full_unstemmed | Harnessing the power of diffusion models for plant disease image augmentation |
title_short | Harnessing the power of diffusion models for plant disease image augmentation |
title_sort | harnessing the power of diffusion models for plant disease image augmentation |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10669158/ https://www.ncbi.nlm.nih.gov/pubmed/38023884 http://dx.doi.org/10.3389/fpls.2023.1280496 |
work_keys_str_mv | AT muhammadabdullah harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation AT salmanzafar harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation AT leekiseong harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation AT handongil harnessingthepowerofdiffusionmodelsforplantdiseaseimageaugmentation |