Cargando…
Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies,...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8959904/ https://www.ncbi.nlm.nih.gov/pubmed/35356111 http://dx.doi.org/10.3389/fpls.2022.813237 |
_version_ | 1784677266306891776 |
---|---|
author | Egusquiza, Itziar Picon, Artzai Irusta, Unai Bereciartua-Perez, Arantza Eggers, Till Klukas, Christian Aramendi, Elisabete Navarra-Mestre, Ramon |
author_facet | Egusquiza, Itziar Picon, Artzai Irusta, Unai Bereciartua-Perez, Arantza Eggers, Till Klukas, Christian Aramendi, Elisabete Navarra-Mestre, Ramon |
author_sort | Egusquiza, Itziar |
collection | PubMed |
description | Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies, they require large and manually annotated image datasets that is not always accessible, specially for rare and new diseases. This study focuses on the development of a plant disease detection algorithm and strategy requiring few plant images (Few-shot learning algorithm). We extend previous work by using a novel challenging dataset containing more than 100,000 images. This dataset includes images of leaves, panicles and stems of five different crops (barley, corn, rape seed, rice, and wheat) for a total of 17 different diseases, where each disease is shown at different disease stages. In this study, we propose a deep metric learning based method to extract latent space representations from plant diseases with just few images by means of a Siamese network and triplet loss function. This enhances previous methods that require a support dataset containing a high number of annotated images to perform metric learning and few-shot classification. The proposed method was compared over a traditional network that was trained with the cross-entropy loss function. Exhaustive experiments have been performed for validating and measuring the benefits of metric learning techniques over classical methods. Results show that the features extracted by the metric learning based approach present better discriminative and clustering properties. Davis-Bouldin index and Silhouette score values have shown that triplet loss network improves the clustering properties with respect to the categorical-cross entropy loss. Overall, triplet loss approach improves the DB index value by 22.7% and Silhouette score value by 166.7% compared to the categorical cross-entropy loss model. Moreover, the F-score parameter obtained from the Siamese network with the triplet loss performs better than classical approaches when there are few images for training, obtaining a 6% improvement in the F-score mean value. Siamese networks with triplet loss have improved the ability to learn different plant diseases using few images of each class. These networks based on metric learning techniques improve clustering and classification results over traditional categorical cross-entropy loss networks for plant disease identification. |
format | Online Article Text |
id | pubmed-8959904 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-89599042022-03-29 Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets Egusquiza, Itziar Picon, Artzai Irusta, Unai Bereciartua-Perez, Arantza Eggers, Till Klukas, Christian Aramendi, Elisabete Navarra-Mestre, Ramon Front Plant Sci Plant Science Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies, they require large and manually annotated image datasets that is not always accessible, specially for rare and new diseases. This study focuses on the development of a plant disease detection algorithm and strategy requiring few plant images (Few-shot learning algorithm). We extend previous work by using a novel challenging dataset containing more than 100,000 images. This dataset includes images of leaves, panicles and stems of five different crops (barley, corn, rape seed, rice, and wheat) for a total of 17 different diseases, where each disease is shown at different disease stages. In this study, we propose a deep metric learning based method to extract latent space representations from plant diseases with just few images by means of a Siamese network and triplet loss function. This enhances previous methods that require a support dataset containing a high number of annotated images to perform metric learning and few-shot classification. The proposed method was compared over a traditional network that was trained with the cross-entropy loss function. Exhaustive experiments have been performed for validating and measuring the benefits of metric learning techniques over classical methods. Results show that the features extracted by the metric learning based approach present better discriminative and clustering properties. Davis-Bouldin index and Silhouette score values have shown that triplet loss network improves the clustering properties with respect to the categorical-cross entropy loss. Overall, triplet loss approach improves the DB index value by 22.7% and Silhouette score value by 166.7% compared to the categorical cross-entropy loss model. Moreover, the F-score parameter obtained from the Siamese network with the triplet loss performs better than classical approaches when there are few images for training, obtaining a 6% improvement in the F-score mean value. Siamese networks with triplet loss have improved the ability to learn different plant diseases using few images of each class. These networks based on metric learning techniques improve clustering and classification results over traditional categorical cross-entropy loss networks for plant disease identification. Frontiers Media S.A. 2022-03-07 /pmc/articles/PMC8959904/ /pubmed/35356111 http://dx.doi.org/10.3389/fpls.2022.813237 Text en Copyright © 2022 Egusquiza, Picon, Irusta, Bereciartua-Perez, Eggers, Klukas, Aramendi and Navarra-Mestre. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Plant Science Egusquiza, Itziar Picon, Artzai Irusta, Unai Bereciartua-Perez, Arantza Eggers, Till Klukas, Christian Aramendi, Elisabete Navarra-Mestre, Ramon Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title | Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title_full | Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title_fullStr | Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title_full_unstemmed | Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title_short | Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets |
title_sort | analysis of few-shot techniques for fungal plant disease classification and evaluation of clustering capabilities over real datasets |
topic | Plant Science |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8959904/ https://www.ncbi.nlm.nih.gov/pubmed/35356111 http://dx.doi.org/10.3389/fpls.2022.813237 |
work_keys_str_mv | AT egusquizaitziar analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT piconartzai analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT irustaunai analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT bereciartuaperezarantza analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT eggerstill analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT klukaschristian analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT aramendielisabete analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets AT navarramestreramon analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets |