Cargando…

Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets

Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies,...

Descripción completa

Detalles Bibliográficos
Autores principales: Egusquiza, Itziar, Picon, Artzai, Irusta, Unai, Bereciartua-Perez, Arantza, Eggers, Till, Klukas, Christian, Aramendi, Elisabete, Navarra-Mestre, Ramon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8959904/
https://www.ncbi.nlm.nih.gov/pubmed/35356111
http://dx.doi.org/10.3389/fpls.2022.813237
_version_ 1784677266306891776
author Egusquiza, Itziar
Picon, Artzai
Irusta, Unai
Bereciartua-Perez, Arantza
Eggers, Till
Klukas, Christian
Aramendi, Elisabete
Navarra-Mestre, Ramon
author_facet Egusquiza, Itziar
Picon, Artzai
Irusta, Unai
Bereciartua-Perez, Arantza
Eggers, Till
Klukas, Christian
Aramendi, Elisabete
Navarra-Mestre, Ramon
author_sort Egusquiza, Itziar
collection PubMed
description Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies, they require large and manually annotated image datasets that is not always accessible, specially for rare and new diseases. This study focuses on the development of a plant disease detection algorithm and strategy requiring few plant images (Few-shot learning algorithm). We extend previous work by using a novel challenging dataset containing more than 100,000 images. This dataset includes images of leaves, panicles and stems of five different crops (barley, corn, rape seed, rice, and wheat) for a total of 17 different diseases, where each disease is shown at different disease stages. In this study, we propose a deep metric learning based method to extract latent space representations from plant diseases with just few images by means of a Siamese network and triplet loss function. This enhances previous methods that require a support dataset containing a high number of annotated images to perform metric learning and few-shot classification. The proposed method was compared over a traditional network that was trained with the cross-entropy loss function. Exhaustive experiments have been performed for validating and measuring the benefits of metric learning techniques over classical methods. Results show that the features extracted by the metric learning based approach present better discriminative and clustering properties. Davis-Bouldin index and Silhouette score values have shown that triplet loss network improves the clustering properties with respect to the categorical-cross entropy loss. Overall, triplet loss approach improves the DB index value by 22.7% and Silhouette score value by 166.7% compared to the categorical cross-entropy loss model. Moreover, the F-score parameter obtained from the Siamese network with the triplet loss performs better than classical approaches when there are few images for training, obtaining a 6% improvement in the F-score mean value. Siamese networks with triplet loss have improved the ability to learn different plant diseases using few images of each class. These networks based on metric learning techniques improve clustering and classification results over traditional categorical cross-entropy loss networks for plant disease identification.
format Online
Article
Text
id pubmed-8959904
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89599042022-03-29 Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets Egusquiza, Itziar Picon, Artzai Irusta, Unai Bereciartua-Perez, Arantza Eggers, Till Klukas, Christian Aramendi, Elisabete Navarra-Mestre, Ramon Front Plant Sci Plant Science Plant fungal diseases are one of the most important causes of crop yield losses. Therefore, plant disease identification algorithms have been seen as a useful tool to detect them at early stages to mitigate their effects. Although deep-learning based algorithms can achieve high detection accuracies, they require large and manually annotated image datasets that is not always accessible, specially for rare and new diseases. This study focuses on the development of a plant disease detection algorithm and strategy requiring few plant images (Few-shot learning algorithm). We extend previous work by using a novel challenging dataset containing more than 100,000 images. This dataset includes images of leaves, panicles and stems of five different crops (barley, corn, rape seed, rice, and wheat) for a total of 17 different diseases, where each disease is shown at different disease stages. In this study, we propose a deep metric learning based method to extract latent space representations from plant diseases with just few images by means of a Siamese network and triplet loss function. This enhances previous methods that require a support dataset containing a high number of annotated images to perform metric learning and few-shot classification. The proposed method was compared over a traditional network that was trained with the cross-entropy loss function. Exhaustive experiments have been performed for validating and measuring the benefits of metric learning techniques over classical methods. Results show that the features extracted by the metric learning based approach present better discriminative and clustering properties. Davis-Bouldin index and Silhouette score values have shown that triplet loss network improves the clustering properties with respect to the categorical-cross entropy loss. Overall, triplet loss approach improves the DB index value by 22.7% and Silhouette score value by 166.7% compared to the categorical cross-entropy loss model. Moreover, the F-score parameter obtained from the Siamese network with the triplet loss performs better than classical approaches when there are few images for training, obtaining a 6% improvement in the F-score mean value. Siamese networks with triplet loss have improved the ability to learn different plant diseases using few images of each class. These networks based on metric learning techniques improve clustering and classification results over traditional categorical cross-entropy loss networks for plant disease identification. Frontiers Media S.A. 2022-03-07 /pmc/articles/PMC8959904/ /pubmed/35356111 http://dx.doi.org/10.3389/fpls.2022.813237 Text en Copyright © 2022 Egusquiza, Picon, Irusta, Bereciartua-Perez, Eggers, Klukas, Aramendi and Navarra-Mestre. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Egusquiza, Itziar
Picon, Artzai
Irusta, Unai
Bereciartua-Perez, Arantza
Eggers, Till
Klukas, Christian
Aramendi, Elisabete
Navarra-Mestre, Ramon
Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title_full Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title_fullStr Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title_full_unstemmed Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title_short Analysis of Few-Shot Techniques for Fungal Plant Disease Classification and Evaluation of Clustering Capabilities Over Real Datasets
title_sort analysis of few-shot techniques for fungal plant disease classification and evaluation of clustering capabilities over real datasets
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8959904/
https://www.ncbi.nlm.nih.gov/pubmed/35356111
http://dx.doi.org/10.3389/fpls.2022.813237
work_keys_str_mv AT egusquizaitziar analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT piconartzai analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT irustaunai analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT bereciartuaperezarantza analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT eggerstill analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT klukaschristian analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT aramendielisabete analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets
AT navarramestreramon analysisoffewshottechniquesforfungalplantdiseaseclassificationandevaluationofclusteringcapabilitiesoverrealdatasets