Cargando…

The Plant Pathology Challenge 2020 data set to classify foliar disease of apples

PREMISE: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemica...

Descripción completa

Detalles Bibliográficos
Autores principales: Thapa, Ranjita, Zhang, Kai, Snavely, Noah, Belongie, Serge, Khan, Awais
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7526434/
https://www.ncbi.nlm.nih.gov/pubmed/33014634
http://dx.doi.org/10.1002/aps3.11390
_version_ 1783588873502195712
author Thapa, Ranjita
Zhang, Kai
Snavely, Noah
Belongie, Serge
Khan, Awais
author_facet Thapa, Ranjita
Zhang, Kai
Snavely, Noah
Belongie, Serge
Khan, Awais
author_sort Thapa, Ranjita
collection PubMed
description PREMISE: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemicals, with increased production costs and increased environmental and health impacts. METHODS AND RESULTS: We have manually captured 3651 high‐quality, real‐life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. A subset of images, expert‐annotated to create a pilot data set for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for the Plant Pathology Challenge as part of the Fine‐Grained Visual Categorization (FGVC) workshop at the 2020 Computer Vision and Pattern Recognition conference (CVPR 2020). Participants were asked to use the image data set to train a machine learning model to classify disease categories and develop an algorithm for disease severity quantification. The top three area under the ROC curve (AUC) values submitted to the private leaderboard were 0.98445, 0.98182, and 0.98089. We also trained an off‐the‐shelf convolutional neural network on this data for disease classification and achieved 97% accuracy on a held‐out test set. DISCUSSION: This data set will contribute toward development and deployment of machine learning–based automated plant disease classification algorithms to ultimately realize fast and accurate disease detection. We will continue to add images to the pilot data set for a larger, more comprehensive expert‐annotated data set for future Kaggle competitions and to explore more advanced methods for disease classification and quantification.
format Online
Article
Text
id pubmed-7526434
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-75264342020-10-02 The Plant Pathology Challenge 2020 data set to classify foliar disease of apples Thapa, Ranjita Zhang, Kai Snavely, Noah Belongie, Serge Khan, Awais Appl Plant Sci Application Articles PREMISE: Apple orchards in the United States are under constant threat from a large number of pathogens and insects. Appropriate and timely deployment of disease management depends on early disease detection. Incorrect and delayed diagnosis can result in either excessive or inadequate use of chemicals, with increased production costs and increased environmental and health impacts. METHODS AND RESULTS: We have manually captured 3651 high‐quality, real‐life symptom images of multiple apple foliar diseases, with variable illumination, angles, surfaces, and noise. A subset of images, expert‐annotated to create a pilot data set for apple scab, cedar apple rust, and healthy leaves, was made available to the Kaggle community for the Plant Pathology Challenge as part of the Fine‐Grained Visual Categorization (FGVC) workshop at the 2020 Computer Vision and Pattern Recognition conference (CVPR 2020). Participants were asked to use the image data set to train a machine learning model to classify disease categories and develop an algorithm for disease severity quantification. The top three area under the ROC curve (AUC) values submitted to the private leaderboard were 0.98445, 0.98182, and 0.98089. We also trained an off‐the‐shelf convolutional neural network on this data for disease classification and achieved 97% accuracy on a held‐out test set. DISCUSSION: This data set will contribute toward development and deployment of machine learning–based automated plant disease classification algorithms to ultimately realize fast and accurate disease detection. We will continue to add images to the pilot data set for a larger, more comprehensive expert‐annotated data set for future Kaggle competitions and to explore more advanced methods for disease classification and quantification. John Wiley and Sons Inc. 2020-09-28 /pmc/articles/PMC7526434/ /pubmed/33014634 http://dx.doi.org/10.1002/aps3.11390 Text en © 2020 Thapa et al. Applications in Plant Sciences published by Wiley Periodicals LLC on behalf of Botanical Society of America This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Application Articles
Thapa, Ranjita
Zhang, Kai
Snavely, Noah
Belongie, Serge
Khan, Awais
The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title_full The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title_fullStr The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title_full_unstemmed The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title_short The Plant Pathology Challenge 2020 data set to classify foliar disease of apples
title_sort plant pathology challenge 2020 data set to classify foliar disease of apples
topic Application Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7526434/
https://www.ncbi.nlm.nih.gov/pubmed/33014634
http://dx.doi.org/10.1002/aps3.11390
work_keys_str_mv AT thaparanjita theplantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT zhangkai theplantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT snavelynoah theplantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT belongieserge theplantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT khanawais theplantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT thaparanjita plantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT zhangkai plantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT snavelynoah plantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT belongieserge plantpathologychallenge2020datasettoclassifyfoliardiseaseofapples
AT khanawais plantpathologychallenge2020datasettoclassifyfoliardiseaseofapples