Cargando…
Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections
Tomato is an important commercial product which is perishable by nature and highly susceptible to fungal incidence once it is harvested. Not all tomatoes are equally vulnerable to pathogenic fungi, and an early detection of the vulnerable ones can help in taking timely preventive actions, ranging fr...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633320/ https://www.ncbi.nlm.nih.gov/pubmed/34848748 http://dx.doi.org/10.1038/s41598-021-02302-2 |
_version_ | 1784607903574917120 |
---|---|
author | Brdar, Sanja Panić, Marko Hogeveen-van Echtelt, Esther Mensink, Manon Grbović, Željana Woltering, Ernst Chauhan, Aneesh |
author_facet | Brdar, Sanja Panić, Marko Hogeveen-van Echtelt, Esther Mensink, Manon Grbović, Željana Woltering, Ernst Chauhan, Aneesh |
author_sort | Brdar, Sanja |
collection | PubMed |
description | Tomato is an important commercial product which is perishable by nature and highly susceptible to fungal incidence once it is harvested. Not all tomatoes are equally vulnerable to pathogenic fungi, and an early detection of the vulnerable ones can help in taking timely preventive actions, ranging from isolating tomato batches to adjusting storage conditions, but also in making right business decisions like dynamic pricing based on quality or better shelf life estimate. More importantly, early detection of vulnerable produce can help in taking timely actions to minimize potential post-harvest losses. This paper investigates Near-infrared (NIR) hyperspectral imaging (1000–1700 nm) and machine learning to build models to automatically predict the susceptibility of sepals of recently harvested tomatoes to future fungal infections. Hyperspectral images of newly harvested tomatoes (cultivar Brioso) from 5 different growers were acquired before the onset of any visible fungal infection. After imaging, the tomatoes were placed under controlled conditions suited for fungal germination and growth for a 4-day period, and then imaged using normal color cameras. All sepals in the color images were ranked for fungal severity using crowdsourcing, and the final severity of each sepal was fused using principal component analysis. A novel hyperspectral data processing pipeline is presented which was used to automatically segment the tomato sepals from spectral images with multiple tomatoes connected via a truss. The key modelling question addressed in this research is whether there is a correlation between the hyperspectral data captured at harvest and the fungal infection observed 4 days later. Using 10-fold and group k-fold cross-validation, XG-Boost and Random Forest based regression models were trained on the features derived from the hyperspectral data corresponding to each sepal in the training set and tested on hold out test set. The best model found a Pearson correlation of 0.837, showing that there is strong linear correlation between the NIR spectra and the future fungal severity of the sepal. The sepal specific predictions were aggregated to predict the susceptibility of individual tomatoes, and a correlation of 0.92 was found. Besides modelling, focus is also on model interpretation, particularly to understand which spectral features are most relevant to model prediction. Two approaches to model interpretation were explored, feature importance and SHAP (SHapley Additive exPlanations), resulting in similar conclusions that the NIR range between 1390–1420 nm contributes most to the model’s final decision. |
format | Online Article Text |
id | pubmed-8633320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-86333202021-12-03 Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections Brdar, Sanja Panić, Marko Hogeveen-van Echtelt, Esther Mensink, Manon Grbović, Željana Woltering, Ernst Chauhan, Aneesh Sci Rep Article Tomato is an important commercial product which is perishable by nature and highly susceptible to fungal incidence once it is harvested. Not all tomatoes are equally vulnerable to pathogenic fungi, and an early detection of the vulnerable ones can help in taking timely preventive actions, ranging from isolating tomato batches to adjusting storage conditions, but also in making right business decisions like dynamic pricing based on quality or better shelf life estimate. More importantly, early detection of vulnerable produce can help in taking timely actions to minimize potential post-harvest losses. This paper investigates Near-infrared (NIR) hyperspectral imaging (1000–1700 nm) and machine learning to build models to automatically predict the susceptibility of sepals of recently harvested tomatoes to future fungal infections. Hyperspectral images of newly harvested tomatoes (cultivar Brioso) from 5 different growers were acquired before the onset of any visible fungal infection. After imaging, the tomatoes were placed under controlled conditions suited for fungal germination and growth for a 4-day period, and then imaged using normal color cameras. All sepals in the color images were ranked for fungal severity using crowdsourcing, and the final severity of each sepal was fused using principal component analysis. A novel hyperspectral data processing pipeline is presented which was used to automatically segment the tomato sepals from spectral images with multiple tomatoes connected via a truss. The key modelling question addressed in this research is whether there is a correlation between the hyperspectral data captured at harvest and the fungal infection observed 4 days later. Using 10-fold and group k-fold cross-validation, XG-Boost and Random Forest based regression models were trained on the features derived from the hyperspectral data corresponding to each sepal in the training set and tested on hold out test set. The best model found a Pearson correlation of 0.837, showing that there is strong linear correlation between the NIR spectra and the future fungal severity of the sepal. The sepal specific predictions were aggregated to predict the susceptibility of individual tomatoes, and a correlation of 0.92 was found. Besides modelling, focus is also on model interpretation, particularly to understand which spectral features are most relevant to model prediction. Two approaches to model interpretation were explored, feature importance and SHAP (SHapley Additive exPlanations), resulting in similar conclusions that the NIR range between 1390–1420 nm contributes most to the model’s final decision. Nature Publishing Group UK 2021-11-30 /pmc/articles/PMC8633320/ /pubmed/34848748 http://dx.doi.org/10.1038/s41598-021-02302-2 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Brdar, Sanja Panić, Marko Hogeveen-van Echtelt, Esther Mensink, Manon Grbović, Željana Woltering, Ernst Chauhan, Aneesh Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title | Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title_full | Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title_fullStr | Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title_full_unstemmed | Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title_short | Predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
title_sort | predicting sensitivity of recently harvested tomatoes and tomato sepals to future fungal infections |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8633320/ https://www.ncbi.nlm.nih.gov/pubmed/34848748 http://dx.doi.org/10.1038/s41598-021-02302-2 |
work_keys_str_mv | AT brdarsanja predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT panicmarko predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT hogeveenvanechteltesther predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT mensinkmanon predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT grboviczeljana predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT wolteringernst predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections AT chauhananeesh predictingsensitivityofrecentlyharvestedtomatoesandtomatosepalstofuturefungalinfections |