Cargando…

Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction

In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (...

Descripción completa

Detalles Bibliográficos
Autores principales: Darra, Nicoleta, Espejo-Garcia, Borja, Kasimati, Aikaterini, Kriezi, Olga, Psomiadis, Emmanouil, Fountas, Spyros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007672/
https://www.ncbi.nlm.nih.gov/pubmed/36904790
http://dx.doi.org/10.3390/s23052586
_version_ 1784905581121765376
author Darra, Nicoleta
Espejo-Garcia, Borja
Kasimati, Aikaterini
Kriezi, Olga
Psomiadis, Emmanouil
Fountas, Spyros
author_facet Darra, Nicoleta
Espejo-Garcia, Borja
Kasimati, Aikaterini
Kriezi, Olga
Psomiadis, Emmanouil
Fountas, Spyros
author_sort Darra, Nicoleta
collection PubMed
description In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (April to September) at 5-day intervals. Actual recorded yields were collected across 108 fields, corresponding to a total area of 410.10 ha of processing tomato in central Greece, to assess the performance of Vis at different temporal scales. In addition, VIs were connected with the crop phenology to establish the annual dynamics of the crop. The highest Pearson coefficient (r) values occurred during a period of 80 to 90 days, indicating the strong relationship between the VIs and the yield. Specifically, RVI presented the highest correlation values of the growing season at 80 (r = 0.72) and 90 days (r = 0.75), while NDVI performed better at 85 days (r = 0.72). This output was confirmed by the AutoML technique, which also indicated the highest performance of the VIs during the same period, with the values of the adjusted R(2) ranging from 0.60 to 0.72. The most precise results were obtained with the combination of ARD regression and SVR, which was the most successful combination for building an ensemble (adj. R(2) = 0.67 ± 0.02).
format Online
Article
Text
id pubmed-10007672
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-100076722023-03-12 Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction Darra, Nicoleta Espejo-Garcia, Borja Kasimati, Aikaterini Kriezi, Olga Psomiadis, Emmanouil Fountas, Spyros Sensors (Basel) Article In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (April to September) at 5-day intervals. Actual recorded yields were collected across 108 fields, corresponding to a total area of 410.10 ha of processing tomato in central Greece, to assess the performance of Vis at different temporal scales. In addition, VIs were connected with the crop phenology to establish the annual dynamics of the crop. The highest Pearson coefficient (r) values occurred during a period of 80 to 90 days, indicating the strong relationship between the VIs and the yield. Specifically, RVI presented the highest correlation values of the growing season at 80 (r = 0.72) and 90 days (r = 0.75), while NDVI performed better at 85 days (r = 0.72). This output was confirmed by the AutoML technique, which also indicated the highest performance of the VIs during the same period, with the values of the adjusted R(2) ranging from 0.60 to 0.72. The most precise results were obtained with the combination of ARD regression and SVR, which was the most successful combination for building an ensemble (adj. R(2) = 0.67 ± 0.02). MDPI 2023-02-26 /pmc/articles/PMC10007672/ /pubmed/36904790 http://dx.doi.org/10.3390/s23052586 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Darra, Nicoleta
Espejo-Garcia, Borja
Kasimati, Aikaterini
Kriezi, Olga
Psomiadis, Emmanouil
Fountas, Spyros
Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title_full Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title_fullStr Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title_full_unstemmed Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title_short Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
title_sort can satellites predict yield? ensemble machine learning and statistical analysis of sentinel-2 imagery for processing tomato yield prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007672/
https://www.ncbi.nlm.nih.gov/pubmed/36904790
http://dx.doi.org/10.3390/s23052586
work_keys_str_mv AT darranicoleta cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction
AT espejogarciaborja cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction
AT kasimatiaikaterini cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction
AT krieziolga cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction
AT psomiadisemmanouil cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction
AT fountasspyros cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction