Cargando…
Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction
In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007672/ https://www.ncbi.nlm.nih.gov/pubmed/36904790 http://dx.doi.org/10.3390/s23052586 |
_version_ | 1784905581121765376 |
---|---|
author | Darra, Nicoleta Espejo-Garcia, Borja Kasimati, Aikaterini Kriezi, Olga Psomiadis, Emmanouil Fountas, Spyros |
author_facet | Darra, Nicoleta Espejo-Garcia, Borja Kasimati, Aikaterini Kriezi, Olga Psomiadis, Emmanouil Fountas, Spyros |
author_sort | Darra, Nicoleta |
collection | PubMed |
description | In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (April to September) at 5-day intervals. Actual recorded yields were collected across 108 fields, corresponding to a total area of 410.10 ha of processing tomato in central Greece, to assess the performance of Vis at different temporal scales. In addition, VIs were connected with the crop phenology to establish the annual dynamics of the crop. The highest Pearson coefficient (r) values occurred during a period of 80 to 90 days, indicating the strong relationship between the VIs and the yield. Specifically, RVI presented the highest correlation values of the growing season at 80 (r = 0.72) and 90 days (r = 0.75), while NDVI performed better at 85 days (r = 0.72). This output was confirmed by the AutoML technique, which also indicated the highest performance of the VIs during the same period, with the values of the adjusted R(2) ranging from 0.60 to 0.72. The most precise results were obtained with the combination of ARD regression and SVR, which was the most successful combination for building an ensemble (adj. R(2) = 0.67 ± 0.02). |
format | Online Article Text |
id | pubmed-10007672 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-100076722023-03-12 Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction Darra, Nicoleta Espejo-Garcia, Borja Kasimati, Aikaterini Kriezi, Olga Psomiadis, Emmanouil Fountas, Spyros Sensors (Basel) Article In this paper, we propose an innovative approach for robust prediction of processing tomato yield using open-source AutoML techniques and statistical analysis. Sentinel-2 satellite imagery was deployed to obtain values of five (5) selected vegetation indices (VIs) during the growing season of 2021 (April to September) at 5-day intervals. Actual recorded yields were collected across 108 fields, corresponding to a total area of 410.10 ha of processing tomato in central Greece, to assess the performance of Vis at different temporal scales. In addition, VIs were connected with the crop phenology to establish the annual dynamics of the crop. The highest Pearson coefficient (r) values occurred during a period of 80 to 90 days, indicating the strong relationship between the VIs and the yield. Specifically, RVI presented the highest correlation values of the growing season at 80 (r = 0.72) and 90 days (r = 0.75), while NDVI performed better at 85 days (r = 0.72). This output was confirmed by the AutoML technique, which also indicated the highest performance of the VIs during the same period, with the values of the adjusted R(2) ranging from 0.60 to 0.72. The most precise results were obtained with the combination of ARD regression and SVR, which was the most successful combination for building an ensemble (adj. R(2) = 0.67 ± 0.02). MDPI 2023-02-26 /pmc/articles/PMC10007672/ /pubmed/36904790 http://dx.doi.org/10.3390/s23052586 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Darra, Nicoleta Espejo-Garcia, Borja Kasimati, Aikaterini Kriezi, Olga Psomiadis, Emmanouil Fountas, Spyros Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title | Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title_full | Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title_fullStr | Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title_full_unstemmed | Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title_short | Can Satellites Predict Yield? Ensemble Machine Learning and Statistical Analysis of Sentinel-2 Imagery for Processing Tomato Yield Prediction |
title_sort | can satellites predict yield? ensemble machine learning and statistical analysis of sentinel-2 imagery for processing tomato yield prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10007672/ https://www.ncbi.nlm.nih.gov/pubmed/36904790 http://dx.doi.org/10.3390/s23052586 |
work_keys_str_mv | AT darranicoleta cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction AT espejogarciaborja cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction AT kasimatiaikaterini cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction AT krieziolga cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction AT psomiadisemmanouil cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction AT fountasspyros cansatellitespredictyieldensemblemachinelearningandstatisticalanalysisofsentinel2imageryforprocessingtomatoyieldprediction |