Cargando…

County-scale crop yield prediction by integrating crop simulation with machine learning models

Crop yield prediction is of great importance for decision making, yet it remains an ongoing scientific challenge. Interactions among different genetic, environmental, and management factors and uncertainty in input values are making crop yield prediction complex. Building upon a previous work in whi...

Descripción completa

Detalles Bibliográficos
Autores principales: Sajid, Saiara Samira, Shahhosseini, Mohsen, Huber, Isaiah, Hu, Guiping, Archontoulis, Sotirios V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742473/
https://www.ncbi.nlm.nih.gov/pubmed/36518505
http://dx.doi.org/10.3389/fpls.2022.1000224
_version_ 1784848527221850112
author Sajid, Saiara Samira
Shahhosseini, Mohsen
Huber, Isaiah
Hu, Guiping
Archontoulis, Sotirios V.
author_facet Sajid, Saiara Samira
Shahhosseini, Mohsen
Huber, Isaiah
Hu, Guiping
Archontoulis, Sotirios V.
author_sort Sajid, Saiara Samira
collection PubMed
description Crop yield prediction is of great importance for decision making, yet it remains an ongoing scientific challenge. Interactions among different genetic, environmental, and management factors and uncertainty in input values are making crop yield prediction complex. Building upon a previous work in which we coupled crop modeling with machine learning (ML) models to predict maize yields for three US Corn Belt states, here, we expand the concept to the entire US Corn Belt (12 states). More specifically, we built five new ML models and their ensemble models, considering the scenarios with and without crop modeling variables. Additional input values in our models are soil, weather, management, and historical yield data. A unique aspect of our work is the spatial analysis to investigate causes for low or high model prediction errors. Our results indicated that the prediction accuracy increases by coupling crop modeling with machine learning. The ensemble model overperformed the individual ML models, having a relative root mean square error (RRMSE) of about 9% for the test years (2018, 2019, and 2020), which is comparable to previous studies. In addition, analysis of the sources of error revealed that counties and crop reporting districts with low cropland ratios have high RRMSE. Furthermore, we found that soil input data and extreme weather events were responsible for high errors in some regions. The proposed models can be deployed for large-scale prediction at the county level and, contingent upon data availability, can be utilized for field level prediction.
format Online
Article
Text
id pubmed-9742473
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-97424732022-12-13 County-scale crop yield prediction by integrating crop simulation with machine learning models Sajid, Saiara Samira Shahhosseini, Mohsen Huber, Isaiah Hu, Guiping Archontoulis, Sotirios V. Front Plant Sci Plant Science Crop yield prediction is of great importance for decision making, yet it remains an ongoing scientific challenge. Interactions among different genetic, environmental, and management factors and uncertainty in input values are making crop yield prediction complex. Building upon a previous work in which we coupled crop modeling with machine learning (ML) models to predict maize yields for three US Corn Belt states, here, we expand the concept to the entire US Corn Belt (12 states). More specifically, we built five new ML models and their ensemble models, considering the scenarios with and without crop modeling variables. Additional input values in our models are soil, weather, management, and historical yield data. A unique aspect of our work is the spatial analysis to investigate causes for low or high model prediction errors. Our results indicated that the prediction accuracy increases by coupling crop modeling with machine learning. The ensemble model overperformed the individual ML models, having a relative root mean square error (RRMSE) of about 9% for the test years (2018, 2019, and 2020), which is comparable to previous studies. In addition, analysis of the sources of error revealed that counties and crop reporting districts with low cropland ratios have high RRMSE. Furthermore, we found that soil input data and extreme weather events were responsible for high errors in some regions. The proposed models can be deployed for large-scale prediction at the county level and, contingent upon data availability, can be utilized for field level prediction. Frontiers Media S.A. 2022-11-28 /pmc/articles/PMC9742473/ /pubmed/36518505 http://dx.doi.org/10.3389/fpls.2022.1000224 Text en Copyright © 2022 Sajid, Shahhosseini, Huber, Hu and Archontoulis https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Plant Science
Sajid, Saiara Samira
Shahhosseini, Mohsen
Huber, Isaiah
Hu, Guiping
Archontoulis, Sotirios V.
County-scale crop yield prediction by integrating crop simulation with machine learning models
title County-scale crop yield prediction by integrating crop simulation with machine learning models
title_full County-scale crop yield prediction by integrating crop simulation with machine learning models
title_fullStr County-scale crop yield prediction by integrating crop simulation with machine learning models
title_full_unstemmed County-scale crop yield prediction by integrating crop simulation with machine learning models
title_short County-scale crop yield prediction by integrating crop simulation with machine learning models
title_sort county-scale crop yield prediction by integrating crop simulation with machine learning models
topic Plant Science
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9742473/
https://www.ncbi.nlm.nih.gov/pubmed/36518505
http://dx.doi.org/10.3389/fpls.2022.1000224
work_keys_str_mv AT sajidsaiarasamira countyscalecropyieldpredictionbyintegratingcropsimulationwithmachinelearningmodels
AT shahhosseinimohsen countyscalecropyieldpredictionbyintegratingcropsimulationwithmachinelearningmodels
AT huberisaiah countyscalecropyieldpredictionbyintegratingcropsimulationwithmachinelearningmodels
AT huguiping countyscalecropyieldpredictionbyintegratingcropsimulationwithmachinelearningmodels
AT archontoulissotiriosv countyscalecropyieldpredictionbyintegratingcropsimulationwithmachinelearningmodels