Cargando…

Random Forests for Global and Regional Crop Yield Predictions

Accurate predictions of crop yield are critical for developing effective agricultural and food policies at the regional and global scales. We evaluated a machine-learning method, Random Forests (RF), for its ability to predict crop yield responses to climate and biophysical variables at global and r...

Descripción completa

Detalles Bibliográficos
Autores principales: Jeong, Jig Han, Resop, Jonathan P., Mueller, Nathaniel D., Fleisher, David H., Yun, Kyungdahm, Butler, Ethan E., Timlin, Dennis J., Shim, Kyo-Moon, Gerber, James S., Reddy, Vangimalla R., Kim, Soo-Hyung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4892571/
https://www.ncbi.nlm.nih.gov/pubmed/27257967
http://dx.doi.org/10.1371/journal.pone.0156571
_version_ 1782435414734274560
author Jeong, Jig Han
Resop, Jonathan P.
Mueller, Nathaniel D.
Fleisher, David H.
Yun, Kyungdahm
Butler, Ethan E.
Timlin, Dennis J.
Shim, Kyo-Moon
Gerber, James S.
Reddy, Vangimalla R.
Kim, Soo-Hyung
author_facet Jeong, Jig Han
Resop, Jonathan P.
Mueller, Nathaniel D.
Fleisher, David H.
Yun, Kyungdahm
Butler, Ethan E.
Timlin, Dennis J.
Shim, Kyo-Moon
Gerber, James S.
Reddy, Vangimalla R.
Kim, Soo-Hyung
author_sort Jeong, Jig Han
collection PubMed
description Accurate predictions of crop yield are critical for developing effective agricultural and food policies at the regional and global scales. We evaluated a machine-learning method, Random Forests (RF), for its ability to predict crop yield responses to climate and biophysical variables at global and regional scales in wheat, maize, and potato in comparison with multiple linear regressions (MLR) serving as a benchmark. We used crop yield data from various sources and regions for model training and testing: 1) gridded global wheat grain yield, 2) maize grain yield from US counties over thirty years, and 3) potato tuber and maize silage yield from the northeastern seaboard region. RF was found highly capable of predicting crop yields and outperformed MLR benchmarks in all performance statistics that were compared. For example, the root mean square errors (RMSE) ranged between 6 and 14% of the average observed yield with RF models in all test cases whereas these values ranged from 14% to 49% for MLR models. Our results show that RF is an effective and versatile machine-learning method for crop yield predictions at regional and global scales for its high accuracy and precision, ease of use, and utility in data analysis. RF may result in a loss of accuracy when predicting the extreme ends or responses beyond the boundaries of the training data.
format Online
Article
Text
id pubmed-4892571
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-48925712016-06-16 Random Forests for Global and Regional Crop Yield Predictions Jeong, Jig Han Resop, Jonathan P. Mueller, Nathaniel D. Fleisher, David H. Yun, Kyungdahm Butler, Ethan E. Timlin, Dennis J. Shim, Kyo-Moon Gerber, James S. Reddy, Vangimalla R. Kim, Soo-Hyung PLoS One Research Article Accurate predictions of crop yield are critical for developing effective agricultural and food policies at the regional and global scales. We evaluated a machine-learning method, Random Forests (RF), for its ability to predict crop yield responses to climate and biophysical variables at global and regional scales in wheat, maize, and potato in comparison with multiple linear regressions (MLR) serving as a benchmark. We used crop yield data from various sources and regions for model training and testing: 1) gridded global wheat grain yield, 2) maize grain yield from US counties over thirty years, and 3) potato tuber and maize silage yield from the northeastern seaboard region. RF was found highly capable of predicting crop yields and outperformed MLR benchmarks in all performance statistics that were compared. For example, the root mean square errors (RMSE) ranged between 6 and 14% of the average observed yield with RF models in all test cases whereas these values ranged from 14% to 49% for MLR models. Our results show that RF is an effective and versatile machine-learning method for crop yield predictions at regional and global scales for its high accuracy and precision, ease of use, and utility in data analysis. RF may result in a loss of accuracy when predicting the extreme ends or responses beyond the boundaries of the training data. Public Library of Science 2016-06-03 /pmc/articles/PMC4892571/ /pubmed/27257967 http://dx.doi.org/10.1371/journal.pone.0156571 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Jeong, Jig Han
Resop, Jonathan P.
Mueller, Nathaniel D.
Fleisher, David H.
Yun, Kyungdahm
Butler, Ethan E.
Timlin, Dennis J.
Shim, Kyo-Moon
Gerber, James S.
Reddy, Vangimalla R.
Kim, Soo-Hyung
Random Forests for Global and Regional Crop Yield Predictions
title Random Forests for Global and Regional Crop Yield Predictions
title_full Random Forests for Global and Regional Crop Yield Predictions
title_fullStr Random Forests for Global and Regional Crop Yield Predictions
title_full_unstemmed Random Forests for Global and Regional Crop Yield Predictions
title_short Random Forests for Global and Regional Crop Yield Predictions
title_sort random forests for global and regional crop yield predictions
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4892571/
https://www.ncbi.nlm.nih.gov/pubmed/27257967
http://dx.doi.org/10.1371/journal.pone.0156571
work_keys_str_mv AT jeongjighan randomforestsforglobalandregionalcropyieldpredictions
AT resopjonathanp randomforestsforglobalandregionalcropyieldpredictions
AT muellernathanield randomforestsforglobalandregionalcropyieldpredictions
AT fleisherdavidh randomforestsforglobalandregionalcropyieldpredictions
AT yunkyungdahm randomforestsforglobalandregionalcropyieldpredictions
AT butlerethane randomforestsforglobalandregionalcropyieldpredictions
AT timlindennisj randomforestsforglobalandregionalcropyieldpredictions
AT shimkyomoon randomforestsforglobalandregionalcropyieldpredictions
AT gerberjamess randomforestsforglobalandregionalcropyieldpredictions
AT reddyvangimallar randomforestsforglobalandregionalcropyieldpredictions
AT kimsoohyung randomforestsforglobalandregionalcropyieldpredictions