Cargando…
A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh
In this study, we attempt to anticipate annual rice production in Bangladesh (1961–2020) using both the Autoregressive Integrated Moving Average (ARIMA) and the eXtreme Gradient Boosting (XGBoost) methods and compare their respective performances. On the basis of the lowest Corrected Akaike Informat...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10042373/ https://www.ncbi.nlm.nih.gov/pubmed/36972270 http://dx.doi.org/10.1371/journal.pone.0283452 |
_version_ | 1784912920567611392 |
---|---|
author | Noorunnahar, Mst Chowdhury, Arman Hossain Mila, Farhana Arefeen |
author_facet | Noorunnahar, Mst Chowdhury, Arman Hossain Mila, Farhana Arefeen |
author_sort | Noorunnahar, Mst |
collection | PubMed |
description | In this study, we attempt to anticipate annual rice production in Bangladesh (1961–2020) using both the Autoregressive Integrated Moving Average (ARIMA) and the eXtreme Gradient Boosting (XGBoost) methods and compare their respective performances. On the basis of the lowest Corrected Akaike Information Criteria (AICc) values, a significant ARIMA (0, 1, 1) model with drift was chosen based on the findings. The drift parameter value shows that the production of rice positively trends upward. Thus, the ARIMA (0, 1, 1) model with drift was found to be significant. On the other hand, the XGBoost model for time series data was developed by changing the tunning parameters frequently with the greatest result. The four prominent error measures, such as mean absolute error (MAE), mean percentage error (MPE), root mean square error (RMSE), and mean absolute percentage error (MAPE), were used to assess the predictive performance of each model. We found that the error measures of the XGBoost model in the test set were comparatively lower than those of the ARIMA model. Comparatively, the MAPE value of the test set of the XGBoost model (5.38%) was lower than that of the ARIMA model (7.23%), indicating that XGBoost performs better than ARIMA at predicting the annual rice production in Bangladesh. Hence, the XGBoost model performs better than the ARIMA model in predicting the annual rice production in Bangladesh. Therefore, based on the better performance, the study forecasted the annual rice production for the next 10 years using the XGBoost model. According to our predictions, the annual rice production in Bangladesh will vary from 57,850,318 tons in 2021 to 82,256,944 tons in 2030. The forecast indicated that the amount of rice produced annually in Bangladesh will increase in the years to come. |
format | Online Article Text |
id | pubmed-10042373 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-100423732023-03-28 A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh Noorunnahar, Mst Chowdhury, Arman Hossain Mila, Farhana Arefeen PLoS One Research Article In this study, we attempt to anticipate annual rice production in Bangladesh (1961–2020) using both the Autoregressive Integrated Moving Average (ARIMA) and the eXtreme Gradient Boosting (XGBoost) methods and compare their respective performances. On the basis of the lowest Corrected Akaike Information Criteria (AICc) values, a significant ARIMA (0, 1, 1) model with drift was chosen based on the findings. The drift parameter value shows that the production of rice positively trends upward. Thus, the ARIMA (0, 1, 1) model with drift was found to be significant. On the other hand, the XGBoost model for time series data was developed by changing the tunning parameters frequently with the greatest result. The four prominent error measures, such as mean absolute error (MAE), mean percentage error (MPE), root mean square error (RMSE), and mean absolute percentage error (MAPE), were used to assess the predictive performance of each model. We found that the error measures of the XGBoost model in the test set were comparatively lower than those of the ARIMA model. Comparatively, the MAPE value of the test set of the XGBoost model (5.38%) was lower than that of the ARIMA model (7.23%), indicating that XGBoost performs better than ARIMA at predicting the annual rice production in Bangladesh. Hence, the XGBoost model performs better than the ARIMA model in predicting the annual rice production in Bangladesh. Therefore, based on the better performance, the study forecasted the annual rice production for the next 10 years using the XGBoost model. According to our predictions, the annual rice production in Bangladesh will vary from 57,850,318 tons in 2021 to 82,256,944 tons in 2030. The forecast indicated that the amount of rice produced annually in Bangladesh will increase in the years to come. Public Library of Science 2023-03-27 /pmc/articles/PMC10042373/ /pubmed/36972270 http://dx.doi.org/10.1371/journal.pone.0283452 Text en © 2023 Noorunnahar et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Noorunnahar, Mst Chowdhury, Arman Hossain Mila, Farhana Arefeen A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title | A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title_full | A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title_fullStr | A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title_full_unstemmed | A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title_short | A tree based eXtreme Gradient Boosting (XGBoost) machine learning model to forecast the annual rice production in Bangladesh |
title_sort | tree based extreme gradient boosting (xgboost) machine learning model to forecast the annual rice production in bangladesh |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10042373/ https://www.ncbi.nlm.nih.gov/pubmed/36972270 http://dx.doi.org/10.1371/journal.pone.0283452 |
work_keys_str_mv | AT noorunnaharmst atreebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh AT chowdhuryarmanhossain atreebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh AT milafarhanaarefeen atreebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh AT noorunnaharmst treebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh AT chowdhuryarmanhossain treebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh AT milafarhanaarefeen treebasedextremegradientboostingxgboostmachinelearningmodeltoforecasttheannualriceproductioninbangladesh |