Cargando…

Gradient boosting machine learning model to predict aflatoxins in Iowa corn

INTRODUCTION: Aflatoxin (AFL), a secondary metabolite produced from filamentous fungi, contaminates corn, posing significant health and safety hazards for humans and livestock through toxigenic and carcinogenic effects. Corn is widely used as an essential commodity for food, feed, fuel, and export m...

Descripción completa

Detalles Bibliográficos
Autores principales: Branstad-Spates, Emily H., Castano-Duque, Lina, Mosher, Gretchen A., Hurburgh, Charles R., Owens, Phillip, Winzeler, Edwin, Rajasekaran, Kanniah, Bowers, Erin L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502509/
https://www.ncbi.nlm.nih.gov/pubmed/37720139
http://dx.doi.org/10.3389/fmicb.2023.1248772
_version_ 1785106337826340864
author Branstad-Spates, Emily H.
Castano-Duque, Lina
Mosher, Gretchen A.
Hurburgh, Charles R.
Owens, Phillip
Winzeler, Edwin
Rajasekaran, Kanniah
Bowers, Erin L.
author_facet Branstad-Spates, Emily H.
Castano-Duque, Lina
Mosher, Gretchen A.
Hurburgh, Charles R.
Owens, Phillip
Winzeler, Edwin
Rajasekaran, Kanniah
Bowers, Erin L.
author_sort Branstad-Spates, Emily H.
collection PubMed
description INTRODUCTION: Aflatoxin (AFL), a secondary metabolite produced from filamentous fungi, contaminates corn, posing significant health and safety hazards for humans and livestock through toxigenic and carcinogenic effects. Corn is widely used as an essential commodity for food, feed, fuel, and export markets; therefore, AFL mitigation is necessary to ensure food and feed safety within the United States (US) and elsewhere in the world. In this case study, an Iowa-centric model was developed to predict AFL contamination using historical corn contamination, meteorological, satellite, and soil property data in the largest corn-producing state in the US. METHODS: We evaluated the performance of AFL prediction with gradient boosting machine (GBM) learning and feature engineering in Iowa corn for two AFL risk thresholds for high contamination events: 20-ppb and 5-ppb. A 90%–10% training-to-testing ratio was utilized in 2010, 2011, 2012, and 2021 (n = 630), with independent validation using the year 2020 (n = 376). RESULTS: The GBM model had an overall accuracy of 96.77% for AFL with a balanced accuracy of 50.00% for a 20-ppb risk threshold, whereas GBM had an overall accuracy of 90.32% with a balanced accuracy of 64.88% for a 5-ppb threshold. The GBM model had a low power to detect high AFL contamination events, resulting in a low sensitivity rate. Analyses for AFL showed satellite-acquired vegetative index during August significantly improved the prediction of corn contamination at the end of the growing season for both risk thresholds. Prediction of high AFL contamination levels was linked to aflatoxin risk indices (ARI) in May. However, ARI in July was an influential factor for the 5-ppb threshold but not for the 20-ppb threshold. Similarly, latitude was an influential factor for the 20-ppb threshold but not the 5-ppb threshold. Furthermore, soil-saturated hydraulic conductivity (Ksat) influenced both risk thresholds. DISCUSSION: Developing these AFL prediction models is practical and implementable in commodity grain handling environments to achieve the goal of preventative rather than reactive mitigations. Finding predictors that influence AFL risk annually is an important cost-effective risk tool and, therefore, is a high priority to ensure hazard management and optimal grain utilization to maximize the utility of the nation’s corn crop.
format Online
Article
Text
id pubmed-10502509
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-105025092023-09-16 Gradient boosting machine learning model to predict aflatoxins in Iowa corn Branstad-Spates, Emily H. Castano-Duque, Lina Mosher, Gretchen A. Hurburgh, Charles R. Owens, Phillip Winzeler, Edwin Rajasekaran, Kanniah Bowers, Erin L. Front Microbiol Microbiology INTRODUCTION: Aflatoxin (AFL), a secondary metabolite produced from filamentous fungi, contaminates corn, posing significant health and safety hazards for humans and livestock through toxigenic and carcinogenic effects. Corn is widely used as an essential commodity for food, feed, fuel, and export markets; therefore, AFL mitigation is necessary to ensure food and feed safety within the United States (US) and elsewhere in the world. In this case study, an Iowa-centric model was developed to predict AFL contamination using historical corn contamination, meteorological, satellite, and soil property data in the largest corn-producing state in the US. METHODS: We evaluated the performance of AFL prediction with gradient boosting machine (GBM) learning and feature engineering in Iowa corn for two AFL risk thresholds for high contamination events: 20-ppb and 5-ppb. A 90%–10% training-to-testing ratio was utilized in 2010, 2011, 2012, and 2021 (n = 630), with independent validation using the year 2020 (n = 376). RESULTS: The GBM model had an overall accuracy of 96.77% for AFL with a balanced accuracy of 50.00% for a 20-ppb risk threshold, whereas GBM had an overall accuracy of 90.32% with a balanced accuracy of 64.88% for a 5-ppb threshold. The GBM model had a low power to detect high AFL contamination events, resulting in a low sensitivity rate. Analyses for AFL showed satellite-acquired vegetative index during August significantly improved the prediction of corn contamination at the end of the growing season for both risk thresholds. Prediction of high AFL contamination levels was linked to aflatoxin risk indices (ARI) in May. However, ARI in July was an influential factor for the 5-ppb threshold but not for the 20-ppb threshold. Similarly, latitude was an influential factor for the 20-ppb threshold but not the 5-ppb threshold. Furthermore, soil-saturated hydraulic conductivity (Ksat) influenced both risk thresholds. DISCUSSION: Developing these AFL prediction models is practical and implementable in commodity grain handling environments to achieve the goal of preventative rather than reactive mitigations. Finding predictors that influence AFL risk annually is an important cost-effective risk tool and, therefore, is a high priority to ensure hazard management and optimal grain utilization to maximize the utility of the nation’s corn crop. Frontiers Media S.A. 2023-09-01 /pmc/articles/PMC10502509/ /pubmed/37720139 http://dx.doi.org/10.3389/fmicb.2023.1248772 Text en Copyright © 2023 Branstad-Spates, Castano-Duque, Mosher, Hurburgh, Owens, Winzeler, Rajasekaran and Bowers. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Branstad-Spates, Emily H.
Castano-Duque, Lina
Mosher, Gretchen A.
Hurburgh, Charles R.
Owens, Phillip
Winzeler, Edwin
Rajasekaran, Kanniah
Bowers, Erin L.
Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title_full Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title_fullStr Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title_full_unstemmed Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title_short Gradient boosting machine learning model to predict aflatoxins in Iowa corn
title_sort gradient boosting machine learning model to predict aflatoxins in iowa corn
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10502509/
https://www.ncbi.nlm.nih.gov/pubmed/37720139
http://dx.doi.org/10.3389/fmicb.2023.1248772
work_keys_str_mv AT branstadspatesemilyh gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT castanoduquelina gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT moshergretchena gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT hurburghcharlesr gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT owensphillip gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT winzeleredwin gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT rajasekarankanniah gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn
AT bowerserinl gradientboostingmachinelearningmodeltopredictaflatoxinsiniowacorn