Cargando…

Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits

Improving genetic yield potential in major food grade crops such as soybean (Glycine max L.) is the most sustainable way to address the growing global food demand and its security concerns. Yield is a complex trait and reliant on various related variables called yield components. In this study, the...

Descripción completa

Detalles Bibliográficos
Autores principales:	Yoosefzadeh-Najafabadi, Mohsen, Tulpan, Dan, Eskandari, Milad
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2021
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8087002/ https://www.ncbi.nlm.nih.gov/pubmed/33930039 http://dx.doi.org/10.1371/journal.pone.0250665

_version_	1783686597386960896
author	Yoosefzadeh-Najafabadi, Mohsen Tulpan, Dan Eskandari, Milad
author_facet	Yoosefzadeh-Najafabadi, Mohsen Tulpan, Dan Eskandari, Milad
author_sort	Yoosefzadeh-Najafabadi, Mohsen
collection	PubMed
description	Improving genetic yield potential in major food grade crops such as soybean (Glycine max L.) is the most sustainable way to address the growing global food demand and its security concerns. Yield is a complex trait and reliant on various related variables called yield components. In this study, the five most important yield component traits in soybean were measured using a panel of 250 genotypes grown in four environments. These traits were the number of nodes per plant (NP), number of non-reproductive nodes per plant (NRNP), number of reproductive nodes per plant (RNP), number of pods per plant (PP), and the ratio of number of pods to number of nodes per plant (P/N). These data were used for predicting the total soybean seed yield using the Multilayer Perceptron (MLP), Radial Basis Function (RBF), and Random Forest (RF), machine learning (ML) algorithms, individually and collectively through an ensemble method based on bagging strategy (E-B). The RBF algorithm with highest Coefficient of Determination (R(2)) value of 0.81 and the lowest Mean Absolute Errors (MAE) and Root Mean Square Error (RMSE) values of 148.61 kg.ha(-1), and 185.31 kg.ha(-1), respectively, was the most accurate algorithm and, therefore, selected as the metaClassifier for the E-B algorithm. Using the E-B algorithm, we were able to increase the prediction accuracy by improving the values of R(2), MAE, and RMSE by 0.1, 0.24 kg.ha(-1), and 0.96 kg.ha(-1), respectively. Furthermore, for the first time in this study, we allied the E-B with the genetic algorithm (GA) to model the optimum values of yield components in an ideotype genotype in which the yield is maximized. The results revealed a better understanding of the relationships between soybean yield and its components, which can be used for selecting parental lines and designing promising crosses for developing cultivars with improved genetic yield potential.
format	Online Article Text
id	pubmed-8087002
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-80870022021-05-06 Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits Yoosefzadeh-Najafabadi, Mohsen Tulpan, Dan Eskandari, Milad PLoS One Research Article Improving genetic yield potential in major food grade crops such as soybean (Glycine max L.) is the most sustainable way to address the growing global food demand and its security concerns. Yield is a complex trait and reliant on various related variables called yield components. In this study, the five most important yield component traits in soybean were measured using a panel of 250 genotypes grown in four environments. These traits were the number of nodes per plant (NP), number of non-reproductive nodes per plant (NRNP), number of reproductive nodes per plant (RNP), number of pods per plant (PP), and the ratio of number of pods to number of nodes per plant (P/N). These data were used for predicting the total soybean seed yield using the Multilayer Perceptron (MLP), Radial Basis Function (RBF), and Random Forest (RF), machine learning (ML) algorithms, individually and collectively through an ensemble method based on bagging strategy (E-B). The RBF algorithm with highest Coefficient of Determination (R(2)) value of 0.81 and the lowest Mean Absolute Errors (MAE) and Root Mean Square Error (RMSE) values of 148.61 kg.ha(-1), and 185.31 kg.ha(-1), respectively, was the most accurate algorithm and, therefore, selected as the metaClassifier for the E-B algorithm. Using the E-B algorithm, we were able to increase the prediction accuracy by improving the values of R(2), MAE, and RMSE by 0.1, 0.24 kg.ha(-1), and 0.96 kg.ha(-1), respectively. Furthermore, for the first time in this study, we allied the E-B with the genetic algorithm (GA) to model the optimum values of yield components in an ideotype genotype in which the yield is maximized. The results revealed a better understanding of the relationships between soybean yield and its components, which can be used for selecting parental lines and designing promising crosses for developing cultivars with improved genetic yield potential. Public Library of Science 2021-04-30 /pmc/articles/PMC8087002/ /pubmed/33930039 http://dx.doi.org/10.1371/journal.pone.0250665 Text en © 2021 Yoosefzadeh-Najafabadi et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle	Research Article Yoosefzadeh-Najafabadi, Mohsen Tulpan, Dan Eskandari, Milad Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title	Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title_full	Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title_fullStr	Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title_full_unstemmed	Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title_short	Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
title_sort	application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8087002/ https://www.ncbi.nlm.nih.gov/pubmed/33930039 http://dx.doi.org/10.1371/journal.pone.0250665
work_keys_str_mv	AT yoosefzadehnajafabadimohsen applicationofmachinelearningandgeneticoptimizationalgorithmsformodelingandoptimizingsoybeanyieldusingitscomponenttraits AT tulpandan applicationofmachinelearningandgeneticoptimizationalgorithmsformodelingandoptimizingsoybeanyieldusingitscomponenttraits AT eskandarimilad applicationofmachinelearningandgeneticoptimizationalgorithmsformodelingandoptimizingsoybeanyieldusingitscomponenttraits

Application of machine learning and genetic optimization algorithms for modeling and optimizing soybean yield using its component traits

Ejemplares similares