Cargando…

Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran

The Balangu (Lallemantia iberica) species have a high gastronomical impact in the Middle East and Balkan region. It is widely used in the local food industry, such as confectionery, edible oil, and protein food. In this study, 49 ecotypes were collected from different regions of Iran. 37 agronomic t...

Descripción completa

Detalles Bibliográficos
Autores principales: Shafagh-Kolvanagh, Jalil, Dehghanian, Hassan, Mohammadi-Nassab, Adel Dabbagh, Moghaddam, Mohammad, Raei, Yaegoob, Salmasi, Saeid Zehtab, Samimifar, Peyvand, Abdoli, Soheila, Gholizadeh-Khajeh, Behnam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649721/
https://www.ncbi.nlm.nih.gov/pubmed/36357455
http://dx.doi.org/10.1038/s41598-022-23335-1
_version_ 1784827858441469952
author Shafagh-Kolvanagh, Jalil
Dehghanian, Hassan
Mohammadi-Nassab, Adel Dabbagh
Moghaddam, Mohammad
Raei, Yaegoob
Salmasi, Saeid Zehtab
Samimifar, Peyvand
Abdoli, Soheila
Gholizadeh-Khajeh, Behnam
author_facet Shafagh-Kolvanagh, Jalil
Dehghanian, Hassan
Mohammadi-Nassab, Adel Dabbagh
Moghaddam, Mohammad
Raei, Yaegoob
Salmasi, Saeid Zehtab
Samimifar, Peyvand
Abdoli, Soheila
Gholizadeh-Khajeh, Behnam
author_sort Shafagh-Kolvanagh, Jalil
collection PubMed
description The Balangu (Lallemantia iberica) species have a high gastronomical impact in the Middle East and Balkan region. It is widely used in the local food industry, such as confectionery, edible oil, and protein food. In this study, 49 ecotypes were collected from different regions of Iran. 37 agronomic traits were measured during the growing season and at harvest time. To find the correlation between the grain yield per unit area, grain yield per single plant (GYSP), oil percent (OP), and protein percent (PP) with other measured traits, which these were utilized as the labels of different machine learning (ML) procedures including Linear Regression (LR), Support Vector Regression (SVR), Random Forest Regression (RFR), and Gradient Boosting Decision Tree Regression (GBDTR). It was observed that there is a linear relationship between the measured agronomic traits and the considered labels. So, the LR, RFR, and GBDTR models showed the lowest mean absolute error, mean square error, and root mean square error than SVR models and good prediction ability of the test data. Although, the RFR and GBDTR have naturally lower bias than other methods in this study, but the GBDTR scheme is preferred because of the over-fitting shortcoming of the RFR technique. The GBDTR method showed better results rather than the other ML regression methods according to the RMSE 3.302, 0.040, 0.028, and 0.060 for GYUA, GYSP, OP, and PP, respectively.
format Online
Article
Text
id pubmed-9649721
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-96497212022-11-15 Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran Shafagh-Kolvanagh, Jalil Dehghanian, Hassan Mohammadi-Nassab, Adel Dabbagh Moghaddam, Mohammad Raei, Yaegoob Salmasi, Saeid Zehtab Samimifar, Peyvand Abdoli, Soheila Gholizadeh-Khajeh, Behnam Sci Rep Article The Balangu (Lallemantia iberica) species have a high gastronomical impact in the Middle East and Balkan region. It is widely used in the local food industry, such as confectionery, edible oil, and protein food. In this study, 49 ecotypes were collected from different regions of Iran. 37 agronomic traits were measured during the growing season and at harvest time. To find the correlation between the grain yield per unit area, grain yield per single plant (GYSP), oil percent (OP), and protein percent (PP) with other measured traits, which these were utilized as the labels of different machine learning (ML) procedures including Linear Regression (LR), Support Vector Regression (SVR), Random Forest Regression (RFR), and Gradient Boosting Decision Tree Regression (GBDTR). It was observed that there is a linear relationship between the measured agronomic traits and the considered labels. So, the LR, RFR, and GBDTR models showed the lowest mean absolute error, mean square error, and root mean square error than SVR models and good prediction ability of the test data. Although, the RFR and GBDTR have naturally lower bias than other methods in this study, but the GBDTR scheme is preferred because of the over-fitting shortcoming of the RFR technique. The GBDTR method showed better results rather than the other ML regression methods according to the RMSE 3.302, 0.040, 0.028, and 0.060 for GYUA, GYSP, OP, and PP, respectively. Nature Publishing Group UK 2022-11-10 /pmc/articles/PMC9649721/ /pubmed/36357455 http://dx.doi.org/10.1038/s41598-022-23335-1 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Shafagh-Kolvanagh, Jalil
Dehghanian, Hassan
Mohammadi-Nassab, Adel Dabbagh
Moghaddam, Mohammad
Raei, Yaegoob
Salmasi, Saeid Zehtab
Samimifar, Peyvand
Abdoli, Soheila
Gholizadeh-Khajeh, Behnam
Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title_full Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title_fullStr Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title_full_unstemmed Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title_short Machine learning-assisted analysis for agronomic dataset of 49 Balangu (Lallemantia iberica L.) ecotypes from different regions of Iran
title_sort machine learning-assisted analysis for agronomic dataset of 49 balangu (lallemantia iberica l.) ecotypes from different regions of iran
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9649721/
https://www.ncbi.nlm.nih.gov/pubmed/36357455
http://dx.doi.org/10.1038/s41598-022-23335-1
work_keys_str_mv AT shafaghkolvanaghjalil machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT dehghanianhassan machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT mohammadinassabadeldabbagh machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT moghaddammohammad machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT raeiyaegoob machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT salmasisaeidzehtab machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT samimifarpeyvand machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT abdolisoheila machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran
AT gholizadehkhajehbehnam machinelearningassistedanalysisforagronomicdatasetof49balangulallemantiaibericalecotypesfromdifferentregionsofiran