Cargando…

Oil yield prediction for sunflower hybrid selection using different machine learning algorithms

Due to the increased demand for sunflower production, its breeding assignment is the intensification of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunflower Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yi...

Descripción completa

Detalles Bibliográficos
Autores principales: Cvejić, Sandra, Hrnjaković, Olivera, Jocković, Milan, Kupusinac, Aleksandar, Doroslovački, Ksenija, Gvozdenac, Sonja, Jocić, Siniša, Miladinović, Dragana
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582183/
https://www.ncbi.nlm.nih.gov/pubmed/37848668
http://dx.doi.org/10.1038/s41598-023-44999-3
_version_ 1785122274185052160
author Cvejić, Sandra
Hrnjaković, Olivera
Jocković, Milan
Kupusinac, Aleksandar
Doroslovački, Ksenija
Gvozdenac, Sonja
Jocić, Siniša
Miladinović, Dragana
author_facet Cvejić, Sandra
Hrnjaković, Olivera
Jocković, Milan
Kupusinac, Aleksandar
Doroslovački, Ksenija
Gvozdenac, Sonja
Jocić, Siniša
Miladinović, Dragana
author_sort Cvejić, Sandra
collection PubMed
description Due to the increased demand for sunflower production, its breeding assignment is the intensification of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunflower Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yield and their characteristics using machine learning (ML) algorithms. In this study, we developed ML models to predict oil yield using two sets of features. Moreover, we evaluated the most relevant features for accurate SOYP. ML algorithms that were used and compared were Artificial Neural Network (ANN), Support Vector Regression, K-Nearest Neighbour, and Random Forest Regressor (RFR). The dataset consisted of samples for 1250 hybrids of which 70% were randomly selected and were used to train the model and 30% were used to test the model and assess its performance. Employing MAE, MSE, RMSE and R2 evaluation metrics, RFR consistently outperformed in all datasets, achieving a peak of 0.92 for R2 in 2019. In contrast, ANN recorded the lowest MAE, reaching 65 in 2018 The paper revealed that in addition to seed yield, the following characteristics of hybrids were important for SOYP: resistance to broomrape (Or) and downy mildew (Pl) and maturity. It was also disclosed that the locality feature could be used for the estimation of sunflower oil yield but it is highly dependable on weather conditions that affect the oil content and seed yield. Up to our knowledge, this is the first study in which ML was used for sunflower oil yield prediction. The obtained results indicate that ML has great potential for application in oil yield prediction, but also selection of parental lines for hybrid production, RFR algorithm was found to be the most effective and along with locality feature is going to be further evaluated as an alternative method for genotypic selection.
format Online
Article
Text
id pubmed-10582183
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-105821832023-10-19 Oil yield prediction for sunflower hybrid selection using different machine learning algorithms Cvejić, Sandra Hrnjaković, Olivera Jocković, Milan Kupusinac, Aleksandar Doroslovački, Ksenija Gvozdenac, Sonja Jocić, Siniša Miladinović, Dragana Sci Rep Article Due to the increased demand for sunflower production, its breeding assignment is the intensification of the development of highly productive oil seed hybrids to satisfy the edible oil industry. Sunflower Oil Yield Prediction (SOYP) can help breeders to identify desirable new hybrids with high oil yield and their characteristics using machine learning (ML) algorithms. In this study, we developed ML models to predict oil yield using two sets of features. Moreover, we evaluated the most relevant features for accurate SOYP. ML algorithms that were used and compared were Artificial Neural Network (ANN), Support Vector Regression, K-Nearest Neighbour, and Random Forest Regressor (RFR). The dataset consisted of samples for 1250 hybrids of which 70% were randomly selected and were used to train the model and 30% were used to test the model and assess its performance. Employing MAE, MSE, RMSE and R2 evaluation metrics, RFR consistently outperformed in all datasets, achieving a peak of 0.92 for R2 in 2019. In contrast, ANN recorded the lowest MAE, reaching 65 in 2018 The paper revealed that in addition to seed yield, the following characteristics of hybrids were important for SOYP: resistance to broomrape (Or) and downy mildew (Pl) and maturity. It was also disclosed that the locality feature could be used for the estimation of sunflower oil yield but it is highly dependable on weather conditions that affect the oil content and seed yield. Up to our knowledge, this is the first study in which ML was used for sunflower oil yield prediction. The obtained results indicate that ML has great potential for application in oil yield prediction, but also selection of parental lines for hybrid production, RFR algorithm was found to be the most effective and along with locality feature is going to be further evaluated as an alternative method for genotypic selection. Nature Publishing Group UK 2023-10-17 /pmc/articles/PMC10582183/ /pubmed/37848668 http://dx.doi.org/10.1038/s41598-023-44999-3 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Cvejić, Sandra
Hrnjaković, Olivera
Jocković, Milan
Kupusinac, Aleksandar
Doroslovački, Ksenija
Gvozdenac, Sonja
Jocić, Siniša
Miladinović, Dragana
Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title_full Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title_fullStr Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title_full_unstemmed Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title_short Oil yield prediction for sunflower hybrid selection using different machine learning algorithms
title_sort oil yield prediction for sunflower hybrid selection using different machine learning algorithms
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10582183/
https://www.ncbi.nlm.nih.gov/pubmed/37848668
http://dx.doi.org/10.1038/s41598-023-44999-3
work_keys_str_mv AT cvejicsandra oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT hrnjakovicolivera oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT jockovicmilan oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT kupusinacaleksandar oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT doroslovackiksenija oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT gvozdenacsonja oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT jocicsinisa oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms
AT miladinovicdragana oilyieldpredictionforsunflowerhybridselectionusingdifferentmachinelearningalgorithms