Cargando…

A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity

Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for t...

Descripción completa

Detalles Bibliográficos
Autores principales: Islam, Muhammad, Shehzad, Farrukh
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933067/
https://www.ncbi.nlm.nih.gov/pubmed/35310811
http://dx.doi.org/10.1155/2022/7271293
_version_ 1784671562419404800
author Islam, Muhammad
Shehzad, Farrukh
author_facet Islam, Muhammad
Shehzad, Farrukh
author_sort Islam, Muhammad
collection PubMed
description Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R(2), RMSE), information criterion (AIC) with weights (AIC(W)), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling.
format Online
Article
Text
id pubmed-8933067
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89330672022-03-19 A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity Islam, Muhammad Shehzad, Farrukh Scientifica (Cairo) Research Article Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R(2), RMSE), information criterion (AIC) with weights (AIC(W)), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling. Hindawi 2022-03-11 /pmc/articles/PMC8933067/ /pubmed/35310811 http://dx.doi.org/10.1155/2022/7271293 Text en Copyright © 2022 Muhammad Islam and Farrukh Shehzad. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Islam, Muhammad
Shehzad, Farrukh
A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title_full A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title_fullStr A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title_full_unstemmed A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title_short A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
title_sort prediction model optimization critiques through centroid clustering by reducing the sample size, integrating statistical and machine learning techniques for wheat productivity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933067/
https://www.ncbi.nlm.nih.gov/pubmed/35310811
http://dx.doi.org/10.1155/2022/7271293
work_keys_str_mv AT islammuhammad apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity
AT shehzadfarrukh apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity
AT islammuhammad predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity
AT shehzadfarrukh predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity