Cargando…
A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity
Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for t...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933067/ https://www.ncbi.nlm.nih.gov/pubmed/35310811 http://dx.doi.org/10.1155/2022/7271293 |
_version_ | 1784671562419404800 |
---|---|
author | Islam, Muhammad Shehzad, Farrukh |
author_facet | Islam, Muhammad Shehzad, Farrukh |
author_sort | Islam, Muhammad |
collection | PubMed |
description | Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R(2), RMSE), information criterion (AIC) with weights (AIC(W)), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling. |
format | Online Article Text |
id | pubmed-8933067 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-89330672022-03-19 A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity Islam, Muhammad Shehzad, Farrukh Scientifica (Cairo) Research Article Machine learning algorithms are rapidly deploying and have made manifold breakthroughs in various fields. The optimization of algorithms got abundant attention of researchers being a core component for deploying the machine learning model (MLM) abled to learn the parameters in significant ways for the given data. Modeling crop productivity through innumerable agronomical constraints has become a crucial task for evolving sustainable agricultural policies. The cross-sectional datasets of 26430 (D1) crop-cut experiments are taken by 2nd-stage area frame sampling, collected from crop reporting service. This research is taken as follows: firstly three more effective numerical optimized datasets are generated (D1, D2, and D3) from D1 by taking the centroid points of features which decrease the sample size; secondly MLM is integrated with the traditional statistical models (TSMs) for multiple linear regression (MLR), and thirdly decision tree regression (DTR) and random forest regression (RFR) are deployed to get the optimized models able to predict the wheat productivity well with 75% datasets to train and 25% to test the model using the evaluation metrics (R(2), RMSE), information criterion (AIC) with weights (AIC(W)), evidence ration (E.R), and decompositions of prediction error. The MLR outperformed for MLM than TSM. The performance capability of MLM and TSM got upswing for generated datasets. RFR got optimized and superperformed for D1, D2, D3, and D4. This study demonstrated strong evidences for deploying MLM for prediction of wheat productivity as an alternative of traditional statistical modeling. Hindawi 2022-03-11 /pmc/articles/PMC8933067/ /pubmed/35310811 http://dx.doi.org/10.1155/2022/7271293 Text en Copyright © 2022 Muhammad Islam and Farrukh Shehzad. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Islam, Muhammad Shehzad, Farrukh A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_full | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_fullStr | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_full_unstemmed | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_short | A Prediction Model Optimization Critiques through Centroid Clustering by Reducing the Sample Size, Integrating Statistical and Machine Learning Techniques for Wheat Productivity |
title_sort | prediction model optimization critiques through centroid clustering by reducing the sample size, integrating statistical and machine learning techniques for wheat productivity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8933067/ https://www.ncbi.nlm.nih.gov/pubmed/35310811 http://dx.doi.org/10.1155/2022/7271293 |
work_keys_str_mv | AT islammuhammad apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT shehzadfarrukh apredictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT islammuhammad predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity AT shehzadfarrukh predictionmodeloptimizationcritiquesthroughcentroidclusteringbyreducingthesamplesizeintegratingstatisticalandmachinelearningtechniquesforwheatproductivity |