Cargando…

Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction

Efficient cropping requires yield estimation for each involved crop, where data-driven models are commonly applied. In recent years, some data-driven modeling technique comparisons have been made, looking for the best model to yield prediction. However, attributes are usually selected based on exper...

Descripción completa

Detalles Bibliográficos
Autores principales:	Gonzalez-Sanchez, Alberto, Frausto-Solis, Juan, Ojeda-Bustamante, Waldo
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Hindawi Publishing Corporation 2014
Materias:	Review Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058283/ https://www.ncbi.nlm.nih.gov/pubmed/24977201 http://dx.doi.org/10.1155/2014/509429

_version_	1782321112462393344
author	Gonzalez-Sanchez, Alberto Frausto-Solis, Juan Ojeda-Bustamante, Waldo
author_facet	Gonzalez-Sanchez, Alberto Frausto-Solis, Juan Ojeda-Bustamante, Waldo
author_sort	Gonzalez-Sanchez, Alberto
collection	PubMed
description	Efficient cropping requires yield estimation for each involved crop, where data-driven models are commonly applied. In recent years, some data-driven modeling technique comparisons have been made, looking for the best model to yield prediction. However, attributes are usually selected based on expertise assessment or in dimensionality reduction algorithms. A fairer comparison should include the best subset of features for each regression technique; an evaluation including several crops is preferred. This paper evaluates the most common data-driven modeling techniques applied to yield prediction, using a complete method to define the best attribute subset for each model. Multiple linear regression, stepwise linear regression, M5′ regression trees, and artificial neural networks (ANN) were ranked. The models were built using real data of eight crops sowed in an irrigation module of Mexico. To validate the models, three accuracy metrics were used: the root relative square error (RRSE), relative mean absolute error (RMAE), and correlation factor (R). The results show that ANNs are more consistent in the best attribute subset composition between the learning and the training stages, obtaining the lowest average RRSE (86.04%), lowest average RMAE (8.75%), and the highest average correlation factor (0.63).
format	Online Article Text
id	pubmed-4058283
institution	National Center for Biotechnology Information
language	English
publishDate	2014
publisher	Hindawi Publishing Corporation
record_format	MEDLINE/PubMed
spelling	pubmed-40582832014-06-29 Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction Gonzalez-Sanchez, Alberto Frausto-Solis, Juan Ojeda-Bustamante, Waldo ScientificWorldJournal Review Article Efficient cropping requires yield estimation for each involved crop, where data-driven models are commonly applied. In recent years, some data-driven modeling technique comparisons have been made, looking for the best model to yield prediction. However, attributes are usually selected based on expertise assessment or in dimensionality reduction algorithms. A fairer comparison should include the best subset of features for each regression technique; an evaluation including several crops is preferred. This paper evaluates the most common data-driven modeling techniques applied to yield prediction, using a complete method to define the best attribute subset for each model. Multiple linear regression, stepwise linear regression, M5′ regression trees, and artificial neural networks (ANN) were ranked. The models were built using real data of eight crops sowed in an irrigation module of Mexico. To validate the models, three accuracy metrics were used: the root relative square error (RRSE), relative mean absolute error (RMAE), and correlation factor (R). The results show that ANNs are more consistent in the best attribute subset composition between the learning and the training stages, obtaining the lowest average RRSE (86.04%), lowest average RMAE (8.75%), and the highest average correlation factor (0.63). Hindawi Publishing Corporation 2014 2014-05-26 /pmc/articles/PMC4058283/ /pubmed/24977201 http://dx.doi.org/10.1155/2014/509429 Text en Copyright © 2014 Alberto Gonzalez-Sanchez et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle	Review Article Gonzalez-Sanchez, Alberto Frausto-Solis, Juan Ojeda-Bustamante, Waldo Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title	Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title_full	Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title_fullStr	Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title_full_unstemmed	Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title_short	Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction
title_sort	attribute selection impact on linear and nonlinear regression models for crop yield prediction
topic	Review Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4058283/ https://www.ncbi.nlm.nih.gov/pubmed/24977201 http://dx.doi.org/10.1155/2014/509429
work_keys_str_mv	AT gonzalezsanchezalberto attributeselectionimpactonlinearandnonlinearregressionmodelsforcropyieldprediction AT fraustosolisjuan attributeselectionimpactonlinearandnonlinearregressionmodelsforcropyieldprediction AT ojedabustamantewaldo attributeselectionimpactonlinearandnonlinearregressionmodelsforcropyieldprediction

Attribute Selection Impact on Linear and Nonlinear Regression Models for Crop Yield Prediction

Ejemplares similares