Cargando…

A methodology for the design of experiments in computational intelligence with multiple regression models

The design of experiments and the validation of the results achieved with them are vital in any research study. This paper focuses on the use of different Machine Learning approaches for regression tasks in the field of Computational Intelligence and especially on a correct comparison between the di...

Descripción completa

Detalles Bibliográficos
Autores principales:	Fernandez-Lozano, Carlos, Gestal, Marcos, Munteanu, Cristian R., Dorado, Julian, Pazos, Alejandro
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2016
Materias:	Bioinformatics
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5136129/ https://www.ncbi.nlm.nih.gov/pubmed/27920952 http://dx.doi.org/10.7717/peerj.2721

_version_	1782471674835238912
author	Fernandez-Lozano, Carlos Gestal, Marcos Munteanu, Cristian R. Dorado, Julian Pazos, Alejandro
author_facet	Fernandez-Lozano, Carlos Gestal, Marcos Munteanu, Cristian R. Dorado, Julian Pazos, Alejandro
author_sort	Fernandez-Lozano, Carlos
collection	PubMed
description	The design of experiments and the validation of the results achieved with them are vital in any research study. This paper focuses on the use of different Machine Learning approaches for regression tasks in the field of Computational Intelligence and especially on a correct comparison between the different results provided for different methods, as those techniques are complex systems that require further study to be fully understood. A methodology commonly accepted in Computational intelligence is implemented in an R package called RRegrs. This package includes ten simple and complex regression models to carry out predictive modeling using Machine Learning and well-known regression algorithms. The framework for experimental design presented herein is evaluated and validated against RRegrs. Our results are different for three out of five state-of-the-art simple datasets and it can be stated that the selection of the best model according to our proposal is statistically significant and relevant. It is of relevance to use a statistical approach to indicate whether the differences are statistically significant using this kind of algorithms. Furthermore, our results with three real complex datasets report different best models than with the previously published methodology. Our final goal is to provide a complete methodology for the use of different steps in order to compare the results obtained in Computational Intelligence problems, as well as from other fields, such as for bioinformatics, cheminformatics, etc., given that our proposal is open and modifiable.
format	Online Article Text
id	pubmed-5136129
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-51361292016-12-05 A methodology for the design of experiments in computational intelligence with multiple regression models Fernandez-Lozano, Carlos Gestal, Marcos Munteanu, Cristian R. Dorado, Julian Pazos, Alejandro PeerJ Bioinformatics The design of experiments and the validation of the results achieved with them are vital in any research study. This paper focuses on the use of different Machine Learning approaches for regression tasks in the field of Computational Intelligence and especially on a correct comparison between the different results provided for different methods, as those techniques are complex systems that require further study to be fully understood. A methodology commonly accepted in Computational intelligence is implemented in an R package called RRegrs. This package includes ten simple and complex regression models to carry out predictive modeling using Machine Learning and well-known regression algorithms. The framework for experimental design presented herein is evaluated and validated against RRegrs. Our results are different for three out of five state-of-the-art simple datasets and it can be stated that the selection of the best model according to our proposal is statistically significant and relevant. It is of relevance to use a statistical approach to indicate whether the differences are statistically significant using this kind of algorithms. Furthermore, our results with three real complex datasets report different best models than with the previously published methodology. Our final goal is to provide a complete methodology for the use of different steps in order to compare the results obtained in Computational Intelligence problems, as well as from other fields, such as for bioinformatics, cheminformatics, etc., given that our proposal is open and modifiable. PeerJ Inc. 2016-12-01 /pmc/articles/PMC5136129/ /pubmed/27920952 http://dx.doi.org/10.7717/peerj.2721 Text en ©2016 Fernandez-Lozano et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle	Bioinformatics Fernandez-Lozano, Carlos Gestal, Marcos Munteanu, Cristian R. Dorado, Julian Pazos, Alejandro A methodology for the design of experiments in computational intelligence with multiple regression models
title	A methodology for the design of experiments in computational intelligence with multiple regression models
title_full	A methodology for the design of experiments in computational intelligence with multiple regression models
title_fullStr	A methodology for the design of experiments in computational intelligence with multiple regression models
title_full_unstemmed	A methodology for the design of experiments in computational intelligence with multiple regression models
title_short	A methodology for the design of experiments in computational intelligence with multiple regression models
title_sort	methodology for the design of experiments in computational intelligence with multiple regression models
topic	Bioinformatics
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5136129/ https://www.ncbi.nlm.nih.gov/pubmed/27920952 http://dx.doi.org/10.7717/peerj.2721
work_keys_str_mv	AT fernandezlozanocarlos amethodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT gestalmarcos amethodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT munteanucristianr amethodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT doradojulian amethodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT pazosalejandro amethodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT fernandezlozanocarlos methodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT gestalmarcos methodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT munteanucristianr methodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT doradojulian methodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels AT pazosalejandro methodologyforthedesignofexperimentsincomputationalintelligencewithmultipleregressionmodels

A methodology for the design of experiments in computational intelligence with multiple regression models

Ejemplares similares