Cargando…

A forecasting method with efficient selection of variables in multivariate data sets

Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the c...

Descripción completa

Detalles Bibliográficos
Autores principales: Sagar, Pinki, Gupta, Prinima, Kashyap, Indu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer Singapore 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914390/
https://www.ncbi.nlm.nih.gov/pubmed/33681697
http://dx.doi.org/10.1007/s41870-021-00619-9
_version_ 1783656991839748096
author Sagar, Pinki
Gupta, Prinima
Kashyap, Indu
author_facet Sagar, Pinki
Gupta, Prinima
Kashyap, Indu
author_sort Sagar, Pinki
collection PubMed
description Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the corresponding conditional mean of y, denoted by E (y|x). In this paper polynomial regression analysis has been improved through efficient selection of variables that is coefficient of determination. Coefficient of determination is a square of the correlation between new predicted y values and actual y values and its values are in the range from 0 to 1. The main purpose of regression analysis is to discover the relationship among the independent and dependent variables or in other words it is an explanation of variation in one variable with another variable. In this paper, the main focus is on Multivariate data sets that have many attributes and it is not necessary that all variables are required for data analysis purposes. Using coefficient of determination (COD) irrelevant attributes get eliminated during analysis. The main objective of research is to reduce the cost of data maintenance, reduce the execution time and improve the prediction accuracy rate. COD helps in selecting suitable independent variables. It is a notch that is used in statistical analysis that assesses how well a model explains and forecasts upcoming outcomes. This method also helps in eliminating the irrelevant variables which are not required for the prediction model by this maintenance cost and size of data sets can be reduced.
format Online
Article
Text
id pubmed-7914390
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer Singapore
record_format MEDLINE/PubMed
spelling pubmed-79143902021-03-01 A forecasting method with efficient selection of variables in multivariate data sets Sagar, Pinki Gupta, Prinima Kashyap, Indu Int J Inf Technol Original Research Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the corresponding conditional mean of y, denoted by E (y|x). In this paper polynomial regression analysis has been improved through efficient selection of variables that is coefficient of determination. Coefficient of determination is a square of the correlation between new predicted y values and actual y values and its values are in the range from 0 to 1. The main purpose of regression analysis is to discover the relationship among the independent and dependent variables or in other words it is an explanation of variation in one variable with another variable. In this paper, the main focus is on Multivariate data sets that have many attributes and it is not necessary that all variables are required for data analysis purposes. Using coefficient of determination (COD) irrelevant attributes get eliminated during analysis. The main objective of research is to reduce the cost of data maintenance, reduce the execution time and improve the prediction accuracy rate. COD helps in selecting suitable independent variables. It is a notch that is used in statistical analysis that assesses how well a model explains and forecasts upcoming outcomes. This method also helps in eliminating the irrelevant variables which are not required for the prediction model by this maintenance cost and size of data sets can be reduced. Springer Singapore 2021-02-28 2021 /pmc/articles/PMC7914390/ /pubmed/33681697 http://dx.doi.org/10.1007/s41870-021-00619-9 Text en © Bharati Vidyapeeth's Institute of Computer Applications and Management 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Research
Sagar, Pinki
Gupta, Prinima
Kashyap, Indu
A forecasting method with efficient selection of variables in multivariate data sets
title A forecasting method with efficient selection of variables in multivariate data sets
title_full A forecasting method with efficient selection of variables in multivariate data sets
title_fullStr A forecasting method with efficient selection of variables in multivariate data sets
title_full_unstemmed A forecasting method with efficient selection of variables in multivariate data sets
title_short A forecasting method with efficient selection of variables in multivariate data sets
title_sort forecasting method with efficient selection of variables in multivariate data sets
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914390/
https://www.ncbi.nlm.nih.gov/pubmed/33681697
http://dx.doi.org/10.1007/s41870-021-00619-9
work_keys_str_mv AT sagarpinki aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets
AT guptaprinima aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets
AT kashyapindu aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets
AT sagarpinki forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets
AT guptaprinima forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets
AT kashyapindu forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets