Cargando…
A forecasting method with efficient selection of variables in multivariate data sets
Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the c...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer Singapore
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914390/ https://www.ncbi.nlm.nih.gov/pubmed/33681697 http://dx.doi.org/10.1007/s41870-021-00619-9 |
_version_ | 1783656991839748096 |
---|---|
author | Sagar, Pinki Gupta, Prinima Kashyap, Indu |
author_facet | Sagar, Pinki Gupta, Prinima Kashyap, Indu |
author_sort | Sagar, Pinki |
collection | PubMed |
description | Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the corresponding conditional mean of y, denoted by E (y|x). In this paper polynomial regression analysis has been improved through efficient selection of variables that is coefficient of determination. Coefficient of determination is a square of the correlation between new predicted y values and actual y values and its values are in the range from 0 to 1. The main purpose of regression analysis is to discover the relationship among the independent and dependent variables or in other words it is an explanation of variation in one variable with another variable. In this paper, the main focus is on Multivariate data sets that have many attributes and it is not necessary that all variables are required for data analysis purposes. Using coefficient of determination (COD) irrelevant attributes get eliminated during analysis. The main objective of research is to reduce the cost of data maintenance, reduce the execution time and improve the prediction accuracy rate. COD helps in selecting suitable independent variables. It is a notch that is used in statistical analysis that assesses how well a model explains and forecasts upcoming outcomes. This method also helps in eliminating the irrelevant variables which are not required for the prediction model by this maintenance cost and size of data sets can be reduced. |
format | Online Article Text |
id | pubmed-7914390 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Springer Singapore |
record_format | MEDLINE/PubMed |
spelling | pubmed-79143902021-03-01 A forecasting method with efficient selection of variables in multivariate data sets Sagar, Pinki Gupta, Prinima Kashyap, Indu Int J Inf Technol Original Research Regression is a kind of data analysis technique in which the relationship between the independent variable(x) and dependent variable(y) is modeled and for polynomial regression it is up to the nth degree polynomial. Polynomial regression fits a nonlinear relationship between the value of x and the corresponding conditional mean of y, denoted by E (y|x). In this paper polynomial regression analysis has been improved through efficient selection of variables that is coefficient of determination. Coefficient of determination is a square of the correlation between new predicted y values and actual y values and its values are in the range from 0 to 1. The main purpose of regression analysis is to discover the relationship among the independent and dependent variables or in other words it is an explanation of variation in one variable with another variable. In this paper, the main focus is on Multivariate data sets that have many attributes and it is not necessary that all variables are required for data analysis purposes. Using coefficient of determination (COD) irrelevant attributes get eliminated during analysis. The main objective of research is to reduce the cost of data maintenance, reduce the execution time and improve the prediction accuracy rate. COD helps in selecting suitable independent variables. It is a notch that is used in statistical analysis that assesses how well a model explains and forecasts upcoming outcomes. This method also helps in eliminating the irrelevant variables which are not required for the prediction model by this maintenance cost and size of data sets can be reduced. Springer Singapore 2021-02-28 2021 /pmc/articles/PMC7914390/ /pubmed/33681697 http://dx.doi.org/10.1007/s41870-021-00619-9 Text en © Bharati Vidyapeeth's Institute of Computer Applications and Management 2021 This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Research Sagar, Pinki Gupta, Prinima Kashyap, Indu A forecasting method with efficient selection of variables in multivariate data sets |
title | A forecasting method with efficient selection of variables in multivariate data sets |
title_full | A forecasting method with efficient selection of variables in multivariate data sets |
title_fullStr | A forecasting method with efficient selection of variables in multivariate data sets |
title_full_unstemmed | A forecasting method with efficient selection of variables in multivariate data sets |
title_short | A forecasting method with efficient selection of variables in multivariate data sets |
title_sort | forecasting method with efficient selection of variables in multivariate data sets |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7914390/ https://www.ncbi.nlm.nih.gov/pubmed/33681697 http://dx.doi.org/10.1007/s41870-021-00619-9 |
work_keys_str_mv | AT sagarpinki aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets AT guptaprinima aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets AT kashyapindu aforecastingmethodwithefficientselectionofvariablesinmultivariatedatasets AT sagarpinki forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets AT guptaprinima forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets AT kashyapindu forecastingmethodwithefficientselectionofvariablesinmultivariatedatasets |